nuclear receptor binding SET domain protein 1 (NSD1) - coding DNA reference sequence

(used for variant description)

(last modified October 25, 2016)


This file was created to facilitate the description of sequence variants on transcript NM_022455.4 in the NSD1 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_009821.1, covering NSD1 transcript NM_022455.4.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                                                   .                g.5771
                                           gacgcggggggagggggg       c.-121

 .         .         .         .         .         .                g.5831
 tgcggcgagcggccccgctctctccccaccgctccgctcgcaccccagtgtaatgagggt       c.-61

 .         .         .         .         .   | 02     .             g.7025
 caccccctccccccagctggcccgggagggggcgcggggcacg | gttgatgccggcccagg    c.-1

          .         .         .         .         .         .       g.7085
 ATGGATCAGACCTGTGAACTACCCAGAAGAAATTGTCTGCTGCCCTTTTCCAATCCAGTG       c.60
 M  D  Q  T  C  E  L  P  R  R  N  C  L  L  P  F  S  N  P  V         p.20

          .         .         .         .         .         .       g.7145
 AATTTAGATGCCCCTGAAGACAAGGACAGCCCTTTCGGTAATGGTCAATCCAATTTTTCT       c.120
 N  L  D  A  P  E  D  K  D  S  P  F  G  N  G  Q  S  N  F  S         p.40

          .         .         .         .         .         .       g.7205
 GAGCCACTTAATGGGTGTACTATGCAGTTATCGACTGTCAGTGGAACATCCCAAAATGCT       c.180
 E  P  L  N  G  C  T  M  Q  L  S  T  V  S  G  T  S  Q  N  A         p.60

          .         .         .         .         .         .       g.7265
 TATGGACAAGATTCTCCATCTTGTTACATTCCACTGCGGAGACTACAGGATTTGGCCTCC       c.240
 Y  G  Q  D  S  P  S  C  Y  I  P  L  R  R  L  Q  D  L  A  S         p.80

          .         .         .         .         .         .       g.7325
 ATGATCAATGTAGAGTATTTAAATGGGTCTGCTGATGGATCAGAATCCTTTCAAGACCCT       c.300
 M  I  N  V  E  Y  L  N  G  S  A  D  G  S  E  S  F  Q  D  P         p.100

          .         .         .         .         .         .       g.7385
 GAAAAAAGTGATTCAAGAGCTCAGACGCCAATTGTTTGCACTTCCTTGAGTCCTGGTGGT       c.360
 E  K  S  D  S  R  A  Q  T  P  I  V  C  T  S  L  S  P  G  G         p.120

          .         .         .         .         .         .       g.7445
 CCTACAGCACTTGCTATGAAACAGGAACCCTCTTGTAATAACTCCCCTGAACTCCAGGTA       c.420
 P  T  A  L  A  M  K  Q  E  P  S  C  N  N  S  P  E  L  Q  V         p.140

          .         .         .         .         .         .       g.7505
 AAAGTAACAAAGACTATCAAGAATGGCTTTCTGCACTTTGAGAATTTTACTTGTGTGGAC       c.480
 K  V  T  K  T  I  K  N  G  F  L  H  F  E  N  F  T  C  V  D         p.160

          .         .         .         .         .         .       g.7565
 GATGCAGATGTAGATTCTGAAATGGACCCAGAACAGCCAGTCACAGAGGATGAGAGTATA       c.540
 D  A  D  V  D  S  E  M  D  P  E  Q  P  V  T  E  D  E  S  I         p.180

          .         .         .         .         .         .       g.7625
 GAGGAGATCTTTGAGGAAACTCAGACCAATGCCACCTGCAATTATGAGACTAAATCAGAG       c.600
 E  E  I  F  E  E  T  Q  T  N  A  T  C  N  Y  E  T  K  S  E         p.200

          .         .         .         .         .         .       g.7685
 AATGGTGTAAAAGTGGCCATGGGAAGTGAACAAGACAGCACACCAGAGAGTAGACACGGT       c.660
 N  G  V  K  V  A  M  G  S  E  Q  D  S  T  P  E  S  R  H  G         p.220

          .         .         .         .         .         .       g.7745
 GCAGTCAAATCGCCATTCTTGCCATTAGCTCCTCAGACTGAAACACAGAAAAATAAGCAA       c.720
 A  V  K  S  P  F  L  P  L  A  P  Q  T  E  T  Q  K  N  K  Q         p.240

          .         .         .         .         .         .       g.7805
 AGAAATGAAGTGGACGGCAGCAATGAAAAAGCAGCCCTTCTCCCAGCCCCCTTTTCACTA       c.780
 R  N  E  V  D  G  S  N  E  K  A  A  L  L  P  A  P  F  S  L         p.260

          .         .         .         .         .         .       g.7865
 GGAGACACAAACATTACAATAGAAGAGCAATTAAACTCAATAAATTTATCTTTTCAGGAT       c.840
 G  D  T  N  I  T  I  E  E  Q  L  N  S  I  N  L  S  F  Q  D         p.280

          .         .         .         .         .         .       g.7925
 GATCCAGATTCCAGTACCAGTACATTAGGAAACATGCTAGAATTACCTGGAACTTCATCA       c.900
 D  P  D  S  S  T  S  T  L  G  N  M  L  E  L  P  G  T  S  S         p.300

          .         .        | 03.         .         .         .    g.63838
 TCATCTACTTCACAGGAATTGCCATTT | TGTCAACCTAAGAAAAAGTCTACGCCACTGAAG    c.960
 S  S  T  S  Q  E  L  P  F   | C  Q  P  K  K  K  S  T  P  L  K      p.320

          .         .         .         .         .         .       g.63898
 TATGAAGTTGGAGATCTCATCTGGGCAAAATTCAAGAGACGCCCATGGTGGCCCTGCAGG       c.1020
 Y  E  V  G  D  L  I  W  A  K  F  K  R  R  P  W  W  P  C  R         p.340

          .         .         .         .    | 04    .         .    g.76058
 ATTTGTTCTGATCCGTTGATTAACACACATTCAAAAATGAAAG | TTTCCAACCGGAGGCCC    c.1080
 I  C  S  D  P  L  I  N  T  H  S  K  M  K  V |   S  N  R  R  P      p.360

          .         .         .         .         .         .       g.76118
 TATCGGCAGTACTACGTGGAGGCTTTTGGAGATCCTTCTGAGAGAGCCTGGGTGGCTGGA       c.1140
 Y  R  Q  Y  Y  V  E  A  F  G  D  P  S  E  R  A  W  V  A  G         p.380

          .         .         .         .         .         .       g.76178
 AAAGCAATCGTCATGTTTGAAGGCAGACATCAATTCGAAGAGCTACCTGTCCTTAGGAGA       c.1200
 K  A  I  V  M  F  E  G  R  H  Q  F  E  E  L  P  V  L  R  R         p.400

          .         .         .       | 05 .         .         .    g.81581
 AGAGGGAAACAGAAAGAAAAAGGATATAGGCATAAG | GTTCCTCAGAAAATTTTGAGTAAA    c.1260
 R  G  K  Q  K  E  K  G  Y  R  H  K   | V  P  Q  K  I  L  S  K      p.420

          .         .         .         .         .         .       g.81641
 TGGGAAGCCAGTGTTGGACTTGCAGAACAGTATGATGTTCCCAAGGGGTCAAAGAACCGA       c.1320
 W  E  A  S  V  G  L  A  E  Q  Y  D  V  P  K  G  S  K  N  R         p.440

          .         .         .         .         .         .       g.81701
 AAATGTATTCCTGGTTCAATCAAGTTGGACAGTGAAGAAGATATGCCATTTGAAGACTGC       c.1380
 K  C  I  P  G  S  I  K  L  D  S  E  E  D  M  P  F  E  D  C         p.460

          .         .         .         .         .         .       g.81761
 ACAAATGATCCTGAGTCAGAACATGACCTGTTGCTTAATGGCTGTTTGAAATCACTGGCT       c.1440
 T  N  D  P  E  S  E  H  D  L  L  L  N  G  C  L  K  S  L  A         p.480

          .         .         .         .         .         .       g.81821
 TTTGATTCTGAACATTCTGCAGATGAGAAGGAAAAGCCTTGCGCTAAATCTCGAGCCAGA       c.1500
 F  D  S  E  H  S  A  D  E  K  E  K  P  C  A  K  S  R  A  R         p.500

          .         .         .         .         .         .       g.81881
 AAGAGCTCTGATAATCCAAAAAGGACTAGTGTGAAAAAGGGCCACATACAATTTGAAGCA       c.1560
 K  S  S  D  N  P  K  R  T  S  V  K  K  G  H  I  Q  F  E  A         p.520

          .         .         .         .         .         .       g.81941
 CATAAAGATGAACGGAGGGGAAAGATTCCAGAGAACCTTGGCCTAAACTTTATCTCTGGG       c.1620
 H  K  D  E  R  R  G  K  I  P  E  N  L  G  L  N  F  I  S  G         p.540

          .         .         .         .         .         .       g.82001
 GATATATCTGATACGCAGGCCTCTAATGAACTTTCCAGGATAGCAAATAGCCTCACAGGG       c.1680
 D  I  S  D  T  Q  A  S  N  E  L  S  R  I  A  N  S  L  T  G         p.560

          .         .         .         .         .         .       g.82061
 TCCAACACTGCCCCAGGAAGTTTTCTGTTTTCTTCCTGTGGAAAAAACACTGCAAAGAAA       c.1740
 S  N  T  A  P  G  S  F  L  F  S  S  C  G  K  N  T  A  K  K         p.580

          .         .         .         .         .         .       g.82121
 GAATTTGAGACTTCAAATGGTGACTCTTTATTGGGCTTGCCTGAGGGTGCTTTGATCTCA       c.1800
 E  F  E  T  S  N  G  D  S  L  L  G  L  P  E  G  A  L  I  S         p.600

          .         .         .         .         .         .       g.82181
 AAGTGTTCTCGAGAGAAGAATAAACCCCAACGAAGCCTGGTGTGTGGTTCAAAAGTGAAG       c.1860
 K  C  S  R  E  K  N  K  P  Q  R  S  L  V  C  G  S  K  V  K         p.620

          .         .         .         .         .         .       g.82241
 CTCTGCTATATTGGAGCAGGTGATGAGGAAAAGCGAAGTGATTCCATTAGTATCTGTACC       c.1920
 L  C  Y  I  G  A  G  D  E  E  K  R  S  D  S  I  S  I  C  T         p.640

          .         .         .         .         .         .       g.82301
 ACTTCTGATGATGGAAGCAGTGACCTGGATCCCATAGAACACAGCTCAGAGTCTGATAAC       c.1980
 T  S  D  D  G  S  S  D  L  D  P  I  E  H  S  S  E  S  D  N         p.660

          .         .         .         .         .         .       g.82361
 AGTGTCCTTGAAATTCCAGATGCTTTCGATAGAACAGAGAACATGTTATCTATGCAGAAA       c.2040
 S  V  L  E  I  P  D  A  F  D  R  T  E  N  M  L  S  M  Q  K         p.680

          .         .         .         .         .         .       g.82421
 AATGAAAAGATAAAGTATTCTAGGTTTGCTGCCACAAACACTAGGGTAAAAGCAAAACAG       c.2100
 N  E  K  I  K  Y  S  R  F  A  A  T  N  T  R  V  K  A  K  Q         p.700

          .         .         .         .         .         .       g.82481
 AAGCCTCTCATTAGTAACTCACATACAGACCACTTAATGGGTTGTACTAAGAGTGCAGAG       c.2160
 K  P  L  I  S  N  S  H  T  D  H  L  M  G  C  T  K  S  A  E         p.720

          .         .         .         .         .         .       g.82541
 CCTGGAACCGAGACGTCTCAGGTTAATCTCTCTGATCTGAAGGCATCTACTCTTGTTCAC       c.2220
 P  G  T  E  T  S  Q  V  N  L  S  D  L  K  A  S  T  L  V  H         p.740

          .         .         .         .         .         .       g.82601
 AAACCCCAGTCAGATTTTACAAATGATGCTCTCTCTCCAAAATTCAACCTGTCATCAAGC       c.2280
 K  P  Q  S  D  F  T  N  D  A  L  S  P  K  F  N  L  S  S  S         p.760

          .         .         .         .         .         .       g.82661
 ATATCCAGTGAGAACTCGTTAATAAAGGGTGGGGCAGCAAATCAAGCTCTATTACATTCG       c.2340
 I  S  S  E  N  S  L  I  K  G  G  A  A  N  Q  A  L  L  H  S         p.780

          .         .         .         .         .         .       g.82721
 AAAAGCAAACAGCCCAAGTTCCGAAGTATAAAGTGCAAACACAAAGAAAATCCAGTTATG       c.2400
 K  S  K  Q  P  K  F  R  S  I  K  C  K  H  K  E  N  P  V  M         p.800

          .         .         .         .         .         .       g.82781
 GCAGAACCCCCAGTTATAAATGAGGAGTGCAGTTTGAAATGCTGCTCTTCTGATACCAAA       c.2460
 A  E  P  P  V  I  N  E  E  C  S  L  K  C  C  S  S  D  T  K         p.820

          .         .         .         .         .         .       g.82841
 GGCTCTCCTTTGGCCAGCATTTCTAAAAGTGGGAAAGTGGATGGTCTAAAACTACTGAAC       c.2520
 G  S  P  L  A  S  I  S  K  S  G  K  V  D  G  L  K  L  L  N         p.840

          .         .         .         .         .         .       g.82901
 AATATGCATGAGAAAACCAGGGATTCAAGTGACATAGAAACAGCAGTGGTGAAACATGTT       c.2580
 N  M  H  E  K  T  R  D  S  S  D  I  E  T  A  V  V  K  H  V         p.860

          .         .         .         .         .         .       g.82961
 TTATCCGAGTTGAAGGAACTCTCTTACAGATCCTTAGGTGAGGATGTCAGTGACTCTGGA       c.2640
 L  S  E  L  K  E  L  S  Y  R  S  L  G  E  D  V  S  D  S  G         p.880

          .         .         .         .         .         .       g.83021
 ACATCAAAGCCATCAAAACCATTACTTTTCTCTTCTGCTTCTAGTCAGAATCACATACCT       c.2700
 T  S  K  P  S  K  P  L  L  F  S  S  A  S  S  Q  N  H  I  P         p.900

          .         .         .         .         .         .       g.83081
 ATTGAACCAGACTACAAATTCAGTACATTGCTAATGATGTTGAAAGATATGCATGATAGT       c.2760
 I  E  P  D  Y  K  F  S  T  L  L  M  M  L  K  D  M  H  D  S         p.920

          .         .         .         .         .         .       g.83141
 AAGACGAAGGAGCAGCGGTTGATGACTGCTCAAAACCTGGTCTCTTACCGGAGTCCTGGT       c.2820
 K  T  K  E  Q  R  L  M  T  A  Q  N  L  V  S  Y  R  S  P  G         p.940

          .         .         .         .         .         .       g.83201
 CGTGGGGACTGTTCTACTAATAGTCCTGTAGGAGTCTCTAAGGTTTTGGTTTCAGGAGGC       c.2880
 R  G  D  C  S  T  N  S  P  V  G  V  S  K  V  L  V  S  G  G         p.960

          .         .         .         .         .         .       g.83261
 TCCACACACAATTCAGAGAAAAAGGGAGATGGCACTCAGAACTCCGCCAATCCTAGCCCT       c.2940
 S  T  H  N  S  E  K  K  G  D  G  T  Q  N  S  A  N  P  S  P         p.980

          .         .         .         .         .         .       g.83321
 AGTGGGGGTGACTCTGCATTATCTGGCGAGTTGTCTGCTTCCCTACCTGGCTTACTGTCC       c.3000
 S  G  G  D  S  A  L  S  G  E  L  S  A  S  L  P  G  L  L  S         p.1000

          .         .         .         .         .         .       g.83381
 GACAAGAGAGACCTCCCTGCTTCTGGTAAAAGTCGTTCAGACTGTGTTACTAGGCGCAAC       c.3060
 D  K  R  D  L  P  A  S  G  K  S  R  S  D  C  V  T  R  R  N         p.1020

          .         .         .         .         .         .       g.83441
 TGTGGACGATCAAAGCCTTCATCCAAATTGCGAGATGCTTTTTCAGCCCAAATGGTAAAG       c.3120
 C  G  R  S  K  P  S  S  K  L  R  D  A  F  S  A  Q  M  V  K         p.1040

          .         .         .         .         .         .       g.83501
 AACACAGTGAACCGTAAAGCCTTAAAGACCGAGCGCAAAAGAAAACTGAATCAGCTTCCA       c.3180
 N  T  V  N  R  K  A  L  K  T  E  R  K  R  K  L  N  Q  L  P         p.1060

          .         .         .         .         .         .       g.83561
 AGTGTGACTCTTGATGCTGTACTGCAGGGAGACCGAGAACGTGGAGGTTCATTGAGAGGT       c.3240
 S  V  T  L  D  A  V  L  Q  G  D  R  E  R  G  G  S  L  R  G         p.1080

          .         .         .         .         .         .       g.83621
 GGGGCAGAAGATCCTAGTAAAGAGGATCCCCTTCAGATAATGGGCCACTTAACAAGTGAA       c.3300
 G  A  E  D  P  S  K  E  D  P  L  Q  I  M  G  H  L  T  S  E         p.1100

          .         .         .         .         .         .       g.83681
 GATGGTGACCATTTTTCTGATGTGCATTTCGATAGCAAGGTTAAGCAATCTGATCCTGGT       c.3360
 D  G  D  H  F  S  D  V  H  F  D  S  K  V  K  Q  S  D  P  G         p.1120

          .         .         .         .         .         .       g.83741
 AAAATTTCTGAAAAAGGACTCTCTTTTGAAAACGGAAAAGGCCCAGAGCTGGACTCTGTA       c.3420
 K  I  S  E  K  G  L  S  F  E  N  G  K  G  P  E  L  D  S  V         p.1140

          .         .         .         .         .         .       g.83801
 ATGAACAGTGAGAATGATGAACTCAATGGTGTAAATCAAGTGGTGCCTAAAAAGCGGTGG       c.3480
 M  N  S  E  N  D  E  L  N  G  V  N  Q  V  V  P  K  K  R  W         p.1160

          .         .         .         .         .         .       g.83861
 CAGCGTTTAAACCAAAGGCGCACTAAACCTCGTAAGCGCATGAACAGATTTAAAGAGAAA       c.3540
 Q  R  L  N  Q  R  R  T  K  P  R  K  R  M  N  R  F  K  E  K         p.1180

          .         .         .         .         .         .       g.83921
 GAAAACTCTGAGTGTGCCTTTAGGGTCTTACTTCCTAGTGACCCTGTGCAGGAGGGGCGG       c.3600
 E  N  S  E  C  A  F  R  V  L  L  P  S  D  P  V  Q  E  G  R         p.1200

          .         .         .         .         .         .       g.83981
 GATGAGTTTCCAGAGCATAGAACTCCTTCAGCAAGCATACTTGAGGAACCACTGACAGAG       c.3660
 D  E  F  P  E  H  R  T  P  S  A  S  I  L  E  E  P  L  T  E         p.1220

          .         .         .         .         .         .       g.84041
 CAAAATCATGCTGACTGCTTAGATTCAGCTGGGCCACGGTTAAATGTTTGTGATAAATCC       c.3720
 Q  N  H  A  D  C  L  D  S  A  G  P  R  L  N  V  C  D  K  S         p.1240

          .         .         .         .         .         .       g.84101
 AGTGCCAGCATTGGTGACATGGAAAAGGAGCCAGGAATTCCCAGTTTGACACCACAGGCT       c.3780
 S  A  S  I  G  D  M  E  K  E  P  G  I  P  S  L  T  P  Q  A         p.1260

          .       | 06 .         .         .         .         .    g.107786
 GAGCTCCCTGAACCAG | CTGTGCGGTCAGAGAAGAAACGCCTTAGGAAGCCAAGCAAGTGG    c.3840
 E  L  P  E  P  A |   V  R  S  E  K  K  R  L  R  K  P  S  K  W      p.1280

          .         .         .         .         .         .       g.107846
 CTTTTGGAATATACAGAAGAATATGATCAGATATTTGCTCCTAAGAAAAAACAAAAGAAG       c.3900
 L  L  E  Y  T  E  E  Y  D  Q  I  F  A  P  K  K  K  Q  K  K         p.1300

          .         .  | 07      .         .         .         .    g.110197
 GTACAGGAGCAGGTGCACAAG | GTAAGTTCCCGCTGTGAAGAGGAAAGCCTTCTAGCCCGA    c.3960
 V  Q  E  Q  V  H  K   | V  S  S  R  C  E  E  E  S  L  L  A  R      p.1320

          .         .         .         .         .         .       g.110257
 GGTCGATCTAGTGCTCAGAACAAGCAGGTGGACGAGAATTCTTTGATTTCAACCAAAGAA       c.4020
 G  R  S  S  A  Q  N  K  Q  V  D  E  N  S  L  I  S  T  K  E         p.1340

          .         .         .         .         .         .       g.110317
 GAGCCTCCAGTTCTTGAAAGGGAGGCTCCGTTTTTGGAGGGCCCCTTGGCTCAGTCAGAA       c.4080
 E  P  P  V  L  E  R  E  A  P  F  L  E  G  P  L  A  Q  S  E         p.1360

          .         .         .         .         .         .       g.110377
 CTTGGAGGTGGACATGCTGAGTTGCCGCAGCTGACCTTGTCTGTGCCTGTGGCTCCGGAA       c.4140
 L  G  G  G  H  A  E  L  P  Q  L  T  L  S  V  P  V  A  P  E         p.1380

          .         .         .         .         .   | 08     .    g.111685
 GTCTCTCCACGGCCTGCCCTTGAGTCTGAGGAATTGCTAGTTAAAACGCCAG | GAAATTAT    c.4200
 V  S  P  R  P  A  L  E  S  E  E  L  L  V  K  T  P  G |   N  Y      p.1400

          .         .         .         .         .         .       g.111745
 GAAAGTAAACGTCAAAGAAAACCAACTAAGAAACTTCTTGAATCCAATGATTTAGACCCT       c.4260
 E  S  K  R  Q  R  K  P  T  K  K  L  L  E  S  N  D  L  D  P         p.1420

          .         .         .         .   | 09     .         .    g.116134
 GGATTTATGCCCAAGAAGGGGGACCTTGGCCTTTCTAAAAAG | TGCTATGAAGCTGGTCAC    c.4320
 G  F  M  P  K  K  G  D  L  G  L  S  K  K   | C  Y  E  A  G  H      p.1440

          .         .         .         .         .         | 10    g.118601
 CTGGAGAATGGCATAACTGAATCTTGTGCCACATCTTATTCAAAAGATTTTGGTGGAG | GC    c.4380
 L  E  N  G  I  T  E  S  C  A  T  S  Y  S  K  D  F  G  G  G |       p.1460

          .         .         .         .         .         .       g.118661
 ACTACCAAGATATTTGACAAGCCAAGGAAGCGAAAACGACAGAGGCATGCTGCAGCCAAG       c.4440
 T  T  K  I  F  D  K  P  R  K  R  K  R  Q  R  H  A  A  A  K         p.1480

          .         .         .         .         .        | 11.    g.120105
 ATGCAGTGTAAAAAAGTGAAAAATGATGACTCGTCAAAAGAGATTCCAGGCTCAGAG | GGA    c.4500
 M  Q  C  K  K  V  K  N  D  D  S  S  K  E  I  P  G  S  E   | G      p.1500

          .         .         .         .         .         .       g.120165
 GAACTAATGCCTCACAGGACGGCCACAAGCCCCAAGGAGACTGTTGAGGAAGGTGTAGAA       c.4560
 E  L  M  P  H  R  T  A  T  S  P  K  E  T  V  E  E  G  V  E         p.1520

          .         .         .         .         .         .       g.120225
 CACGATCCCGGGATGCCTGCCTCTAAAAAAATGCAGGGTGAACGCGGTGGAGGAGCTGCA       c.4620
 H  D  P  G  M  P  A  S  K  K  M  Q  G  E  R  G  G  G  A  A         p.1540

          .         .  | 12      .         .         .         .    g.123690
 CTCAAGGAGAATGTCTGTCAG | AATTGTGAAAAATTGGGTGAGCTGCTGTTATGTGAGGCT    c.4680
 L  K  E  N  V  C  Q   | N  C  E  K  L  G  E  L  L  L  C  E  A      p.1560

          .         .         .         .         .         .       g.123750
 CAGTGCTGTGGGGCTTTCCACCTGGAGTGCCTTGGATTGACTGAGATGCCAAGAGGAAAA       c.4740
 Q  C  C  G  A  F  H  L  E  C  L  G  L  T  E  M  P  R  G  K         p.1580

          .         .      | 13  .         .         .         .    g.128907
 TTTATCTGCAATGAATGTCGCACAG | GAATCCATACCTGTTTTGTATGTAAGCAGAGTGGG    c.4800
 F  I  C  N  E  C  R  T  G |   I  H  T  C  F  V  C  K  Q  S  G      p.1600

          .         .         .         .         .         .       g.128967
 GAAGATGTTAAAAGGTGCCTTCTACCCTTGTGTGGAAAGTTTTACCATGAAGAGTGTGTC       c.4860
 E  D  V  K  R  C  L  L  P  L  C  G  K  F  Y  H  E  E  C  V         p.1620

          .         .         .         .         .         .       g.129027
 CAGAAGTACCCACCCACTGTTATGCAGAACAAGGGCTTCCGGTGCTCCCTCCACATCTGT       c.4920
 Q  K  Y  P  P  T  V  M  Q  N  K  G  F  R  C  S  L  H  I  C         p.1640

          .         .         .         .       | 14 .         .    g.131924
 ATAACCTGTCATGCTGCTAATCCAGCCAATGTTTCTGCATCTAAAG | GTCGGTTGATGCGC    c.4980
 I  T  C  H  A  A  N  P  A  N  V  S  A  S  K  G |   R  L  M  R      p.1660

          .         .         .         .         .         .       g.131984
 TGTGTCCGCTGTCCTGTGGCATACCACGCCAATGACTTTTGCCTGGCTGCTGGGTCAAAG       c.5040
 C  V  R  C  P  V  A  Y  H  A  N  D  F  C  L  A  A  G  S  K         p.1680

          .         .         .         .         .         .       g.132044
 ATCCTTGCATCTAATAGTATCATCTGCCCTAATCACTTTACCCCTAGGCGGGGCTGCCGA       c.5100
 I  L  A  S  N  S  I  I  C  P  N  H  F  T  P  R  R  G  C  R         p.1700

          .         .         .         .       | 15 .         .    g.139497
 AATCATGAGCATGTTAATGTTAGCTGGTGCTTTGTGTGCTCAGAAG | GAGGCAGCCTTCTG    c.5160
 N  H  E  H  V  N  V  S  W  C  F  V  C  S  E  G |   G  S  L  L      p.1720

          .         .         .         .         .         .       g.139557
 TGCTGTGATTCTTGCCCTGCTGCTTTTCATCGTGAATGCCTGAACATTGATATCCCTGAA       c.5220
 C  C  D  S  C  P  A  A  F  H  R  E  C  L  N  I  D  I  P  E         p.1740

          .         .         .         .         .         .       g.139617
 GGAAACTGGTATTGCAATGACTGTAAAGCAGGCAAAAAGCCACACTACAGGGAGATTGTC       c.5280
 G  N  W  Y  C  N  D  C  K  A  G  K  K  P  H  Y  R  E  I  V         p.1760

          .         .    | 16    .         .         .         .    g.141560
 TGGGTAAAAGTTGGACGATACAG | GTGGTGGCCAGCTGAGATCTGCCATCCTCGAGCTGTT    c.5340
 W  V  K  V  G  R  Y  R  |  W  W  P  A  E  I  C  H  P  R  A  V      p.1780

          .         .         .         .         .         .       g.141620
 CCTTCCAACATTGATAAGATGAGACATGATGTGGGAGAGTTCCCAGTCCTCTTTTTTGGA       c.5400
 P  S  N  I  D  K  M  R  H  D  V  G  E  F  P  V  L  F  F  G         p.1800

          .         .         .         .         .         .       g.141680
 TCTAATGACTATTTGTGGACTCACCAGGCCCGAGTCTTCCCTTACATGGAGGGTGACGTG       c.5460
 S  N  D  Y  L  W  T  H  Q  A  R  V  F  P  Y  M  E  G  D  V         p.1820

          .         .         .         .          | 17        .    g.145604
 AGCAGCAAGGATAAGATGGGCAAAGGAGTGGATGGGACATATAAAAAAG | CTCTTCAGGAA    c.5520
 S  S  K  D  K  M  G  K  G  V  D  G  T  Y  K  K  A |   L  Q  E      p.1840

          .         .         .         .         .         .       g.145664
 GCTGCAGCAAGGTTTGAGGAATTAAAGGCCCAAAAAGAGCTAAGACAGCTGCAGGAAGAC       c.5580
 A  A  A  R  F  E  E  L  K  A  Q  K  E  L  R  Q  L  Q  E  D         p.1860

          .         .         .         .   | 18     .         .    g.152504
 CGAAAGAATGACAAGAAGCCACCACCTTATAAACATATAAAG | GTAAACCGTCCTATTGGC    c.5640
 R  K  N  D  K  K  P  P  P  Y  K  H  I  K   | V  N  R  P  I  G      p.1880

          .         .         .         .         .         .       g.152564
 AGGGTACAGATCTTCACTGCAGACTTATCTGAAATACCCCGTTGCAACTGTAAAGCTACT       c.5700
 R  V  Q  I  F  T  A  D  L  S  E  I  P  R  C  N  C  K  A  T         p.1900

          .         .         .         .         .         .       g.152624
 GATGAGAACCCCTGTGGGATAGACTCTGAATGCATCAACCGCATGCTGCTCTATGAGTGC       c.5760
 D  E  N  P  C  G  I  D  S  E  C  I  N  R  M  L  L  Y  E  C         p.1920

          .         .         .         .         .         .       g.152684
 CACCCCACAGTGTGTCCTGCCGGAGGGCGCTGTCAAAACCAGTGCTTTTCCAAGCGCCAA       c.5820
 H  P  T  V  C  P  A  G  G  R  C  Q  N  Q  C  F  S  K  R  Q         p.1940

          .         .         .         .         .         .       g.152744
 TATCCAGAGGTTGAAATTTTCCGCACATTACAGCGGGGTTGGGGTCTACGGACAAAAACA       c.5880
 Y  P  E  V  E  I  F  R  T  L  Q  R  G  W  G  L  R  T  K  T         p.1960

          .   | 19     .         .         .         .         .    g.154434
 GATATTAAAAAG | GGTGAATTTGTGAATGAGTATGTGGGTGAGCTTATAGATGAAGAAGAA    c.5940
 D  I  K  K   | G  E  F  V  N  E  Y  V  G  E  L  I  D  E  E  E      p.1980

          .         .         .         .         .         .       g.154494
 TGCAGAGCTCGAATTCGCTATGCTCAAGAACATGATATCACTAATTTCTATATGCTCACC       c.6000
 C  R  A  R  I  R  Y  A  Q  E  H  D  I  T  N  F  Y  M  L  T         p.2000

           | 20        .         .         .         .         .    g.155759
 CTAGACAAA | GACCGAATCATTGATGCTGGTCCCAAAGGAAACTATGCTCGGTTCATGAAT    c.6060
 L  D  K   | D  R  I  I  D  A  G  P  K  G  N  Y  A  R  F  M  N      p.2020

          .         .         .         .         .         .       g.155819
 CATTGCTGCCAGCCCAACTGTGAAACACAGAAGTGGTCTGTGAATGGAGATACCCGTGTA       c.6120
 H  C  C  Q  P  N  C  E  T  Q  K  W  S  V  N  G  D  T  R  V         p.2040

          .         .         .  | 21      .         .         .    g.160769
 GGCCTTTTTGCACTAAGTGACATTAAAGCAG | GCACTGAACTTACCTTCAACTACAACCTA    c.6180
 G  L  F  A  L  S  D  I  K  A  G |   T  E  L  T  F  N  Y  N  L      p.2060

          .         .         .         .         .         .       g.160829
 GAATGTCTTGGGAATGGAAAGACTGTTTGCAAATGTGGAGCCCCGAACTGCAGTGGCTTC       c.6240
 E  C  L  G  N  G  K  T  V  C  K  C  G  A  P  N  C  S  G  F         p.2080

          .         | 22         .         .         .         .    g.163917
 TTGGGTGTAAGGCCAAAG | AATCAACCCATTGCCACGGAAGAAAAGTCAAAGAAATTCAAG    c.6300
 L  G  V  R  P  K   | N  Q  P  I  A  T  E  E  K  S  K  K  F  K      p.2100

          .         .         .         .         .         .       g.163977
 AAGAAGCAACAGGGAAAGCGCAGGACCCAGGGTGAAATCACAAAGGAGCGAGAAGATGAG       c.6360
 K  K  Q  Q  G  K  R  R  T  Q  G  E  I  T  K  E  R  E  D  E         p.2120

          .         .         .         .         .         .       g.164037
 TGTTTTAGTTGTGGGGATGCTGGCCAGCTCGTCTCCTGCAAGAAACCAGGCTGCCCAAAA       c.6420
 C  F  S  C  G  D  A  G  Q  L  V  S  C  K  K  P  G  C  P  K         p.2140

          .         .         .         .    | 23    .         .    g.165770
 GTTTACCACGCAGACTGTCTCAATCTGACCAAGCGACCAGCAG | GGAAATGGGAATGTCCG    c.6480
 V  Y  H  A  D  C  L  N  L  T  K  R  P  A  G |   K  W  E  C  P      p.2160

          .         .         .         .         .         .       g.165830
 TGGCATCAGTGTGACATCTGCGGGAAGGAAGCAGCCTCCTTCTGTGAGATGTGCCCCAGC       c.6540
 W  H  Q  C  D  I  C  G  K  E  A  A  S  F  C  E  M  C  P  S         p.2180

          .         .         .         .         .         .       g.165890
 TCCTTTTGTAAGCAGCATCGAGAAGGGATGCTTTTCATTTCCAAACTGGATGGGCGTCTG       c.6600
 S  F  C  K  Q  H  R  E  G  M  L  F  I  S  K  L  D  G  R  L         p.2200

          .         .         .         .         .         .       g.165950
 TCTTGTACTGAGCATGACCCCTGTGGGCCCAATCCTCTGGAACCTGGGGAGATCCGTGAG       c.6660
 S  C  T  E  H  D  P  C  G  P  N  P  L  E  P  G  E  I  R  E         p.2220

          .         .         .         .         .         .       g.166010
 TATGTGCCTCCCCCAGTACCGCTGCCTCCAGGGCCAAGCACTCACCTGGCAGAGCAATCA       c.6720
 Y  V  P  P  P  V  P  L  P  P  G  P  S  T  H  L  A  E  Q  S         p.2240

          .         .         .         .         .         .       g.166070
 ACAGGAATGGCTGCTCAGGCACCCAAAATGTCAGATAAACCTCCTGCTGACACCAACCAG       c.6780
 T  G  M  A  A  Q  A  P  K  M  S  D  K  P  P  A  D  T  N  Q         p.2260

          .         .         .         .         .         .       g.166130
 ATGCTGTCGCTCTCCAAAAAAGCTCTGGCAGGGACTTGTCAGAGGCCATTGCTACCTGAA       c.6840
 M  L  S  L  S  K  K  A  L  A  G  T  C  Q  R  P  L  L  P  E         p.2280

          .         .         .         .         .         .       g.166190
 AGACCTCTTGAGAGAACTGACTCCAGGCCCCAGCCTTTAGATAAGGTCAGAGACCTCGCT       c.6900
 R  P  L  E  R  T  D  S  R  P  Q  P  L  D  K  V  R  D  L  A         p.2300

          .         .         .         .         .         .       g.166250
 GGGTCAGGGACCAAATCCCAATCCTTGGTTTCCAGCCAGAGGCCACTGGACAGGCCACCA       c.6960
 G  S  G  T  K  S  Q  S  L  V  S  S  Q  R  P  L  D  R  P  P         p.2320

          .         .         .         .         .         .       g.166310
 GCAGTGGCAGGACCAAGACCCCAGCTAAGCGACAAACCCTCTCCAGTGACCAGCCCAAGC       c.7020
 A  V  A  G  P  R  P  Q  L  S  D  K  P  S  P  V  T  S  P  S         p.2340

          .         .         .         .         .         .       g.166370
 TCCTCACCCTCAGTCAGGTCCCAACCACTGGAAAGACCTCTGGGGACGGCTGACCCAAGG       c.7080
 S  S  P  S  V  R  S  Q  P  L  E  R  P  L  G  T  A  D  P  R         p.2360

          .         .         .         .         .         .       g.166430
 CTGGATAAATCCATAGGTGCTGCCAGCCCAAGGCCCCAGTCACTGGAGAAAACCTCAGTT       c.7140
 L  D  K  S  I  G  A  A  S  P  R  P  Q  S  L  E  K  T  S  V         p.2380

          .         .         .         .         .         .       g.166490
 CCCACTGGCCTGAGACTTCCGCCGCCAGACAGACTGCTCATTACTAGCAGTCCCAAACCC       c.7200
 P  T  G  L  R  L  P  P  P  D  R  L  L  I  T  S  S  P  K  P         p.2400

          .         .         .         .         .         .       g.166550
 CAGACTTCAGACAGGCCTACTGACAAACCCCATGCCTCTTTGTCCCAGAGACTCCCACCT       c.7260
 Q  T  S  D  R  P  T  D  K  P  H  A  S  L  S  Q  R  L  P  P         p.2420

          .         .         .         .         .         .       g.166610
 CCTGAGAAAGTACTATCAGCTGTGGTCCAGACCCTTGTAGCTAAAGAAAAAGCACTGAGG       c.7320
 P  E  K  V  L  S  A  V  V  Q  T  L  V  A  K  E  K  A  L  R         p.2440

          .         .         .         .         .         .       g.166670
 CCTGTGGACCAGAATACTCAGTCAAAAAATAGAGCTGCTTTGGTGATGGATCTCATAGAC       c.7380
 P  V  D  Q  N  T  Q  S  K  N  R  A  A  L  V  M  D  L  I  D         p.2460

          .         .         .         .         .         .       g.166730
 CTAACTCCTCGCCAGAAGGAGCGGGCAGCTTCACCTCATCAGGTCACACCACAGGCTGAT       c.7440
 L  T  P  R  Q  K  E  R  A  A  S  P  H  Q  V  T  P  Q  A  D         p.2480

          .         .         .         .         .         .       g.166790
 GAGAAGATGCCAGTGTTGGAGTCAAGTTCATGGCCTGCCAGCAAAGGTCTGGGGCATATG       c.7500
 E  K  M  P  V  L  E  S  S  S  W  P  A  S  K  G  L  G  H  M         p.2500

          .         .         .         .         .         .       g.166850
 CCGAGAGCTGTTGAGAAAGGCTGTGTGTCAGATCCTCTTCAGACATCTGGGAAAGCAGCA       c.7560
 P  R  A  V  E  K  G  C  V  S  D  P  L  Q  T  S  G  K  A  A         p.2520

          .         .         .         .         .         .       g.166910
 GCCCCTTCAGAGGACCCCTGGCAAGCTGTTAAATCACTCACCCAGGCCAGACTTCTTTCT       c.7620
 A  P  S  E  D  P  W  Q  A  V  K  S  L  T  Q  A  R  L  L  S         p.2540

          .         .         .         .         .         .       g.166970
 CAGCCTCCTGCCAAGGCCTTTTTATATGAGCCAACAACTCAGGCCTCAGGAAGAGCTTCT       c.7680
 Q  P  P  A  K  A  F  L  Y  E  P  T  T  Q  A  S  G  R  A  S         p.2560

          .         .         .         .         .         .       g.167030
 GCAGGGGCTGAGCAGACCCCAGGGCCTCTTAGCCAATCCCCGGGCCTGGTGAAGCAGGCG       c.7740
 A  G  A  E  Q  T  P  G  P  L  S  Q  S  P  G  L  V  K  Q  A         p.2580

          .         .         .         .         .         .       g.167090
 AAGCAGATGGTCGGAGGCCAGCAACTACCTGCACTTGCCGCCAAGAGTGGGCAATCTTTT       c.7800
 K  Q  M  V  G  G  Q  Q  L  P  A  L  A  A  K  S  G  Q  S  F         p.2600

          .         .         .         .         .         .       g.167150
 AGGTCTCTCGGGAAGGCCCCAGCCTCCCTCCCCACTGAAGAAAAGAAGTTGGTAACCACA       c.7860
 R  S  L  G  K  A  P  A  S  L  P  T  E  E  K  K  L  V  T  T         p.2620

          .         .         .         .         .         .       g.167210
 GAGCAAAGTCCCTGGGCCCTGGGAAAAGCCTCATCACGGGCAGGGCTCTGGCCCATAGTG       c.7920
 E  Q  S  P  W  A  L  G  K  A  S  S  R  A  G  L  W  P  I  V         p.2640

          .         .         .         .         .         .       g.167270
 GCTGGACAGACACTGGCACAGTCTTGCTGGTCTGCTGGGAGCACACAGACATTGGCACAG       c.7980
 A  G  Q  T  L  A  Q  S  C  W  S  A  G  S  T  Q  T  L  A  Q         p.2660

          .         .         .         .         .         .       g.167330
 ACTTGCTGGTCTCTTGGAAGAGGGCAAGACCCCAAACCAGAGCAAAATACACTTCCAGCT       c.8040
 T  C  W  S  L  G  R  G  Q  D  P  K  P  E  Q  N  T  L  P  A         p.2680

          .         .         .         .         .                 g.167381
 CTTAACCAGGCTCCTTCCAGTCACAAGTGTGCAGAATCAGAACAGAAGTAG                c.8091
 L  N  Q  A  P  S  S  H  K  C  A  E  S  E  Q  K  X                  p.2696

          .         .         .         .         .         .       g.167441
 taccaatcaatgtcacatgaacaaacaagctgcccccagggtaccatttggggaggggaa       c.*60

          .         .         .         .         .         .       g.167501
 atcttttctttctttcccccttaaaaaaaaacacatctgccccgaacactttcccactgt       c.*120

          .         .         .         .         .         .       g.167561
 tattctttcctcatatcccaacactcagaactcttgtgacattagccagtgggggcttat       c.*180

          .         .         .         .         .         .       g.167621
 ggttgtgtgaaccatgtatgaaaatccagtgggccccaaccaaggagacagacagacttg       c.*240

          .         .         .         .         .         .       g.167681
 ggtctctttcccccaacttttccacatggtcatcgtgaaataaaaagtccactctggagt       c.*300

          .         .         .         .         .         .       g.167741
 caagtatggaattcaattccgctggtcaggttggaaggtataggggctctcaaagcgatt       c.*360

          .         .         .         .         .         .       g.167801
 tccccaaccagacagagccccattgagggcacctaggaacccttgggaggaaatggtgtt       c.*420

          .         .         .         .         .         .       g.167861
 ctttcaaatcagtggcgatttcctgagcattcacgtgttctaggccgggtgctagtcact       c.*480

          .         .         .         .         .         .       g.167921
 gatgagagatacaggcctcatccctgtgagcctggattccaaggctttcaggaacctttg       c.*540

          .         .         .         .         .         .       g.167981
 accaggaagtaacaggaagttctgaggggccctggggctttagactcattttgaaatgtc       c.*600

          .         .         .         .         .         .       g.168041
 ctttgtggcaccagaagtggttgtgttgaggaagtgtctcttggctgcggtgtgcatggg       c.*660

          .         .         .         .         .         .       g.168101
 tgcgtgtgcatgcgcgcacactcacagaggtctcctctatagatgcaagggtgctgcatt       c.*720

          .         .         .         .         .         .       g.168161
 gaggccagcaaggctgttggctgtggggtcgccgctgctgcttttgtctgggctgtgcag       c.*780

          .         .         .         .         .         .       g.168221
 agtctcaagatcagtccttggaggagcaggtggtcaggggcagtcgggctctgtgcgaat       c.*840

          .         .         .         .         .         .       g.168281
 gtagatttccagcagtggaagaaggcatttggcaagcttctctttctttgcttttgtttc       c.*900

          .         .         .         .         .         .       g.168341
 tacctatttttctctttgtacatgaatccaccccatccctatttccctaaaacactcagg       c.*960

          .         .         .         .         .         .       g.168401
 tgctttcagatttcagagcctcgggcagtggacatagggaatctctggcaagctctgagc       c.*1020

          .         .         .         .         .         .       g.168461
 tagacacaccagcttcaggaagagtaccagatcctgatgggaaatttcttttccccattc       c.*1080

          .         .         .         .         .         .       g.168521
 cttttccctcctgagtggagggagtcctcttcttcgcctccctgagaattgctgtgctct       c.*1140

          .         .         .         .         .         .       g.168581
 gtattgagagcacctgcctgctgacttagctcaaaggcaagccagaacccttccctgaag       c.*1200

          .         .         .         .         .         .       g.168641
 actggcaagaggtggtgtttagagcaacgtccaggctaagagatgactcctattaactgc       c.*1260

          .         .         .         .         .         .       g.168701
 tgattatctgttactgctgccctgagctggggcccaagggctgggaaatctgttggtgct       c.*1320

          .         .         .         .         .         .       g.168761
 accctgccctaccattcacccagctcacagactgccaacaggaagtgctgtttggctagt       c.*1380

          .         .         .         .         .         .       g.168821
 ttcctcccacttgtctacccctcctttgtccttagaccaacatgtttacctctctgcttt       c.*1440

          .         .         .         .         .         .       g.168881
 gccaacttagccagcaggccatccccggccctaacgtctcctggccattatctcttagtt       c.*1500

          .         .         .         .         .         .       g.168941
 atggctttcacgctctcaataggattctgtatttggtcccaatttcctcaagttcttatt       c.*1560

          .         .         .         .         .         .       g.169001
 gaggttactcccatcaattccacggagggaacagtagttattatagaagcatttgcgctt       c.*1620

          .         .         .         .         .         .       g.169061
 tatctaaagattaaaaatagaatctgcttttatttcccaaagtctgtctctgaggttgag       c.*1680

          .         .         .         .         .         .       g.169121
 acacttgaactcaggcagagggacgaggctgggcagggctgtcctgagtttaggggccta       c.*1740

          .         .         .         .         .         .       g.169181
 tccctgcatttcactgagacctcggaatctcctctgtgaattccacctgcctagttctcc       c.*1800

          .         .         .         .         .         .       g.169241
 cctttcatcctctctctcttcccacatcatcaaagaggaaaagctctttgttcaaaagga       c.*1860

          .         .         .         .         .         .       g.169301
 agagaaaacgtaaagcatcttattttcttttaaaagaattttaaaccatgaaaaagatat       c.*1920

          .         .         .         .         .         .       g.169361
 ttttaaagaaattcaccgagaacattaaagttcattatattaagtatttatcatgtgtga       c.*1980

          .         .         .         .         .         .       g.169421
 gaataataaatatataactgcagctagtaggtccctttccctaatcttttaggtcatatg       c.*2040

          .         .         .         .         .         .       g.169481
 agtagggtttgcttggtgccagtcctgtgcccttttctctccagtcatctgtagttgtga       c.*2100

          .         .         .         .         .         .       g.169541
 tcagaaaaaggtatctgcactgcactgtcagagtctcctttcactatgttgtgtgttaaa       c.*2160

          .         .         .         .         .         .       g.169601
 ttaccgtagctctttgtttcatgaaataaactgtgaatttggggggggcggggggagggc       c.*2220

          .         .         .         .         .         .       g.169661
 gtgcaggccatgtaaaaattttccgtggagaagtttgattctaaagtagcttctctaaag       c.*2280

          .         .         .         .         .         .       g.169721
 taggctttggtaggtaatcaacttgacagcagtctagatgtctcacaggacaggagggag       c.*2340

          .         .         .         .         .         .       g.169781
 tgagggaaaggggccatgattggctgctttgtggttttattttggttctttccattctcc       c.*2400

          .         .         .         .         .         .       g.169841
 gccattcattggaggcttcgttccagacctgcctgggaaaacagcttctgagccattttg       c.*2460

          .         .         .         .         .         .       g.169901
 gggagcagttcttcatctgaatggatggacatctgggcttccttcaagggccattgaatg       c.*2520

          .         .         .         .         .         .       g.169961
 ggaactagaaaaccactggaaactagaaatttgagctattgggcccaccagtagcagcat       c.*2580

          .         .         .         .         .         .       g.170021
 gtgatactagatggttaaaatcatgaaagcagtcactatccaattagaagcagagtcaca       c.*2640

          .         .         .         .         .         .       g.170081
 acaactgttgggaaatgtgactcttggaggaaggtggggagggagtggccttgccagccc       c.*2700

          .         .         .         .         .         .       g.170141
 tgtgggacgtcccctgaagtttgtaataagaccccttttccaaagggatgtgaattggag       c.*2760

          .         .         .         .         .         .       g.170201
 tgaaaaggaaatctttcatcttagaaaacttctggtccttaacgcagggtggtatttggg       c.*2820

          .         .         .         .         .         .       g.170261
 tatgtgcttggaaattgagatctcaagagtgtttgccttggagccagctccccaggaggc       c.*2880

          .         .         .         .         .         .       g.170321
 cttttccagggacaaggcaaaagttgaaattctccatgggtagctagaaagccaatacat       c.*2940

          .         .         .         .         .         .       g.170381
 ctagccctgctaagtcagaaaaagattatgaaaaatgttgaaatttacattcaaagcctc       c.*3000

          .         .         .         .         .         .       g.170441
 atttgcttatcttgctggagccaacccagtctaatagcaaaatagctgtcattgatacag       c.*3060

          .         .         .         .         .         .       g.170501
 aaacatcctcatttttaaatgtctgctttaccctgttactgagtttgagatgacttaaat       c.*3120

          .         .         .         .         .         .       g.170561
 cactgtgttgaccctcttctgaaccaaatctttagcattgatgaaaatagttattttatt       c.*3180

          .         .         .         .         .         .       g.170621
 ctttacatccttcaccccacactatggtcagggcatgaaacaccctgttgatcccttccc       c.*3240

          .         .         .         .         .         .       g.170681
 aggctcggcactgtctgctcactggagccggactcccaggttgtaattctaatgttgcct       c.*3300

          .         .         .         .         .         .       g.170741
 catgagaacagaatggcagaaagtttagtcctgacagattcccccatagggagtaatgag       c.*3360

          .         .         .         .         .         .       g.170801
 gacagcatgaaacttggataggttttacccttagtccctataaggtggattttactaagg       c.*3420

          .         .         .         .         .         .       g.170861
 ttttttaaatgatactgtcatcctcttggggtttatcagccaggttagaggagcccagtg       c.*3480

          .         .         .         .         .         .       g.170921
 tcctaacctctctcagatcatggcagagaaggagctgcctccagcccctttcttgctgag       c.*3540

          .         .         .         .         .         .       g.170981
 tttcatttgagcagttccatgtgtagacattccaagtcactgcttggtagttgctgtggg       c.*3600

          .         .         .         .         .         .       g.171041
 agcctgtcattggctatggccagttagttctcagctgagcttcctagggccagtgcaaca       c.*3660

          .         .         .         .         .         .       g.171101
 gggccagaggctgctatagtgtaaattgaaataagaatagatcattgttttgtacacaca       c.*3720

          .         .         .         .         .         .       g.171161
 cacaataaaatgtaatgatggtgctaatttcacggtataaataagcactgccaagggttg       c.*3780

          .         .         .         .         .         .       g.171221
 agggactggcagctcaagaaacccgggttcctgtttgggaggagattttatgtagaaaag       c.*3840

          .         .         .         .         .         .       g.171281
 tttgaggctttgttaaaagtggggagaaggaagatcctcagtgaagcctgcacccaaccc       c.*3900

          .         .         .         .         .         .       g.171341
 tggagtggcccagtgcaatccagaggtggaagagatcctatatccaggtgaaggtggcca       c.*3960

          .         .         .         .         .         .       g.171401
 ttgagtttctcagggctggggccaccttgtccatagcctccgtccacgctgcctggagca       c.*4020

          .         .         .         .         .         .       g.171461
 ggttgttagagagctctggttgttgggtcttcctcagctcccttctgcccctctctacct       c.*4080

          .         .         .         .         .         .       g.171521
 cttccactcatggaagcccctctactgcttatgaagattaagggtagtattttctaagga       c.*4140

          .         .         .         .         .         .       g.171581
 agtggaaagaattaaactagaaatccacaacctcggaagaagtgtttcgagtttaacatg       c.*4200

          .         .         .         .         .         .       g.171641
 cgctgtttctgcttatgtggttccttctctagagctgctttcccatggctttcaaaacat       c.*4260

          .         .         .         .         .         .       g.171701
 caggttattgtggggcttcaggtgtaaggtcctggaagttcagcaaagtttcgtggacaa       c.*4320

          .         .         .         .         .         .       g.171761
 gacatgggcacagagagtagaagcagaaataaatggttctatgttttcaacttccagggt       c.*4380

          .         .         .         .         .         .       g.171821
 tggggcaggccagagcaaggcggtctcatcgaggtgggtgctacctgtgtgtgtgtagat       c.*4440

          .         .         .         .         .         .       g.171881
 gagtgtgctgaaggtggggagggcagcacacagcagctcatggcagagccgcctcctagg       c.*4500

          .         .         .         .         .         .       g.171941
 tcttggcaaagaggcaagctgacgatagacatctacctatattgttaagaaaggggtcgg       c.*4560

          .         .         .         .         .         .       g.172001
 ggggatcagccaaggtccatcattgcttttttgccgcgcccccccccccccgcccccata       c.*4620

          .         .         .         .         .         .       g.172061
 gattgtcagctgtaagtgaaactcctagtgaaaaagaggggagccctgtgttaggagtcc       c.*4680

          .         .         .         .         .         .       g.172121
 ccataaacatgtactgtaattctttgtatatagaaaaaaaatttactgtaaagtaaagtt       c.*4740

          .                                                         g.172135
 taactttactcata                                                     c.*4754

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Nuclear receptor binding SET domain protein 1 protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 17
©2004-2016 Leiden University Medical Center