SON DNA binding protein (SON) - coding DNA reference sequence

(used for variant description)

(last modified October 6, 2016)


This file was created to facilitate the description of sequence variants on transcript NM_138927.2 in the SON gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NC_000021.8, covering SON transcript NM_138927.2.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
           .         .         .         .         .                g.5049
      atgctgggagcctggaggactagcgaggaggagttgagagaacggagcggacgcc       c.-1

          .         .         .         .         .         .       g.5109
 ATGGCGACCAACATCGAGCAGATTTTTAGGTCTTTCGTGGTCAGTAAATTCCGGGAAATT       c.60
 M  A  T  N  I  E  Q  I  F  R  S  F  V  V  S  K  F  R  E  I         p.20

          .        | 02.         .         .         .         .    g.8212
 CAACAGGAGCTTTCCAG | TGGAAGGAATGAAGGCCAGCTGAATGGTGAAACAAATACACCC    c.120
 Q  Q  E  L  S  S  |  G  R  N  E  G  Q  L  N  G  E  T  N  T  P      p.40

          .         .         .         .         .         .       g.8272
 ATTGAAGGAAACCAGGCGGGTGATGCAGCTGCCTCTGCCAGGAGTCTACCAAATGAAGAA       c.180
 I  E  G  N  Q  A  G  D  A  A  A  S  A  R  S  L  P  N  E  E         p.60

          .         .         .         .         .         .       g.8332
 ATAGTGCAGAAGATAGAGGAAGTACTTTCTGGGGTCTTAGATACAGAACTACGATATAAG       c.240
 I  V  Q  K  I  E  E  V  L  S  G  V  L  D  T  E  L  R  Y  K         p.80

      | 03   .         .         .         .         .         .    g.11488
 CCAG | ACTTGAAAGAGGGCTCCAGAAAAAGTAGATGCGTATCTGTACAAACAGATCCTACT    c.300
 P  D |   L  K  E  G  S  R  K  S  R  C  V  S  V  Q  T  D  P  T      p.100

          .         .         .         .         .         .       g.11548
 GATGAAATTCCCACTAAAAAGTCAAAGAAGCATAAAAAGCACAAAAACAAAAAGAAGAAA       c.360
 D  E  I  P  T  K  K  S  K  K  H  K  K  H  K  N  K  K  K  K         p.120

          .         .         .         .         .         .       g.11608
 AAGAAGAAAGAAAAGGAAAAAAAATATAAAAGACAGCCAGAAGAATCTGAGTCAAAGACG       c.420
 K  K  K  E  K  E  K  K  Y  K  R  Q  P  E  E  S  E  S  K  T         p.140

          .         .         .         .         .         .       g.11668
 AAATCTCATGATGATGGGAACATAGATTTAGAATCTGATTCCTTTTTAAAGTTTGATTCT       c.480
 K  S  H  D  D  G  N  I  D  L  E  S  D  S  F  L  K  F  D  S         p.160

          .         .         .         .         .         .       g.11728
 GAACCTTCAGCTGTGGCGCTGGAGCTTCCTACAAGAGCATTTGGCCCATCTGAGACCAAT       c.540
 E  P  S  A  V  A  L  E  L  P  T  R  A  F  G  P  S  E  T  N         p.180

          .         .         .         .         .         .       g.11788
 GAATCCCCTGCAGTTGTGCTAGAACCTCCTGTAGTATCAATGGAGGTATCAGAGCCACAC       c.600
 E  S  P  A  V  V  L  E  P  P  V  V  S  M  E  V  S  E  P  H         p.200

          .         .         .         .         .         .       g.11848
 ATCTTAGAAACTCTGAAGCCAGCTACAAAAACTGCAGAACTGTCAGTTGTATCTACATCA       c.660
 I  L  E  T  L  K  P  A  T  K  T  A  E  L  S  V  V  S  T  S         p.220

          .         .         .         .         .         .       g.11908
 GTAATCTCAGAGCAGTCAGAGCAGTCTGTGGCAGTAATGCCAGAACCATCCATGACAAAG       c.720
 V  I  S  E  Q  S  E  Q  S  V  A  V  M  P  E  P  S  M  T  K         p.240

          .         .         .         .         .         .       g.11968
 ATTCTGGATTCCTTTGCAGCAGCACCAGTGCCTACTACAACACTGGTGTTGAAGTCATCT       c.780
 I  L  D  S  F  A  A  A  P  V  P  T  T  T  L  V  L  K  S  S         p.260

          .         .         .         .         .         .       g.12028
 GAGCCAGTTGTAACAATGTCAGTGGAGTATCAGATGAAGTCTGTGCTGAAATCTGTGGAG       c.840
 E  P  V  V  T  M  S  V  E  Y  Q  M  K  S  V  L  K  S  V  E         p.280

          .         .         .         .         .         .       g.12088
 AGCACATCTCCAGAGCCATCAAAGATCATGTTGGTAGAGCCCCCAGTAGCAAAAGTGTTA       c.900
 S  T  S  P  E  P  S  K  I  M  L  V  E  P  P  V  A  K  V  L         p.300

          .         .         .         .         .         .       g.12148
 GAGCCTTCAGAAACCCTTGTGGTATCATCAGAGACACCTACTGAGGTGTACCCTGAGCCA       c.960
 E  P  S  E  T  L  V  V  S  S  E  T  P  T  E  V  Y  P  E  P         p.320

          .         .         .         .         .         .       g.12208
 AGCACATCAACAACAATGGATTTTCCAGAGTCATCTGCAATTGAAGCGCTAAGATTGCCA       c.1020
 S  T  S  T  T  M  D  F  P  E  S  S  A  I  E  A  L  R  L  P         p.340

          .         .         .         .         .         .       g.12268
 GAGCAGCCTGTAGACGTACCATCGGAGATTGCAGATTCATCCATGACAAGACCGCAGGAG       c.1080
 E  Q  P  V  D  V  P  S  E  I  A  D  S  S  M  T  R  P  Q  E         p.360

          .         .         .         .         .         .       g.12328
 TTGCCGGAGCTGCCTAAGACCACAGCGTTGGAGCTGCAGGAGTCGTCGGTGGCCTCAGCG       c.1140
 L  P  E  L  P  K  T  T  A  L  E  L  Q  E  S  S  V  A  S  A         p.380

          .         .         .         .         .         .       g.12388
 ATGGAGTTGCCGGGGCCACCTGCGACCTCCATGCCGGAGTTGCAGGGGCCCCCTGTGACT       c.1200
 M  E  L  P  G  P  P  A  T  S  M  P  E  L  Q  G  P  P  V  T         p.400

          .         .         .         .         .         .       g.12448
 CCAGTGCTGGAGTTACCTGGGCCCTCTGCTACCCCGGTGCCAGAGTTGCCAGGGCCCCTT       c.1260
 P  V  L  E  L  P  G  P  S  A  T  P  V  P  E  L  P  G  P  L         p.420

          .         .         .         .         .         .       g.12508
 TCTACCCCAGTGCCTGAGTTGCCAGGGCCCCCTGCGACAGCAGTGCCTGAGTTGCCAGGG       c.1320
 S  T  P  V  P  E  L  P  G  P  P  A  T  A  V  P  E  L  P  G         p.440

          .         .         .         .         .         .       g.12568
 CCCTCTGTGACACCAGTGCCACAGTTGTCGCAGGAATTGCCAGGGCTTCCAGCACCATCC       c.1380
 P  S  V  T  P  V  P  Q  L  S  Q  E  L  P  G  L  P  A  P  S         p.460

          .         .         .         .         .         .       g.12628
 ATGGGGTTGGAGCCACCACAGGAGGTACCAGAGCCACCTGTGATGGCACAGGAGTTGCCA       c.1440
 M  G  L  E  P  P  Q  E  V  P  E  P  P  V  M  A  Q  E  L  P         p.480

          .         .         .         .         .         .       g.12688
 GGGCTGCCTTTGGTGACAGCAGCAGTAGAGTTGCCAGAGCAGCCTGCGGTAACAGTAGCA       c.1500
 G  L  P  L  V  T  A  A  V  E  L  P  E  Q  P  A  V  T  V  A         p.500

          .         .         .         .         .         .       g.12748
 ATGGAGTTGACCGAACAACCTGTGACGACGACAGAGTTGGAGCAGCCTGTGGGGATGACA       c.1560
 M  E  L  T  E  Q  P  V  T  T  T  E  L  E  Q  P  V  G  M  T         p.520

          .         .         .         .         .         .       g.12808
 ACGGTGGAACATCCTGGGCATCCTGAGGTGACAACGGCAACAGGGTTGCTGGGGCAGCCT       c.1620
 T  V  E  H  P  G  H  P  E  V  T  T  A  T  G  L  L  G  Q  P         p.540

          .         .         .         .         .         .       g.12868
 GAGGCAACGATGGTGCTGGAGTTGCCAGGACAGCCAGTGGCAACGACAGCGCTGGAGTTG       c.1680
 E  A  T  M  V  L  E  L  P  G  Q  P  V  A  T  T  A  L  E  L         p.560

          .         .         .         .         .         .       g.12928
 CCGGGGCAGCCTTCGGTGACTGGGGTGCCAGAGTTGCCAGGGCTGCCTTCGGCAACTAGG       c.1740
 P  G  Q  P  S  V  T  G  V  P  E  L  P  G  L  P  S  A  T  R         p.580

          .         .         .         .         .         .       g.12988
 GCACTGGAGTTGTCGGGGCAGCCTGTGGCAACTGGGGCACTAGAGTTGCCTGGGCCGCTC       c.1800
 A  L  E  L  S  G  Q  P  V  A  T  G  A  L  E  L  P  G  P  L         p.600

          .         .         .         .         .         .       g.13048
 ATGGCAGCTGGGGCACTGGAGTTCTCGGGGCAGTCTGGGGCAGCTGGAGCACTGGAGCTT       c.1860
 M  A  A  G  A  L  E  F  S  G  Q  S  G  A  A  G  A  L  E  L         p.620

          .         .         .         .         .         .       g.13108
 TTGGGGCAGCCTCTGGCAACAGGGGTGCTGGAGTTGCCAGGGCAGCCTGGGGCGCCAGAG       c.1920
 L  G  Q  P  L  A  T  G  V  L  E  L  P  G  Q  P  G  A  P  E         p.640

          .         .         .         .         .         .       g.13168
 TTGCCTGGGCAGCCTGTGGCAACTGTGGCGCTGGAGATCTCTGTTCAGTCTGTGGTGACA       c.1980
 L  P  G  Q  P  V  A  T  V  A  L  E  I  S  V  Q  S  V  V  T         p.660

          .         .         .         .         .         .       g.13228
 ACATCGGAGCTGTCAACGATGACCGTGTCGCAGTCCCTGGAGGTGCCCTCGACGACAGCG       c.2040
 T  S  E  L  S  T  M  T  V  S  Q  S  L  E  V  P  S  T  T  A         p.680

          .         .         .         .         .         .       g.13288
 CTGGAATCCTATAATACGGTAGCACAGGAGCTGCCTACTACATTAGTGGGGGAGACTTCT       c.2100
 L  E  S  Y  N  T  V  A  Q  E  L  P  T  T  L  V  G  E  T  S         p.700

          .         .         .         .         .         .       g.13348
 GTAACAGTAGGAGTGGATCCCTTGATGGCCCCAGAATCCCATATATTAGCTTCTAACACC       c.2160
 V  T  V  G  V  D  P  L  M  A  P  E  S  H  I  L  A  S  N  T         p.720

          .         .         .         .         .         .       g.13408
 ATGGAGACCCATATATTAGCATCCAACACCATGGACTCCCAAATGCTAGCGTCCAACACC       c.2220
 M  E  T  H  I  L  A  S  N  T  M  D  S  Q  M  L  A  S  N  T         p.740

          .         .         .         .         .         .       g.13468
 ATGGACTCCCAGATGCTAGCATCCAACACCATGGACTCCCAGATGTTAGCGTCTAGCACC       c.2280
 M  D  S  Q  M  L  A  S  N  T  M  D  S  Q  M  L  A  S  S  T         p.760

          .         .         .         .         .         .       g.13528
 ATGGACTCCCAGATGTTAGCAACTAGCTCCATGGACTCCCAGATGTTAGCAACTAGCTCC       c.2340
 M  D  S  Q  M  L  A  T  S  S  M  D  S  Q  M  L  A  T  S  S         p.780

          .         .         .         .         .         .       g.13588
 ATGGACTCCCAGATGTTAGCAACTAGCACTATGGACTCCCAGATGTTAGCAACCAGTTCC       c.2400
 M  D  S  Q  M  L  A  T  S  T  M  D  S  Q  M  L  A  T  S  S         p.800

          .         .         .         .         .         .       g.13648
 ATGGACTCCCAGATGTTAGCAACCAGCTCCATGGACTCCCAGATGTTAGCAACCAGCTCC       c.2460
 M  D  S  Q  M  L  A  T  S  S  M  D  S  Q  M  L  A  T  S  S         p.820

          .         .         .         .         .         .       g.13708
 ATGGACTCCCAGATGTTAGCAACCAGCTCCATGGACTCCCAGATGTTAGCAACCAGCACC       c.2520
 M  D  S  Q  M  L  A  T  S  S  M  D  S  Q  M  L  A  T  S  T         p.840

          .         .         .         .         .         .       g.13768
 ATGGATTCTCAGATGTTAGCAACCAGCACCATGGACTCCCAGATGTTAGCAACTAGCTCA       c.2580
 M  D  S  Q  M  L  A  T  S  T  M  D  S  Q  M  L  A  T  S  S         p.860

          .         .         .         .         .         .       g.13828
 ATGGATTCCCAGATGTTAGCATCTGGCACTATGGACTCTCAAATGTTAGCTTCTGGCACC       c.2640
 M  D  S  Q  M  L  A  S  G  T  M  D  S  Q  M  L  A  S  G  T         p.880

          .         .         .         .         .         .       g.13888
 ATGGATGCTCAGATGTTAGCGTCTGGTACCATGGATGCCCAGATGTTAGCGTCTAGTACC       c.2700
 M  D  A  Q  M  L  A  S  G  T  M  D  A  Q  M  L  A  S  S  T         p.900

          .         .         .         .         .         .       g.13948
 CAAGATTCTGCTATGTTGGGTTCAAAATCTCCTGATCCCTATAGGTTAGCTCAGGATCCT       c.2760
 Q  D  S  A  M  L  G  S  K  S  P  D  P  Y  R  L  A  Q  D  P         p.920

          .         .         .         .         .         .       g.14008
 TACAGGTTAGCTCAGGATCCCTATAGGTTGGGCCATGACCCCTATAGATTAGGTCATGAT       c.2820
 Y  R  L  A  Q  D  P  Y  R  L  G  H  D  P  Y  R  L  G  H  D         p.940

          .         .         .         .         .         .       g.14068
 GCTTACAGGTTAGGACAAGACCCTTATAGATTAGGCCATGATCCCTACAGACTAACTCCT       c.2880
 A  Y  R  L  G  Q  D  P  Y  R  L  G  H  D  P  Y  R  L  T  P         p.960

          .         .         .         .         .         .       g.14128
 GATCCCTATAGGATGTCACCTAGACCCTACAGGATAGCACCCAGGTCCTATAGAATAGCA       c.2940
 D  P  Y  R  M  S  P  R  P  Y  R  I  A  P  R  S  Y  R  I  A         p.980

          .         .         .         .         .         .       g.14188
 CCCAGGCCATATAGGTTAGCACCTAGACCCCTGATGTTAGCATCTAGACGTTCTATGATG       c.3000
 P  R  P  Y  R  L  A  P  R  P  L  M  L  A  S  R  R  S  M  M         p.1000

          .         .         .         .         .         .       g.14248
 ATGTCCTATGCTGCAGAACGTTCCATGATGTCATCTTACGAACGCTCTATGATGTCTTAT       c.3060
 M  S  Y  A  A  E  R  S  M  M  S  S  Y  E  R  S  M  M  S  Y         p.1020

          .         .         .         .         .         .       g.14308
 GAGCGGTCTATGATGTCCCCTATGGCTGAACGCTCTATGATGTCAGCCTACGAGCGCTCT       c.3120
 E  R  S  M  M  S  P  M  A  E  R  S  M  M  S  A  Y  E  R  S         p.1040

          .         .         .         .         .         .       g.14368
 ATGATGTCAGCCTACGAGCGCTCTATGATGTCCCCTATGGCTGAGCGCTCTATGATGTCA       c.3180
 M  M  S  A  Y  E  R  S  M  M  S  P  M  A  E  R  S  M  M  S         p.1060

          .         .         .         .         .         .       g.14428
 GCTTATGAACGCTCCATGATGTCAGCTTATGAACGCTCCATGATGTCCCCAATGGCTGAT       c.3240
 A  Y  E  R  S  M  M  S  A  Y  E  R  S  M  M  S  P  M  A  D         p.1080

          .         .         .         .         .         .       g.14488
 CGATCTATGATGTCCATGGGTGCTGACCGGTCTATGATGTCGTCATACTCTGCTGCTGAC       c.3300
 R  S  M  M  S  M  G  A  D  R  S  M  M  S  S  Y  S  A  A  D         p.1100

          .         .         .         .         .         .       g.14548
 CGGTCTATGATGTCATCGTACTCTGCAGCTGACCGATCTATGATGTCATCTTATACTGCT       c.3360
 R  S  M  M  S  S  Y  S  A  A  D  R  S  M  M  S  S  Y  T  A         p.1120

          .         .         .         .         .         .       g.14608
 GATCGTTCAATGATGTCTATGGCTGCTGATTCTTACACCGATTCTTACACTGACACATAT       c.3420
 D  R  S  M  M  S  M  A  A  D  S  Y  T  D  S  Y  T  D  T  Y         p.1140

          .         .         .         .         .         .       g.14668
 ACAGAGGCATATATGGTGCCACCTTTGCCTCCTGAAGAGCCCCCAACAATGCCACCGTTG       c.3480
 T  E  A  Y  M  V  P  P  L  P  P  E  E  P  P  T  M  P  P  L         p.1160

          .         .         .         .         .         .       g.14728
 CCACCTGAGGAGCCACCAATGACACCACCATTGCCTCCTGAGGAACCACCAGAGGGTCCA       c.3540
 P  P  E  E  P  P  M  T  P  P  L  P  P  E  E  P  P  E  G  P         p.1180

          .         .         .         .         .         .       g.14788
 GCATTGCCCACTGAGCAGTCAGCATTAACAGCTGAAAATACTTGGCCTACAGAGGTGCCA       c.3600
 A  L  P  T  E  Q  S  A  L  T  A  E  N  T  W  P  T  E  V  P         p.1200

          .         .         .         .         .         .       g.14848
 TCATCACCATCTGAAGAGTCTGTATCGCAGCCTGAGCCTCCTGTGAGTCAAAGTGAGATT       c.3660
 S  S  P  S  E  E  S  V  S  Q  P  E  P  P  V  S  Q  S  E  I         p.1220

          .         .         .         .         .         .       g.14908
 TCGGAGCCTTCAGCAGTGCCTACTGATTATTCAGTGTCAGCATCAGATCCCTCAGTTTTA       c.3720
 S  E  P  S  A  V  P  T  D  Y  S  V  S  A  S  D  P  S  V  L         p.1240

          .         .         .         .         .         .       g.14968
 GTATCAGAGGCTGCTGTGACTGTTCCAGAACCACCACCAGAGCCAGAATCTTCAATTACG       c.3780
 V  S  E  A  A  V  T  V  P  E  P  P  P  E  P  E  S  S  I  T         p.1260

          .         .         .         .         .         .       g.15028
 TTAACACCTGTAGAGTCTGCAGTAGTAGCAGAAGAACATGAAGTTGTTCCAGAGAGACCA       c.3840
 L  T  P  V  E  S  A  V  V  A  E  E  H  E  V  V  P  E  R  P         p.1280

          .         .         .         .         .         .       g.15088
 GTGACTTGTATGGTATCTGAAACTCCCGCCATGTCAGCTGAACCAACTGTGTTAGCATCA       c.3900
 V  T  C  M  V  S  E  T  P  A  M  S  A  E  P  T  V  L  A  S         p.1300

          .         .         .         .         .         .       g.15148
 GAGCCTCCTGTTATGTCAGAGACAGCAGAAACATTTGATTCCATGAGAGCCTCAGGACAT       c.3960
 E  P  P  V  M  S  E  T  A  E  T  F  D  S  M  R  A  S  G  H         p.1320

          .         .         .         .         .         .       g.15208
 GTTGCCTCAGAAGTATCTACATCCTTGTTGGTTCCAGCAGTAACTACTCCAGTGCTGGCA       c.4020
 V  A  S  E  V  S  T  S  L  L  V  P  A  V  T  T  P  V  L  A         p.1340

          .         .         .         .         .         .       g.15268
 GAGAGCATTCTGGAGCCGCCAGCCATGGCTGCCCCAGAGTCTTCAGCTATGGCTGTCCTG       c.4080
 E  S  I  L  E  P  P  A  M  A  A  P  E  S  S  A  M  A  V  L         p.1360

          .         .         .         .         .         .       g.15328
 GAGTCTTCGGCTGTGACCGTCCTGGAGTCTTCGACTGTGACTGTCCTGGAGTCTTCGACT       c.4140
 E  S  S  A  V  T  V  L  E  S  S  T  V  T  V  L  E  S  S  T         p.1380

          .         .         .         .         .         .       g.15388
 GTAACTGTCCTGGAGCCTTCGGTTGTGACTGTCCCGGAGCCTCCTGTTGTGGCTGAGCCA       c.4200
 V  T  V  L  E  P  S  V  V  T  V  P  E  P  P  V  V  A  E  P         p.1400

          .         .         .         .         .         .       g.15448
 GACTATGTTACCATTCCTGTGCCAGTTGTTTCTGCGCTGGAGCCTTCTGTGCCTGTTCTG       c.4260
 D  Y  V  T  I  P  V  P  V  V  S  A  L  E  P  S  V  P  V  L         p.1420

          .         .         .         .         .         .       g.15508
 GAACCAGCGGTGTCAGTCCTTCAACCTTCTATGATTGTTTCAGAACCATCTGTTTCTGTC       c.4320
 E  P  A  V  S  V  L  Q  P  S  M  I  V  S  E  P  S  V  S  V         p.1440

          .         .         .         .         .         .       g.15568
 CAGGAATCGACTGTGACAGTTTCAGAGCCTGCTGTCACAGTCTCAGAGCAGACTCAAGTA       c.4380
 Q  E  S  T  V  T  V  S  E  P  A  V  T  V  S  E  Q  T  Q  V         p.1460

          .         .         .         .         .         .       g.15628
 ATACCAACTGAGGTGGCTATAGAGTCCACACCAATGATACTGGAATCTAGTATCATGTCA       c.4440
 I  P  T  E  V  A  I  E  S  T  P  M  I  L  E  S  S  I  M  S         p.1480

          .         .         .         .         .         .       g.15688
 TCACATGTTATGAAAGGAATTAATCTATCCTCTGGTGATCAAAATCTTGCTCCAGAGATT       c.4500
 S  H  V  M  K  G  I  N  L  S  S  G  D  Q  N  L  A  P  E  I         p.1500

          .         .         .         .         .         .       g.15748
 GGCATGCAGGAGATTGCATTGCATTCAGGTGAAGAACCACATGCTGAGGAACACCTGAAA       c.4560
 G  M  Q  E  I  A  L  H  S  G  E  E  P  H  A  E  E  H  L  K         p.1520

          .         .         .         .         .         .       g.15808
 GGTGACTTTTACGAAAGTGAACATGGTATAAATATAGACCTTAATATAAATAATCATTTA       c.4620
 G  D  F  Y  E  S  E  H  G  I  N  I  D  L  N  I  N  N  H  L         p.1540

          .         .         .         .         .         .       g.15868
 ATTGCTAAAGAGATGGAACATAATACAGTGTGTGCTGCTGGTACTAGTCCTGTTGGGGAA       c.4680
 I  A  K  E  M  E  H  N  T  V  C  A  A  G  T  S  P  V  G  E         p.1560

          .         .         .         .         .         .       g.15928
 ATTGGTGAAGAGAAAATTTTGCCCACCAGTGAGACTAAACAGCGCACAGTATTGGATACC       c.4740
 I  G  E  E  K  I  L  P  T  S  E  T  K  Q  R  T  V  L  D  T         p.1580

          .         .         .         .         .         .       g.15988
 TACCCTGGTGTTAGTGAAGCTGATGCAGGAGAAACTCTATCTTCTACTGGTCCTTTTGCT       c.4800
 Y  P  G  V  S  E  A  D  A  G  E  T  L  S  S  T  G  P  F  A         p.1600

          .         .         .         .         .         .       g.16048
 CTGGAACCTGATGCAACAGGAACTAGTAAGGGTATTGAATTTACCACAGCATCTACTCTC       c.4860
 L  E  P  D  A  T  G  T  S  K  G  I  E  F  T  T  A  S  T  L         p.1620

          .         .         .         .         .         .       g.16108
 AGTTTAGTTAATAAATATGATGTTGATTTATCTTTAACTACTCAAGATACTGAACATGAC       c.4920
 S  L  V  N  K  Y  D  V  D  L  S  L  T  T  Q  D  T  E  H  D         p.1640

          .         .         .         .         .         .       g.16168
 ATGGTAATTTCCACCAGTCCTAGTGGTGGTAGTGAAGCTGACATTGAAGGGCCTTTGCCT       c.4980
 M  V  I  S  T  S  P  S  G  G  S  E  A  D  I  E  G  P  L  P         p.1660

          .         .         .         .         .         .       g.16228
 GCTAAAGATATTCATCTTGATTTACCATCTAATAATAACCTTGTTAGTAAGGATACAGAA       c.5040
 A  K  D  I  H  L  D  L  P  S  N  N  N  L  V  S  K  D  T  E         p.1680

          .         .         .         .         .         .       g.16288
 GAACCATTACCTGTAAAAGAGAGTGACCAGACATTAGCAGCTCTGCTCAGCCCTAAAGAA       c.5100
 E  P  L  P  V  K  E  S  D  Q  T  L  A  A  L  L  S  P  K  E         p.1700

          .         .         .         .         .         .       g.16348
 AGTAGTGGAGGAGAAAAAGAAGTACCTCCCCCTCCTAAAGAGACACTGCCTGATTCAGGA       c.5160
 S  S  G  G  E  K  E  V  P  P  P  P  K  E  T  L  P  D  S  G         p.1720

          .         .         .         .         .         .       g.16408
 TTTTCTGCCAATATTGAGGATATTAATGAAGCAGATTTAGTGAGACCGTTACTTCCTAAG       c.5220
 F  S  A  N  I  E  D  I  N  E  A  D  L  V  R  P  L  L  P  K         p.1740

          .         .         .         .         .         .       g.16468
 GACATGGAACGTCTTACAAGCCTTAGAGCTGGCATTGAAGGACCTTTACTTGCAAGTGAT       c.5280
 D  M  E  R  L  T  S  L  R  A  G  I  E  G  P  L  L  A  S  D         p.1760

          .         .         .         .         .         .       g.16528
 GTTGGACGTGACAGATCTGCTGCCAGCCCGGTTGTAAGTAGTATGCCAGAAAGAGCTTCA       c.5340
 V  G  R  D  R  S  A  A  S  P  V  V  S  S  M  P  E  R  A  S         p.1780

          .         .         .         .         .         .       g.16588
 GAGTCTTCTTCAGAGGAAAAAGATGATTATGAAATTTTTGTAAAAGTTAAGGACACTCAC       c.5400
 E  S  S  S  E  E  K  D  D  Y  E  I  F  V  K  V  K  D  T  H         p.1800

          .         .         .         .         .         .       g.16648
 GAAAAAAGCAAGAAAAATAAGAACCGTGATAAGGGGGAGAAAGAGAAGAAAAGAGACTCT       c.5460
 E  K  S  K  K  N  K  N  R  D  K  G  E  K  E  K  K  R  D  S         p.1820

          .         .         .         .         .         .       g.16708
 TCATTAAGATCTCGAAGTAAGCGTTCCAAATCTTCTGAACACAAATCACGCAAGCGTACC       c.5520
 S  L  R  S  R  S  K  R  S  K  S  S  E  H  K  S  R  K  R  T         p.1840

          .         .         .         .         .         .       g.16768
 AGTGAATCTCGTTCTAGGGCAAGAAAGAGATCATCTAAGTCCAAGTCTCATCGCTCTCAG       c.5580
 S  E  S  R  S  R  A  R  K  R  S  S  K  S  K  S  H  R  S  Q         p.1860

          .         .         .         .         .         .       g.16828
 ACACGTTCACGGTCACGTTCAAGACGCAGGAGGAGAAGCAGCAGATCAAGATCAAAGTCT       c.5640
 T  R  S  R  S  R  S  R  R  R  R  R  S  S  R  S  R  S  K  S         p.1880

          .         .         .         .         .         .       g.16888
 AGAGGAAGAAGATCTGTATCAAAAGAGAAGCGCAAAAGATCTCCAAAGCACAGATCCAAG       c.5700
 R  G  R  R  S  V  S  K  E  K  R  K  R  S  P  K  H  R  S  K         p.1900

          .         .         .         .         .         .       g.16948
 TCTAGGGAAAGAAAAAGAAAAAGATCAAGCTCCAGGGATAACCGAAAGACAGTTAGAGCT       c.5760
 S  R  E  R  K  R  K  R  S  S  S  R  D  N  R  K  T  V  R  A         p.1920

          .         .         .         .         .         .       g.17008
 CGAAGTCGAACCCCAAGTCGTCGGAGTCGGAGTCATACTCCAAGTCGTCGACGAAGGTCT       c.5820
 R  S  R  T  P  S  R  R  S  R  S  H  T  P  S  R  R  R  R  S         p.1940

          .         .         .         .         .         .       g.17068
 AGATCTGTGGGTAGAAGAAGGAGCTTTAGCATTTCCCCAAGCCGCCGCAGCCGCACCCCC       c.5880
 R  S  V  G  R  R  R  S  F  S  I  S  P  S  R  R  S  R  T  P         p.1960

          .         .         .         .         .         .       g.17128
 AGCCGCCGCAGCCGCACCCCCAGCCGCCGCAGCCGCACCCCCAGCCGCCGCAGCCGCACC       c.5940
 S  R  R  S  R  T  P  S  R  R  S  R  T  P  S  R  R  S  R  T         p.1980

          .         .         .         .         .         .       g.17188
 CCCAGCCGCCGGAGCCGCACCCCTAGCCGTCGGAGCCGCACCCCAAGCCGCCGGAGAAGA       c.6000
 P  S  R  R  S  R  T  P  S  R  R  S  R  T  P  S  R  R  R  R         p.2000

          .         .         .         .         .         .       g.17248
 TCAAGGTCTGTGGTAAGAAGACGAAGCTTCAGTATCTCACCAGTCAGATTAAGGCGATCA       c.6060
 S  R  S  V  V  R  R  R  S  F  S  I  S  P  V  R  L  R  R  S         p.2020

          .         .         .         .         .         .       g.17308
 AGAACACCCTTAAGAAGAAGGTTTAGCAGATCTCCCATCCGTCGTAAAAGATCCAGGTCT       c.6120
 R  T  P  L  R  R  R  F  S  R  S  P  I  R  R  K  R  S  R  S         p.2040

          .         .         .         . | 04       .         .    g.19132
 TCTGAACGAGGCAGATCACCCAAACGTCTGACAGATTTGG | ATAAGGCTCAATTACTTGAA    c.6180
 S  E  R  G  R  S  P  K  R  L  T  D  L  D |   K  A  Q  L  L  E      p.2060

          .         .         .         .         .         .       g.19192
 ATAGCCAAAGCTAATGCAGCTGCCATGTGTGCTAAGGCTGGTGTCCCTTTACCACCAAAC       c.6240
 I  A  K  A  N  A  A  A  M  C  A  K  A  G  V  P  L  P  P  N         p.2080

          .         .         .         .         .         .       g.19252
 CTAAAGCCTGCACCTCCACCTACTATAGAAGAGAAAGTTGCTAAAAAGTCAGGAGGAGCT       c.6300
 L  K  P  A  P  P  P  T  I  E  E  K  V  A  K  K  S  G  G  A         p.2100

          .         .  | 05      .         .         .         .    g.21225
 ACTATAGAAGAACTAACTGAG | AAATGTAAACAGATCGCACAGAGTAAAGAAGATGATGAT    c.6360
 T  I  E  E  L  T  E   | K  C  K  Q  I  A  Q  S  K  E  D  D  D      p.2120

          .         .         .         .         .         .       g.21285
 GTAATAGTGAATAAACCTCATGTTTCGGATGAAGAGGAAGAAGAACCTCCTTTTTATCAT       c.6420
 V  I  V  N  K  P  H  V  S  D  E  E  E  E  E  P  P  F  Y  H         p.2140

          .         .         .         .         | 06         .    g.21555
 CATCCCTTTAAACTCAGTGAACCCAAACCTATTTTTTTCAATCTGAAT | ATTGCTGCAGCA    c.6480
 H  P  F  K  L  S  E  P  K  P  I  F  F  N  L  N   | I  A  A  A      p.2160

          .         .         .         .         .         .       g.21615
 AAACCAACTCCACCAAAAAGCCAGGTAACATTAACAAAAGAATTCCCTGTATCATCTGGA       c.6540
 K  P  T  P  P  K  S  Q  V  T  L  T  K  E  F  P  V  S  S  G         p.2180

          .         .         .         .         .         .       g.21675
 TCTCAACATCGGAAAAAAGAAGCGGATAGTGTTTATGGAGAATGGGTTCCTGTGGAGAAA       c.6600
 S  Q  H  R  K  K  E  A  D  S  V  Y  G  E  W  V  P  V  E  K         p.2200

          .         .         .         .         .        | 07.    g.29116
 AATGGTGAAGAAAACAAAGATGATGATAATGTTTTCAGCAGCAATTTGCCCTCAGAG | CCT    c.6660
 N  G  E  E  N  K  D  D  D  N  V  F  S  S  N  L  P  S  E   | P      p.2220

          .         .         .         .         .         .       g.29176
 GTGGACATCTCTACAGCAATGAGTGAACGGGCACTTGCTCAGAAAAGACTCAGTGAGAAT       c.6720
 V  D  I  S  T  A  M  S  E  R  A  L  A  Q  K  R  L  S  E  N         p.2240

          .         .         .         .         | 08         .    g.30939
 GCATTTGATCTTGAAGCCATGAGCATGTTAAATAGAGCTCAGGAAAGG | ATTGATGCCTGG    c.6780
 A  F  D  L  E  A  M  S  M  L  N  R  A  Q  E  R   | I  D  A  W      p.2260

          .         .         .         .         .         .       g.30999
 GCTCAGCTGAACTCTATTCCTGGCCAGTTCACAGGAAGTACAGGAGTACAGGTTTTGACA       c.6840
 A  Q  L  N  S  I  P  G  Q  F  T  G  S  T  G  V  Q  V  L  T         p.2280

          .         .         .         .      | 09  .         .    g.35279
 CAAGAACAGTTGGCCAATACTGGTGCCCAAGCCTGGATTAAAAAG | GATCAGTTCTTAAGA    c.6900
 Q  E  Q  L  A  N  T  G  A  Q  A  W  I  K  K   | D  Q  F  L  R      p.2300

          .         .         .         .         .         .       g.35339
 GCAGCCCCGGTAACTGGAGGAATGGGAGCCGTTTTGATGAGAAAAATGGGCTGGAGAGAA       c.6960
 A  A  P  V  T  G  G  M  G  A  V  L  M  R  K  M  G  W  R  E         p.2320

          .         .         .         .         .         .       g.35399
 GGAGAAGGATTAGGAAAAAACAAAGAAGGCAATAAGGAACCCATCCTAGTTGATTTTAAG       c.7020
 G  E  G  L  G  K  N  K  E  G  N  K  E  P  I  L  V  D  F  K         p.2340

          .    | 10    .         .         .         .         .    g.37599
 ACAGACCGAAAAG | GTCTTGTTGCAGTAGGAGAAAGAGCACAAAAGAGGTCTGGGAACTTC    c.7080
 T  D  R  K  G |   L  V  A  V  G  E  R  A  Q  K  R  S  G  N  F      p.2360

          .         .      | 11  .         .         .         .    g.37769
 TCTGCTGCAATGAAAGATCTGTCAG | GCAAACATCCTGTGTCTGCTTTGATGGAGATCTGT    c.7140
 S  A  A  M  K  D  L  S  G |   K  H  P  V  S  A  L  M  E  I  C      p.2380

          .         .         .         .         .         .       g.37829
 AATAAAAGAAGGTGGCAACCACCTGAATTTCTATTGGTCCATGATAGTGGCCCTGATCAT       c.7200
 N  K  R  R  W  Q  P  P  E  F  L  L  V  H  D  S  G  P  D  H         p.2400

          .         .  | 12      .         .         .         .    g.38360
 CGCAAACATTTTCTCTTTAGG | GTATTGAGAAATGGAGCCCTTACCAGACCCAATTGTATG    c.7260
 R  K  H  F  L  F  R   | V  L  R  N  G  A  L  T  R  P  N  C  M      p.2420

          .         .                                               g.38381
 TTTTTCTTGAATAGGTATTGA                                              c.7281
 F  F  L  N  R  Y  X                                                p.2426

          .         .         .         .         .         .       g.38441
 taaatggaagcgcttaccagcccagctttgccagccctaataagaagcatgctaaagcca       c.*60

          .         .         .         .         .         .       g.38501
 cagcagctactgtggttcttcaagcaatgggccttgtaccaaaggacctcatggctaatg       c.*120

          .         .         .         .         .         .       g.38561
 ccacttgcttcaggagtgcctcacgtagatagattgaggttttataataatcatttcaga       c.*180

          .         .         .         .         .         .       g.38621
 attttactctgcatcacaatgtatttcctctttaatgttgtaaatatttggcaatttaag       c.*240

          .         .         .         .         .         .       g.38681
 acattgtgtaaaaagcaatctgtaaaaacatctccaggctttgatttttgtaccatggaa       c.*300

          .         .         .         .         .         .       g.38741
 attgtatttaaccatacagggttttggtatgtttatattgtttaccttagtgatgtattt       c.*360

          .         .         .         .         .         .       g.38801
 gtttaagtggctaacatccaaacgactgtttgaaggcatcagagtaatcttcagtgtgga       c.*420

          .         .         .         .         .         .       g.38861
 atgttaaataacgcttttatactgtattttgtactatgatgtaactccccttccttatgg       c.*480

          .         .         .         .         .         .       g.38921
 ctaggctactgtaacacttgcctgtaatcagtgaagggctgtgcaccttgtactatttca       c.*540

          .         .         .         .         .         .       g.38981
 caatgggttctgctggacagataatgggccagtgttattgaggtgatcaagatctgttcc       c.*600

          .         .         .         .         .         .       g.39041
 acagggctaatgccaccatctcccctcaaaattttgtagaggttctaaaaagaaagtggt       c.*660

          .         .         .         .         .         .       g.39101
 atgttgtgtgatgatcagcactaagtcctgcattcctgttaaagccacttgggtcataag       c.*720

          .         .         .         .         .         .       g.39161
 aagggagtaaaaaatgaagtctgactagaattctattgcagaggccagtacatttagtat       c.*780

          .         .         .         .         .         .       g.39221
 ggcattgagttgtgatatagttttactttgatgtgcattttgaatttcagctacacctag       c.*840

          .         .         .         .         .         .       g.39281
 atagacgtaaaatgataattaaaatgctgtaaccaacttatctaataaaattggcaacca       c.*900

          .         .         .         .         .         .       g.39341
 gccactattttgttgactatgagaaagttaaaagtttatgttaatttttagggtctgata       c.*960

          .         .         .         .         .         .       g.39401
 gaatatttcatgtgtattacagtggtattcatatgctatgtctctaaactttattttcaa       c.*1020

          .         .         .         .         .         .       g.39461
 aagcttaaggcccaaatacaaacttctctggaataaacgtggtgttttattttctgggtt       c.*1080

          .                                                         g.39471
 ataaagtgaa                                                         c.*1090

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The SON DNA binding protein protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 17
©2004-2016 Leiden University Medical Center