survival of motor neuron 2, centromeric (SMN2) - coding DNA reference sequence

(used for variant description)

(last modified May 10, 2019)


This file was created to facilitate the description of sequence variants on transcript NM_017411.3 in the SMN2 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_008728.1, covering SMN2 transcript NM_017411.3.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                     .         .         .         .                g.5043
                  ccacaaatgtgggagggcgataaccactcgtagaaagcgtgag       c.-121

 .         .         .         .         .         .                g.5103
 aagttactacaagcggtcctcccggccaccgtactgttccgctcccagaagccccgggcg       c.-61

 .         .         .         .         .         .                g.5163
 gcggaagtcgtcactcttaagaagggacggggccccacgctgcgcacccgcgggtttgct       c.-1

          .         .         .         .         .         .       g.5223
 ATGGCGATGAGCAGCGGCGGCAGTGGTGGCGGCGTCCCGGAGCAGGAGGATTCCGTGCTG       c.60
 M  A  M  S  S  G  G  S  G  G  G  V  P  E  Q  E  D  S  V  L         p.20

          .         .  | 02      .         .         .         .    g.18931
 TTCCGGCGCGGCACAGGCCAG | AGCGATGATTCTGACATTTGGGATGATACAGCACTGATA    c.120
 F  R  R  G  T  G  Q   | S  D  D  S  D  I  W  D  D  T  A  L  I      p.40

          .         .         .    | 03    .         .         .    g.21469
 AAAGCATATGATAAAGCTGTGGCTTCATTTAAG | CATGCTCTAAAGAATGGTGACATTTGT    c.180
 K  A  Y  D  K  A  V  A  S  F  K   | H  A  L  K  N  G  D  I  C      p.60

          .         .         .         .         .         .       g.21529
 GAAACTTCGGGTAAACCAAAAACCACACCTAAAAGAAAACCTGCTAAGAAGAATAAAAGC       c.240
 E  T  S  G  K  P  K  T  T  P  K  R  K  P  A  K  K  N  K  S         p.80

          .         .         .    | 04    .         .         .    g.22438
 CAAAAGAAGAATACTGCAGCTTCCTTACAACAG | TGGAAAGTTGGGGACAAATGTTCTGCC    c.300
 Q  K  K  N  T  A  A  S  L  Q  Q   | W  K  V  G  D  K  C  S  A      p.100

          .         .         .         .         .         .       g.22498
 ATTTGGTCAGAAGACGGTTGCATTTACCCAGCTACCATTGCTTCAATTGATTTTAAGAGA       c.360
 I  W  S  E  D  G  C  I  Y  P  A  T  I  A  S  I  D  F  K  R         p.120

          .         .         .         .         .         .       g.22558
 GAAACCTGTGTTGTGGTTTACACTGGATATGGAAATAGAGAGGAGCAAAATCTGTCCGAT       c.420
 E  T  C  V  V  V  Y  T  G  Y  G  N  R  E  E  Q  N  L  S  D         p.140

          .         .         .         .         .     | 05   .    g.22777
 CTACTTTCCCCAATCTGTGAAGTAGCTAATAATATAGAACAAAATGCTCAAGAG | AATGAA    c.480
 L  L  S  P  I  C  E  V  A  N  N  I  E  Q  N  A  Q  E   | N  E      p.160

          .         .         .         .         .         .       g.22837
 AATGAAAGCCAAGTTTCAACAGATGAAAGTGAGAACTCCAGGTCTCCTGGAAATAAATCA       c.540
 N  E  S  Q  V  S  T  D  E  S  E  N  S  R  S  P  G  N  K  S         p.180

          .         .         .         .         .         .       g.22897
 GATAACATCAAGCCCAAATCTGCTCCATGGAACTCTTTTCTCCCTCCACCACCCCCCATG       c.600
 D  N  I  K  P  K  S  A  P  W  N  S  F  L  P  P  P  P  P  M         p.200

          .         .        | 06.         .         .         .    g.24745
 CCAGGGCCAAGACTGGGACCAGGAAAG | CCAGGTCTAAAATTCAATGGCCCACCACCGCCA    c.660
 P  G  P  R  L  G  P  G  K   | P  G  L  K  F  N  G  P  P  P  P      p.220

          .         .         .         .         .         .       g.24805
 CCGCCACCACCACCACCCCACTTACTATCATGCTGGCTGCCTCCATTTCCTTCTGGACCA       c.720
 P  P  P  P  P  P  H  L  L  S  C  W  L  P  P  F  P  S  G  P         p.240

     | 07    .         .         .         .         .         .    g.26175
 CCA | ATAATTCCCCCACCACCTCCCATATGTCCAGATTCTCTTGATGATGCTGATGCTTTG    c.780
 P   | I  I  P  P  P  P  P  I  C  P  D  S  L  D  D  A  D  A  L      p.260

          .         .         .         .         .     | 08   .    g.32004
 GGAAGTATGTTAATTTCATGGTACATGAGTGGCTATCATACTGGCTATTATATG | GGTTTT    c.840
 G  S  M  L  I  S  W  Y  M  S  G  Y  H  T  G  Y  Y  M   | G  F      p.280

          .         .         .         .                       g.32049
 AGACAAAATCAAAAAGAAGGAAGGTGCTCACATTCCTTAAATTAA |                   c.886
 R  Q  N  Q  K  E  G  R  C  S  H  S  L  N  X                     p.294

     | 09    .         .         .         .         .         .    g.32553
 gga | gaaatgctggcatagagcagcactaaatgacaccactaaagaaacgatcagacagat    c.*60

          .         .         .         .         .         .       g.32613
 ctggaatgtgaagcgttatagaagataactggcctcatttcttcaaaatatcaagtgttg       c.*120

          .         .         .         .         .         .       g.32673
 ggaaagaaaaaaggaagtggaatgggtaactcttcttgattaaaagttatgtaataacca       c.*180

          .         .         .         .         .         .       g.32733
 aatgcaatgtgaaatattttactggactctattttgaaaaaccatctgtaaaagactgag       c.*240

          .         .         .         .         .         .       g.32793
 gtgggggtgggaggccagcacggtggtgaggcagttgagaaaatttgaatgtggattaga       c.*300

          .         .         .         .         .         .       g.32853
 ttttgaatgatattggataattattggtaattttatgagctgtgagaagggtgttgtagt       c.*360

          .         .         .         .         .         .       g.32913
 ttataaaagactgtcttaatttgcatacttaagcatttaggaatgaagtgttagagtgtc       c.*420

          .         .         .         .         .         .       g.32973
 ttaaaatgtttcaaatggtttaacaaaatgtatgtgaggcgtatgtggcaaaatgttaca       c.*480

          .         .         .         .         .         .       g.33033
 gaatctaactggtggacatggctgttcattgtactgtttttttctatcttctatatgttt       c.*540

          .         .         .         .                           g.33073
 aaaagtatataataaaaatatttaatttttttttaaatta                           c.*580

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Survival of motor neuron 2, centromeric protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 21c
©2004-2019 Leiden University Medical Center