polymerase (RNA) II (DNA directed) polypeptide A, 220kDa (POLR2A) - coding DNA reference sequence

(used for variant description)

(last modified August 7, 2019)


This file was created to facilitate the description of sequence variants on transcript NM_000937.4 in the POLR2A gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_027747.1, covering POLR2A transcript NM_000937.4.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                                         .         .                g.5026
                                   gagagcgcggccgggacggttggaga       c.-361

 .         .         .         .         .         .                g.5086
 agaaggcggctcccggaagggggagagacaaactgccgtaacctctgccgttcaggaacc       c.-301

 .         .         .         .         .         .                g.5146
 cggttacttatttattcgttaccctttttcttcttcctcccccaaaaaccttttcctttt       c.-241

 .         .         .         .         .         .                g.5206
 cccttctttttttttcctttttgggagctgaaaaatttccggtaagggaaagaagggctc       c.-181

 .         .         .         .         .         .                g.5266
 ctttcgctccttatttccccgcctccttccctcccccaccttcccctcctccggcttttt       c.-121

 .         .         .         .         .         .                g.5326
 cctcccaactcggggaggtccttcccggtggccgccctgacgaggtctgagcacctaggc       c.-61

 .         .         .         .         .         .                g.5386
 ggaggcggcgcaggctttttgtagtgaggtttgcgcctgcgcagcgcgcctgcctccgcc       c.-1

          .         .         .         .         .         .       g.5446
 ATGCACGGGGGTGGCCCCCCCTCGGGGGACAGCGCATGCCCGCTGCGCACCATCAAGAGA       c.60
 M  H  G  G  G  P  P  S  G  D  S  A  C  P  L  R  T  I  K  R         p.20

          .         .         .    | 02    .         .         .    g.16589
 GTCCAGTTCGGAGTCCTGAGTCCGGATGAACTG | AAGCGAATGTCTGTGACGGAGGGTGGC    c.120
 V  Q  F  G  V  L  S  P  D  E  L   | K  R  M  S  V  T  E  G  G      p.40

          .         .         .         .         .         .       g.16649
 ATCAAATACCCAGAGACGACTGAGGGAGGCCGCCCCAAGCTTGGGGGGCTGATGGACCCG       c.180
 I  K  Y  P  E  T  T  E  G  G  R  P  K  L  G  G  L  M  D  P         p.60

          .         .         .         .       | 03 .         .    g.16845
 AGGCAGGGGGTGATTGAGCGGACTGGCCGCTGCCAAACATGTGCAG | GAAACATGACAGAG    c.240
 R  Q  G  V  I  E  R  T  G  R  C  Q  T  C  A  G |   N  M  T  E      p.80

          .         .         .         .         .         .       g.16905
 TGTCCTGGCCACTTTGGCCACATTGAACTGGCCAAGCCTGTGTTTCACGTGGGCTTCCTG       c.300
 C  P  G  H  F  G  H  I  E  L  A  K  P  V  F  H  V  G  F  L         p.100

          .         .         .         .         .         .       g.16965
 GTGAAGACAATGAAAGTTTTGCGCTGTGTCTGCTTCTTCTGCTCCAAACTGCTTGTGGAC       c.360
 V  K  T  M  K  V  L  R  C  V  C  F  F  C  S  K  L  L  V  D         p.120

     | 04    .         .         .         .         .         .    g.17118
 TCT | AACAACCCAAAGATCAAGGATATCCTGGCTAAGTCCAAGGGACAGCCCAAGAAGCGG    c.420
 S   | N  N  P  K  I  K  D  I  L  A  K  S  K  G  Q  P  K  K  R      p.140

          .         .         .         .         .         .       g.17178
 CTCACACATGTCTACGACCTTTGCAAGGGCAAAAACATATGCGAGGGTGGGGAGGAGATG       c.480
 L  T  H  V  Y  D  L  C  K  G  K  N  I  C  E  G  G  E  E  M         p.160

          .         .         .         .         .        | 05.    g.17388
 GACAACAAGTTCGGTGTGGAACAACCTGAGGGTGACGAGGATCTGACCAAAGAAAAG | GGC    c.540
 D  N  K  F  G  V  E  Q  P  E  G  D  E  D  L  T  K  E  K   | G      p.180

          .         .         .         .         .         .       g.17448
 CATGGTGGCTGTGGGCGGTACCAGCCCAGGATCCGGCGTTCTGGCCTAGAGCTGTATGCG       c.600
 H  G  G  C  G  R  Y  Q  P  R  I  R  R  S  G  L  E  L  Y  A         p.200

          .         .         .         .         .         .       g.17508
 GAATGGAAGCACGTTAATGAGGACTCTCAGGAGAAGAAGATCCTGCTGAGTCCAGAGCGA       c.660
 E  W  K  H  V  N  E  D  S  Q  E  K  K  I  L  L  S  P  E  R         p.220

          .         .         .         .         .         .       g.17568
 GTGCATGAGATCTTCAAACGCATCTCAGATGAGGAGTGTTTTGTGCTGGGCATGGAGCCC       c.720
 V  H  E  I  F  K  R  I  S  D  E  E  C  F  V  L  G  M  E  P         p.240

          .         .         .         .         .         .       g.17628
 CGCTATGCACGGCCAGAGTGGATGATTGTCACAGTGCTGCCTGTGCCCCCGCTCTCCGTG       c.780
 R  Y  A  R  P  E  W  M  I  V  T  V  L  P  V  P  P  L  S  V         p.260

          .         .         .          | 06        .         .    g.17998
 CGGCCTGCTGTTGTGATGCAGGGCTCTGCCCGTAACCAG | GATGACCTGACTCACAAACTG    c.840
 R  P  A  V  V  M  Q  G  S  A  R  N  Q   | D  D  L  T  H  K  L      p.280

          .         .         .         .         .         .       g.18058
 GCTGACATCGTGAAGATCAACAATCAGCTGCGGCGCAATGAGCAGAACGGCGCAGCGGCC       c.900
 A  D  I  V  K  I  N  N  Q  L  R  R  N  E  Q  N  G  A  A  A         p.300

          .         .         .         .         .         .       g.18118
 CATGTCATTGCAGAGGATGTGAAGCTCCTCCAGTTCCATGTGGCCACCATGGTGGACAAT       c.960
 H  V  I  A  E  D  V  K  L  L  Q  F  H  V  A  T  M  V  D  N         p.320

          .         .  | 07      .         .         .         .    g.18310
 GAGCTGCCTGGCTTGCCCCGT | GCCATGCAGAAGTCTGGGCGTCCCCTCAAGTCCCTGAAG    c.1020
 E  L  P  G  L  P  R   | A  M  Q  K  S  G  R  P  L  K  S  L  K      p.340

          .         .         .         .         .         .       g.18370
 CAGCGGTTGAAGGGCAAGGAAGGCCGGGTGCGAGGGAACCTGATGGGCAAAAGAGTGGAC       c.1080
 Q  R  L  K  G  K  E  G  R  V  R  G  N  L  M  G  K  R  V  D         p.360

          .         .         .         .         .         .       g.18430
 TTCTCGGCCCGTACTGTCATCACCCCCGACCCCAACCTCTCCATTGACCAGGTTGGCGTG       c.1140
 F  S  A  R  T  V  I  T  P  D  P  N  L  S  I  D  Q  V  G  V         p.380

          .         .         .         .         .         .       g.18490
 CCCCGCTCCATTGCTGCCAACATGACCTTTGCGGAGATTGTCACCCCCTTCAACATTGAC       c.1200
 P  R  S  I  A  A  N  M  T  F  A  E  I  V  T  P  F  N  I  D         p.400

    | 08     .         .         .         .         .         .    g.18757
 AG | ACTTCAAGAACTAGTGCGCAGGGGGAACAGCCAGTACCCAGGCGCCAAGTACATCATC    c.1260
 R  |  L  Q  E  L  V  R  R  G  N  S  Q  Y  P  G  A  K  Y  I  I      p.420

          .         .         .         .         .         .       g.18817
 CGAGACAATGGTGATCGCATTGACTTGCGTTTCCACCCCAAGCCCAGTGACCTTCACCTG       c.1320
 R  D  N  G  D  R  I  D  L  R  F  H  P  K  P  S  D  L  H  L         p.440

          .      | 09  .         .         .         .         .    g.19705
 CAGACCGGCTATAAG | GTGGAACGGCACATGTGTGATGGGGACATTGTTATCTTCAACCGG    c.1380
 Q  T  G  Y  K   | V  E  R  H  M  C  D  G  D  I  V  I  F  N  R      p.460

          .         .         .         .         .         .       g.19765
 CAGCCAACTCTGCACAAAATGTCCATGATGGGGCATCGGGTCCGCATTCTCCCATGGTCT       c.1440
 Q  P  T  L  H  K  M  S  M  M  G  H  R  V  R  I  L  P  W  S         p.480

          .         . | 10       .         .         .         .    g.19942
 ACCTTTCGCTTGAATCTTAG | TGTGACAACTCCGTACAATGCAGACTTTGACGGGGATGAG    c.1500
 T  F  R  L  N  L  S  |  V  T  T  P  Y  N  A  D  F  D  G  D  E      p.500

          .         .         .         .         .         .       g.20002
 ATGAACTTGCACCTGCCACAGTCTCTGGAGACGCGAGCAGAGATCCAGGAGCTGGCCATG       c.1560
 M  N  L  H  L  P  Q  S  L  E  T  R  A  E  I  Q  E  L  A  M         p.520

          .         .         .         .         .         .       g.20062
 GTTCCTCGCATGATTGTCACCCCCCAGAGCAATCGGCCTGTCATGGGTATTGTGCAGGAC       c.1620
 V  P  R  M  I  V  T  P  Q  S  N  R  P  V  M  G  I  V  Q  D         p.540

          .         .         .         .         .  | 11      .    g.21269
 ACACTCACAGCAGTGCGCAAATTCACCAAGAGAGACGTCTTCCTGGAGCGG | GGTGAAGTG    c.1680
 T  L  T  A  V  R  K  F  T  K  R  D  V  F  L  E  R   | G  E  V      p.560

          .         .         .         .         .         .       g.21329
 ATGAACCTCCTGATGTTCCTGTCGACGTGGGATGGGAAGGTCCCACAGCCGGCCATCCTA       c.1740
 M  N  L  L  M  F  L  S  T  W  D  G  K  V  P  Q  P  A  I  L         p.580

          .         .         .         .         .         .       g.21389
 AAGCCCCGGCCCCTGTGGACAGGCAAGCAAATCTTCTCCCTCATCATACCTGGTCACATC       c.1800
 K  P  R  P  L  W  T  G  K  Q  I  F  S  L  I  I  P  G  H  I         p.600

          .         .         .         .         .         .       g.21449
 AATTGTATCCGTACCCACAGCACCCATCCCGATGATGAAGACAGTGGCCCTTACAAGCAC       c.1860
 N  C  I  R  T  H  S  T  H  P  D  D  E  D  S  G  P  Y  K  H         p.620

          .         .  | 12      .         .         .         .    g.21600
 ATCTCTCCTGGGGACACCAAG | GTGGTGGTGGAGAATGGGGAGCTGATCATGGGCATCCTG    c.1920
 I  S  P  G  D  T  K   | V  V  V  E  N  G  E  L  I  M  G  I  L      p.640

          .         .         .         .         .         .       g.21660
 TGTAAGAAGTCTCTGGGCACGTCAGCTGGCTCCCTGGTCCACATCTCCTACCTAGAGATG       c.1980
 C  K  K  S  L  G  T  S  A  G  S  L  V  H  I  S  Y  L  E  M         p.660

          .         .         .         .         .         .       g.21720
 GGTCATGACATCACTCGCCTCTTCTACTCCAACATTCAGACTGTCATTAACAACTGGCTC       c.2040
 G  H  D  I  T  R  L  F  Y  S  N  I  Q  T  V  I  N  N  W  L         p.680

          . | 13       .         .         .         .         .    g.21960
 CTCATCGAGG | GTCATACTATTGGCATTGGGGACTCCATTGCTGATTCTAAGACTTACCAG    c.2100
 L  I  E  G |   H  T  I  G  I  G  D  S  I  A  D  S  K  T  Y  Q      p.700

          .         .         .         .      | 14  .         .    g.22162
 GACATTCAGAACACTATTAAGAAGGCCAAGCAGGACGTAATAGAG | GTCATCGAGAAGGCA    c.2160
 D  I  Q  N  T  I  K  K  A  K  Q  D  V  I  E   | V  I  E  K  A      p.720

          .         .         .         .         .         .       g.22222
 CACAACAATGAGCTGGAGCCCACCCCAGGGAACACTCTGCGGCAGACGTTTGAGAATCAG       c.2220
 H  N  N  E  L  E  P  T  P  G  N  T  L  R  Q  T  F  E  N  Q         p.740

          .         .         .         .         .         .       g.22282
 GTGAACCGCATTCTTAACGATGCCCGAGACAAGACTGGCTCCTCTGCTCAGAAATCCCTG       c.2280
 V  N  R  I  L  N  D  A  R  D  K  T  G  S  S  A  Q  K  S  L         p.760

          .         .         .         .         .         .       g.22342
 TCTGAATACAACAACTTCAAGTCTATGGTCGTGTCCGGAGCTAAAGGTTCCAAGATTAAC       c.2340
 S  E  Y  N  N  F  K  S  M  V  V  S  G  A  K  G  S  K  I  N         p.780

           | 15        .         .         .         .         .    g.22572
 ATCTCCCAG | GTCATTGCTGTCGTTGGACAGCAGAACGTCGAGGGCAAGCGGATTCCATTT    c.2400
 I  S  Q   | V  I  A  V  V  G  Q  Q  N  V  E  G  K  R  I  P  F      p.800

          .         .         .         .         .         .       g.22632
 GGCTTCAAGCACCGGACTCTGCCTCACTTCATCAAGGATGACTACGGGCCTGAGAGCCGT       c.2460
 G  F  K  H  R  T  L  P  H  F  I  K  D  D  Y  G  P  E  S  R         p.820

          .         .         .         .         .         .       g.22692
 GGCTTTGTGGAGAACTCCTACCTAGCCGGCCTCACACCCACTGAGTTCTTTTTCCACGCC       c.2520
 G  F  V  E  N  S  Y  L  A  G  L  T  P  T  E  F  F  F  H  A         p.840

          .         .         .         .         .   | 16     .    g.23147
 ATGGGGGGTCGTGAGGGGCTCATTGACACGGCTGTCAAGACTGCTGAGACTG | GATACATC    c.2580
 M  G  G  R  E  G  L  I  D  T  A  V  K  T  A  E  T  G |   Y  I      p.860

          .         .         .         .         .         .       g.23207
 CAGCGGCGGCTGATCAAGTCCATGGAGTCAGTGATGGTGAAGTACGACGCGACTGTGCGG       c.2640
 Q  R  R  L  I  K  S  M  E  S  V  M  V  K  Y  D  A  T  V  R         p.880

          .         .         .         .         .         .       g.23267
 AACTCCATCAACCAGGTGGTGCAGCTGCGCTACGGCGAAGACGGCCTGGCAGGCGAGAGC       c.2700
 N  S  I  N  Q  V  V  Q  L  R  Y  G  E  D  G  L  A  G  E  S         p.900

          .         .         .         .         .       | 17 .    g.23746
 GTTGAGTTCCAGAACCTGGCTACGCTTAAGCCTTCCAACAAGGCTTTTGAGAAGAA | GTTC    c.2760
 V  E  F  Q  N  L  A  T  L  K  P  S  N  K  A  F  E  K  K  |  F      p.920

          .         .         .         .         .         .       g.23806
 CGCTTTGATTATACCAATGAGAGGGCCCTGCGGCGCACTCTGCAGGAGGACCTGGTGAAG       c.2820
 R  F  D  Y  T  N  E  R  A  L  R  R  T  L  Q  E  D  L  V  K         p.940

          .         .         .         .         .         .       g.23866
 GACGTGCTGAGCAACGCACACATCCAGAACGAGTTGGAGCGGGAATTTGAGCGGATGCGG       c.2880
 D  V  L  S  N  A  H  I  Q  N  E  L  E  R  E  F  E  R  M  R         p.960

          .         .         .         .         | 18         .    g.24018
 GAGGATCGGGAGGTGCTCAGGGTCATCTTCCCAACTGGAGACAGCAAG | GTCGTCCTCCCC    c.2940
 E  D  R  E  V  L  R  V  I  F  P  T  G  D  S  K   | V  V  L  P      p.980

          .         .         .         .         .         .       g.24078
 TGTAACCTGCTGCGGATGATCTGGAATGCTCAGAAAATCTTCCACATCAACCCACGCCTT       c.3000
 C  N  L  L  R  M  I  W  N  A  Q  K  I  F  H  I  N  P  R  L         p.1000

          .         .         .     | 19   .         .         .    g.24233
 CCCTCCGACCTGCACCCCATCAAAGTGGTGGAGG | GAGTCAAGGAATTGAGCAAGAAGCTG    c.3060
 P  S  D  L  H  P  I  K  V  V  E  G |   V  K  E  L  S  K  K  L      p.1020

          .         .         .         .         .         .       g.24293
 GTGATTGTGAATGGGGATGACCCACTAAGTCGACAGGCCCAGGAAAATGCCACGCTGCTC       c.3120
 V  I  V  N  G  D  D  P  L  S  R  Q  A  Q  E  N  A  T  L  L         p.1040

          .         .         .         .         .         .       g.24353
 TTCAACATCCACCTGCGGTCCACGTTGTGTTCCCGCCGCATGGCAGAGGAGTTTCGGCTC       c.3180
 F  N  I  H  L  R  S  T  L  C  S  R  R  M  A  E  E  F  R  L         p.1060

          .         .         .         .         .         .       g.24413
 AGTGGGGAGGCCTTCGACTGGCTGCTTGGGGAGATTGAGTCCAAGTTCAACCAAGCCATT       c.3240
 S  G  E  A  F  D  W  L  L  G  E  I  E  S  K  F  N  Q  A  I         p.1080

  | 20       .         .         .         .         .         .    g.28932
  | GCGCATCCCGGGGAAATGGTGGGGGCTCTGGCTGCGCAGTCCCTTGGAGAACCTGCCACC    c.3300
  | A  H  P  G  E  M  V  G  A  L  A  A  Q  S  L  G  E  P  A  T      p.1100

          .         .         .         .         .         .       g.28992
 CAGATGACCTTGAATACCTTCCACTATGCTGGTGTGTCTGCCAAGAATGTGACGCTGGGT       c.3360
 Q  M  T  L  N  T  F  H  Y  A  G  V  S  A  K  N  V  T  L  G         p.1120

          .         .         .         .         .         .       g.29052
 GTGCCCCGACTTAAGGAGCTCATCAACATTTCCAAGAAGCCAAAGACTCCTTCGCTTACT       c.3420
 V  P  R  L  K  E  L  I  N  I  S  K  K  P  K  T  P  S  L  T         p.1140

          .         .         .         .      | 21  .         .    g.29580
 GTCTTCCTGTTGGGCCAGTCCGCTCGAGATGCTGAGAGAGCCAAG | GATATTCTGTGCCGT    c.3480
 V  F  L  L  G  Q  S  A  R  D  A  E  R  A  K   | D  I  L  C  R      p.1160

          .         .         .         .         .         .       g.29640
 CTGGAGCATACAACGTTGAGGAAGGTGACTGCCAACACAGCCATCTACTATGACCCCAAC       c.3540
 L  E  H  T  T  L  R  K  V  T  A  N  T  A  I  Y  Y  D  P  N         p.1180

          .         .         .         .         .         .       g.29700
 CCCCAGAGCACGGTGGTGGCAGAGGATCAGGAATGGGTGAATGTCTACTATGAAATGCCT       c.3600
 P  Q  S  T  V  V  A  E  D  Q  E  W  V  N  V  Y  Y  E  M  P         p.1200

          .         .         .         .         .         .       g.29760
 GACTTTGATGTGGCCCGAATCTCCCCCTGGCTGTTGCGGGTGGAGCTGGATCGGAAGCAC       c.3660
 D  F  D  V  A  R  I  S  P  W  L  L  R  V  E  L  D  R  K  H         p.1220

          .         .         .         .         .   | 22     .    g.30161
 ATGACTGACCGGAAGCTCACCATGGAGCAGATTGCTGAAAAGATCAATGCTG | GTTTTGGT    c.3720
 M  T  D  R  K  L  T  M  E  Q  I  A  E  K  I  N  A  G |   F  G      p.1240

          .         .         .         .         .         .       g.30221
 GACGACTTGAACTGCATCTTTAATGATGACAATGCAGAGAAGCTGGTGCTCCGTATTCGC       c.3780
 D  D  L  N  C  I  F  N  D  D  N  A  E  K  L  V  L  R  I  R         p.1260

          .         .         .    | 23    .         .         .    g.31863
 ATCATGAACAGCGATGAGAACAAGATGCAAGAG | GAGGAAGAGGTGGTGGACAAGATGGAT    c.3840
 I  M  N  S  D  E  N  K  M  Q  E   | E  E  E  V  V  D  K  M  D      p.1280

          .         .         .         .         .         .       g.31923
 GATGATGTCTTCCTGCGCTGCATCGAGTCCAACATGCTGACAGATATGACCCTGCAGGGC       c.3900
 D  D  V  F  L  R  C  I  E  S  N  M  L  T  D  M  T  L  Q  G         p.1300

          .         | 24         .         .         .         .    g.32069
 ATCGAGCAGATCAGCAAG | GTGTACATGCACTTGCCACAGACAGACAACAAGAAGAAGATC    c.3960
 I  E  Q  I  S  K   | V  Y  M  H  L  P  Q  T  D  N  K  K  K  I      p.1320

          .         .         .         .         .         .       g.32129
 ATCATCACGGAGGATGGGGAATTCAAGGCCCTGCAGGAGTGGATCCTGGAGACGGACGGC       c.4020
 I  I  T  E  D  G  E  F  K  A  L  Q  E  W  I  L  E  T  D  G         p.1340

          .         .         .         .         .         .       g.32189
 GTGAGCTTGATGCGGGTGCTGAGTGAGAAGGACGTGGACCCCGTACGCACCACGTCCAAT       c.4080
 V  S  L  M  R  V  L  S  E  K  D  V  D  P  V  R  T  T  S  N         p.1360

          .         .  | 25      .         .         .         .    g.32471
 GACATTGTGGAGATCTTCACG | GTGCTGGGCATTGAAGCCGTGCGGAAGGCCCTGGAGCGG    c.4140
 D  I  V  E  I  F  T   | V  L  G  I  E  A  V  R  K  A  L  E  R      p.1380

          .         .         .         .         .         .       g.32531
 GAGCTGTACCACGTCATCTCCTTTGATGGCTCCTATGTCAATTACCGACACTTGGCTCTC       c.4200
 E  L  Y  H  V  I  S  F  D  G  S  Y  V  N  Y  R  H  L  A  L         p.1400

          .         .         .         .         .         .       g.32591
 TTGTGTGATACCATGACCTGTCGTGGCCACTTGATGGCCATCACCCGACACGGAGTCAAC       c.4260
 L  C  D  T  M  T  C  R  G  H  L  M  A  I  T  R  H  G  V  N         p.1420

          .         .         .         .      | 26  .         .    g.32794
 CGCCAGGACACAGGACCACTCATGAAGTGTTCCTTTGAGGAAACG | GTGGACGTGCTTATG    c.4320
 R  Q  D  T  G  P  L  M  K  C  S  F  E  E  T   | V  D  V  L  M      p.1440

          .         .         .         .         .         .       g.32854
 GAAGCAGCCGCACACGGTGAGAGTGACCCCATGAAGGGGGTCTCTGAGAATATCATGCTG       c.4380
 E  A  A  A  H  G  E  S  D  P  M  K  G  V  S  E  N  I  M  L         p.1460

          .         .         .         .         .         .       g.32914
 GGCCAGCTGGCTCCGGCCGGCACTGGCTGCTTTGACCTCCTGCTTGATGCAGAGAAGTGC       c.4440
 G  Q  L  A  P  A  G  T  G  C  F  D  L  L  L  D  A  E  K  C         p.1480

          .         .         .         .         .   | 27     .    g.33106
 AAGTATGGCATGGAGATCCCCACCAATATCCCCGGCCTGGGGGCTGCTGGAC | CCACCGGC    c.4500
 K  Y  G  M  E  I  P  T  N  I  P  G  L  G  A  A  G  P |   T  G      p.1500

          .         .         .         .         .         .       g.33166
 ATGTTCTTTGGTTCAGCACCCAGTCCCATGGGTGGAATCTCTCCTGCCATGACACCTTGG       c.4560
 M  F  F  G  S  A  P  S  P  M  G  G  I  S  P  A  M  T  P  W         p.1520

          .         .         .         .       | 28 .         .    g.33409
 AACCAGGGTGCAACCCCTGCCTATGGCGCCTGGTCCCCCAGTGTTG | GGAGTGGAATGACC    c.4620
 N  Q  G  A  T  P  A  Y  G  A  W  S  P  S  V  G |   S  G  M  T      p.1540

          .         .         .         .         .         .       g.33469
 CCAGGGGCAGCCGGCTTCTCTCCCAGTGCTGCGTCAGATGCCAGCGGCTTCAGCCCAGGT       c.4680
 P  G  A  A  G  F  S  P  S  A  A  S  D  A  S  G  F  S  P  G         p.1560

          .         .         .         .         .         .       g.33529
 TACTCCCCTGCCTGGTCTCCCACACCGGGCTCCCCGGGGTCCCCAGGTCCCTCAAGCCCC       c.4740
 Y  S  P  A  W  S  P  T  P  G  S  P  G  S  P  G  P  S  S  P         p.1580

          .       | 29 .         .         .         .         .    g.33686
 TACATCCCTTCACCAG | GTGGTGCCATGTCTCCCAGCTACTCGCCAACGTCACCTGCCTAC    c.4800
 Y  I  P  S  P  G |   G  A  M  S  P  S  Y  S  P  T  S  P  A  Y      p.1600

          .         .         .         .         .         .       g.33746
 GAGCCCCGCTCTCCTGGGGGCTACACACCCCAGAGTCCCTCTTATTCCCCCACTTCACCC       c.4860
 E  P  R  S  P  G  G  Y  T  P  Q  S  P  S  Y  S  P  T  S  P         p.1620

          .         .         .         .         .         .       g.33806
 TCCTACTCCCCTACCTCTCCATCCTATTCTCCAACCAGTCCCAACTATAGTCCCACATCA       c.4920
 S  Y  S  P  T  S  P  S  Y  S  P  T  S  P  N  Y  S  P  T  S         p.1640

          .         .         .         .         .         .       g.33866
 CCCAGCTATTCGCCAACGTCACCCAGCTACTCACCGACCTCTCCCAGCTACTCACCCACC       c.4980
 P  S  Y  S  P  T  S  P  S  Y  S  P  T  S  P  S  Y  S  P  T         p.1660

          .         .         .         .         .         .       g.33926
 TCTCCCAGCTACTCGCCCACCTCTCCCAGCTACTCGCCCACCTCTCCCAGCTACTCACCC       c.5040
 S  P  S  Y  S  P  T  S  P  S  Y  S  P  T  S  P  S  Y  S  P         p.1680

          .         .         .         .         .         .       g.33986
 ACTTCCCCTAGCTACTCGCCCACTTCCCCTAGCTACTCGCCAACGTCTCCCAGCTACTCG       c.5100
 T  S  P  S  Y  S  P  T  S  P  S  Y  S  P  T  S  P  S  Y  S         p.1700

          .         .         .         .         .         .       g.34046
 CCGACATCTCCCAGCTACTCGCCAACTTCACCCAGCTATTCTCCCACTTCTCCCAGCTAC       c.5160
 P  T  S  P  S  Y  S  P  T  S  P  S  Y  S  P  T  S  P  S  Y         p.1720

          .         .         .         .         .         .       g.34106
 TCACCTACCTCTCCAAGCTATTCACCCACCTCCCCCAGCTACTCACCCACTTCCCCAAGT       c.5220
 S  P  T  S  P  S  Y  S  P  T  S  P  S  Y  S  P  T  S  P  S         p.1740

          .         .         .         .         .         .       g.34166
 TACTCACCCACCAGCCCGAACTATTCTCCAACCAGTCCCAATTACACCCCAACATCACCC       c.5280
 Y  S  P  T  S  P  N  Y  S  P  T  S  P  N  Y  T  P  T  S  P         p.1760

          .         .         .         .         .         .       g.34226
 AGCTACAGCCCGACATCACCCAGCTATTCACCTACTAGTCCCAACTACACACCTACCAGC       c.5340
 S  Y  S  P  T  S  P  S  Y  S  P  T  S  P  N  Y  T  P  T  S         p.1780

          .         .         .         .         .         .       g.34286
 CCTAACTACAGCCCAACCTCTCCAAGCTACTCTCCAACATCACCCAGCTATTCCCCGACC       c.5400
 P  N  Y  S  P  T  S  P  S  Y  S  P  T  S  P  S  Y  S  P  T         p.1800

          .         .         .         .         .         .       g.34346
 TCACCAAGTTACTCCCCTTCCAGCCCACGATACACACCACAGTCTCCAACCTATACCCCA       c.5460
 S  P  S  Y  S  P  S  S  P  R  Y  T  P  Q  S  P  T  Y  T  P         p.1820

          .         .         .         .         .         .       g.34406
 AGCTCACCCAGCTACAGCCCCAGCTCGCCCAGCTACAGCCCAACCTCACCCAAGTACACC       c.5520
 S  S  P  S  Y  S  P  S  S  P  S  Y  S  P  T  S  P  K  Y  T         p.1840

          .         .         .         .         .         .       g.34466
 CCAACCAGTCCTTCTTACAGTCCCAGCTCCCCAGAGTATACCCCAACCTCTCCCAAGTAC       c.5580
 P  T  S  P  S  Y  S  P  S  S  P  E  Y  T  P  T  S  P  K  Y         p.1860

          .         .         .         .         .         .       g.34526
 TCACCTACCAGTCCCAAATATTCACCCACCTCTCCCAAGTACTCGCCTACCAGTCCCACC       c.5640
 S  P  T  S  P  K  Y  S  P  T  S  P  K  Y  S  P  T  S  P  T         p.1880

          .         .         .         .         .         .       g.34586
 TATTCACCCACCACCCCAAAATACTCCCCAACATCTCCTACTTATTCCCCAACCTCTCCA       c.5700
 Y  S  P  T  T  P  K  Y  S  P  T  S  P  T  Y  S  P  T  S  P         p.1900

          .         .         .         .         .         .       g.34646
 GTCTACACCCCAACCTCTCCCAAGTACTCACCTACTAGCCCCACTTACTCGCCCACTTCC       c.5760
 V  Y  T  P  T  S  P  K  Y  S  P  T  S  P  T  Y  S  P  T  S         p.1920

          .         .         .         .         .         .       g.34706
 CCCAAGTACTCGCCCACCAGCCCCACCTACTCGCCCACCTCCCCCAAAGGCTCAACCTAC       c.5820
 P  K  Y  S  P  T  S  P  T  Y  S  P  T  S  P  K  G  S  T  Y         p.1940

          .         .         .         .         .         .       g.34766
 TCTCCCACTTCCCCTGGTTACTCGCCCACCAGCCCCACCTACAGTCTCACAAGCCCGGCT       c.5880
 S  P  T  S  P  G  Y  S  P  T  S  P  T  Y  S  L  T  S  P  A         p.1960

          .         .         .                                     g.34799
 ATCAGCCCGGATGACAGTGACGAGGAGAACTGA                                  c.5913
 I  S  P  D  D  S  D  E  E  N  X                                    p.1970

          .         .         .         .         .         .       g.34859
 gggcacgtggggtgcggcagcgggctagggcccagggcagcttgcccgtgctgctgtgca       c.*60

          .         .         .         .         .         .       g.34919
 gttcttgcctccctcacggggcgtcacccccagcccagctccgttgtacataaatgcctt       c.*120

          .         .         .         .         .         .       g.34979
 gtggcagagctcccggtgaacttctggatcccgtttctgatgcagactcttgtcttgttc       c.*180

          .         .         .         .         .         .       g.35039
 tccacttgtgctgttagaactcactggcccagtggtgttctcactcctaccccacccacc       c.*240

          .         .         .         .         .         .       g.35099
 ccctgcctgtccccaaattgaagatccttccttgcctgtggcttgatgcggggcgggtaa       c.*300

          .         .         .         .         .         .       g.35159
 agggtattttaacttaggggtagttcctgctgtgagtggttacagctgatcctcgggaag       c.*360

          .         .         .         .         .         .       g.35219
 aacaaagctaaagctgccttttgtctgttattttatttttttgaagtttaaataaagttt       c.*420

          .                                                         g.35238
 actaattttgaccaaaagt                                                c.*439

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Polymerase (RNA) II (DNA directed) polypeptide A, 220kDa protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 21c
©2004-2019 Leiden University Medical Center