E1A binding protein p300 (EP300) - coding DNA reference sequence

(used for variant description)

(last modified February 4, 2016)


This file was created to facilitate the description of sequence variants on transcript NM_001429.3 in the EP300 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_009817.1, covering EP300 transcript NM_001429.3.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                               .         .         .                g.5035
                          gccgaggaggaagaggttgatggcggcggcggagc       c.-361

 .         .         .         .         .         .                g.5095
 tccgagagacctcggctgggcaggggccggccgtggcgggccggggactgcgcctctaga       c.-301

 .         .         .         .         .         .                g.5155
 gccgcgagttctcgggaattcgccgcagcggacgcgctcggcgaatttgtgctcttgtgc       c.-241

 .         .         .         .         .         .                g.5215
 cctcctccgggcttgggcccaggcccggcccctcgcacttgcccttaccttttctatcga       c.-181

 .         .         .         .         .         .                g.5275
 gtccgcatccctctccagccactgcgacccggcgaagagaaaaaggaacttcccccaccc       c.-121

 .         .         .         .         .         .                g.5335
 cctcgggtgccgtcggagccccccagcccacccctgggtgcggcgcggggaccccgggcc       c.-61

 .         .         .         .         .         .                g.5395
 gaagaagagatttcctgaggattctggttttcctcgcttgtatctccgaaagaattaaaa       c.-1

          .         .         .         .         .         .       g.5455
 ATGGCCGAGAATGTGGTGGAACCGGGGCCGCCTTCAGCCAAGCGGCCTAAACTCTCATCT       c.60
 M  A  E  N  V  V  E  P  G  P  P  S  A  K  R  P  K  L  S  S         p.20

          .         .         .     | 02   .         .         .    g.29603
 CCGGCCCTCTCGGCGTCCGCCAGCGATGGCACAG | ATTTTGGCTCTCTATTTGACTTGGAG    c.120
 P  A  L  S  A  S  A  S  D  G  T  D |   F  G  S  L  F  D  L  E      p.40

          .         .         .         .         .         .       g.29663
 CACGACTTACCAGATGAATTAATCAACTCTACAGAATTGGGACTAACCAATGGTGGTGAT       c.180
 H  D  L  P  D  E  L  I  N  S  T  E  L  G  L  T  N  G  G  D         p.60

          .         .         .         .         .         .       g.29723
 ATTAATCAGCTTCAGACAAGTCTTGGCATGGTACAAGATGCAGCTTCTAAACATAAACAG       c.240
 I  N  Q  L  Q  T  S  L  G  M  V  Q  D  A  A  S  K  H  K  Q         p.80

          .         .         .         .         .         .       g.29783
 CTGTCAGAATTGCTGCGATCTGGTAGTTCCCCTAACCTCAATATGGGAGTTGGTGGCCCA       c.300
 L  S  E  L  L  R  S  G  S  S  P  N  L  N  M  G  V  G  G  P         p.100

          .         .         .         .         .         .       g.29843
 GGTCAAGTCATGGCCAGCCAGGCCCAACAGAGCAGTCCTGGATTAGGTTTGATAAATAGC       c.360
 G  Q  V  M  A  S  Q  A  Q  Q  S  S  P  G  L  G  L  I  N  S         p.120

          .         .         .         .         .         .       g.29903
 ATGGTCAAAAGCCCAATGACACAGGCAGGCTTGACTTCTCCCAACATGGGGATGGGCACT       c.420
 M  V  K  S  P  M  T  Q  A  G  L  T  S  P  N  M  G  M  G  T         p.140

          .         .         .         .         .         .       g.29963
 AGTGGACCAAATCAGGGTCCTACGCAGTCAACAGGTATGATGAACAGTCCAGTAAATCAG       c.480
 S  G  P  N  Q  G  P  T  Q  S  T  G  M  M  N  S  P  V  N  Q         p.160

          .         .         .         .         .         .       g.30023
 CCTGCCATGGGAATGAACACAGGGATGAATGCGGGCATGAATCCTGGAATGTTGGCTGCA       c.540
 P  A  M  G  M  N  T  G  M  N  A  G  M  N  P  G  M  L  A  A         p.180

          .         .         .         .         .         .       g.30083
 GGCAATGGACAAGGGATAATGCCTAATCAAGTCATGAACGGTTCAATTGGAGCAGGCCGA       c.600
 G  N  G  Q  G  I  M  P  N  Q  V  M  N  G  S  I  G  A  G  R         p.200

          .         .         .         .         .         .       g.30143
 GGGCGACAGAATATGCAGTACCCAAACCCAGGCATGGGAAGTGCTGGCAACTTACTGACT       c.660
 G  R  Q  N  M  Q  Y  P  N  P  G  M  G  S  A  G  N  L  L  T         p.220

          .         .         .         .         .         .       g.30203
 GAGCCTCTTCAGCAGGGCTCTCCCCAGATGGGAGGACAAACAGGATTGAGAGGCCCCCAG       c.720
 E  P  L  Q  Q  G  S  P  Q  M  G  G  Q  T  G  L  R  G  P  Q         p.240

           | 03        .         .         .         .         .    g.38305
 CCTCTTAAG | ATGGGAATGATGAACAACCCCAATCCTTATGGTTCACCATATACTCAGAAT    c.780
 P  L  K   | M  G  M  M  N  N  P  N  P  Y  G  S  P  Y  T  Q  N      p.260

          .         .         .         .         .         .       g.38365
 CCTGGACAGCAGATTGGAGCCAGTGGCCTTGGTCTCCAGATTCAGACAAAAACTGTACTA       c.840
 P  G  Q  Q  I  G  A  S  G  L  G  L  Q  I  Q  T  K  T  V  L         p.280

          .         .         .         .         .         .       g.38425
 TCAAATAACTTATCTCCATTTGCTATGGACAAAAAGGCAGTTCCTGGTGGAGGAATGCCC       c.900
 S  N  N  L  S  P  F  A  M  D  K  K  A  V  P  G  G  G  M  P         p.300

        | 04 .         .         .         .         .         .    g.39931
 AACATG | GGTCAACAGCCAGCCCCGCAGGTCCAGCAGCCAGGCCTGGTGACTCCAGTTGCC    c.960
 N  M   | G  Q  Q  P  A  P  Q  V  Q  Q  P  G  L  V  T  P  V  A      p.320

          .         .         .         .         .         .       g.39991
 CAAGGGATGGGTTCTGGAGCACATACAGCTGATCCAGAGAAGCGCAAGCTCATCCAGCAG       c.1020
 Q  G  M  G  S  G  A  H  T  A  D  P  E  K  R  K  L  I  Q  Q         p.340

          .         .         .         .         .         .       g.40051
 CAGCTTGTTCTCCTTTTGCATGCTCACAAGTGCCAGCGCCGGGAACAGGCCAATGGGGAA       c.1080
 Q  L  V  L  L  L  H  A  H  K  C  Q  R  R  E  Q  A  N  G  E         p.360

          .         .         .         .         .         .       g.40111
 GTGAGGCAGTGCAACCTTCCCCACTGTCGCACAATGAAGAATGTCCTAAACCACATGACA       c.1140
 V  R  Q  C  N  L  P  H  C  R  T  M  K  N  V  L  N  H  M  T         p.380

          .         .         | 05         .         .         .    g.42312
 CACTGCCAGTCAGGCAAGTCTTGCCAAG | TGGCACACTGTGCATCTTCTCGACAAATCATT    c.1200
 H  C  Q  S  G  K  S  C  Q  V |   A  H  C  A  S  S  R  Q  I  I      p.400

          .         .         .         .         .         .       g.42372
 TCACACTGGAAGAATTGTACAAGACATGATTGTCCTGTGTGTCTCCCCCTCAAAAATGCT       c.1260
 S  H  W  K  N  C  T  R  H  D  C  P  V  C  L  P  L  K  N  A         p.420

          .         .   | 06     .         .         .         .    g.43816
 GGTGATAAGAGAAATCAACAGC | CAATTTTGACTGGAGCACCCGTTGGACTTGGAAATCCT    c.1320
 G  D  K  R  N  Q  Q  P |   I  L  T  G  A  P  V  G  L  G  N  P      p.440

          .         .         .         .         .         .       g.43876
 AGCTCTCTAGGGGTGGGTCAACAGTCTGCCCCCAACCTAAGCACTGTTAGTCAGATTGAT       c.1380
 S  S  L  G  V  G  Q  Q  S  A  P  N  L  S  T  V  S  Q  I  D         p.460

          .         .         .         .         .         .       g.43936
 CCCAGCTCCATAGAAAGAGCCTATGCAGCTCTTGGACTACCCTATCAAGTAAATCAGATG       c.1440
 P  S  S  I  E  R  A  Y  A  A  L  G  L  P  Y  Q  V  N  Q  M         p.480

          .         .         .         .         .         .       g.43996
 CCGACACAACCCCAGGTGCAAGCAAAGAACCAGCAGAATCAGCAGCCTGGGCAGTCTCCC       c.1500
 P  T  Q  P  Q  V  Q  A  K  N  Q  Q  N  Q  Q  P  G  Q  S  P         p.500

          .         .         | 07         .         .         .    g.48235
 CAAGGCATGCGGCCCATGAGCAACATGA | GTGCTAGTCCTATGGGAGTAAATGGAGGTGTA    c.1560
 Q  G  M  R  P  M  S  N  M  S |   A  S  P  M  G  V  N  G  G  V      p.520

          .         .         .         .         .         .       g.48295
 GGAGTTCAAACGCCGAGTCTTCTTTCTGACTCAATGTTGCATTCAGCCATAAATTCTCAA       c.1620
 G  V  Q  T  P  S  L  L  S  D  S  M  L  H  S  A  I  N  S  Q         p.540

    | 08     .         .         .         .         .         .    g.50101
 AA | CCCAATGATGAGTGAAAATGCCAGTGTGCCCTCCCTGGGTCCTATGCCAACAGCAGCT    c.1680
 N  |  P  M  M  S  E  N  A  S  V  P  S  L  G  P  M  P  T  A  A      p.560

          .         .         .         .         .         .       g.50161
 CAACCATCCACTACTGGAATTCGGAAACAGTGGCACGAAGATATTACTCAGGATCTTCGA       c.1740
 Q  P  S  T  T  G  I  R  K  Q  W  H  E  D  I  T  Q  D  L  R         p.580

          .         . | 09       .         .         .         .    g.52570
 AATCATCTTGTTCACAAACT | CGTCCAAGCCATATTTCCTACGCCGGATCCTGCTGCTTTA    c.1800
 N  H  L  V  H  K  L  |  V  Q  A  I  F  P  T  P  D  P  A  A  L      p.600

          .         .         .         .         .         .       g.52630
 AAAGACAGACGGATGGAAAACCTAGTTGCATATGCTCGGAAAGTTGAAGGGGACATGTAT       c.1860
 K  D  R  R  M  E  N  L  V  A  Y  A  R  K  V  E  G  D  M  Y         p.620

          .         | 10         .         .         .         .    g.53480
 GAATCTGCAAACAATCGA | GCGGAATACTACCACCTTCTAGCTGAGAAAATCTATAAGATC    c.1920
 E  S  A  N  N  R   | A  E  Y  Y  H  L  L  A  E  K  I  Y  K  I      p.640

          .         .         .         .         .         .       g.53540
 CAGAAAGAACTAGAAGAAAAACGAAGGACCAGACTACAGAAGCAGAACATGCTACCAAAT       c.1980
 Q  K  E  L  E  E  K  R  R  T  R  L  Q  K  Q  N  M  L  P  N         p.660

          .         .         .         .         .         .       g.53600
 GCTGCAGGCATGGTTCCAGTTTCCATGAATCCAGGGCCTAACATGGGACAGCCGCAACCA       c.2040
 A  A  G  M  V  P  V  S  M  N  P  G  P  N  M  G  Q  P  Q  P         p.680

          .    | 11    .         .         .         .         .    g.59176
 GGAATGACTTCTA | ATGGCCCTCTACCTGACCCAAGTATGATCCGTGGCAGTGTGCCAAAC    c.2100
 G  M  T  S  N |   G  P  L  P  D  P  S  M  I  R  G  S  V  P  N      p.700

          .         .         .  | 12      .         .         .    g.60256
 CAGATGATGCCTCGAATAACTCCACAATCTG | GTTTGAATCAATTTGGCCAGATGAGCATG    c.2160
 Q  M  M  P  R  I  T  P  Q  S  G |   L  N  Q  F  G  Q  M  S  M      p.720

          .         .         .         .         .         .       g.60316
 GCCCAGCCCCCTATTGTACCCCGGCAAACCCCTCCTCTTCAGCACCATGGACAGTTGGCT       c.2220
 A  Q  P  P  I  V  P  R  Q  T  P  P  L  Q  H  H  G  Q  L  A         p.740

          .         .  | 13      .         .         .         .    g.61467
 CAACCTGGAGCTCTCAACCCG | CCTATGGGCTATGGGCCTCGTATGCAACAGCCTTCCAAC    c.2280
 Q  P  G  A  L  N  P   | P  M  G  Y  G  P  R  M  Q  Q  P  S  N      p.760

          .         .         .         .         .         .       g.61527
 CAGGGCCAGTTCCTTCCTCAGACTCAGTTCCCATCACAGGGAATGAATGTAACAAATATC       c.2340
 Q  G  Q  F  L  P  Q  T  Q  F  P  S  Q  G  M  N  V  T  N  I         p.780

          .         .         .          | 14        .         .    g.62172
 CCTTTGGCTCCGTCCAGCGGTCAAGCTCCAGTGTCTCAA | GCACAAATGTCTAGTTCTTCC    c.2400
 P  L  A  P  S  S  G  Q  A  P  V  S  Q   | A  Q  M  S  S  S  S      p.800

          .         .         .         .         .         .       g.62232
 TGCCCGGTGAACTCTCCTATAATGCCTCCAGGGTCTCAGGGGAGCCACATTCACTGTCCC       c.2460
 C  P  V  N  S  P  I  M  P  P  G  S  Q  G  S  H  I  H  C  P         p.820

          .         .         .         .         .         .       g.62292
 CAGCTTCCTCAACCAGCTCTTCATCAGAATTCACCCTCGCCTGTACCTAGTCGTACCCCC       c.2520
 Q  L  P  Q  P  A  L  H  Q  N  S  P  S  P  V  P  S  R  T  P         p.840

          .         .         .         .         .         .       g.62352
 ACCCCTCACCATACTCCCCCAAGCATAGGGGCTCAGCAGCCACCAGCAACAACAATTCCA       c.2580
 T  P  H  H  T  P  P  S  I  G  A  Q  Q  P  P  A  T  T  I  P         p.860

          .         .         .         .         .         .       g.62412
 GCCCCTGTTCCTACACCTCCTGCCATGCCACCTGGGCCACAGTCCCAGGCTCTACATCCC       c.2640
 A  P  V  P  T  P  P  A  M  P  P  G  P  Q  S  Q  A  L  H  P         p.880

          .         .         .         .         .         .       g.62472
 CCTCCAAGGCAGACACCTACACCACCAACAACACAACTTCCCCAACAAGTGCAGCCTTCA       c.2700
 P  P  R  Q  T  P  T  P  P  T  T  Q  L  P  Q  Q  V  Q  P  S         p.900

          .         .         .         .         .         .       g.62532
 CTTCCTGCTGCACCTTCTGCTGACCAGCCCCAGCAGCAGCCTCGCTCACAGCAGAGCACA       c.2760
 L  P  A  A  P  S  A  D  Q  P  Q  Q  Q  P  R  S  Q  Q  S  T         p.920

          .         .         .         .         .        | 15.    g.64226
 GCAGCGTCTGTTCCTACCCCAACAGCACCGCTGCTTCCTCCGCAGCCTGCAACTCCA | CTT    c.2820
 A  A  S  V  P  T  P  T  A  P  L  L  P  P  Q  P  A  T  P   | L      p.940

          .         .         .         .         .         .       g.64286
 TCCCAGCCAGCTGTAAGCATTGAAGGACAGGTATCAAATCCTCCATCTACTAGTAGCACA       c.2880
 S  Q  P  A  V  S  I  E  G  Q  V  S  N  P  P  S  T  S  S  T         p.960

          .         .         .         .         .         .       g.64346
 GAAGTGAATTCTCAGGCCATTGCTGAGAAGCAGCCTTCCCAGGAAGTGAAGATGGAGGCC       c.2940
 E  V  N  S  Q  A  I  A  E  K  Q  P  S  Q  E  V  K  M  E  A         p.980

          .         .         .         .         .        | 16.    g.64599
 AAAATGGAAGTGGATCAACCAGAACCAGCAGATACTCAGCCGGAGGATATTTCAGAG | TCT    c.3000
 K  M  E  V  D  Q  P  E  P  A  D  T  Q  P  E  D  I  S  E   | S      p.1000

          .         .         .         .         .         .       g.64659
 AAAGTGGAAGACTGTAAAATGGAATCTACCGAAACAGAAGAGAGAAGCACTGAGTTAAAA       c.3060
 K  V  E  D  C  K  M  E  S  T  E  T  E  E  R  S  T  E  L  K         p.1020

          .         .         .         .         .         .       g.64719
 ACTGAAATAAAAGAGGAGGAAGACCAGCCAAGTACTTCAGCTACCCAGTCATCTCCGGCT       c.3120
 T  E  I  K  E  E  E  D  Q  P  S  T  S  A  T  Q  S  S  P  A         p.1040

          .         .   | 17     .         .         .         .    g.67423
 CCAGGACAGTCAAAGAAAAAGA | TTTTCAAACCAGAAGAACTACGACAGGCACTGATGCCA    c.3180
 P  G  Q  S  K  K  K  I |   F  K  P  E  E  L  R  Q  A  L  M  P      p.1060

          .         .         .         .         .         .       g.67483
 ACTTTGGAGGCACTTTACCGTCAGGATCCAGAATCCCTTCCCTTTCGTCAACCTGTGGAC       c.3240
 T  L  E  A  L  Y  R  Q  D  P  E  S  L  P  F  R  Q  P  V  D         p.1080

          .         .  | 18      .         .         .         .    g.69598
 CCTCAGCTTTTAGGAATCCCT | GATTACTTTGATATTGTGAAGAGCCCCATGGATCTTTCT    c.3300
 P  Q  L  L  G  I  P   | D  Y  F  D  I  V  K  S  P  M  D  L  S      p.1100

          .         .         .         .         .         .       g.69658
 ACCATTAAGAGGAAGTTAGACACTGGACAGTATCAGGAGCCCTGGCAGTATGTCGATGAT       c.3360
 T  I  K  R  K  L  D  T  G  Q  Y  Q  E  P  W  Q  Y  V  D  D         p.1120

          .         .         .         .         .         .       g.69718
 ATTTGGCTTATGTTCAATAATGCCTGGTTATATAACCGGAAAACATCACGGGTATACAAA       c.3420
 I  W  L  M  F  N  N  A  W  L  Y  N  R  K  T  S  R  V  Y  K         p.1140

          .         .         .         .         .         .       g.69778
 TACTGCTCCAAGCTCTCTGAGGTCTTTGAACAAGAAATTGACCCAGTGATGCAAAGCCTT       c.3480
 Y  C  S  K  L  S  E  V  F  E  Q  E  I  D  P  V  M  Q  S  L         p.1160

          .         .  | 19      .         .         .         .    g.70841
 GGATACTGTTGTGGCAGAAAG | TTGGAGTTCTCTCCACAGACACTGTGTTGCTACGGCAAA    c.3540
 G  Y  C  C  G  R  K   | L  E  F  S  P  Q  T  L  C  C  Y  G  K      p.1180

          .         .         .         .         . | 20       .    g.73042
 CAGTTGTGCACAATACCTCGTGATGCCACTTATTACAGTTACCAGAACAG | GTATCATTTC    c.3600
 Q  L  C  T  I  P  R  D  A  T  Y  Y  S  Y  Q  N  R  |  Y  H  F      p.1200

          .         .         .         .         .         .       g.73102
 TGTGAGAAGTGTTTCAATGAGATCCAAGGGGAGAGCGTTTCTTTGGGGGATGACCCTTCC       c.3660
 C  E  K  C  F  N  E  I  Q  G  E  S  V  S  L  G  D  D  P  S         p.1220

          .  | 21      .         .         .         .         .    g.75162
 CAGCCTCAAAC | TACAATAAATAAAGAACAATTTTCCAAGAGAAAAAATGACACACTGGAT    c.3720
 Q  P  Q  T  |  T  I  N  K  E  Q  F  S  K  R  K  N  D  T  L  D      p.1240

          | 22         .         .         .         .         .    g.76495
 CCTGAACT | GTTTGTTGAATGTACAGAGTGCGGAAGAAAGATGCATCAGATCTGTGTCCTT    c.3780
 P  E  L  |  F  V  E  C  T  E  C  G  R  K  M  H  Q  I  C  V  L      p.1260

          .         .       | 23 .         .         .         .    g.79023
 CACCATGAGATCATCTGGCCTGCTGG | ATTCGTCTGTGATGGCTGTTTAAAGAAAAGTGCA    c.3840
 H  H  E  I  I  W  P  A  G  |  F  V  C  D  G  C  L  K  K  S  A      p.1280

          .         .         .     | 24   .         .         .    g.80865
 CGAACTAGGAAAGAAAATAAGTTTTCTGCTAAAA | GGTTGCCATCTACCAGACTTGGCACC    c.3900
 R  T  R  K  E  N  K  F  S  A  K  R |   L  P  S  T  R  L  G  T      p.1300

          .         .         .         .         .         .       g.80925
 TTTCTAGAGAATCGTGTGAATGACTTTCTGAGGCGACAGAATCACCCTGAGTCAGGAGAG       c.3960
 F  L  E  N  R  V  N  D  F  L  R  R  Q  N  H  P  E  S  G  E         p.1320

          .         .         .         .         .         .       g.80985
 GTCACTGTTAGAGTAGTTCATGCTTCTGACAAAACCGTGGAAGTAAAACCAGGCATGAAA       c.4020
 V  T  V  R  V  V  H  A  S  D  K  T  V  E  V  K  P  G  M  K         p.1340

       | 25  .         .         .         .         .         .    g.81166
 GCAAG | GTTTGTGGACAGTGGAGAGATGGCAGAATCCTTTCCATACCGAACCAAAGCCCTC    c.4080
 A  R  |  F  V  D  S  G  E  M  A  E  S  F  P  Y  R  T  K  A  L      p.1360

          .         .         .         .         .         .       g.81226
 TTTGCCTTTGAAGAAATTGATGGTGTTGACCTGTGCTTCTTTGGCATGCATGTTCAAGAG       c.4140
 F  A  F  E  E  I  D  G  V  D  L  C  F  F  G  M  H  V  Q  E         p.1380

          .         .         .   | 26     .         .         .    g.81921
 TATGGCTCTGACTGCCCTCCACCCAACCAGAG | GAGAGTATACATATCTTACCTCGATAGT    c.4200
 Y  G  S  D  C  P  P  P  N  Q  R  |  R  V  Y  I  S  Y  L  D  S      p.1400

          .         .         .         .         .         .       g.81981
 GTTCATTTCTTCCGTCCTAAATGCTTGAGGACTGCAGTCTATCATGAAATCCTAATTGGA       c.4260
 V  H  F  F  R  P  K  C  L  R  T  A  V  Y  H  E  I  L  I  G         p.1420

          .         .       | 27 .         .         .         .    g.82830
 TATTTAGAATATGTCAAGAAATTAGG | TTACACAACAGGGCATATTTGGGCATGTCCACCA    c.4320
 Y  L  E  Y  V  K  K  L  G  |  Y  T  T  G  H  I  W  A  C  P  P      p.1440

          .         .         .         .         .         .       g.82890
 AGTGAGGGAGATGATTATATCTTCCATTGCCATCCTCCTGACCAGAAGATACCCAAGCCC       c.4380
 S  E  G  D  D  Y  I  F  H  C  H  P  P  D  Q  K  I  P  K  P         p.1460

          .         .         .         .         .         .       g.82950
 AAGCGACTGCAGGAATGGTACAAAAAAATGCTTGACAAGGCTGTATCAGAGCGTATTGTC       c.4440
 K  R  L  Q  E  W  Y  K  K  M  L  D  K  A  V  S  E  R  I  V         p.1480

          .   | 28     .         .         .         .         .    g.84937
 CATGACTACAAG | GATATTTTTAAACAAGCTACTGAAGATAGATTAACAAGTGCAAAGGAA    c.4500
 H  D  Y  K   | D  I  F  K  Q  A  T  E  D  R  L  T  S  A  K  E      p.1500

          .         .         .         .         .         .       g.84997
 TTGCCTTATTTCGAGGGTGATTTCTGGCCCAATGTTCTGGAAGAAAGCATTAAGGAACTG       c.4560
 L  P  Y  F  E  G  D  F  W  P  N  V  L  E  E  S  I  K  E  L         p.1520

          .         .         .         .         .        | 29.    g.86016
 GAACAGGAGGAAGAAGAGAGAAAACGAGAGGAAAACACCAGCAATGAAAGCACAGAT | GTG    c.4620
 E  Q  E  E  E  E  R  K  R  E  E  N  T  S  N  E  S  T  D   | V      p.1540

          .         .         .         .         .         .       g.86076
 ACCAAGGGAGACAGCAAAAATGCTAAAAAGAAGAATAATAAGAAAACCAGCAAAAATAAG       c.4680
 T  K  G  D  S  K  N  A  K  K  K  N  N  K  K  T  S  K  N  K         p.1560

          .         .         .         .         .         .       g.86136
 AGCAGCCTGAGTAGGGGCAACAAGAAGAAACCCGGGATGCCCAATGTATCTAACGACCTC       c.4740
 S  S  L  S  R  G  N  K  K  K  P  G  M  P  N  V  S  N  D  L         p.1580

          .         .         .          | 30        .         .    g.88658
 TCACAGAAACTATATGCCACCATGGAGAAGCATAAAGAG | GTCTTCTTTGTGATCCGCCTC    c.4800
 S  Q  K  L  Y  A  T  M  E  K  H  K  E   | V  F  F  V  I  R  L      p.1600

          .         .         .         .         .         .       g.88718
 ATTGCTGGCCCTGCTGCCAACTCCCTGCCTCCCATTGTTGATCCTGATCCTCTCATCCCC       c.4860
 I  A  G  P  A  A  N  S  L  P  P  I  V  D  P  D  P  L  I  P         p.1620

          .         .         .         .         .         .       g.88778
 TGCGATCTGATGGATGGTCGGGATGCGTTTCTCACGCTGGCAAGGGACAAGCACCTGGAG       c.4920
 C  D  L  M  D  G  R  D  A  F  L  T  L  A  R  D  K  H  L  E         p.1640

          .         .         .         .         .         .       g.88838
 TTCTCTTCACTCCGAAGAGCCCAGTGGTCCACCATGTGCATGCTGGTGGAGCTGCACACG       c.4980
 F  S  S  L  R  R  A  Q  W  S  T  M  C  M  L  V  E  L  H  T         p.1660

          .         .         .         .         .         .       g.88898
 CAGAGCCAGGACCGCTTTGTCTACACCTGCAATGAATGCAAGCACCATGTGGAGACACGC       c.5040
 Q  S  Q  D  R  F  V  Y  T  C  N  E  C  K  H  H  V  E  T  R         p.1680

          .         .  | 31      .         .         .         .    g.89202
 TGGCACTGTACTGTCTGTGAG | GATTATGACTTGTGTATCACCTGCTATAACACTAAAAAC    c.5100
 W  H  C  T  V  C  E   | D  Y  D  L  C  I  T  C  Y  N  T  K  N      p.1700

          .         .         .         .         .         .       g.89262
 CATGACCACAAAATGGAGAAACTAGGCCTTGGCTTAGATGATGAGAGCAACAACCAGCAG       c.5160
 H  D  H  K  M  E  K  L  G  L  G  L  D  D  E  S  N  N  Q  Q         p.1720

          .         .         .         .         .         .       g.89322
 GCTGCAGCCACCCAGAGCCCAGGCGATTCTCGCCGCCTGAGTATCCAGCGCTGCATCCAG       c.5220
 A  A  A  T  Q  S  P  G  D  S  R  R  L  S  I  Q  R  C  I  Q         p.1740

          .         .         .         .         .         .       g.89382
 TCTCTGGTCCATGCTTGCCAGTGTCGGAATGCCAATTGCTCACTGCCATCCTGCCAGAAG       c.5280
 S  L  V  H  A  C  Q  C  R  N  A  N  C  S  L  P  S  C  Q  K         p.1760

          .         .         .         .         .         .       g.89442
 ATGAAGCGGGTTGTGCAGCATACCAAGGGTTGCAAACGGAAAACCAATGGCGGGTGCCCC       c.5340
 M  K  R  V  V  Q  H  T  K  G  C  K  R  K  T  N  G  G  C  P         p.1780

          .         .         .         .         .         .       g.89502
 ATCTGCAAGCAGCTCATTGCCCTCTGCTGCTACCATGCCAAGCACTGCCAGGAGAACAAA       c.5400
 I  C  K  Q  L  I  A  L  C  C  Y  H  A  K  H  C  Q  E  N  K         p.1800

          .         .         .         .         .         .       g.89562
 TGCCCGGTGCCGTTCTGCCTAAACATCAAGCAGAAGCTCCGGCAGCAACAGCTGCAGCAC       c.5460
 C  P  V  P  F  C  L  N  I  K  Q  K  L  R  Q  Q  Q  L  Q  H         p.1820

          .         .         .         .         .         .       g.89622
 CGACTACAGCAGGCCCAAATGCTTCGCAGGAGGATGGCCAGCATGCAGCGGACTGGTGTG       c.5520
 R  L  Q  Q  A  Q  M  L  R  R  R  M  A  S  M  Q  R  T  G  V         p.1840

          .         .         .         .         .         .       g.89682
 GTTGGGCAGCAACAGGGCCTCCCTTCCCCCACTCCTGCCACTCCAACGACACCAACTGGC       c.5580
 V  G  Q  Q  Q  G  L  P  S  P  T  P  A  T  P  T  T  P  T  G         p.1860

          .         .         .         .         .         .       g.89742
 CAACAGCCAACCACCCCGCAGACGCCCCAGCCCACTTCTCAGCCTCAGCCTACCCCTCCC       c.5640
 Q  Q  P  T  T  P  Q  T  P  Q  P  T  S  Q  P  Q  P  T  P  P         p.1880

          .         .         .         .         .         .       g.89802
 AATAGCATGCCACCCTACTTGCCCAGGACTCAAGCTGCTGGCCCTGTGTCCCAGGGTAAG       c.5700
 N  S  M  P  P  Y  L  P  R  T  Q  A  A  G  P  V  S  Q  G  K         p.1900

          .         .         .         .         .         .       g.89862
 GCAGCAGGCCAGGTGACCCCTCCAACCCCTCCTCAGACTGCTCAGCCACCCCTTCCAGGG       c.5760
 A  A  G  Q  V  T  P  P  T  P  P  Q  T  A  Q  P  P  L  P  G         p.1920

          .         .         .         .         .         .       g.89922
 CCCCCACCTGCAGCAGTGGAAATGGCAATGCAGATTCAGAGAGCAGCGGAGACGCAGCGC       c.5820
 P  P  P  A  A  V  E  M  A  M  Q  I  Q  R  A  A  E  T  Q  R         p.1940

          .         .         .         .         .         .       g.89982
 CAGATGGCCCACGTGCAAATTTTTCAAAGGCCAATCCAACACCAGATGCCCCCGATGACT       c.5880
 Q  M  A  H  V  Q  I  F  Q  R  P  I  Q  H  Q  M  P  P  M  T         p.1960

          .         .         .         .         .         .       g.90042
 CCCATGGCCCCCATGGGTATGAACCCACCTCCCATGACCAGAGGTCCCAGTGGGCATTTG       c.5940
 P  M  A  P  M  G  M  N  P  P  P  M  T  R  G  P  S  G  H  L         p.1980

          .         .         .         .         .         .       g.90102
 GAGCCAGGGATGGGACCGACAGGGATGCAGCAACAGCCACCCTGGAGCCAAGGAGGATTG       c.6000
 E  P  G  M  G  P  T  G  M  Q  Q  Q  P  P  W  S  Q  G  G  L         p.2000

          .         .         .         .         .         .       g.90162
 CCTCAGCCCCAGCAACTACAGTCTGGGATGCCAAGGCCAGCCATGATGTCAGTGGCCCAG       c.6060
 P  Q  P  Q  Q  L  Q  S  G  M  P  R  P  A  M  M  S  V  A  Q         p.2020

          .         .         .         .         .         .       g.90222
 CATGGTCAACCTTTGAACATGGCTCCACAACCAGGATTGGGCCAGGTAGGTATCAGCCCA       c.6120
 H  G  Q  P  L  N  M  A  P  Q  P  G  L  G  Q  V  G  I  S  P         p.2040

          .         .         .         .         .         .       g.90282
 CTCAAACCAGGCACTGTGTCTCAACAAGCCTTACAAAACCTTTTGCGGACTCTCAGGTCT       c.6180
 L  K  P  G  T  V  S  Q  Q  A  L  Q  N  L  L  R  T  L  R  S         p.2060

          .         .         .         .         .         .       g.90342
 CCCAGCTCTCCCCTGCAGCAGCAACAGGTGCTTAGTATCCTTCACGCCAACCCCCAGCTG       c.6240
 P  S  S  P  L  Q  Q  Q  Q  V  L  S  I  L  H  A  N  P  Q  L         p.2080

          .         .         .         .         .         .       g.90402
 TTGGCTGCATTCATCAAGCAGCGGGCTGCCAAGTATGCCAACTCTAATCCACAACCCATC       c.6300
 L  A  A  F  I  K  Q  R  A  A  K  Y  A  N  S  N  P  Q  P  I         p.2100

          .         .         .         .         .         .       g.90462
 CCTGGGCAGCCTGGCATGCCCCAGGGGCAGCCAGGGCTACAGCCACCTACCATGCCAGGT       c.6360
 P  G  Q  P  G  M  P  Q  G  Q  P  G  L  Q  P  P  T  M  P  G         p.2120

          .         .         .         .         .         .       g.90522
 CAGCAGGGGGTCCACTCCAATCCAGCCATGCAGAACATGAATCCAATGCAGGCGGGCGTT       c.6420
 Q  Q  G  V  H  S  N  P  A  M  Q  N  M  N  P  M  Q  A  G  V         p.2140

          .         .         .         .         .         .       g.90582
 CAGAGGGCTGGCCTGCCCCAGCAGCAACCACAGCAGCAACTCCAGCCACCCATGGGAGGG       c.6480
 Q  R  A  G  L  P  Q  Q  Q  P  Q  Q  Q  L  Q  P  P  M  G  G         p.2160

          .         .         .         .         .         .       g.90642
 ATGAGCCCCCAGGCTCAGCAGATGAACATGAACCACAACACCATGCCTTCACAATTCCGA       c.6540
 M  S  P  Q  A  Q  Q  M  N  M  N  H  N  T  M  P  S  Q  F  R         p.2180

          .         .         .         .         .         .       g.90702
 GACATCTTGAGACGACAGCAAATGATGCAACAGCAGCAGCAACAGGGAGCAGGGCCAGGA       c.6600
 D  I  L  R  R  Q  Q  M  M  Q  Q  Q  Q  Q  Q  G  A  G  P  G         p.2200

          .         .         .         .         .         .       g.90762
 ATAGGCCCTGGAATGGCCAACCATAACCAGTTCCAGCAACCCCAAGGAGTTGGCTACCCA       c.6660
 I  G  P  G  M  A  N  H  N  Q  F  Q  Q  P  Q  G  V  G  Y  P         p.2220

          .         .         .         .         .         .       g.90822
 CCACAGCAGCAGCAGCGGATGCAGCATCACATGCAACAGATGCAACAAGGAAATATGGGA       c.6720
 P  Q  Q  Q  Q  R  M  Q  H  H  M  Q  Q  M  Q  Q  G  N  M  G         p.2240

          .         .         .         .         .         .       g.90882
 CAGATAGGCCAGCTTCCCCAGGCCTTGGGAGCAGAGGCAGGTGCCAGTCTACAGGCCTAT       c.6780
 Q  I  G  Q  L  P  Q  A  L  G  A  E  A  G  A  S  L  Q  A  Y         p.2260

          .         .         .         .         .         .       g.90942
 CAGCAGCGACTCCTTCAGCAACAGATGGGGTCCCCTGTTCAGCCCAACCCCATGAGCCCC       c.6840
 Q  Q  R  L  L  Q  Q  Q  M  G  S  P  V  Q  P  N  P  M  S  P         p.2280

          .         .         .         .         .         .       g.91002
 CAGCAGCATATGCTCCCAAATCAGGCCCAGTCCCCACACCTACAAGGCCAGCAGATCCCT       c.6900
 Q  Q  H  M  L  P  N  Q  A  Q  S  P  H  L  Q  G  Q  Q  I  P         p.2300

          .         .         .         .         .         .       g.91062
 AATTCTCTCTCCAATCAAGTGCGCTCTCCCCAGCCTGTCCCTTCTCCACGGCCACAGTCC       c.6960
 N  S  L  S  N  Q  V  R  S  P  Q  P  V  P  S  P  R  P  Q  S         p.2320

          .         .         .         .         .         .       g.91122
 CAGCCCCCCCACTCCAGTCCTTCCCCAAGGATGCAGCCTCAGCCTTCTCCACACCACGTT       c.7020
 Q  P  P  H  S  S  P  S  P  R  M  Q  P  Q  P  S  P  H  H  V         p.2340

          .         .         .         .         .         .       g.91182
 TCCCCACAGACAAGTTCCCCACATCCTGGACTGGTAGCTGCCCAGGCCAACCCCATGGAA       c.7080
 S  P  Q  T  S  S  P  H  P  G  L  V  A  A  Q  A  N  P  M  E         p.2360

          .         .         .         .         .         .       g.91242
 CAAGGGCATTTTGCCAGCCCGGACCAGAATTCAATGCTTTCTCAGCTTGCTAGCAATCCA       c.7140
 Q  G  H  F  A  S  P  D  Q  N  S  M  L  S  Q  L  A  S  N  P         p.2380

          .         .         .         .         .         .       g.91302
 GGCATGGCAAACCTCCATGGTGCAAGCGCCACGGACCTGGGACTCAGCACCGATAACTCA       c.7200
 G  M  A  N  L  H  G  A  S  A  T  D  L  G  L  S  T  D  N  S         p.2400

          .         .         .         .                           g.91347
 GACTTGAATTCAAACCTCTCACAGAGTACACTAGACATACACTAG                      c.7245
 D  L  N  S  N  L  S  Q  S  T  L  D  I  H  X                        p.2414

          .         .         .         .         .         .       g.91407
 agacaccttgtagtattttgggagcaaaaaaattattttctcttaacaagactttttgta       c.*60

          .         .         .         .         .         .       g.91467
 ctgaaaacaatttttttgaatctttcgtagcctaaaagacaattttccttggaacacata       c.*120

          .         .         .         .         .         .       g.91527
 agaactgtgcagtagccgtttgtggtttaaagcaaacatgcaagatgaacctgagggatg       c.*180

          .         .         .         .         .         .       g.91587
 atagaatacaaagaatatatttttgttatggctggttaccaccagcctttcttccccttt       c.*240

          .         .         .         .         .         .       g.91647
 gtgtgtgtggttcaagtgtgcactgggaggaggctgaggcctgtgaagccaaacaatatg       c.*300

          .         .         .         .         .         .       g.91707
 ctcctgccttgcacctccaataggttttattattttttttaaattaatgaacatatgtaa       c.*360

          .         .         .         .         .         .       g.91767
 tattaatagttattatttactggtgcagatggttgacatttttccctattttcctcactt       c.*420

          .         .         .         .         .         .       g.91827
 tatggaagagttaaaacatttctaaaccagaggacaaaaggggttaatgttactttaaaa       c.*480

          .         .         .         .         .         .       g.91887
 ttacattctatatatatataaatatatataaatatatattaaaataccagttttttttct       c.*540

          .         .         .         .         .         .       g.91947
 ctgggtgcaaagatgttcattcttttaaaaaatgtttaaaaaaaaaaaaaaactgccttt       c.*600

          .         .         .         .         .         .       g.92007
 cttcccctcaagtcaacttttgtgctccagaaaattttctattctgtaagtctgagcgta       c.*660

          .         .         .         .         .         .       g.92067
 aaacttcaagtattaaaataatttgtacatgtagagagaaaaatgactttttcaaaaata       c.*720

          .         .         .         .         .         .       g.92127
 tacaggggcagctgccaaattgatgtattatatattgtggtttctgtttcttgaaagaat       c.*780

          .         .         .         .         .         .       g.92187
 ttttttcgttatttttacatctaacaaagtaaaaaaattaaaaagagggtaagaaacgat       c.*840

          .         .         .         .         .         .       g.92247
 tccggtgggatgattttaacatgcaaaatgtccctgggggtttcttctttgcttgctttc       c.*900

          .         .         .         .         .         .       g.92307
 ttcctccttaccctaccccccactcacacacacacacacacacacacacacacacacaca       c.*960

          .         .         .         .         .         .       g.92367
 cacacactttctataaaacttgaaaatagcaaaaaccctcaactgttgtaaatcatgcaa       c.*1020

          .         .         .         .         .         .       g.92427
 ttaaagttgattacttataaatatgaactttggatcactgtatagactgttaaatttgat       c.*1080

          .         .         .         .                           g.92468
 ttcttattacctattgttaaataaactgtgtgagacagaca                          c.*1121

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The E1A binding protein p300 protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 14c
©2004-2016 Leiden University Medical Center