E1A binding protein p400 (EP400) - coding DNA reference sequence

(used for variant description)

(last modified January 4, 2025)


This file was created to facilitate the description of sequence variants on transcript NM_015409.4 in the EP400 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NC_000012.11, covering EP400 transcript NM_015409.4.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                               .         .         .                g.5032
                             gtagcagccgcgccgccgcttcctcccgccgg       c.-121

 .         .         .         .         .         .                g.5092
 ggccccggatgcactgagcggctgcggcgcggcttccatcctcccgccctcctgacgcgg       c.-61

 .         .         .     | 02   .         .         .             g.15700
 ccggagcgcagccctgaggcccagg | gagaacgacacattggatacagaagggaggtgatc    c.-1

          .         .         .         .         .         .       g.15760
 ATGCACCATGGCACTGGCCCCCAGAACGTCCAGCATCAGCTGCAGAGGTCCAGGGCCTGC       c.60
 M  H  H  G  T  G  P  Q  N  V  Q  H  Q  L  Q  R  S  R  A  C         p.20

          .         .         .         .         .         .       g.15820
 CCTGGCAGCGAGGGTGAGGAGCAGCCGGCCCACCCCAACCCACCCCCGTCCCCCGCAGCT       c.120
 P  G  S  E  G  E  E  Q  P  A  H  P  N  P  P  P  S  P  A  A         p.40

          .         .         .         .         .         .       g.15880
 CCCTTCGCTCCCTCAGCAAGCCCGTCGGCACCCCAGTCTCCCAGTTATCAAATACAGCAG       c.180
 P  F  A  P  S  A  S  P  S  A  P  Q  S  P  S  Y  Q  I  Q  Q         p.60

          .         .         .         .         .         .       g.15940
 CTGATGAATAGGAGCCCTGCAACCGGGCAGAACGTGAACATCACCCTGCAGAGCGTGGGC       c.240
 L  M  N  R  S  P  A  T  G  Q  N  V  N  I  T  L  Q  S  V  G         p.80

          .         .         .         .         .         .       g.16000
 CCTGTCGTCGGGGGAAACCAGCAGATCACACTGGCCCCACTGCCGCTCCCCAGCCCCACC       c.300
 P  V  V  G  G  N  Q  Q  I  T  L  A  P  L  P  L  P  S  P  T         p.100

          .         .         .         .         .         .       g.16060
 TCTCCAGGCTTCCAGTTCAGCGCTCAGCCTCGGCGGTTTGAGCATGGGTCTCCATCATAC       c.360
 S  P  G  F  Q  F  S  A  Q  P  R  R  F  E  H  G  S  P  S  Y         p.120

          .         .         .         .         .         .       g.16120
 ATTCAGGTCACGTCCCCCTTGTCCCAGCAGGTCCAGACCCAGAGTCCCACGCAGCCCAGT       c.420
 I  Q  V  T  S  P  L  S  Q  Q  V  Q  T  Q  S  P  T  Q  P  S         p.140

          .         .         .         .         .         .       g.16180
 CCGGGGCCGGGGCAGGCCTTGCAGAATGTGCGTGCAGGTGCCCCTGGCCCTGGGCTGGGC       c.480
 P  G  P  G  Q  A  L  Q  N  V  R  A  G  A  P  G  P  G  L  G         p.160

          .         .         .         .         .         .       g.16240
 CTCTGCAGCAGCAGCCCTACAGGGGGCTTCGTGGATGCCAGCGTGCTGGTGAGGCAGATC       c.540
 L  C  S  S  S  P  T  G  G  F  V  D  A  S  V  L  V  R  Q  I         p.180

          .         .         .         .         .         .       g.16300
 AGCTTGAGCCCCTCCAGTGGTGGACACTTTGTGTTTCAGGATGGGTCAGGGCTCACCCAG       c.600
 S  L  S  P  S  S  G  G  H  F  V  F  Q  D  G  S  G  L  T  Q         p.200

          .         .         .         .         .         .       g.16360
 ATCGCCCAGGGAGCCCAGGTTCAGCTCCAGCACCCGGGTACGCCCATCACAGTCCGAGAG       c.660
 I  A  Q  G  A  Q  V  Q  L  Q  H  P  G  T  P  I  T  V  R  E         p.220

          .         .         .         .         .         .       g.16420
 CGGAGACCCTCCCAGCCCCACACACAGTCAGGGGGCACCATCCACCACCTGGGACCCCAG       c.720
 R  R  P  S  Q  P  H  T  Q  S  G  G  T  I  H  H  L  G  P  Q         p.240

          .         .         .         .         .         .       g.16480
 AGCCCTGCAGCCGCGGGTGGGGCCGGCCTGCAGCCCCTGGCCAGCCCAAGCCACATCACC       c.780
 S  P  A  A  A  G  G  A  G  L  Q  P  L  A  S  P  S  H  I  T         p.260

          .         .         .         .         .         .       g.16540
 ACGGCTAACTTGCCACCGCAGATCAGCAGCATCATCCAGGGCCAGCTGGTTCAGCAGCAG       c.840
 T  A  N  L  P  P  Q  I  S  S  I  I  Q  G  Q  L  V  Q  Q  Q         p.280

          .         .         .         .         .         .       g.16600
 CAGGTGCTGCAGGGGCCGCCGCTGCCCCGGCCCCTGGGCTTCGAGAGGACACCCGGCGTG       c.900
 Q  V  L  Q  G  P  P  L  P  R  P  L  G  F  E  R  T  P  G  V         p.300

          .         .         .         .         .         .       g.16660
 CTGCTCCCCGGGGCTGGGGGCGCAGCGGGGTTTGGGATGACGTCCCCACCCCCGCCCACC       c.960
 L  L  P  G  A  G  G  A  A  G  F  G  M  T  S  P  P  P  P  T         p.320

          .         .         .         .         .         .       g.16720
 AGCCCTTCCAGGACTGCCGTGCCCCCAGGCCTTTCCAGCCTCCCACTCACGTCTGTGGGG       c.1020
 S  P  S  R  T  A  V  P  P  G  L  S  S  L  P  L  T  S  V  G         p.340

          .         .         .         .         .         .       g.16780
 AACACGGGAATGAAGAAGGTTCCCAAGAAGTTAGAGGAGATTCCCCCAGCCTCTCCGGAG       c.1080
 N  T  G  M  K  K  V  P  K  K  L  E  E  I  P  P  A  S  P  E         p.360

          .         .         .         .         .         .       g.16840
 ATGGCACAGATGAGGAAGCAGTGCCTGGACTATCATTACCAGGAGATGCAGGCTCTGAAG       c.1140
 M  A  Q  M  R  K  Q  C  L  D  Y  H  Y  Q  E  M  Q  A  L  K         p.380

          .         .         .         .         .         .       g.16900
 GAGGTCTTCAAGGAGTATTTGATTGAACTGTTTTTCTTGCAACACTTTCAAGGGAACATG       c.1200
 E  V  F  K  E  Y  L  I  E  L  F  F  L  Q  H  F  Q  G  N  M         p.400

          .         .         .         .         .         .       g.16960
 ATGGATTTCTTAGCTTTCAAGAAGAAACATTATGCCCCATTACAAGCATATCTTAGGCAG       c.1260
 M  D  F  L  A  F  K  K  K  H  Y  A  P  L  Q  A  Y  L  R  Q         p.420

          .         .         .         .         .         .       g.17020
 AATGATTTGGACATTGAAGAAGAGGAGGAGGAGGAGGAAGAGGAGGAAGAAAAATCTGAG       c.1320
 N  D  L  D  I  E  E  E  E  E  E  E  E  E  E  E  E  K  S  E         p.440

          .      | 03  .         .         .         .         .    g.34819
 GTTATCAATGACGAG | CAGCAAGCCCTCGCAGGGAGCCTGGTAGCAGGGGCCGGAAGCACA    c.1380
 V  I  N  D  E   | Q  Q  A  L  A  G  S  L  V  A  G  A  G  S  T      p.460

          .         .         .         .         .      | 04  .    g.36574
 GTAGAGACGGACCTGTTTAAGAGGCAGCAGGCGATGCCCTCCACAGGTATGGCAG | AGCAG    c.1440
 V  E  T  D  L  F  K  R  Q  Q  A  M  P  S  T  G  M  A  E |   Q      p.480

          .         .         .         .         .         .       g.36634
 TCTAAGAGGCCTCGCCTTGAAGTGGGTCACCAAGGGGTAGTTTTCCAGCACCCAGGGGCG       c.1500
 S  K  R  P  R  L  E  V  G  H  Q  G  V  V  F  Q  H  P  G  A         p.500

          .         .         .         .    | 05    .         .    g.37190
 GACGCAGGCGTTCCTCTCCAGCAACTAATGCCGACCGCACAAG | GAGGAATGCCCCCCACG    c.1560
 D  A  G  V  P  L  Q  Q  L  M  P  T  A  Q  G |   G  M  P  P  T      p.520

          .         .         .         .         .         .       g.37250
 CCGCAGGCCGCGCAGCTCGCTGGACAGAGGCAGAGTCAGCAGCAGTATGACCCCTCCACG       c.1620
 P  Q  A  A  Q  L  A  G  Q  R  Q  S  Q  Q  Q  Y  D  P  S  T         p.540

          .         .         .         .         .         .       g.37310
 GGGCCTCCCGTGCAGAACGCTGCCAGCTTGCACACCCCACTGCCGCAGCTGCCCGGGAGG       c.1680
 G  P  P  V  Q  N  A  A  S  L  H  T  P  L  P  Q  L  P  G  R         p.560

          .         .         .         .         .         .       g.37370
 CTGCCCCCAGCCGGTGTTCCCACTGCAGCCCTCTCCTCTGCGCTGCAGTTTGCACAGCAG       c.1740
 L  P  P  A  G  V  P  T  A  A  L  S  S  A  L  Q  F  A  Q  Q         p.580

          .         .         .         .         .         .       g.37430
 CCGCAAGTGGTAGAGGCCCAGACACAGCTCCAAATCCCGGTGAAGACTCAGCAGCCCAAT       c.1800
 P  Q  V  V  E  A  Q  T  Q  L  Q  I  P  V  K  T  Q  Q  P  N         p.600

          .         .         .         .         .         .       g.37490
 GTTCCCATCCCTGCACCGCCCAGCAGCCAACTCCCCATCCCTCCCTCGCAGCCTGCACAG       c.1860
 V  P  I  P  A  P  P  S  S  Q  L  P  I  P  P  S  Q  P  A  Q         p.620

          .         .         .         .         .         .       g.37550
 CTGGCCCTCCACGTTCCCACACCTGGAAAGGTGCAGGTGCAGGCCTCTCAGCTTTCCTCC       c.1920
 L  A  L  H  V  P  T  P  G  K  V  Q  V  Q  A  S  Q  L  S  S         p.640

           | 06        .         .         .         .         .    g.41645
 CTGCCACAG | ATGGTAGCATCGACAAGGCTCCCTGTGGACCCTGCCCCGCCCTGCCCACGG    c.1980
 L  P  Q   | M  V  A  S  T  R  L  P  V  D  P  A  P  P  C  P  R      p.660

          .         .         .         .         .         .       g.41705
 CCTCTGCCCACCTCTTCTACCTCGTCCCTCGCGCCTGTGAGTGGCTCCGGCCCAGGACCC       c.2040
 P  L  P  T  S  S  T  S  S  L  A  P  V  S  G  S  G  P  G  P         p.680

          .         .         .         .         .         .       g.41765
 TCCCCTGCTCGATCCTCTCCAGTAAATAGACCTTCCTCAGCCACCAATAAGGCACTATCT       c.2100
 S  P  A  R  S  S  P  V  N  R  P  S  S  A  T  N  K  A  L  S         p.700

          .         .         .         .         .         .       g.41825
 CCAGTCACTTCCCGGACCCCAGGGGTGGTGGCATCTGCCCCCACCAAACCACAGAGTCCT       c.2160
 P  V  T  S  R  T  P  G  V  V  A  S  A  P  T  K  P  Q  S  P         p.720

          .         .         .         .         .         .       g.41885
 GCTCAGAATGCCACCTCGTCCCAAGACAGTTCTCAGGATACGCTGACAGAACAAATAACT       c.2220
 A  Q  N  A  T  S  S  Q  D  S  S  Q  D  T  L  T  E  Q  I  T         p.740

     | 07    .         .         .         .         .         .    g.42842
 CTG | GAGAACCAGGTGCATCAGCGCATTGCGGAGCTGAGGAAAGCAGGTCTGTGGTCCCAG    c.2280
 L   | E  N  Q  V  H  Q  R  I  A  E  L  R  K  A  G  L  W  S  Q      p.760

          .         .         .         .         .         .       g.42902
 AGGCGTCTGCCAAAGCTGCAGGAGGCCCCACGCCCCAAGTCCCACTGGGACTATCTGCTG       c.2340
 R  R  L  P  K  L  Q  E  A  P  R  P  K  S  H  W  D  Y  L  L         p.780

          .         .         .         .         .         .       g.42962
 GAGGAGATGCAGTGGATGGCCACAGACTTTGCCCAGGAGAGGAGGTGGAAGGTGGCTGCT       c.2400
 E  E  M  Q  W  M  A  T  D  F  A  Q  E  R  R  W  K  V  A  A         p.800

           | 08        .         .         .         .         .    g.45095
 GCGAAGAAG | CTCGTTAGAACTGTGGTGCGCCATCACGAGGAGAAGCAGCTCCGTGAAGAA    c.2460
 A  K  K   | L  V  R  T  V  V  R  H  H  E  E  K  Q  L  R  E  E      p.820

          .         .         .         .         .         .       g.45155
 AGGGGGAAGAAGGAAGAGCAGAGCAGACTGAGGCGGATAGCCGCCTCCACGGCCCGGGAG       c.2520
 R  G  K  K  E  E  Q  S  R  L  R  R  I  A  A  S  T  A  R  E         p.840

          .         .         . | 09       .         .         .    g.45746
 ATAGAGTGCTTTTGGTCGAATATTGAACAG | GTTGTGGAAATAAAACTACGAGTAGAATTA    c.2580
 I  E  C  F  W  S  N  I  E  Q   | V  V  E  I  K  L  R  V  E  L      p.860

          .         .         .         .          | 10        .    g.46498
 GAAGAAAAAAGGAAGAAGGCCTTAAATTTACAGAAAGTTTCCAGGAGAG | GGAAAGAATTG    c.2640
 E  E  K  R  K  K  A  L  N  L  Q  K  V  S  R  R  G |   K  E  L      p.880

          .         .         .          | 11        .         .    g.47274
 AGACCTAAAGGATTTGACGCATTACAGGAAAGTTCTCTG | GATTCAGGAATGTCTGGAAGA    c.2700
 R  P  K  G  F  D  A  L  Q  E  S  S  L   | D  S  G  M  S  G  R      p.900

          .         .         .        | 12.         .         .    g.49970
 AAAAGAAAAGCTAGCATATCTTTGACTGATGACGAAG | TGGACGATGAAGAGGAAACAATT    c.2760
 K  R  K  A  S  I  S  L  T  D  D  E  V |   D  D  E  E  E  T  I      p.920

          .         .         .         .         .         .       g.50030
 GAAGAGGAGGAAGCAAATGAAGGCGTTGTGGACCACCAAACAGAACTTTCTAATTTAGCC       c.2820
 E  E  E  E  A  N  E  G  V  V  D  H  Q  T  E  L  S  N  L  A         p.940

         | 13.         .         .         .         .         .    g.60210
 AAGGAAG | CTGAGCTGCCCCTCCTGGACCTGATGAAGCTGTACGAAGGCGCCTTCCTGCCG    c.2880
 K  E  A |   E  L  P  L  L  D  L  M  K  L  Y  E  G  A  F  L  P      p.960

          .         .         .         .         .      | 14  .    g.61197
 AGTTCTCAGTGGCCCCGGCCGAAGCCTGATGGGGAGGACACAAGCGGAGAGGAAG | ATGCA    c.2940
 S  S  Q  W  P  R  P  K  P  D  G  E  D  T  S  G  E  E  D |   A      p.980

          .         .         .         .         .         .       g.61257
 GATGACTGTCCAGGCGACAGGGAGAGTCGCAAGGACTTGGTTCTCATCGACTCGCTTTTC       c.3000
 D  D  C  P  G  D  R  E  S  R  K  D  L  V  L  I  D  S  L  F         p.1000

          .         .         .         .         .         .       g.61317
 ATCATGGATCAGTTCAAAGCTGCCGAGAGGATGAATATCGGGAAGCCAAACGCCAAGGAC       c.3060
 I  M  D  Q  F  K  A  A  E  R  M  N  I  G  K  P  N  A  K  D         p.1020

          .         .         .         .         .         .       g.61377
 ATTGCGGACGTCACTGCGGTGGCTGAAGCCATCCTGCCGAAGGGCAGTGCTCGGGTCACA       c.3120
 I  A  D  V  T  A  V  A  E  A  I  L  P  K  G  S  A  R  V  T         p.1040

        | 15 .         .         .         .         .         .    g.61834
 ACCTCG | GTCAAGTTTAATGCTCCATCTTTGTTGTATGGGGCTCTCAGAGATTATCAGAAG    c.3180
 T  S   | V  K  F  N  A  P  S  L  L  Y  G  A  L  R  D  Y  Q  K      p.1060

          .         .         .         .         .         .       g.61894
 ATTGGCCTGGACTGGCTGGCCAAACTTTACAGGAAGAATCTCAATGGCATATTGGCAGAT       c.3240
 I  G  L  D  W  L  A  K  L  Y  R  K  N  L  N  G  I  L  A  D         p.1080

          .         .         .         .         .         .       g.61954
 GAAGCTGGGCTGGGTAAAACAGTGCAGATCATTGCTTTTTTTGCCCACCTAGCTTGTAAC       c.3300
 E  A  G  L  G  K  T  V  Q  I  I  A  F  F  A  H  L  A  C  N         p.1100

      | 16   .         .         .         .         .         .    g.66634
 GAAG | GTAATTGGGGCCCCCATCTTGTTGTTGTGAGAAGTTGTAACATACTCAAGTGGGAG    c.3360
 E  G |   N  W  G  P  H  L  V  V  V  R  S  C  N  I  L  K  W  E      p.1120

          .         .         .         .         .         .       g.66694
 CTTGAATTGAAACGTTGGTGTCCCGGACTCAAAATCCTCTCATATATTGGCAGCCACAGA       c.3420
 L  E  L  K  R  W  C  P  G  L  K  I  L  S  Y  I  G  S  H  R         p.1140

          .         .  | 17      .         .         .         .    g.68128
 GAACTCAAAGCAAAGAGACAG | GAGTGGGCCGAACCCAACAGCTTCCACGTCTGCATCACG    c.3480
 E  L  K  A  K  R  Q   | E  W  A  E  P  N  S  F  H  V  C  I  T      p.1160

          .         .         .         .         .         .       g.68188
 TCCTACACTCAGTTCTTCCGGGGCCTCACCGCCTTCACACGAGTGCGCTGGAAGTGCCTG       c.3540
 S  Y  T  Q  F  F  R  G  L  T  A  F  T  R  V  R  W  K  C  L         p.1180

          .         .         .         .         .         .       g.68248
 GTCATTGATGAGATGCAGCGCGTGAAGGGCATGACCGAGAGGCACTGGGAAGCGGTTTTC       c.3600
 V  I  D  E  M  Q  R  V  K  G  M  T  E  R  H  W  E  A  V  F         p.1200

          .  | 18      .         .         .         .         .    g.68619
 ACCCTGCAGAG | CCAACAACGTCTGCTTCTGATCGACTCGCCGCTGCACAATACCTTCCTG    c.3660
 T  L  Q  S  |  Q  Q  R  L  L  L  I  D  S  P  L  H  N  T  F  L      p.1220

          .         .         .         .         .         .       g.68679
 GAGCTCTGGACCATGGTGCACTTCCTGGTCCCAGGGATCTCCAGGCCCTACCTGAGCTCC       c.3720
 E  L  W  T  M  V  H  F  L  V  P  G  I  S  R  P  Y  L  S  S         p.1240

          .         .         .         .         .         .       g.68739
 CCTCTGAGGGCCCCCAGTGAAGAGAGCCAGGATTACTACCATAAAGTGGTCATAAGGTTA       c.3780
 P  L  R  A  P  S  E  E  S  Q  D  Y  Y  H  K  V  V  I  R  L         p.1260

        | 19 .         .         .         .         .         .    g.68911
 CACAGG | GTGACACAGCCATTTATTTTGAGGAGAACTAAGAGAGATGTGGAAAAGCAACTA    c.3840
 H  R   | V  T  Q  P  F  I  L  R  R  T  K  R  D  V  E  K  Q  L      p.1280

          .         .         .         .         .         .       g.68971
 ACAAAGAAATATGAGCATGTTTTGAAGTGTCGCCTTTCTAACCGACAAAAAGCCTTATAC       c.3900
 T  K  K  Y  E  H  V  L  K  C  R  L  S  N  R  Q  K  A  L  Y         p.1300

          .         .    | 20    .         .         .         .    g.72652
 GAGGACGTTATCCTGCAACCTGG | CACTCAGGAGGCCTTGAAGAGCGGGCACTTTGTCAAC    c.3960
 E  D  V  I  L  Q  P  G  |  T  Q  E  A  L  K  S  G  H  F  V  N      p.1320

          .         .         .         .         .         .       g.72712
 GTCCTGAGCATCCTTGTGCGGCTGCAGCGCATCTGCAACCACCCTGGGCTCGTCGAGCCC       c.4020
 V  L  S  I  L  V  R  L  Q  R  I  C  N  H  P  G  L  V  E  P         p.1340

          .         .         .         .         .         .       g.72772
 CGGCACCCAGGCTCTTCCTACGTGGCGGGGCCACTGGAGTATCCGTCCGCATCTCTAATC       c.4080
 R  H  P  G  S  S  Y  V  A  G  P  L  E  Y  P  S  A  S  L  I         p.1360

          .         .         . | 21       .         .         .    g.73320
 CTGAAGGCACTGGAGAGAGATTTCTGGAAG | GAAGCAGATCTTTCTATGTTTGATCTCATC    c.4140
 L  K  A  L  E  R  D  F  W  K   | E  A  D  L  S  M  F  D  L  I      p.1380

          .         .         .         .         .         .       g.73380
 GGCTTAGAAAATAAAATCACTCGTCACGAGGCAGAGTTGCTGTCTAAGAAAAAGATACCG       c.4200
 G  L  E  N  K  I  T  R  H  E  A  E  L  L  S  K  K  K  I  P         p.1400

          .         .         .         .         .         .       g.73440
 CGGAAACTCATGGAGGAAATCTCCACTTCAGCAGCCCCAGCAGCCCGACCAGCAGCAGCA       c.4260
 R  K  L  M  E  E  I  S  T  S  A  A  P  A  A  R  P  A  A  A         p.1420

          .        | 22.         .         .         .         .    g.75172
 AAGCTGAAGGCCAGCAG | GTTGTTTCAGCCTGTGCAGTATGGCCAGAAGCCCGAGGGTCGC    c.4320
 K  L  K  A  S  R  |  L  F  Q  P  V  Q  Y  G  Q  K  P  E  G  R      p.1440

          .         .         .         .         .         .       g.75232
 ACCGTGGCTTTCCCCAGCACTCACCCGCCCCGGACGGCAGCCCCCACCACGGCCTCTGCT       c.4380
 T  V  A  F  P  S  T  H  P  P  R  T  A  A  P  T  T  A  S  A         p.1460

          .         .         .         .         .         .       g.75292
 GCTCCACAGGGCCCGCTTCGAGGACGGCCGCCCATCGCCACGTTCTCTGCCAATCCGGAG       c.4440
 A  P  Q  G  P  L  R  G  R  P  P  I  A  T  F  S  A  N  P  E         p.1480

         | 23.         .         .         .         .         .    g.76212
 GCAAAAG | CAGCAGCAGCCCCGTTTCAGACCTCTCAGGCTTCCGCCAGTGCTCCACGACAC    c.4500
 A  K  A |   A  A  A  P  F  Q  T  S  Q  A  S  A  S  A  P  R  H      p.1500

          .         .         .         .         .         .       g.76272
 CAGCCCGCCTCGGCCTCCAGCACAGCCGCTAGCCCGGCCCATCCTGCGAAACTGCGGGCC       c.4560
 Q  P  A  S  A  S  S  T  A  A  S  P  A  H  P  A  K  L  R  A         p.1520

          .         .         .         .         .         .       g.76332
 CAGACCACAGCACAGGCCTCCACCCCAGGCCAGCCCCCGCCCCAGCCCCAGGCCCCCTCG       c.4620
 Q  T  T  A  Q  A  S  T  P  G  Q  P  P  P  Q  P  Q  A  P  S         p.1540

          .         .         .         .         .         .       g.76392
 CACGCGGCCGGGCAGAGCGCGCTGCCTCAGAGGCTGGTGCTCCCCTCGCAGGCCCAGGCC       c.4680
 H  A  A  G  Q  S  A  L  P  Q  R  L  V  L  P  S  Q  A  Q  A         p.1560

          . | 24       .         .         .         .         .    g.78907
 CGCTTGCCCA | GTGGAGAGGTAGTGAAAATAGCTCAGCTGGCATCCATCACAGGACCACAG    c.4740
 R  L  P  S |   G  E  V  V  K  I  A  Q  L  A  S  I  T  G  P  Q      p.1580

          .         .         .         .         .         .       g.78967
 AGCCGCGTGGCTCAGCCAGAGACGCCGGTGACACTGCAGTTCCAGGGCAGCAAGTTCACC       c.4800
 S  R  V  A  Q  P  E  T  P  V  T  L  Q  F  Q  G  S  K  F  T         p.1600

          .         .         .         .         .      | 25  .    g.80731
 CTGTCACACAGCCAGCTCCGGCAGCTCACAGCGGGCCAGCCGCTGCAGCTGCAAG | GCAGC    c.4860
 L  S  H  S  Q  L  R  Q  L  T  A  G  Q  P  L  Q  L  Q  G |   S      p.1620

          .         .         .         .         .         .       g.80791
 GTCCTCCAGATCGTGTCCGCCCCCGGGCAGCCCTACCTTCGAGCCCCTGGCCCTGTGGTG       c.4920
 V  L  Q  I  V  S  A  P  G  Q  P  Y  L  R  A  P  G  P  V  V         p.1640

          .         .         .         .         .         .       g.80851
 ATGCAGACCGTGTCTCAGGCGGGCGCTGTGCACGGCGCCCTGGGAAGCAAGCCCCCGGCC       c.4980
 M  Q  T  V  S  Q  A  G  A  V  H  G  A  L  G  S  K  P  P  A         p.1660

          .         .         .     | 26   .         .         .    g.82543
 GGCGGTCCCAGCCCTGCACCCTTGACCCCACAAG | TTGGCGTTCCGGGCCGCGTGGCGGTG    c.5040
 G  G  P  S  P  A  P  L  T  P  Q  V |   G  V  P  G  R  V  A  V      p.1680

          .         .         .         .         .         .       g.82603
 AATGCCTTGGCTGTAGGAGAACCCGGAACGGCCTCCAAACCAGCTTCTCCCATTGGAGGG       c.5100
 N  A  L  A  V  G  E  P  G  T  A  S  K  P  A  S  P  I  G  G         p.1700

           | 27        .         .         .         .         .    g.83148
 CCGACCCAG | GAGGAAAAGACCAGACTCTTGAAAGAGCGCCTGGATCAGATTTATTTAGTC    c.5160
 P  T  Q   | E  E  K  T  R  L  L  K  E  R  L  D  Q  I  Y  L  V      p.1720

          .         .         .         .         .         .       g.83208
 AACGAGCGGCGCTGTTCTCAAGCTCCAGTCTATGGCAGAGACTTGCTAAGGATTTGTGCC       c.5220
 N  E  R  R  C  S  Q  A  P  V  Y  G  R  D  L  L  R  I  C  A         p.1740

          .         .         .         .         .         .       g.83268
 CTGCCTAGCCATGGAAGGGTACAGTGGCGTGGGTCCCTGGATGGCCGTCGTGGGAAGGAG       c.5280
 L  P  S  H  G  R  V  Q  W  R  G  S  L  D  G  R  R  G  K  E         p.1760

          .         .         .         .         .         .       g.83328
 GCCGGGCCAGCGCACAGTTACACTTCATCCTCAGAAAGTCCAAGTGAGCTGATGTTGACG       c.5340
 A  G  P  A  H  S  Y  T  S  S  S  E  S  P  S  E  L  M  L  T         p.1780

          .         .         .         .  | 28      .         .    g.84800
 CTTTGTCGGTGTGGAGAGTCTCTGCAGGATGTTATTGACAG | GGTGGCCTTTGTGATTCCT    c.5400
 L  C  R  C  G  E  S  L  Q  D  V  I  D  R  |  V  A  F  V  I  P      p.1800

          .         .         .         .         .         .       g.84860
 CCGGTGGTGGCAGCACCCCCGTCCCTACGGGTGCCGCGGCCGCCACCCCTGTACAGCCAC       c.5460
 P  V  V  A  A  P  P  S  L  R  V  P  R  P  P  P  L  Y  S  H         p.1820

          .         .         .         .         .         .       g.84920
 AGAATGAGGATCTTGAGGCAGGGCCTGAGAGAGCACGCTGCGCCGTACTTCCAGCAGCTG       c.5520
 R  M  R  I  L  R  Q  G  L  R  E  H  A  A  P  Y  F  Q  Q  L         p.1840

          .         .         .         .         .         .       g.84980
 CGGCAGACCACGGCTCCACGCCTGCTGCAGTTCCCTGAGCTGAGGCTGGTGCAGTTCGAC       c.5580
 R  Q  T  T  A  P  R  L  L  Q  F  P  E  L  R  L  V  Q  F  D         p.1860

      | 29   .         .         .         .         .         .    g.85125
 TCAG | GGAAGTTGGAAGCTTTAGCTATCTTGCTTCAGAAATTGAAATCTGAAGGACGTCGG    c.5640
 S  G |   K  L  E  A  L  A  I  L  L  Q  K  L  K  S  E  G  R  R      p.1880

          .         .         .         .         .         .       g.85185
 GTGCTGATTTTATCACAGATGATTCTTATGTTGGACATTTTAGAGATGTTCTTGAACTTC       c.5700
 V  L  I  L  S  Q  M  I  L  M  L  D  I  L  E  M  F  L  N  F         p.1900

          .         .         .         .         .     | 30   .    g.87039
 CATTACCTCACCTATGTAAGAATCGATGAAAATGCCAGCAGTGAGCAACGGCAG | GAACTG    c.5760
 H  Y  L  T  Y  V  R  I  D  E  N  A  S  S  E  Q  R  Q   | E  L      p.1920

          .         .         .         .         .         .       g.87099
 ATGAGGAGTTTCAACAGAGACAGGCGGATTTTTTGTGCCATTCTCTCCACTCACAGCCGT       c.5820
 M  R  S  F  N  R  D  R  R  I  F  C  A  I  L  S  T  H  S  R         p.1940

          .         .         .         .         .         .       g.87159
 ACCACAGGTATAAACCTTGTAGAGGCGGACACCGTCGTGTTTTATGACAATGACCTGAAT       c.5880
 T  T  G  I  N  L  V  E  A  D  T  V  V  F  Y  D  N  D  L  N         p.1960

          .         .         .         .         .         .       g.87219
 CCAGTGATGGATGCCAAAGCTCAGGAGTGGTGCGATAGGATCGGGAGATGCAAAGACATC       c.5940
 P  V  M  D  A  K  A  Q  E  W  C  D  R  I  G  R  C  K  D  I         p.1980

          .  | 31      .         .         .         .         .    g.92811
 CACATATACAG | GCTTGTGAGTGGCAATTCCATTGAAGAGAAATTGTTGAAAAATGGAACT    c.6000
 H  I  Y  R  |  L  V  S  G  N  S  I  E  E  K  L  L  K  N  G  T      p.2000

          .         .         .         .         .         .       g.92871
 AAAGATCTGATCCGAGAAGTGGCTGCTCAGGGAAATGACTACTCCATGGCTTTCTTAACT       c.6060
 K  D  L  I  R  E  V  A  A  Q  G  N  D  Y  S  M  A  F  L  T         p.2020

     | 32    .         .         .         .         .         .    g.93090
 CAG | CGAACCATCCAGGAGCTGTTTGAAGTTTATTCTCCCATGGATGATGCTGGCTTCCCG    c.6120
 Q   | R  T  I  Q  E  L  F  E  V  Y  S  P  M  D  D  A  G  F  P      p.2040

          .         .         .         .         .         .       g.93150
 GTCAAAGCTGAGGAGTTTGTGGTGCTTTCTCAGGAACCTTCTGTCACGGAAACCATTGCA       c.6180
 V  K  A  E  E  F  V  V  L  S  Q  E  P  S  V  T  E  T  I  A         p.2060

          .         .        | 33.         .         .         .    g.98417
 CCCAAAATTGCAAGACCTTTCATAGAG | GCCCTCAAGAGTATTGAGTATCTGGAGGAGGAT    c.6240
 P  K  I  A  R  P  F  I  E   | A  L  K  S  I  E  Y  L  E  E  D      p.2080

          .         .         .         .         .         .       g.98477
 GCCCAGAAGTCCGCACAGGAGGGGGTGCTGGGACCACACACTGATGCTCTGTCATCAGAC       c.6300
 A  Q  K  S  A  Q  E  G  V  L  G  P  H  T  D  A  L  S  S  D         p.2100

          .         .         .         .         .         .       g.98537
 TCTGAGAACATGCCGTGTGATGAAGAACCATCCCAATTAGAGGAGCTAGCTGACTTCATG       c.6360
 S  E  N  M  P  C  D  E  E  P  S  Q  L  E  E  L  A  D  F  M         p.2120

        | 34 .         .         .         .         .         .    g.98779
 GAGCAG | CTTACACCAATTGAAAAATATGCTTTAAATTACCTGGAATTATTCCATACTTCT    c.6420
 E  Q   | L  T  P  I  E  K  Y  A  L  N  Y  L  E  L  F  H  T  S      p.2140

          .         .         . | 35       .         .         .    g.99287
 ATTGAGCAAGAAAAGGAGAGAAACAGTGAG | GACGCAGTGATGACTGCAGTGAGGGCATGG    c.6480
 I  E  Q  E  K  E  R  N  S  E   | D  A  V  M  T  A  V  R  A  W      p.2160

          .         .         .         .         .         .       g.99347
 GAGTTCTGGAACCTGAAGACCCTGCAGGAGAGGGAGGCCCGGCTGCGGCTGGAGCAGGAG       c.6540
 E  F  W  N  L  K  T  L  Q  E  R  E  A  R  L  R  L  E  Q  E         p.2180

          .         .         .         .      | 36  .         .    g.99766
 GAGGCGGAGCTCCTGACCTACACGCGAGAGGATGCCTACAGCATG | GAGTATGTCTACGAA    c.6600
 E  A  E  L  L  T  Y  T  R  E  D  A  Y  S  M   | E  Y  V  Y  E      p.2200

          .         .         . | 37       .         .         .    g.99910
 GATGTCGATGGGCAGACAGAAGTCATGCCG | CTCTGGACCCCACCCACCCCGCCGCAGGAC    c.6660
 D  V  D  G  Q  T  E  V  M  P   | L  W  T  P  P  T  P  P  Q  D      p.2220

          .         .         .         .         .         .       g.99970
 GACAGCGACATCTACCTCGACTCGGTCATGTGTCTCATGTATGAAGCCACTCCCATCCCA       c.6720
 D  S  D  I  Y  L  D  S  V  M  C  L  M  Y  E  A  T  P  I  P         p.2240

          .         .         .         .         .         .       g.100030
 GAGGCTAAGCTGCCCCCTGTGTACGTGAGGAAGGAGCGGAAGCGACACAAAACAGACCCC       c.6780
 E  A  K  L  P  P  V  Y  V  R  K  E  R  K  R  H  K  T  D  P         p.2260

      | 38   .         .         .         .         .         .    g.100455
 TCAG | CTGCAGGCAGGAAGAAGAAGCAGCGTCACGGGGAGGCGGTCGTCCCTCCTCGGTCC    c.6840
 S  A |   A  G  R  K  K  K  Q  R  H  G  E  A  V  V  P  P  R  S      p.2280

          .         .         .         .         .         .       g.100515
 CTGTTTGACCGCGCAACACCAGGACTTCTGAAAATTCGCAGAGAGGGCAAGGAGCAGAAG       c.6900
 L  F  D  R  A  T  P  G  L  L  K  I  R  R  E  G  K  E  Q  K         p.2300

          .         .         .         .         .         .       g.100575
 AAGAATATTCTGCTGAAGCAGCAGGTGCCATTCGCCAAGCCCCTGCCAACTTTTGCCAAA       c.6960
 K  N  I  L  L  K  Q  Q  V  P  F  A  K  P  L  P  T  F  A  K         p.2320

          .         .         .         .         .         .       g.100635
 CCCACAGCTGAGCCTGGTCAAGACAACCCCGAGTGGCTCATCAGTGAGGACTGGGCGCTG       c.7020
 P  T  A  E  P  G  Q  D  N  P  E  W  L  I  S  E  D  W  A  L         p.2340

        | 39 .         .         .         .         .         .    g.100861
 CTGCAG | GCTGTAAAGCAGTTACTGGAGCTGCCTTTGAACCTCACAATCGTGTCACCTGCT    c.7080
 L  Q   | A  V  K  Q  L  L  E  L  P  L  N  L  T  I  V  S  P  A      p.2360

          .         .         .         .         .         .       g.100921
 CACACACCTAATTGGGATCTTGTCAGTGACGTTGTTAACTCCTGTAGCCGAATCTACCGC       c.7140
 H  T  P  N  W  D  L  V  S  D  V  V  N  S  C  S  R  I  Y  R         p.2380

          .         .         .         .         .         .       g.100981
 TCTTCCAAACAGTGCCGGAATCGCTACGAGAATGTCATCATTCCACGAGAGGAGGGGAAG       c.7200
 S  S  K  Q  C  R  N  R  Y  E  N  V  I  I  P  R  E  E  G  K         p.2400

  | 40       .         .         .         .         .         .    g.105463
  | AGTAAAAACAACCGTCCTCTCCGTACGAGCCAGATCTATGCCCAGGATGAGAATGCCACA    c.7260
  | S  K  N  N  R  P  L  R  T  S  Q  I  Y  A  Q  D  E  N  A  T      p.2420

          .         .         .         .         .         .       g.105523
 CACACCCAGCTGTACACGAGCCACTTTGACTTAATGAAAATGACTGCTGGCAAGAGGAGT       c.7320
 H  T  Q  L  Y  T  S  H  F  D  L  M  K  M  T  A  G  K  R  S         p.2440

          .        | 41.         .         .         .         .    g.105722
 CCCCCAATCAAACCTCT | GCTTGGCATGAATCCCTTTCAGAAGAACCCCAAGCACGCGTCT    c.7380
 P  P  I  K  P  L  |  L  G  M  N  P  F  Q  K  N  P  K  H  A  S      p.2460

          .     | 42   .         .         .         .         .    g.108272
 GTGTTGGCAGAAAG | TGGAATCAACTATGACAAGCCGCTGCCTCCCATCCAGGTGGCATCT    c.7440
 V  L  A  E  S  |  G  I  N  Y  D  K  P  L  P  P  I  Q  V  A  S      p.2480

          .         .         .    | 43    .         .         .    g.108450
 CTCCGTGCAGAGCGAATCGCAAAAGAGAAAAAG | GCTCTGGCTGATCAGCAGAAGGCACAG    c.7500
 L  R  A  E  R  I  A  K  E  K  K   | A  L  A  D  Q  Q  K  A  Q      p.2500

          .         .         .         .         .         .       g.108510
 CAGCCGGCCGTGGCCCAGCCACCCCCGCCCCAGCCGCAGCCCCCACCACCCCCGCAGCAG       c.7560
 Q  P  A  V  A  Q  P  P  P  P  Q  P  Q  P  P  P  P  P  Q  Q         p.2520

          .         .         .         .         .         .       g.108570
 CCACCGCCACCGCTGCCACAACCACAGGCAGCGGGCAGCCAGCCGCCAGCAGGGCCACCA       c.7620
 P  P  P  P  L  P  Q  P  Q  A  A  G  S  Q  P  P  A  G  P  P         p.2540

          .         .         .         .         .         .       g.108630
 GCTGTCCAGCCCCAACCCCAGCCACAGCCCCAGACCCAGCCACAGCCTGTGCAGGCCCCA       c.7680
 A  V  Q  P  Q  P  Q  P  Q  P  Q  T  Q  P  Q  P  V  Q  A  P         p.2560

          .         .         .         .         | 44         .    g.110066
 GCGAAGGCGCAGCCCGCAATCACGACGGGGGGCAGTGCAGCCGTACTG | GCAGGAACCATT    c.7740
 A  K  A  Q  P  A  I  T  T  G  G  S  A  A  V  L   | A  G  T  I      p.2580

          .         .         .     | 45   .         .         .    g.110205
 AAAACATCAGTTACTGGGACGAGCATGCCCACTG | GTGCCGTGAGTGGAAATGTGATCGTG    c.7800
 K  T  S  V  T  G  T  S  M  P  T  G |   A  V  S  G  N  V  I  V      p.2600

          .         .         .         .         .         .       g.110265
 AACACCATCGCAGGGGTCCCAGCTGCCACCTTCCAGTCCATCAACAAGCGCCTGGCGTCG       c.7860
 N  T  I  A  G  V  P  A  A  T  F  Q  S  I  N  K  R  L  A  S         p.2620

          .         .     | 46   .         .         .         .    g.117226
 CCAGTGGCTCCTGGGGCCTTGACT | ACGCCGGGAGGCTCTGCTCCCGCCCAGGTGGTGCAC    c.7920
 P  V  A  P  G  A  L  T   | T  P  G  G  S  A  P  A  Q  V  V  H      p.2640

          .         .         .         .         .         .       g.117286
 ACCCAGCCCCCGCCACGGGCAGTCGGCTCCCCAGCCACGGCGACCCCTGACCTGGTGTCC       c.7980
 T  Q  P  P  P  R  A  V  G  S  P  A  T  A  T  P  D  L  V  S         p.2660

          .         .         .         .         .         .       g.117346
 ATGGCAACGACTCAGGGTGTTCGAGCGGTCACTTCTGTGACAGCCTCGGCCGTGGTCACT       c.8040
 M  A  T  T  Q  G  V  R  A  V  T  S  V  T  A  S  A  V  V  T         p.2680

          .         .         .         .         .         | 47    g.117548
 ACCAACCTGACCCCAGTGCAGACCCCGGCACGGTCTTTGGTGCCCCAAGTGTCCCAAG | CC    c.8100
 T  N  L  T  P  V  Q  T  P  A  R  S  L  V  P  Q  V  S  Q  A |       p.2700

          .         .         .         .         .         .       g.117608
 ACAGGAGTTCAGCTCCCTGGAAAAACCATCACACCTGCACATTTCCAGCTTCTCAGGCAG       c.8160
 T  G  V  Q  L  P  G  K  T  I  T  P  A  H  F  Q  L  L  R  Q         p.2720

          .         .         .         .         .         .       g.117668
 CAGCAGCAGCAGCAGCAACAACAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAG       c.8220
 Q  Q  Q  Q  Q  Q  Q  Q  Q  Q  Q  Q  Q  Q  Q  Q  Q  Q  Q  Q         p.2740

          .         .         .         .         .         .       g.117728
 CAGCAACAGCAGCAGCAGCAACAGACGACGACGACCTCTCAGGTGCAAGTTCCACAGATC       c.8280
 Q  Q  Q  Q  Q  Q  Q  Q  T  T  T  T  S  Q  V  Q  V  P  Q  I         p.2760

          .         .         .         .         .     | 48   .    g.119754
 CAGGGCCAGGCCCAGTCCCCAGCACAGATCAAAGCTGTGGGCAAGCTGACGCCG | GAACAC    c.8340
 Q  G  Q  A  Q  S  P  A  Q  I  K  A  V  G  K  L  T  P   | E  H      p.2780

          .         .         .         .         .         .       g.119814
 CTCATCAAAATGCAGAAGCAGAAACTGCAGATGCCCCCGCAGCCCCCACCGCCACAGGCC       c.8400
 L  I  K  M  Q  K  Q  K  L  Q  M  P  P  Q  P  P  P  P  Q  A         p.2800

          .         .         .         .         .         .       g.119874
 CAGTCTGCGCCCCCGCAGCCAACAGCCCAAGTGCAAGTGCAGACCTCGCAGCCGCCGCAG       c.8460
 Q  S  A  P  P  Q  P  T  A  Q  V  Q  V  Q  T  S  Q  P  P  Q         p.2820

          .         .         .         .         .         .       g.119934
 CAGCAGAGCCCCCAGCTCACGACGGTCACGGCCCCAAGGCCTGGTGCCCTGCTGACGGGC       c.8520
 Q  Q  S  P  Q  L  T  T  V  T  A  P  R  P  G  A  L  L  T  G         p.2840

          .         .         .    | 49    .         .         .    g.121881
 ACCACCGTGGCCAACCTCCAGGTGGCCCGGCTC | ACCCGGGTTCCCACTTCTCAGCTGCAG    c.8580
 T  T  V  A  N  L  Q  V  A  R  L   | T  R  V  P  T  S  Q  L  Q      p.2860

          .         .         .         .         .         .       g.121941
 GCGCAAGGGCAGATGCAGACCCAGGCACCCCAGCCAGCCCAGGTGGCCTTGGCGAAGCCT       c.8640
 A  Q  G  Q  M  Q  T  Q  A  P  Q  P  A  Q  V  A  L  A  K  P         p.2880

          .         .         .         .         .         .       g.122001
 CCGGTGGTGTCCGTCCCGGCAGCTGTGGTCTCCTCACCGGGAGTCACCACCCTGCCCATG       c.8700
 P  V  V  S  V  P  A  A  V  V  S  S  P  G  V  T  T  L  P  M         p.2900

          .         .         .         .          | 50        .    g.122453
 AACGTCGCGGGGATCAGCGTGGCGATCGGTCAGCCACAGAAGGCAGCAG | GACAGACCGTG    c.8760
 N  V  A  G  I  S  V  A  I  G  Q  P  Q  K  A  A  G |   Q  T  V      p.2920

          .         .         .         .         .         .       g.122513
 GTGGCCCAGCCCGTGCACATGCAGCAGCTGCTGAAGCTGAAGCAGCAGGCCGTCCAGCAG       c.8820
 V  A  Q  P  V  H  M  Q  Q  L  L  K  L  K  Q  Q  A  V  Q  Q         p.2940

          .         .         .         .         .     | 51   .    g.124581
 CAGAAGGCCATCCAGCCCCAGGCTGCACAGGGCCCGGCAGCCGTCCAGCAGAAG | ATCACC    c.8880
 Q  K  A  I  Q  P  Q  A  A  Q  G  P  A  A  V  Q  Q  K   | I  T      p.2960

          .         .         .         .         .         .       g.124641
 GCACAGCAGATCACCACCCCTGGCGCGCAGCAGAAGGTTGCCTACGCCGCGCAGCCGGCC       c.8940
 A  Q  Q  I  T  T  P  G  A  Q  Q  K  V  A  Y  A  A  Q  P  A         p.2980

          .         .         .         .         .         .       g.124701
 CTTAAGACCCAGTTTCTTACCACACCCATCTCCCAGGCCCAGAAACTGGCCGGGGCCCAG       c.9000
 L  K  T  Q  F  L  T  T  P  I  S  Q  A  Q  K  L  A  G  A  Q         p.3000

          .         .  | 52      .         .         .         .    g.131635
 CAAGTGCAGACCCAGATCCAG | GTTGCAAAACTTCCTCAAGTTGTTCAACAGCAAACACCC    c.9060
 Q  V  Q  T  Q  I  Q   | V  A  K  L  P  Q  V  V  Q  Q  Q  T  P      p.3020

          .         .         .          | 53        .         .    g.132502
 GTGGCCAGCATCCAGCAAGTTGCCTCTGCTTCCCAGCAG | GCTTCTCCACAGACTGTGGCG    c.9120
 V  A  S  I  Q  Q  V  A  S  A  S  Q  Q   | A  S  P  Q  T  V  A      p.3040

          .         .         .         .         .         .       g.132562
 CTCACGCAGGCGACGGCGGCCGGGCAGCAGGTGCAGATGATCCCTGCAGTGACCGCGACT       c.9180
 L  T  Q  A  T  A  A  G  Q  Q  V  Q  M  I  P  A  V  T  A  T         p.3060

          .         .         .         .         .         .       g.132622
 GCCCAGGTGGTTCAGCAGAAACTCATTCAGCAGCAGGTGGTGACCACGGCGTCGGCCCCG       c.9240
 A  Q  V  V  Q  Q  K  L  I  Q  Q  Q  V  V  T  T  A  S  A  P         p.3080

          .         .         .         .         .         .       g.132682
 CTCCAGACTCCAGGCGCTCCCAACCCAGCCCAGGTGCCCGCCAGCTCCGACAGCCCAAGC       c.9300
 L  Q  T  P  G  A  P  N  P  A  Q  V  P  A  S  S  D  S  P  S         p.3100

          .         .         .         .         .         .       g.132742
 CAGCAGCCCAAGTTACAGATGAGGGTCCCTGCTGTCAGGCTAAAGACACCTACTAAGCCT       c.9360
 Q  Q  P  K  L  Q  M  R  V  P  A  V  R  L  K  T  P  T  K  P         p.3120

          .                                                         g.132754
 CCGTGCCAGTAG                                                       c.9372
 P  C  Q  X                                                         p.3123

          .         .         .         .         .         .       g.132814
 tcagggcagcagggctgcctctcatctaaagcaaaactaccttcctcacagaaaacgctt       c.*60

          .         .         .         .         .         .       g.132874
 tattagtgaaccttgggaccatgtcacgcaagagattcagcactgggaaagatataattg       c.*120

          .         .         .         .         .         .       g.132934
 aaacaaaatagtgtaatcattttattaaaatgcatcccacactgcaggacaaatggtcct       c.*180

          .         .         .         .         .         .       g.132994
 tatggagtgccgcgttctctgtactacgtggctcatggaaaaagtgacaacatggcttcc       c.*240

          .         .         .         .         .         .       g.133054
 tctaaatcatttcacctttcagtccccacccgcacccgtcccctagagccatagtactgt       c.*300

          .         .         .         .         .         .       g.133114
 gttctgaaagccatttagaatttctttgtgagcatgtagtgctttgcacgccacagaagc       c.*360

          .         .         .         .         .         .       g.133174
 cgtctgccgtgtgtgaggagcatacaatggactttctaaagataaggcgtgggcttccac       c.*420

          .         .         .         .         .         .       g.133234
 agtgtctgccagagtttagttctttataccttactgaaaaatgcctcgtggtcttcgcag       c.*480

          .         .         .         .         .         .       g.133294
 aggggaaggcctgtctaaagtcaatcatccgagatgggttttccattccaaagaaaggca       c.*540

          .         .         .         .         .         .       g.133354
 atatggttccttccttccctcctaaaatatgacttaacttttaagagaaatgttctgaca       c.*600

          .         .         .         .         .         .       g.133414
 cccacctaaacacacaaggcacgttcctggcctgtgttcaagggaaatgatcagtcattg       c.*660

          .         .         .         .         .         .       g.133474
 cattgttattccaaagagcagccaacagtggcctcccccaggccctaccctgcaatggga       c.*720

          .         .         .         .         .         .       g.133534
 ttcgctttcatttaatggaaacttctgggactgatgcccaactcagtgcactcaagacgc       c.*780

          .         .         .         .         .         .       g.133594
 atctccagttttcgggggaagctggtatttgacatagtgtgttaaacagctcctgagaac       c.*840

          .         .         .         .         .         .       g.133654
 ctttgggacactctgccatggctggcgtgaggcccagaggaccacgcagaggcaatggta       c.*900

          .         .         .         .         .         .       g.133714
 gtacagatgtcacagctgagggtacgatgaggcctgggctcagtgagccaggacgaatgt       c.*960

          .         .         .         .         .         .       g.133774
 gacagacaccccttgctgccacagtcagccctttgacgaaggtgggctggtgattctgga       c.*1020

          .         .         .         .         .         .       g.133834
 agtattggctatagcggtgggcccagtcaactcttccttgtggacttacgacagcagatt       c.*1080

          .         .         .         .         .         .       g.133894
 ttctctaggataagcttgtgtggttctgccagtgaagcagagaaccacctgtgctgttgt       c.*1140

          .         .         .         .         .         .       g.133954
 ggaaggcgtgccgttgagggggaaaacgaagcccagtatttgctactgtttttccttttt       c.*1200

          .         .         .         .         .         .       g.134014
 ttactatgacaggaaaataaatgcaattttagtggaattgattgacagtgtctccttact       c.*1260

          .         .         .         .         .         .       g.134074
 ttgaagttttcaccaaagcaaaaaggtccatatccaatagtatcctttgtgctgtggctt       c.*1320

          .         .         .         .         .         .       g.134134
 gattttggcctattttacattatttggtccaggaaattaggttatattaggttttttgta       c.*1380

          .         .         .         .         .         .       g.134194
 tactaaaaatcagttatggcacaataaagattttctgtttttaaattgtatttcatctgc       c.*1440

          .         .         .         .         .         .       g.134254
 ttcctccccattctctcactttaagtgacattgaggaaggtattctgtcccacaggtttc       c.*1500

          .         .         .         .         .         .       g.134314
 tgtggacagcgatacagcaggagtcagtgaaatcaactggggagctcacttgagctcttg       c.*1560

          .         .         .         .         .         .       g.134374
 ataagaaatgtggagaaaagtaaaaaccaagctttgaagaaacagaagaaattaatcttt       c.*1620

          .         .         .         .         .         .       g.134434
 tagttagttgaacataccaaagcagaggactggaatctgtttgttctaaccaacccgttc       c.*1680

          .         .         .         .         .         .       g.134494
 tccctggcttggcacgtgccgtgagagcgcagcttgccggagggagggccgctgtgtgcg       c.*1740

          .         .         .         .         .         .       g.134554
 cctcacatctggctcccagtggaaacttttactcctcctcatccgcagatgtgatagaac       c.*1800

          .         .         .         .         .         .       g.134614
 tgaagtatctaggaattctgcctttgtcatttgttttaatttgtgtgccctgttcatttt       c.*1860

          .         .         .         .         .         .       g.134674
 ttttgtctttcccaaatcttggtagtctccttatagttgaagataaaatgttgagtgcac       c.*1920

          .         .         .         .         .         .       g.134734
 ttattttagaatatcctagacataactgtctaagtaaaagcgctctattaatctaaaaca       c.*1980

          .         .         .         .         .         .       g.134794
 ctacaagagaatttaacaccatctctcaaatgcttttttggagagcttaatgggattctg       c.*2040

          .         .         .         .         .         .       g.134854
 aatatttgcaatgtggagtttccgccccgatctcacgtcagtgagggtctcctgtctctc       c.*2100

          .         .         .         .         .         .       g.134914
 aagtgtgtttcctttggctgttccctaatacaaaacacggacatatttttactcgtagca       c.*2160

          .         .         .         .         .         .       g.134974
 ctcaatttagtaacttctagatgctaccgttgacctgagttaaattcatttagtcgtgta       c.*2220

          .         .         .         .         .         .       g.135034
 cgtaaaaactctccttttagtgtgttattttcttggccttcccttttaaaggttaaagtt       c.*2280

          .         .         .         .         .         .       g.135094
 tctaacctaagaattaagtacgcgttcaggaagctgttgtctaggccttccccttgtgaa       c.*2340

          .         .         .         .         .         .       g.135154
 tctgggttcattccaatacggcaagtaagagttggaaactttgagaacacagactataaa       c.*2400

          .         .         .         .         .         .       g.135214
 ggcagcagcccgaacactgtcagactctaattggcgaccctgggaaacagttgccctgct       c.*2460

          .         .         .         .         .         .       g.135274
 attctttaaagaaagacgtttattctgatgataaaaacagttagccagactgtttttaaa       c.*2520

          .         .         .         .         .         .       g.135334
 gcacctggcgggaagcagaaggttggatccaagcccttgttcagatttggtgcctgataa       c.*2580

          .         .         .         .         .         .       g.135394
 gacaggggtttctctttttgtgacctttattattattattttgttaactgttgtaaccag       c.*2640

          .         .         .         .         .         .       g.135454
 ttagctgttgtgttttaagatagaaaggaacaagactaaaattgtaaatactttgtaaac       c.*2700

          .         .         .         .         .         .       g.135514
 atcagcatttgtacttgaatagtaggattttaaagggcattgatagcataccaaacaaaa       c.*2760

          .         .         .                                     g.135547
 ggcaaaataaagtgacctttttatatatttttt                                  c.*2793

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The E1A binding protein p400 protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 30b
©2004-2025 Leiden University Medical Center