host cell factor C1 (VP16-accessory protein) (HCFC1) - coding DNA reference sequence

(used for variant description)

(last modified April 22, 2020)


This file was created to facilitate the description of sequence variants on transcript NM_005334.2 in the HCFC1 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_012513.1, covering HCFC1 transcript NM_005334.2.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                     .         .         .         .                g.5048
             aagatggcggctcccacaattgtggtctgagcgccggcggggctgcga       c.-481

 .         .         .         .         .         .                g.5108
 cgaccgcggcgcttgtggctcctttccacccgccccggaagccggccagtgctggggact       c.-421

 .         .         .         .         .         .                g.5168
 tgggggtgtgggaaccgggccggggctccgccatttccggcgggggagggctacgactga       c.-361

 .         .         .         .         .         .                g.5228
 ggaagggaggagagagaggcggctcaagatggcggctcccagggcctcccgcccgagctt       c.-301

 .         .         .         .         .         .                g.5288
 gtaagcgggagcgcccggacaagtagtcggggcgacgggactcagcggcctccagcttct       c.-241

 .         .         .         .         .         .                g.5348
 tgagcctaggcgctcgacagtttcgggcggctcttgcggagacggggtgagcgagaagaa       c.-181

 .         .         .         .         .         .                g.5408
 agggaagagccaaagggaaggagggcagttaagatggcggcctccatggagtcgtctacc       c.-121

 .         .         .         .         .         .                g.5468
 gctgtgtgagaaaccgcttctccgtgagagctgccttagacgaaagggggtgtgtgaaag       c.-61

 .         .         .         .         .         .                g.5528
 gaattgaggggctcccttcccgcttgttgacttctccccaccgcaccctttcccggaact       c.-1

          .         .         .         .         .         .       g.5588
 ATGGCTTCGGCCGTGTCGCCCGCCAACTTGCCAGCGGTGCTTCTGCAGCCCCGCTGGAAG       c.60
 M  A  S  A  V  S  P  A  N  L  P  A  V  L  L  Q  P  R  W  K         p.20

          .         .         .         .         .         .       g.5648
 CGAGTGGTGGGCTGGTCGGGTCCGGTGCCACGGCCCCGCCACGGCCACCGCGCCGTGGCC       c.120
 R  V  V  G  W  S  G  P  V  P  R  P  R  H  G  H  R  A  V  A         p.40

          .         .         .         .         .         .       g.5708
 ATCAAGGAGCTCATCGTGGTGTTTGGCGGCGGCAACGAGGGAATAGTGGACGAACTGCAC       c.180
 I  K  E  L  I  V  V  F  G  G  G  N  E  G  I  V  D  E  L  H         p.60

          .    | 02    .         .         .         .         .    g.11689
 GTGTACAACACGG | CAACCAACCAGTGGTTCATCCCAGCCGTGAGGGGGGACATTCCCCCT    c.240
 V  Y  N  T  A |   T  N  Q  W  F  I  P  A  V  R  G  D  I  P  P      p.80

          .         .         .         .         .         .       g.11749
 GGGTGTGCAGCCTATGGCTTCGTGTGTGACGGGACTCGCCTCCTGGTGTTTGGTGGGATG       c.300
 G  C  A  A  Y  G  F  V  C  D  G  T  R  L  L  V  F  G  G  M         p.100

          .         .         .         .   | 03     .         .    g.12102
 GTGGAGTATGGGAAATACAGCAATGACCTCTACGAACTCCAG | GCGAGCCGGTGGGAGTGG    c.360
 V  E  Y  G  K  Y  S  N  D  L  Y  E  L  Q   | A  S  R  W  E  W      p.120

          .         .         .         .         .         .       g.12162
 AAGAGACTCAAAGCAAAGACGCCCAAAAACGGGCCCCCTCCGTGTCCTCGACTCGGGCAC       c.420
 K  R  L  K  A  K  T  P  K  N  G  P  P  P  C  P  R  L  G  H         p.140

          .         .         .         .         .         .       g.12222
 AGCTTCTCCCTTGTGGGCAACAAATGCTACCTGTTTGGGGGTCTGGCCAATGATAGCGAG       c.480
 S  F  S  L  V  G  N  K  C  Y  L  F  G  G  L  A  N  D  S  E         p.160

          .         .    | 04    .         .         .         .    g.12972
 GACCCAAAGAACAACATTCCAAG | GTACCTGAATGACTTATATATCCTGGAATTACGGCCA    c.540
 D  P  K  N  N  I  P  R  |  Y  L  N  D  L  Y  I  L  E  L  R  P      p.180

          .         .         .         .         .         .       g.13032
 GGCTCTGGAGTGGTAGCCTGGGACATTCCCATCACTTACGGGGTCCTACCACCACCCCGG       c.600
 G  S  G  V  V  A  W  D  I  P  I  T  Y  G  V  L  P  P  P  R         p.200

          .         .         .         .         .         .       g.13092
 GAGTCACATACTGCCGTGGTCTACACCGAAAAAGACAATAAGAAGTCCAAGCTGGTGATC       c.660
 E  S  H  T  A  V  V  Y  T  E  K  D  N  K  K  S  K  L  V  I         p.220

          .         .         .         .         .   | 05     .    g.14070
 TACGGCGGGATGAGTGGCTGCAGGCTGGGGGACCTGTGGACCCTAGATATTG | ACACCCTG    c.720
 Y  G  G  M  S  G  C  R  L  G  D  L  W  T  L  D  I  D |   T  L      p.240

          .         .         .         .         .         .       g.14130
 ACGTGGAATAAGCCCAGTCTCAGCGGGGTGGCGCCTCTTCCTCGCAGTCTCCACTCGGCA       c.780
 T  W  N  K  P  S  L  S  G  V  A  P  L  P  R  S  L  H  S  A         p.260

          .        | 06.         .         .         .         .    g.14763
 ACCACCATCGGAAATAA | AATGTACGTGTTTGGTGGCTGGGTGCCTCTCGTCATGGATGAC    c.840
 T  T  I  G  N  K  |  M  Y  V  F  G  G  W  V  P  L  V  M  D  D      p.280

          .         .         .         .         .         .       g.14823
 GTCAAAGTGGCCACACACGAGAAGGAGTGGAAGTGTACCAACACGCTGGCTTGTCTCAAC       c.900
 V  K  V  A  T  H  E  K  E  W  K  C  T  N  T  L  A  C  L  N         p.300

      | 07   .         .         .         .         .         .    g.16010
 CTGG | ATACCATGGCCTGGGAGACCATCCTGATGGATACACTGGAGGACAACATCCCCCGT    c.960
 L  D |   T  M  A  W  E  T  I  L  M  D  T  L  E  D  N  I  P  R      p.320

          .         .         .         .         .         .       g.16070
 GCTCGGGCTGGCCACTGCGCAGTCGCCATCAACACCCGCCTGTACATTTGGAGTGGGCGT       c.1020
 A  R  A  G  H  C  A  V  A  I  N  T  R  L  Y  I  W  S  G  R         p.340

          .         .         .         .         .         .       g.16130
 GACGGCTACCGCAAGGCCTGGAACAACCAGGTCTGCTGCAAGGACCTCTGGTACCTAGAG       c.1080
 D  G  Y  R  K  A  W  N  N  Q  V  C  C  K  D  L  W  Y  L  E         p.360

      | 08   .         .         .         .         .         .    g.16263
 ACAG | AAAAGCCACCACCCCCAGCCCGAGTACAACTGGTACGCGCCAACACCAACTCCCTG    c.1140
 T  E |   K  P  P  P  P  A  R  V  Q  L  V  R  A  N  T  N  S  L      p.380

          .         .         .         .         .         .       g.16323
 GAGGTGAGCTGGGGGGCAGTGGCAACAGCCGACAGCTACCTTCTCCAGCTCCAGAAATAT       c.1200
 E  V  S  W  G  A  V  A  T  A  D  S  Y  L  L  Q  L  Q  K  Y         p.400

          .         .         .         .         .         .       g.16383
 GACATTCCTGCCACGGCTGCTACTGCCACCTCCCCTACACCCAATCCGGTCCCATCTGTG       c.1260
 D  I  P  A  T  A  A  T  A  T  S  P  T  P  N  P  V  P  S  V         p.420

          .         .         .         .         .         .       g.16443
 CCTGCCAACCCTCCCAAGAGCCCTGCCCCAGCAGCAGCCGCACCTGCTGTGCAGCCGCTG       c.1320
 P  A  N  P  P  K  S  P  A  P  A  A  A  A  P  A  V  Q  P  L         p.440

          .         .         .         .         .         .       g.16503
 ACCCAAGTAGGCATCACGCTCCTGCCCCAGGCTGCCCCCGCACCCCCGACCACCACCACC       c.1380
 T  Q  V  G  I  T  L  L  P  Q  A  A  P  A  P  P  T  T  T  T         p.460

          .         .         .         .         .         .       g.16563
 ATCCAGGTCTTGCCAACGGTGCCTGGCAGCTCCATTTCTGTGCCCACCGCAGCCAGGACT       c.1440
 I  Q  V  L  P  T  V  P  G  S  S  I  S  V  P  T  A  A  R  T         p.480

      | 09   .         .         .         .         .         .    g.16933
 CAAG | GTGTCCCTGCTGTTCTCAAAGTGACCGGTCCTCAGGCTACAACAGGAACTCCATTG    c.1500
 Q  G |   V  P  A  V  L  K  V  T  G  P  Q  A  T  T  G  T  P  L      p.500

          .         .         .         .         .         .       g.16993
 GTCACCATGCGACCTGCCAGCCAGGCTGGGAAAGCCCCTGTCACCGTGACCTCCCTTCCC       c.1560
 V  T  M  R  P  A  S  Q  A  G  K  A  P  V  T  V  T  S  L  P         p.520

          .         .         .         .      | 10  .         .    g.17617
 GCCGGAGTGCGGATGGTTGTGCCAACACAGAGTGCCCAGGGAACG | GTGATTGGCAGTAGC    c.1620
 A  G  V  R  M  V  V  P  T  Q  S  A  Q  G  T   | V  I  G  S  S      p.540

          .         .         .         .         .         .       g.17677
 CCACAGATGAGTGGGATGGCCGCACTGGCCGCTGCGGCCGCTGCCACCCAGAAGATCCCC       c.1680
 P  Q  M  S  G  M  A  A  L  A  A  A  A  A  A  T  Q  K  I  P         p.560

          .         .         .         .         .         .       g.17737
 CCTTCCTCGGCACCCACGGTGCTGAGTGTCCCAGCGGGTACCACCATCGTGAAGACCATG       c.1740
 P  S  S  A  P  T  V  L  S  V  P  A  G  T  T  I  V  K  T  M         p.580

          .         .         .         .         .         .       g.17797
 GCTGTGACACCTGGCACTACCACCCTCCCAGCCACTGTGAAGGTGGCCTCCTCGCCAGTC       c.1800
 A  V  T  P  G  T  T  T  L  P  A  T  V  K  V  A  S  S  P  V         p.600

     | 11    .         .         .         .         .         .    g.18176
 ATG | GTGAGCAACCCTGCCACTCGCATGCTGAAGACTGCAGCCGCCCAGGTGGGGACATCG    c.1860
 M   | V  S  N  P  A  T  R  M  L  K  T  A  A  A  Q  V  G  T  S      p.620

          .         .         .         .         .         .       g.18236
 GTTTCCTCCGCCACCAACACGTCTACCCGCCCTATCATCACAGTGCACAAGTCAGGCACT       c.1920
 V  S  S  A  T  N  T  S  T  R  P  I  I  T  V  H  K  S  G  T         p.640

          .         .         .         .         .         .       g.18296
 GTGACAGTGGCCCAGCAAGCCCAGGTGGTGACCACAGTTGTGGGCGGGGTCACCAAGACC       c.1980
 V  T  V  A  Q  Q  A  Q  V  V  T  T  V  V  G  G  V  T  K  T         p.660

          .         .         .         .         | 12         .    g.18494
 ATCACCCTGGTGAAGAGCCCCATCTCTGTCCCAGGAGGCAGTGCTCTG | ATTTCCAATCTG    c.2040
 I  T  L  V  K  S  P  I  S  V  P  G  G  S  A  L   | I  S  N  L      p.680

          .         .         .         .         .         .       g.18554
 GGCAAAGTGATGTCGGTGGTCCAGACCAAACCAGTTCAGACTTCAGCAGTCACAGGCCAG       c.2100
 G  K  V  M  S  V  V  Q  T  K  P  V  Q  T  S  A  V  T  G  Q         p.700

          .         .         .    | 13    .         .         .    g.18862
 GCGTCCACGGGTCCTGTGACTCAGATCATCCAG | ACCAAAGGGCCCCTGCCAGCGGGAACA    c.2160
 A  S  T  G  P  V  T  Q  I  I  Q   | T  K  G  P  L  P  A  G  T      p.720

          .         .         .         .         .         .       g.18922
 ATCCTGAAGCTGGTGACCTCAGCAGATGGCAAGCCCACCACCATCATCACTACCACGCAG       c.2220
 I  L  K  L  V  T  S  A  D  G  K  P  T  T  I  I  T  T  T  Q         p.740

          .         .         .         .         .         .       g.18982
 GCCAGTGGGGCGGGGACCAAGCCCACCATCCTGGGCATCAGCAGCGTCTCCCCCAGTACC       c.2280
 A  S  G  A  G  T  K  P  T  I  L  G  I  S  S  V  S  P  S  T         p.760

          .         .         .         .         .         .       g.19042
 ACCAAGCCCGGCACGACCACCATCATCAAAACCATCCCCATGTCGGCCATCATCACCCAG       c.2340
 T  K  P  G  T  T  T  I  I  K  T  I  P  M  S  A  I  I  T  Q         p.780

          .    | 14    .         .         .         .         .    g.19355
 GCGGGCGCCACGG | GTGTGACCAGCAGTCCTGGCATCAAGTCCCCCATCACCATCATCACC    c.2400
 A  G  A  T  G |   V  T  S  S  P  G  I  K  S  P  I  T  I  I  T      p.800

          .         .         .         .         .         .       g.19415
 ACCAAGGTGATGACTTCAGGAACTGGAGCACCTGCGAAAATCATCACTGCTGTCCCCAAA       c.2460
 T  K  V  M  T  S  G  T  G  A  P  A  K  I  I  T  A  V  P  K         p.820

          .         .         .       | 15 .         .         .    g.19629
 ATTGCCACTGGCCACGGGCAGCAGGGAGTGACCCAG | GTGGTGCTTAAGGGGGCCCCGGGA    c.2520
 I  A  T  G  H  G  Q  Q  G  V  T  Q   | V  V  L  K  G  A  P  G      p.840

          .         .         .         .         .         .       g.19689
 CAGCCAGGCACCATCCTCCGCACTGTGCCCATGGGGGGTGTTCGCCTGGTCACACCCGTC       c.2580
 Q  P  G  T  I  L  R  T  V  P  M  G  G  V  R  L  V  T  P  V         p.860

          .         .         .         .         .      | 16  .    g.19962
 ACCGTCTCCGCCGTCAAGCCAGCCGTCACCACGTTGGTTGTGAAAGGCACCACAG | GTGTC    c.2640
 T  V  S  A  V  K  P  A  V  T  T  L  V  V  K  G  T  T  G |   V      p.880

          .         .         .         .         .         .       g.20022
 ACGACCCTAGGCACAGTGACAGGCACCGTCTCCACCAGCCTTGCCGGGGCGGGGGGCCAC       c.2700
 T  T  L  G  T  V  T  G  T  V  S  T  S  L  A  G  A  G  G  H         p.900

          .         .         .         .         .         .       g.20082
 AGCACTAGTGCTTCCCTGGCCACGCCCATCACCACCTTGGGCACCATTGCCACCCTCTCA       c.2760
 S  T  S  A  S  L  A  T  P  I  T  T  L  G  T  I  A  T  L  S         p.920

          .         .         .         .         .         .       g.20142
 AGCCAGGTGATCAACCCCACTGCCATCACTGTGTCGGCCGCACAGACCACGCTGACAGCG       c.2820
 S  Q  V  I  N  P  T  A  I  T  V  S  A  A  Q  T  T  L  T  A         p.940

          .         .         .       | 17 .         .         .    g.20850
 GCAGGCGGGCTCACAACCCCAACCATCACCATGCAG | CCCGTGTCCCAGCCCACCCAGGTA    c.2880
 A  G  G  L  T  T  P  T  I  T  M  Q   | P  V  S  Q  P  T  Q  V      p.960

          .         .         .         .         .         .       g.20910
 ACTCTGATCACGGCACCTAGTGGGGTGGAGGCCCAGCCTGTGCATGACCTCCCTGTGTCC       c.2940
 T  L  I  T  A  P  S  G  V  E  A  Q  P  V  H  D  L  P  V  S         p.980

          .         .         .         .         .         .       g.20970
 ATTCTGGCCTCCCCGACTACAGAACAGCCCACCGCCACAGTTACCATCGCCGACTCAGGC       c.3000
 I  L  A  S  P  T  T  E  Q  P  T  A  T  V  T  I  A  D  S  G         p.1000

          .         .         .         .         .         .       g.21030
 CAGGGTGATGTGCAGCCTGGCACTGTCACCTTGGTGTGCTCCAACCCACCCTGTGAGACC       c.3060
 Q  G  D  V  Q  P  G  T  V  T  L  V  C  S  N  P  P  C  E  T         p.1020

          .         .         .         .         .         .       g.21090
 CACGAGACTGGCACCACCAACACGGCCACCACTACTGTTGTGGCTAACCTTGGGGGACAC       c.3120
 H  E  T  G  T  T  N  T  A  T  T  T  V  V  A  N  L  G  G  H         p.1040

          .         .         .         .         .         .       g.21150
 CCCCAGCCCACCCAAGTGCAGTTCGTCTGTGACAGACAGGAGGCAGCTGCTTCTCTTGTG       c.3180
 P  Q  P  T  Q  V  Q  F  V  C  D  R  Q  E  A  A  A  S  L  V         p.1060

          .         .         .         .         .         .       g.21210
 ACCTCGACTGTGGGCCAGCAGAATGGTAGCGTGGTCCGAGTCTGTTCGAACCCGCCCTGC       c.3240
 T  S  T  V  G  Q  Q  N  G  S  V  V  R  V  C  S  N  P  P  C         p.1080

          .         .         .         .         .         .       g.21270
 GAGACCCACGAGACGGGCACCACCAACACCGCCACCACCGCCACCTCCAACATGGCCGGG       c.3300
 E  T  H  E  T  G  T  T  N  T  A  T  T  A  T  S  N  M  A  G         p.1100

          .         .         .         .         .         .       g.21330
 CAGCATGGCTGCTCAAACCCACCCTGCGAGACCCACGAGACGGGCACCACCAACACTGCC       c.3360
 Q  H  G  C  S  N  P  P  C  E  T  H  E  T  G  T  T  N  T  A         p.1120

          .         .         .         .         .         .       g.21390
 ACTACAGCCATGTCGAGCGTCGGCGCCAACCACCAGCGAGATGCCCGTCGGGCCTGTGCA       c.3420
 T  T  A  M  S  S  V  G  A  N  H  Q  R  D  A  R  R  A  C  A         p.1140

          .         .         .         .         .         .       g.21450
 GCTGGCACCCCTGCCGTGATCCGGATCAGTGTGGCCACTGGGGCGCTGGAGGCAGCCCAG       c.3480
 A  G  T  P  A  V  I  R  I  S  V  A  T  G  A  L  E  A  A  Q         p.1160

          .         .         .         .         .         .       g.21510
 GGCTCTAAGTCCCAGTGCCAAACCCGCCAGACCAGCGCGACCAGCACCACCATGACTGTG       c.3540
 G  S  K  S  Q  C  Q  T  R  Q  T  S  A  T  S  T  T  M  T  V         p.1180

          .         .         .         .         .         .       g.21570
 ATGGCCACCGGGGCCCCGTGCTCGGCCGGCCCACTCCTTGGGCCGAGCATGGCACGGGAG       c.3600
 M  A  T  G  A  P  C  S  A  G  P  L  L  G  P  S  M  A  R  E         p.1200

          .         .         .         .         .         .       g.21630
 CCCGGGGGCCGCAGCCCTGCTTTTGTGCAGTTGGCCCCTCTGAGCAGCAAAGTCAGGCTG       c.3660
 P  G  G  R  S  P  A  F  V  Q  L  A  P  L  S  S  K  V  R  L         p.1220

          .         .         .         .         .         .       g.21690
 AGCAGCCCAAGCATTAAGGACCTTCCTGCGGGGCGCCACAGCCATGCGGTCAGCACCGCT       c.3720
 S  S  P  S  I  K  D  L  P  A  G  R  H  S  H  A  V  S  T  A         p.1240

          .         .         .         .         .         .       g.21750
 GCCATGACCCGTTCCAGCGTGGGTGCTGGGGAGCCCCGCATGGCACCTGTGTGCGAGAGC       c.3780
 A  M  T  R  S  S  V  G  A  G  E  P  R  M  A  P  V  C  E  S         p.1260

          .         .         .         .         .         .       g.21810
 CTCCAGGGTGGCTCGCCCAGCACCACAGTGACTGTGACAGCCCTGGAGGCACTGCTGTGC       c.3840
 L  Q  G  G  S  P  S  T  T  V  T  V  T  A  L  E  A  L  L  C         p.1280

          .         .         .         .         .         .       g.21870
 CCCTCGGCCACCGTGACCCAAGTCTGCTCCAACCCACCATGTGAGACCCACGAGACAGGC       c.3900
 P  S  A  T  V  T  Q  V  C  S  N  P  P  C  E  T  H  E  T  G         p.1300

          .         .         .         .         .         .       g.21930
 ACCACCAACACCGCCACTACCTCGAATGCAGGCAGCGCCCAGAGGGTGTGCTCCAACCCG       c.3960
 T  T  N  T  A  T  T  S  N  A  G  S  A  Q  R  V  C  S  N  P         p.1320

          .         .         .         .         .         .       g.21990
 CCATGCGAGACCCACGAGACGGGCACCACCCACACGGCCACCACCGCTACTTCAAACGGG       c.4020
 P  C  E  T  H  E  T  G  T  T  H  T  A  T  T  A  T  S  N  G         p.1340

          .         .         .         .         .         .       g.22050
 GGCACGGGCCAGCCCGAGGGTGGGCAGCAGCCCCCTGCTGGTCGCCCCTGTGAGACACAC       c.4080
 G  T  G  Q  P  E  G  G  Q  Q  P  P  A  G  R  P  C  E  T  H         p.1360

          .         .         .         .         .         .       g.22110
 CAGACCACTTCCACTGGCACCACCATGTCGGTCAGCGTGGGTGCCCTGCTTCCCGACGCC       c.4140
 Q  T  T  S  T  G  T  T  M  S  V  S  V  G  A  L  L  P  D  A         p.1380

          .         .         .         .         .         .       g.22170
 ACTTCTTCCCACAGGACCGTGGAGTCTGGCCTAGAGGTGGCGGCGGCACCCAGCGTCACC       c.4200
 T  S  S  H  R  T  V  E  S  G  L  E  V  A  A  A  P  S  V  T         p.1400

          .         .         .         .         .         .       g.22230
 CCCCAGGCTGGCACCGCGCTGCTGGCTCCTTTCCCAACACAGAGGGTGTGCTCCAACCCC       c.4260
 P  Q  A  G  T  A  L  L  A  P  F  P  T  Q  R  V  C  S  N  P         p.1420

          .         .         .         .         .         .       g.22290
 CCCTGTGAGACCCACGAGACGGGCACCACTCACACGGCCACCACTGTCACTTCCAACATG       c.4320
 P  C  E  T  H  E  T  G  T  T  H  T  A  T  T  V  T  S  N  M         p.1440

          .    | 18    .         .         .         .         .    g.22645
 AGTTCAAACCAAG | ACCCCCCACCTGCTGCCAGCGATCAGGGAGAGGTGGAGAGCACCCAG    c.4380
 S  S  N  Q  D |   P  P  P  A  A  S  D  Q  G  E  V  E  S  T  Q      p.1460

          .         .         .         .         .         .       g.22705
 GGCGACAGCGTGAACATCACCAGCTCCAGTGCCATCACGACAACCGTGTCCTCCACACTG       c.4440
 G  D  S  V  N  I  T  S  S  S  A  I  T  T  T  V  S  S  T  L         p.1480

          .         .         .         .         .        | 19.    g.23413
 ACGCGGGCTGTGACCACCGTGACGCAGTCCACACCGGTCCCGGGCCCCTCTGTGCCG | CCC    c.4500
 T  R  A  V  T  T  V  T  Q  S  T  P  V  P  G  P  S  V  P   | P      p.1500

          .         .         .         .         .         .       g.23473
 CCAGAGGAACTCCAGGTGTCGCCAGGTCCTCGCCAGCAGCTGCCGCCACGGCAGCTTCTG       c.4560
 P  E  E  L  Q  V  S  P  G  P  R  Q  Q  L  P  P  R  Q  L  L         p.1520

          .         .         .         .         .         .       g.23533
 CAGTCGGCTTCCACAGCCCTGATGGGGGAGTCCGCCGAGGTCCTGTCAGCCTCCCAGACC       c.4620
 Q  S  A  S  T  A  L  M  G  E  S  A  E  V  L  S  A  S  Q  T         p.1540

          .         .         .         .         .         .       g.23593
 CCTGAGCTCCCGGCCGCCGTGGATCTGAGCAGCACAGGGGAGCCATCTTCGGGCCAGGAG       c.4680
 P  E  L  P  A  A  V  D  L  S  S  T  G  E  P  S  S  G  Q  E         p.1560

          .         .         .         .         .         .       g.23653
 TCTGCCGGCTCTGCGGTGGTGGCCACTGTGGTGGTCCAGCCACCCCCACCCACACAGTCC       c.4740
 S  A  G  S  A  V  V  A  T  V  V  V  Q  P  P  P  P  T  Q  S         p.1580

          .         .         .         .         .         .       g.23713
 GAAGTAGACCAGTTATCACTTCCCCAAGAGCTAATGGCCGAGGCCCAAGCTGGCACCACC       c.4800
 E  V  D  Q  L  S  L  P  Q  E  L  M  A  E  A  Q  A  G  T  T         p.1600

          .         .         .         .         .         .       g.23773
 ACCCTCATGGTAACGGGGCTCACCCCCGAGGAGCTGGCAGTGACGGCTGCTGCAGAAGCA       c.4860
 T  L  M  V  T  G  L  T  P  E  E  L  A  V  T  A  A  A  E  A         p.1620

          .         .         .         .         .         .       g.23833
 GCTGCCCAGGCCGCAGCCACGGAGGAAGCCCAGGCCCTGGCCATCCAGGCGGTGCTCCAG       c.4920
 A  A  Q  A  A  A  T  E  E  A  Q  A  L  A  I  Q  A  V  L  Q         p.1640

          .         .   | 20     .         .         .         .    g.24248
 GCCGCGCAGCAGGCCGTCATGG | GCACCGGCGAGCCCATGGACACCTCCGAGGCAGCAGCA    c.4980
 A  A  Q  Q  A  V  M  G |   T  G  E  P  M  D  T  S  E  A  A  A      p.1660

          .         .         .         .         .         .       g.24308
 ACCGTGACTCAGGCGGAGCTGGGGCACCTGTCGGCCGAGGGTCAGGAGGGCCAGGCCACC       c.5040
 T  V  T  Q  A  E  L  G  H  L  S  A  E  G  Q  E  G  Q  A  T         p.1680

          .         .         .         .         .         .       g.24368
 ACCATACCCATTGTGCTGACACAGCAGGAGCTGGCTGCCCTGGTGCAGCAGCAGCAGCTG       c.5100
 T  I  P  I  V  L  T  Q  Q  E  L  A  A  L  V  Q  Q  Q  Q  L         p.1700

          .         .         .         .         .         .       g.24428
 CAGGAGGCCCAGGCCCAGCAGCAGCATCACCACCTCCCCACTGAGGCCCTGGCCCCTGCC       c.5160
 Q  E  A  Q  A  Q  Q  Q  H  H  H  L  P  T  E  A  L  A  P  A         p.1720

          .         .         .         .         .         .       g.24488
 GACAGTCTCAACGACCCAGCCATTGAGAGCAATTGCCTCAATGAGCTGGCCGGCACGGTC       c.5220
 D  S  L  N  D  P  A  I  E  S  N  C  L  N  E  L  A  G  T  V         p.1740

          .         .         .         . | 21       .         .    g.24681
 CCCAGCACTGTGGCGCTGCTGCCCTCAACGGCCACTGAGA | GCCTGGCTCCATCCAACACA    c.5280
 P  S  T  V  A  L  L  P  S  T  A  T  E  S |   L  A  P  S  N  T      p.1760

          .         .         .         .         .         .       g.24741
 TTTGTGGCCCCCCAGCCGGTTGTGGTGGCCAGCCCAGCCAAGCTGCAGGCTGCAGCTACC       c.5340
 F  V  A  P  Q  P  V  V  V  A  S  P  A  K  L  Q  A  A  A  T         p.1780

          .         .         .          | 22        .         .    g.24902
 CTGACCGAAGTGGCCAATGGCATCGAGTCCCTGGGTGTG | AAGCCAGACCTGCCGCCCCCA    c.5400
 L  T  E  V  A  N  G  I  E  S  L  G  V   | K  P  D  L  P  P  P      p.1800

          .         .         .         .         .         .       g.24962
 CCCAGCAAAGCCCCCATGAAGAAGGAAAACCAGTGGTTTGATGTGGGAGTCATTAAGGGC       c.5460
 P  S  K  A  P  M  K  K  E  N  Q  W  F  D  V  G  V  I  K  G         p.1820

          .         .         .         .         .        | 23.    g.25373
 ACCAATGTAATGGTGACACACTATTTCCTGCCACCAGATGATGCTGTCCCATCAGAC | GAT    c.5520
 T  N  V  M  V  T  H  Y  F  L  P  P  D  D  A  V  P  S  D   | D      p.1840

          .         .         .         .         .         .       g.25433
 GATTTGGGCACCGTCCCTGACTATAACCAGCTGAAGAAGCAGGAGCTGCAGCCAGGCACA       c.5580
 D  L  G  T  V  P  D  Y  N  Q  L  K  K  Q  E  L  Q  P  G  T         p.1860

          .         .         .         .         .         .       g.25493
 GCCTATAAGTTTCGTGTTGCCGGAATCAATGCCTGTGGCCGGGGGCCCTTCAGCGAAATC       c.5640
 A  Y  K  F  R  V  A  G  I  N  A  C  G  R  G  P  F  S  E  I         p.1880

          .         .         .         .         .         .       g.25553
 TCAGCCTTTAAGACGTGCCTGCCTGGTTTCCCAGGGGCCCCTTGTGCCATTAAAATCAGC       c.5700
 S  A  F  K  T  C  L  P  G  F  P  G  A  P  C  A  I  K  I  S         p.1900

     | 24    .         .         .         .         .         .    g.25882
 AAA | AGTCCGGATGGTGCTCACCTCACCTGGGAGCCACCCTCTGTGACCTCCGGCAAGATT    c.5760
 K   | S  P  D  G  A  H  L  T  W  E  P  P  S  V  T  S  G  K  I      p.1920

          .         .         .         .         .         .       g.25942
 ATCGAGTACTCCGTGTACCTGGCCATCCAGAGCTCACAGGCTGGGGGCGAGCTCAAGAGC       c.5820
 I  E  Y  S  V  Y  L  A  I  Q  S  S  Q  A  G  G  E  L  K  S         p.1940

          .         .         .         .         .         .       g.26002
 TCCACCCCGGCCCAGCTGGCCTTCATGCGGGTGTACTGCGGGCCCAGCCCCTCCTGCCTG       c.5880
 S  T  P  A  Q  L  A  F  M  R  V  Y  C  G  P  S  P  S  C  L         p.1960

          .         .         .         .         .         .       g.26062
 GTGCAGTCCTCCAGCCTTTCCAACGCCCACATCGACTACACCACCAAGCCCGCCATCATC       c.5940
 V  Q  S  S  S  L  S  N  A  H  I  D  Y  T  T  K  P  A  I  I         p.1980

          .         .         .         .         .         .       g.26122
 TTCCGCATCGCCGCCCGCAATGAGAAGGGCTATGGCCCGGCCACACAAGTGAGGTGGCTG       c.6000
 F  R  I  A  A  R  N  E  K  G  Y  G  P  A  T  Q  V  R  W  L         p.2000

      | 25   .         .         .         .         .         .    g.26808
 CAGG | AAACCAGTAAAGACAGCTCTGGCACCAAGCCAGCCAACAAGCGGCCCATGTCCTCT    c.6060
 Q  E |   T  S  K  D  S  S  G  T  K  P  A  N  K  R  P  M  S  S      p.2020

          | 26         .         .         .                        g.27034
 CCAGAAAT | GAAATCTGCTCCAAAGAAATCTAAGGCCGATGGTCAGTGA                c.6108
 P  E  M  |  K  S  A  P  K  K  S  K  A  D  G  Q  X                  p.2035

          .         .         .         .         .         .       g.27094
 gaggaagctgactagcccctggattcttctccagacccccctgcttcaggaacacccgcc       c.*60

          .         .         .         .         .         .       g.27154
 agggcccacccctcccaccccgtcccagcattcgcacttcaccctcgcgagccgctgttc       c.*120

          .         .         .         .         .         .       g.27214
 actcctctcccctttctctttctctctgtttttaaaataatctaaagaaagcacatttta       c.*180

          .         .         .         .         .         .       g.27274
 ccattgctgttgggaggaagcagaggcagatgggaaagcagagagaggagcgcgcttcct       c.*240

          .         .         .         .         .         .       g.27334
 ttcctccccgctgccgcccaccctggggagagacttttgcggggagggaaggcggagctg       c.*300

          .         .         .         .         .         .       g.27394
 aggacagccagctccgccctcccaaggctgtgcgttcctgagggccaggtcgggggcagg       c.*360

          .         .         .         .         .         .       g.27454
 catggaggggaggaaaggcgtccctcttggccctccccagagtggctttcctggcaccct       c.*420

          .         .         .         .         .         .       g.27514
 ggcctgggtgtctggttctgttttcttttcttccccttgtgtttccagtcacctaacttc       c.*480

          .         .         .         .         .         .       g.27574
 ccttcctcaggctcccccggcccaccctgctcagtgaccccacaggaagcttacacattt       c.*540

          .         .         .         .         .         .       g.27634
 tctcagaggcctttgtgctcccacctcttctaccctccccctcttctttcccattttaaa       c.*600

          .         .         .         .         .         .       g.27694
 aaagaaaagaaggaaaaagaaaaaaggggcaaggagccccgcggcggcctgggcagcgcc       c.*660

          .         .         .         .         .         .       g.27754
 tgtgcagacctccctgcaggccgcactgccaactgctgcatttgttgtgttttttaggtt       c.*720

          .         .         .         .         .         .       g.27814
 gcaattggtgaagttcacactttcattgtaattttagcgtgtggggttttgtcccttttt       c.*780

          .         .         .         .         .         .       g.27874
 tgttgttgttagctgtgtacagaatgtgtaatcttttttcttttctcttttttttgtttt       c.*840

          .         .         .         .         .         .       g.27934
 gttttgttttgtttttgtttttttacttttttcttcttggctaattcttggcagggatct       c.*900

          .         .         .         .         .         .       g.27994
 ttctggaggaaaagctggggccagccagggcaggagaggtgtgaaatctgccacgagggg       c.*960

          .         .         .         .         .         .       g.28054
 cctgctgtttgccacccagcccaacttcctgttgctggcccctgccctctgcccttttgc       c.*1020

          .         .         .         .         .         .       g.28114
 ctgtcctcaggccgctggaacaaaggaaggacagctcattcctcatgggcgatcactccg       c.*1080

          .         .         .         .         .         .       g.28174
 catctatagggtcgagcctaggggagcttgagggagggctggggcctccttgtcctggat       c.*1140

          .         .         .         .         .         .       g.28234
 ttccagctctccccatgccccctccctgagcaccaccggcaccgcctcccaaacagggct       c.*1200

          .         .         .         .         .         .       g.28294
 gctggtttccgcagccactgctccacctcccccaaatcgtcatggaaagggtggagaatg       c.*1260

          .         .         .         .         .         .       g.28354
 gaggggaaccaggcgtccttggaggcagcttgggagggtgactgtgtagtgtcacccaca       c.*1320

          .         .         .         .         .         .       g.28414
 agggaggctagggcaatggagcaggccaccagcagcagctgtgcagcatggaactcaggc       c.*1380

          .         .         .         .         .         .       g.28474
 caggctccgaggctgggggatctgcttggagttttctgccccccaccccaaacttctgtc       c.*1440

          .         .         .         .         .         .       g.28534
 gaggagcaaggcttgccagcaagtcagaaggatttgaaccgagcagccaatctttccagc       c.*1500

          .         .         .         .         .         .       g.28594
 cctcccctaccgacctctgcctggagacgcagcagcctgtgtcctccagggcctctggtt       c.*1560

          .         .         .         .         .         .       g.28654
 tgttgtattatagtatatttcgctgtggaaaatgtcacgtttagtcaccttggagcccac       c.*1620

          .         .         .         .         .         .       g.28714
 tcacctggtcctgttgttttaccccatcccttctctcgcgcgcctattgatttgtttctg       c.*1680

          .         .         .         .         .         .       g.28774
 aggagagtacaccgttcactattgtagagtaacccctgtgactcaatattaccatagtgc       c.*1740

          .         .         .         .         .                 g.28824
 gatgtcgttttgtgctattttgaacaattaaaagactttttttgaaataa                 c.*1790

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Host cell factor C1 (VP16-accessory protein) protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 23
©2004-2020 Leiden University Medical Center