transcription factor 20 (AR1) (TCF20) - coding DNA reference sequence

(used for variant description)

(last modified March 22, 2018)


This file was created to facilitate the description of sequence variants on transcript NM_005650.2 in the TCF20 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_028982.1, covering TCF20 transcript NM_005650.2.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                               .         .         .                g.5134
                         gagggctgttggcctgctgctgtgctgctgaacagt       c.-1

          .         .         .         .         .         .       g.5194
 ATGCAGTCCTTTCGGGAGCAAAGCAGTTACCACGGAAACCAGCAAAGCTACCCACAGGAG       c.60
 M  Q  S  F  R  E  Q  S  S  Y  H  G  N  Q  Q  S  Y  P  Q  E         p.20

          .         .         .         .         .         .       g.5254
 GTACACGGCTCATCCCGGCTAGAAGAGTTCAGCCCTCGTCAGGCCCAGATGTTCCAGAAT       c.120
 V  H  G  S  S  R  L  E  E  F  S  P  R  Q  A  Q  M  F  Q  N         p.40

          .         .         .         .         .         .       g.5314
 TTTGGAGGTACAGGTGGCAGTAGTGGCAGCAGTGGCAGTGGCAGTGGTGGTGGACGACGA       c.180
 F  G  G  T  G  G  S  S  G  S  S  G  S  G  S  G  G  G  R  R         p.60

          .         .         .         .         .         .       g.5374
 GGAGCAGCAGCTGCTGCGGCAGCGATGGCTAGCGAGACCTCTGGCCATCAAGGTTACCAG       c.240
 G  A  A  A  A  A  A  A  M  A  S  E  T  S  G  H  Q  G  Y  Q         p.80

          .         .         .         .         .         .       g.5434
 GGTTTCAGGAAAGAGGCTGGAGATTTTTACTACATGGCAGGCAACAAAGACCCCGTGACT       c.300
 G  F  R  K  E  A  G  D  F  Y  Y  M  A  G  N  K  D  P  V  T         p.100

          .         .         .         .         .         .       g.5494
 ACAGGAACCCCACAGCCTCCTCAGCGAAGGCCTTCTGGGCCTGTGCAGAGCTATGGACCC       c.360
 T  G  T  P  Q  P  P  Q  R  R  P  S  G  P  V  Q  S  Y  G  P         p.120

          .         .         .         .         .         .       g.5554
 CCCCAGGGGAGCAGCTTTGGCAATCAGTATGGGAGTGAGGGTCATGTGGGCCAGTTTCAA       c.420
 P  Q  G  S  S  F  G  N  Q  Y  G  S  E  G  H  V  G  Q  F  Q         p.140

          .         .         .         .         .         .       g.5614
 GCACAGCACTCTGGCCTTGGCGGTGTGTCACATTATCAGCAGGATTACACTGGGCCTTTC       c.480
 A  Q  H  S  G  L  G  G  V  S  H  Y  Q  Q  D  Y  T  G  P  F         p.160

          .         .         .         .         .         .       g.5674
 TCTCCAGGGAGTGCTCAGTACCAACAGCAGGCTTCCAGCCAGCAGCAGCAGCAGCAAGTC       c.540
 S  P  G  S  A  Q  Y  Q  Q  Q  A  S  S  Q  Q  Q  Q  Q  Q  V         p.180

          .         .         .         .         .         .       g.5734
 CAGCAGTTGAGACAACAGCTTTACCAGTCCCATCAGCCCCTGCCACAGGCCACTGGCCAA       c.600
 Q  Q  L  R  Q  Q  L  Y  Q  S  H  Q  P  L  P  Q  A  T  G  Q         p.200

          .         .         .         .         .         .       g.5794
 CCAGCATCCAGCTCATCCCATCTACAGCCAATGCAGCGGCCCTCAACTCTGCCATCCTCT       c.660
 P  A  S  S  S  S  H  L  Q  P  M  Q  R  P  S  T  L  P  S  S         p.220

          .         .         .         .         .         .       g.5854
 GCTGCTGGTTACCAGTTAAGAGTGGGTCAGTTTGGCCAACACTATCAGTCTTCTGCTTCC       c.720
 A  A  G  Y  Q  L  R  V  G  Q  F  G  Q  H  Y  Q  S  S  A  S         p.240

          .         .         .         .         .         .       g.5914
 TCCTCCTCCTCCTCCTCCTTCCCTTCACCACAGCGTTTTAGCCAGTCTGGACAGAGCTAT       c.780
 S  S  S  S  S  S  F  P  S  P  Q  R  F  S  Q  S  G  Q  S  Y         p.260

          .         .         .         .         .         .       g.5974
 GATGGCAGTTACAATGTGAATGCTGGATCTCAGTATGAAGGACACAATGTGGGTTCTAAT       c.840
 D  G  S  Y  N  V  N  A  G  S  Q  Y  E  G  H  N  V  G  S  N         p.280

          .         .         .         .         .         .       g.6034
 GCACAGGCTTATGGAACACAATCCAATTACAGCTATCAGCCTCAATCTATGAAGAATTTT       c.900
 A  Q  A  Y  G  T  Q  S  N  Y  S  Y  Q  P  Q  S  M  K  N  F         p.300

          .         .         .         .         .         .       g.6094
 GAACAGGCAAAGATTCCACAAGGGACCCAACAGGGGCAGCAGCAGCAGCAACCGCAGCAA       c.960
 E  Q  A  K  I  P  Q  G  T  Q  Q  G  Q  Q  Q  Q  Q  P  Q  Q         p.320

          .         .         .         .         .         .       g.6154
 CAACAACACCCTTCTCAGCATGTGATGCAGTATACTAACGCTGCCACCAAGCTGCCCCTG       c.1020
 Q  Q  H  P  S  Q  H  V  M  Q  Y  T  N  A  A  T  K  L  P  L         p.340

          .         .         .         .         .         .       g.6214
 CAAAGCCAAGTGGGGCAGTACAACCAGCCTGAGGTTCCTGTGAGGTCCCCCATGCAGTTT       c.1080
 Q  S  Q  V  G  Q  Y  N  Q  P  E  V  P  V  R  S  P  M  Q  F         p.360

          .         .         .         .         .         .       g.6274
 CACCAGAACTTCAGCCCCATTTCTAACCCTTCTCCAGCTGCCTCTGTGGTTCAGTCTCCA       c.1140
 H  Q  N  F  S  P  I  S  N  P  S  P  A  A  S  V  V  Q  S  P         p.380

          .         .         .         .         .         .       g.6334
 AGCTGTAGTTCTACCCCATCTCCTCTCATGCAGACTGGGGAGAATCTCCAGTGTGGGCAA       c.1200
 S  C  S  S  T  P  S  P  L  M  Q  T  G  E  N  L  Q  C  G  Q         p.400

          .         .         .         .         .         .       g.6394
 GGCAGTGTGCCTATGGGTTCCAGAAACAGAATTTTACAGTTAATGCCTCAACTCAGTCCA       c.1260
 G  S  V  P  M  G  S  R  N  R  I  L  Q  L  M  P  Q  L  S  P         p.420

          .         .         .         .         .         .       g.6454
 ACCCCATCAATGATGCCCAGTCCTAATTCTCATGCTGCAGGCTTCAAAGGGTTTGGACTA       c.1320
 T  P  S  M  M  P  S  P  N  S  H  A  A  G  F  K  G  F  G  L         p.440

          .         .         .         .         .         .       g.6514
 GAAGGGGTACCAGAAAAGCGACTGACAGATCCTGGGTTGAGTAGTTTGAGTGCTCTGAGT       c.1380
 E  G  V  P  E  K  R  L  T  D  P  G  L  S  S  L  S  A  L  S         p.460

          .         .         .         .         .         .       g.6574
 ACTCAAGTGGCCAATCTTCCTAACACTGTCCAGCACATGTTACTTTCTGATGCCCTGACT       c.1440
 T  Q  V  A  N  L  P  N  T  V  Q  H  M  L  L  S  D  A  L  T         p.480

          .         .         .         .         .         .       g.6634
 CCTCAGAAGAAGACCTCCAAGAGGCCCTCATCTTCCAAGAAAGCAGATAGCTGCACAAAT       c.1500
 P  Q  K  K  T  S  K  R  P  S  S  S  K  K  A  D  S  C  T  N         p.500

          .         .         .         .         .         .       g.6694
 TCTGAAGGCTCCTCACAACCTGAAGAACAGCTGAAGTCCCCTATGGCAGAGTCATTAGAT       c.1560
 S  E  G  S  S  Q  P  E  E  Q  L  K  S  P  M  A  E  S  L  D         p.520

          .         .         .         .         .         .       g.6754
 GGAGGCTGCTCCAGCAGTTCAGAGGATCAAGGCGAGAGAGTGCGGCAACTAAGTGGCCAG       c.1620
 G  G  C  S  S  S  S  E  D  Q  G  E  R  V  R  Q  L  S  G  Q         p.540

          .         .         .         .         .         .       g.6814
 AGCACCAGCTCTGACACCACCTACAAGGGTGGAGCCTCTGAGAAAGCTGGCTCCTCACCG       c.1680
 S  T  S  S  D  T  T  Y  K  G  G  A  S  E  K  A  G  S  S  P         p.560

          .         .         .         .         .         .       g.6874
 GCACAAGGTGCTCAGAATGAACCCCCCAGACTCAATGCTAGTCCTGCCGCAAGAGAAGAG       c.1740
 A  Q  G  A  Q  N  E  P  P  R  L  N  A  S  P  A  A  R  E  E         p.580

          .         .         .         .         .         .       g.6934
 GCCACCTCACCAGGCGCTAAGGACATGCCATTGTCATCCGACGGGAACCCAAAGGTTAAT       c.1800
 A  T  S  P  G  A  K  D  M  P  L  S  S  D  G  N  P  K  V  N         p.600

          .         .         .         .         .         .       g.6994
 GAGAAGACTGTTGGGGTGATTGTCTCCCGGGAAGCCATGACAGGTCGGGTAGAAAAGCCT       c.1860
 E  K  T  V  G  V  I  V  S  R  E  A  M  T  G  R  V  E  K  P         p.620

          .         .         .         .         .         .       g.7054
 GGTGGACAAGATAAAGGCTCCCAAGAGGATGATCCTGCAGCCACTCAAAGGCCACCTAGC       c.1920
 G  G  Q  D  K  G  S  Q  E  D  D  P  A  A  T  Q  R  P  P  S         p.640

          .         .         .         .         .         .       g.7114
 AATGGTGGGGCAAAGGAAACCAGTCATGCATCACTTCCCCAGCCAGAGCCTCCAGGAGGA       c.1980
 N  G  G  A  K  E  T  S  H  A  S  L  P  Q  P  E  P  P  G  G         p.660

          .         .         .         .         .         .       g.7174
 GGAGGGAGCAAAGGAAACAAGAATGGCGATAACAACTCCAACCATAATGGAGAAGGAAAT       c.2040
 G  G  S  K  G  N  K  N  G  D  N  N  S  N  H  N  G  E  G  N         p.680

          .         .         .         .         .         .       g.7234
 GGCCAGAGTGGCCACTCTGCAGCGGGCCCTGGTTTTACGAGCAGAACTGAGCCTAGCAAA       c.2100
 G  Q  S  G  H  S  A  A  G  P  G  F  T  S  R  T  E  P  S  K         p.700

          .         .         .         .         .         .       g.7294
 TCTCCTGGAAGTCTGCGCTATAGTTACAAAGATAGTTTCGGGTCAGCCGTGCCACGAAAT       c.2160
 S  P  G  S  L  R  Y  S  Y  K  D  S  F  G  S  A  V  P  R  N         p.720

          .         .         .         .         .         .       g.7354
 GTCAGTGGCTTTCCTCAGTATCCTACAGGGCAAGAAAAGGGAGATTTCACTGGCCATGGG       c.2220
 V  S  G  F  P  Q  Y  P  T  G  Q  E  K  G  D  F  T  G  H  G         p.740

          .         .         .         .         .         .       g.7414
 GAACGAAAGGGTAGAAATGAAAAATTCCCAAGCCTCCTGCAGGAAGTGCTTCAGGGTTAC       c.2280
 E  R  K  G  R  N  E  K  F  P  S  L  L  Q  E  V  L  Q  G  Y         p.760

          .         .         .         .         .         .       g.7474
 CACCACCACCCTGACAGGAGATATTCTAGGAGTACTCAAGAGCATCAGGGGATGGCTGGT       c.2340
 H  H  H  P  D  R  R  Y  S  R  S  T  Q  E  H  Q  G  M  A  G         p.780

          .         .         .         .         .         .       g.7534
 AGCCTAGAAGGAACCACAAGGCCCAATGTCTTGGTTAGTCAAACCAATGAATTAGCTAGC       c.2400
 S  L  E  G  T  T  R  P  N  V  L  V  S  Q  T  N  E  L  A  S         p.800

          .         .         .         .         .         .       g.7594
 AGGGGCCTTCTGAACAAAAGCATTGGGTCTCTATTAGAAAATCCCCACTGGGGCCCCTGG       c.2460
 R  G  L  L  N  K  S  I  G  S  L  L  E  N  P  H  W  G  P  W         p.820

          .         .         .         .         .         .       g.7654
 GAAAGGAAATCAAGCAGCACAGCTCCTGAAATGAAACAGATCAATTTGACTGACTATCCA       c.2520
 E  R  K  S  S  S  T  A  P  E  M  K  Q  I  N  L  T  D  Y  P         p.840

          .         .         .         .         .         .       g.7714
 ATTCCCAGAAAGTTTGAAATAGAGCCTCAGTCATCAGCACATGAGCCTGGGGGTTCCCTC       c.2580
 I  P  R  K  F  E  I  E  P  Q  S  S  A  H  E  P  G  G  S  L         p.860

          .         .         .         .         .         .       g.7774
 TCTGAAAGAAGATCAGTGATCTGTGATATTTCTCCACTAAGACAGATTGTCAGGGACCCA       c.2640
 S  E  R  R  S  V  I  C  D  I  S  P  L  R  Q  I  V  R  D  P         p.880

          .         .         .         .         .         .       g.7834
 GGGGCTCACTCACTGGGACACATGAGTGCCGACACCAGAATTGGGAGGAATGACCGTCTC       c.2700
 G  A  H  S  L  G  H  M  S  A  D  T  R  I  G  R  N  D  R  L         p.900

          .         .         .         .         .         .       g.7894
 AATCCAACTTTAAGTCAGTCGGTCATTCTTCCTGGTGGTTTGGTGTCCATGGAAACCAAG       c.2760
 N  P  T  L  S  Q  S  V  I  L  P  G  G  L  V  S  M  E  T  K         p.920

          .         .         .         .         .         .       g.7954
 CTGAAATCCCAGAGCGGGCAGATAAAAGAGGAAGACTTTGAACAGTCTAAATCTCAAGCT       c.2820
 L  K  S  Q  S  G  Q  I  K  E  E  D  F  E  Q  S  K  S  Q  A         p.940

          .         .         .         .         .         .       g.8014
 AGTTTCAACAACAAGAAATCTGGAGACCACTGCCATCCTCCTAGCATCAAGCATGAGTCT       c.2880
 S  F  N  N  K  K  S  G  D  H  C  H  P  P  S  I  K  H  E  S         p.960

          .         .         .         .         .         .       g.8074
 TACCGCGGCAATGCCAGCCCTGGAGCAGCAACCCATGATTCCCTTTCAGACTATGGCCCG       c.2940
 Y  R  G  N  A  S  P  G  A  A  T  H  D  S  L  S  D  Y  G  P         p.980

          .         .         .         .         .         .       g.8134
 CAAGACAGCAGACCCACGCCAATGCGGCGGGTCCCTGGCAGAGTTGGTGGTCGGGAGGGC       c.3000
 Q  D  S  R  P  T  P  M  R  R  V  P  G  R  V  G  G  R  E  G         p.1000

          .         .         .         .         .         .       g.8194
 ATGAGGGGTCGGTCCCCTTCTCAATATCATGACTTTGCAGAAAAATTGAAAATGTCTCCT       c.3060
 M  R  G  R  S  P  S  Q  Y  H  D  F  A  E  K  L  K  M  S  P         p.1020

          .         .         .         .         .         .       g.8254
 GGGCGGAGCAGAGGCCCAGGGGGAGACCCTCATCACATGAATCCACACATGACCTTTTCA       c.3120
 G  R  S  R  G  P  G  G  D  P  H  H  M  N  P  H  M  T  F  S         p.1040

          .         .         .         .         .         .       g.8314
 GAGAGGGCTAACCGGAGTTCTTTACACACTCCCTTTTCTCCCAACTCAGAAACCCTGGCC       c.3180
 E  R  A  N  R  S  S  L  H  T  P  F  S  P  N  S  E  T  L  A         p.1060

          .         .         .         .         .         .       g.8374
 TCTGCTTATCATGCAAATACTCGGGCTCATGCTTATGGGGACCCTAACGCAGGTTTGAAT       c.3240
 S  A  Y  H  A  N  T  R  A  H  A  Y  G  D  P  N  A  G  L  N         p.1080

          .         .         .         .         .         .       g.8434
 TCTCAGCTGCATTATAAGAGACAGATGTACCAACAGCAACCAGAGGAGTATAAAGACTGG       c.3300
 S  Q  L  H  Y  K  R  Q  M  Y  Q  Q  Q  P  E  E  Y  K  D  W         p.1100

          .         .         .         .         .         .       g.8494
 AGCAGCGGTTCTGCTCAGGGAGTAATTGCTGCAGCACAGCACAGGCAGGAGGGGCCACGG       c.3360
 S  S  G  S  A  Q  G  V  I  A  A  A  Q  H  R  Q  E  G  P  R         p.1120

          .         .         .         .         .         .       g.8554
 AAGAGTCCAAGGCAGCAGCAGTTTCTTGACAGAGTACGGAGCCCTCTGAAAAATGACAAA       c.3420
 K  S  P  R  Q  Q  Q  F  L  D  R  V  R  S  P  L  K  N  D  K         p.1140

          .         .         .         .         .         .       g.8614
 GATGGTATGATGTATGGCCCACCAGTGGGGACTTACCATGACCCCAGTGCCCAGGAGGCT       c.3480
 D  G  M  M  Y  G  P  P  V  G  T  Y  H  D  P  S  A  Q  E  A         p.1160

          .         .         .         .         .         .       g.8674
 GGGCGCTGCCTAATGTCTAGTGATGGTCTGCCTAACAAGGGCATGGAATTAAAGCATGGC       c.3540
 G  R  C  L  M  S  S  D  G  L  P  N  K  G  M  E  L  K  H  G         p.1180

          .         .         .         .         .         .       g.8734
 TCCCAGAAGTTACAAGAATCCTGTTGGGATCTTTCTCGGCAAACTTCTCCAGCCAAAAGC       c.3600
 S  Q  K  L  Q  E  S  C  W  D  L  S  R  Q  T  S  P  A  K  S         p.1200

          .         .         .         .         .         .       g.8794
 AGCGGTCCTCCAGGAATGTCCAGTCAAAAAAGGTATGGGCCGCCCCATGAGACTGATGGA       c.3660
 S  G  P  P  G  M  S  S  Q  K  R  Y  G  P  P  H  E  T  D  G         p.1220

          .         .         .         .         .         .       g.8854
 CATGGACTAGCTGAGGCTACACAGTCATCCAAACCTGGTAGTGTTATGCTGAGACTTCCA       c.3720
 H  G  L  A  E  A  T  Q  S  S  K  P  G  S  V  M  L  R  L  P         p.1240

          .         .         .         .         .         .       g.8914
 GGCCAGGAGGATCATTCTTCTCAAAACCCCTTAATCATGAGGAGGCGTGTTCGTTCTTTT       c.3780
 G  Q  E  D  H  S  S  Q  N  P  L  I  M  R  R  R  V  R  S  F         p.1260

          .         .         .         .         .         .       g.8974
 ATCTCTCCCATTCCCAGTAAGAGACAGTCACAAGATGTAAAGAACAGTAGCACTGAAGAT       c.3840
 I  S  P  I  P  S  K  R  Q  S  Q  D  V  K  N  S  S  T  E  D         p.1280

          .         .         .         .         .         .       g.9034
 AAAGGTCGCCTCCTTCACTCATCAAAAGAAGGCGCTGATAAAGCATTCAATTCCTATGCC       c.3900
 K  G  R  L  L  H  S  S  K  E  G  A  D  K  A  F  N  S  Y  A         p.1300

          .         .         .         .         .         .       g.9094
 CATCTTTCTCACAGTCAGGATATCAAGTCTATCCCTAAGAGAGATTCCTCCAAGGACCTT       c.3960
 H  L  S  H  S  Q  D  I  K  S  I  P  K  R  D  S  S  K  D  L         p.1320

          .         .         .         .         .         .       g.9154
 CCAAGTCCAGATAGTAGAAACTGCCCTGCTGTTACCCTCACAAGCCCTGCTAAGACCAAA       c.4020
 P  S  P  D  S  R  N  C  P  A  V  T  L  T  S  P  A  K  T  K         p.1340

          .         .         .         .         .         .       g.9214
 ATACTGCCCCCACGGAAAGGACGGGGATTGAAATTGGAAGCTATAGTTCAGAAGATTACA       c.4080
 I  L  P  P  R  K  G  R  G  L  K  L  E  A  I  V  Q  K  I  T         p.1360

          .         .         .         .         .         .       g.9274
 TCCCCAAATATTAGGAGGAGCGCATCTTCGAACAGTGCGGAGGCTGGGGGAGACACGGTT       c.4140
 S  P  N  I  R  R  S  A  S  S  N  S  A  E  A  G  G  D  T  V         p.1380

          .         .         .         .         .         .       g.9334
 ACGCTTGATGATATACTGTCTTTGAAGAGTGGTCCTCCTGAAGGTGGGAGTGTTGCTGTT       c.4200
 T  L  D  D  I  L  S  L  K  S  G  P  P  E  G  G  S  V  A  V         p.1400

          .         .         .         .         .         .       g.9394
 CAGGATGCTGACATAGAGAAGAGAAAAGGTGAGGTGGCTTCGGACCTAGTCAGTCCAGCA       c.4260
 Q  D  A  D  I  E  K  R  K  G  E  V  A  S  D  L  V  S  P  A         p.1420

          .         .         .         .         .         .       g.9454
 AACCAGGAGTTGCACGTAGAGAAACCTCTTCCAAGGTCTTCAGAAGAGTGGCGTGGCAGC       c.4320
 N  Q  E  L  H  V  E  K  P  L  P  R  S  S  E  E  W  R  G  S         p.1440

          .         .         .         .         .         .       g.9514
 GTGGATGACAAAGTGAAGACAGAGACACATGCAGAAACAGTTACTGCCGGAAAGGAACCC       c.4380
 V  D  D  K  V  K  T  E  T  H  A  E  T  V  T  A  G  K  E  P         p.1460

          .         .         .         .         .         .       g.9574
 CCTGGTGCCATGACATCCACAACCTCACAGAAGCCTGGTAGTAACCAAGGGAGACCAGAT       c.4440
 P  G  A  M  T  S  T  T  S  Q  K  P  G  S  N  Q  G  R  P  D         p.1480

          .         .         .         .         .         .       g.9634
 GGTTCCCTGGGTGGAACAGCACCTTTAATCTTTCCAGACTCAAAGAATGTACCTCCAGTG       c.4500
 G  S  L  G  G  T  A  P  L  I  F  P  D  S  K  N  V  P  P  V         p.1500

          .         .         .         .         .         .       g.9694
 GGCATATTGGCCCCTGAGGCAAACCCCAAGGCTGAAGAGAAGGAGAACGATACAGTGACG       c.4560
 G  I  L  A  P  E  A  N  P  K  A  E  E  K  E  N  D  T  V  T         p.1520

          .         .         .         .         .         .       g.9754
 ATTTCACCGAAGCAAGAGGGTTTCCCTCCAAAGGGATATTTCCCATCAGGAAAGAAGAAG       c.4620
 I  S  P  K  Q  E  G  F  P  P  K  G  Y  F  P  S  G  K  K  K         p.1540

          .         .         .         .         .         .       g.9814
 GGGAGACCCATTGGTAGTGTGAATAAGCAAAAGAAACAGCAGCAGCCACCGCCTCCACCC       c.4680
 G  R  P  I  G  S  V  N  K  Q  K  K  Q  Q  Q  P  P  P  P  P         p.1560

          .         .         .         .         .         .       g.9874
 CCTCAGCCCCCACAGATACCAGAAGGTTCTGCAGATGGAGAGCCAAAGCCAAAAAAACAG       c.4740
 P  Q  P  P  Q  I  P  E  G  S  A  D  G  E  P  K  P  K  K  Q         p.1580

          .         .         .         .         .         .       g.9934
 AGGCAAAGGAGGGAGAGAAGGAAGCCTGGGGCCCAGCCGAGGAAGCGAAAAACCAAACAA       c.4800
 R  Q  R  R  E  R  R  K  P  G  A  Q  P  R  K  R  K  T  K  Q         p.1600

          .         .         .         .         .         .       g.9994
 GCAGTTCCCATTGTGGAACCCCAAGAACCTGAGATCAAACTAAAATATGCCACCCAGCCA       c.4860
 A  V  P  I  V  E  P  Q  E  P  E  I  K  L  K  Y  A  T  Q  P         p.1620

          .         .         .         .         .         .       g.10054
 CTGGATAAAACTGATGCCAAGAACAAGTCTTTTTACCCTTACATCCATGTAGTAAATAAG       c.4920
 L  D  K  T  D  A  K  N  K  S  F  Y  P  Y  I  H  V  V  N  K         p.1640

          .         .         .         .         .         .       g.10114
 TGTGAACTTGGAGCCGTTTGTACAATCATCAATGCTGAGGAAGAAGAACAGACCAAATTA       c.4980
 C  E  L  G  A  V  C  T  I  I  N  A  E  E  E  E  Q  T  K  L         p.1660

          .         .         .         .         .         .       g.10174
 GTGAGGGGCAGGAAGGGTCAGAGGTCACTGACCCCTCCACCTAGCAGCACTGAAAGCAAG       c.5040
 V  R  G  R  K  G  Q  R  S  L  T  P  P  P  S  S  T  E  S  K         p.1680

          .         .         .         .         .         .       g.10234
 GCGCTCCCGGCCTCGTCCTTTATGCTGCAGGGACCTGTTGTGACAGAGTCTTCGGTTATG       c.5100
 A  L  P  A  S  S  F  M  L  Q  G  P  V  V  T  E  S  S  V  M         p.1700

          .         .         .         .         .         .       g.10294
 GGGCACCTGGTTTGCTGTCTGTGTGGCAAGTGGGCCAGTTACCGGAACATGGGTGACCTC       c.5160
 G  H  L  V  C  C  L  C  G  K  W  A  S  Y  R  N  M  G  D  L         p.1720

          .         .         .         .         .         .       g.10354
 TTTGGACCTTTTTATCCCCAAGATTATGCAGCCACTCTCCCGAAGAATCCACCTCCTAAG       c.5220
 F  G  P  F  Y  P  Q  D  Y  A  A  T  L  P  K  N  P  P  P  K         p.1740

          .         .         .         .         .         .       g.10414
 AGGGCCACAGAAATGCAGAGCAAAGTTAAGGTACGGCACAAAAGTGCTTCTAATGGCTCC       c.5280
 R  A  T  E  M  Q  S  K  V  K  V  R  H  K  S  A  S  N  G  S         p.1760

          .         .         .         .         .         .       g.10474
 AAGACGGACACTGAGGAGGAGGAAGAGCAGCAGCAGCAGCAGAAGGAGCAGAGAAGCCTG       c.5340
 K  T  D  T  E  E  E  E  E  Q  Q  Q  Q  Q  K  E  Q  R  S  L         p.1780

          .         .         .         .         .         .       g.10534
 GCCGCACACCCCAGGTTTAAGCGGCGCCACCGCTCGGAAGACTGTGGTGGAGGCCCTCGG       c.5400
 A  A  H  P  R  F  K  R  R  H  R  S  E  D  C  G  G  G  P  R         p.1800

          .         .         .         .         .         .       g.10594
 TCCCTGTCCAGGGGGCTCCCTTGTAAAAAAGCAGCCACTGAGGGCAGCAGTGAAAAGACT       c.5460
 S  L  S  R  G  L  P  C  K  K  A  A  T  E  G  S  S  E  K  T         p.1820

          .         .         .         .         .         .       g.10654
 GTTTTGGACTCGAAGCCCTCCGTGCCCACCACTTCAGAAGGTGGCCCTGAGCTGGAGTTA       c.5520
 V  L  D  S  K  P  S  V  P  T  T  S  E  G  G  P  E  L  E  L         p.1840

          .         .         .         .         .         .       g.10714
 CAAATCCCTGAACTACCTCTTGACAGCAATGAATTTTGGGTCCATGAGGGTTGTATTCTC       c.5580
 Q  I  P  E  L  P  L  D  S  N  E  F  W  V  H  E  G  C  I  L         p.1860

          .         .         .         .         .         .       g.10774
 TGGGCCAATGGAATCTACCTGGTTTGTGGCAGGCTCTATGGCCTGCAGGAAGCGCTGGAA       c.5640
 W  A  N  G  I  Y  L  V  C  G  R  L  Y  G  L  Q  E  A  L  E         p.1880

          .      | 02  .         .         .         .         .    g.40782
 ATAGCCAGAGAGATG | AAATGTTCCCACTGCCAGGAGGCAGGCGCCACCTTGGGCTGCTAC    c.5700
 I  A  R  E  M   | K  C  S  H  C  Q  E  A  G  A  T  L  G  C  Y      p.1900

          .         .         .         .          | 03        .    g.50554
 AACAAAGGCTGCTCCTTCCGATACCATTACCCGTGTGCCATTGATGCAG | ATTGTTTGCTA    c.5760
 N  K  G  C  S  F  R  Y  H  Y  P  C  A  I  D  A  D |   C  L  L      p.1920

          .         .         .          | 04        .         .    g.51724
 CATGAGGAGAACTTCTCGGTGAGGTGCCCTAAGCACAAG | CCTCCCCTTCCGTGCCCTCTC    c.5820
 H  E  E  N  F  S  V  R  C  P  K  H  K   | P  P  L  P  C  P  L      p.1940

          .         .         .         .         .         .       g.51784
 CCCCCCTTGCAGAACAAGACCGCGAAAGGCAGCCTCAGCACAGAGCAGTCGGAGCGGGGG       c.5880
 P  P  L  Q  N  K  T  A  K  G  S  L  S  T  E  Q  S  E  R  G         p.1960

                                                                g.51787
 TGA |                                                             c.5884
 X                                                               p.1960

          .         .         .         .     | 05   .         .    g.59097
 ggggggcagtgtgctcgtgggaatggaaaggacagcaagcacag | gtgagactgtggagat    c.*60

          .         .         .         .         .         .       g.59157
 gagaaggtggtggacactcgtgatggaatggaaatcgtcctaccgtgcagccacaccctg       c.*120

          .         .         .         .         .         .       g.59217
 ccctgccccgccccgccccgcccgcgtgcctgcccatgccagcacttccttaagttctca       c.*180

          .         .         .         .         .         .       g.59277
 catcacactcaaaccagtgacaccacaggaaagaaagacccaagacgttggaatggctgt       c.*240

          .         .         .         .         .         .       g.59337
 ttccatggacacaatctccatagtgacaatgtggggggaggggggaggggtgggatgatg       c.*300

          .         .         .         .         .         .       g.59397
 gggaaagggtggggggaattaaaagggagggataaatatatatatataaatctattttta       c.*360

          .         .         .         .         .         .       g.59457
 gtctggaaagactttgtttaaatgaaaggtgcgctatcccttttgattctgttttaaaat       c.*420

          .         .         .         .         .         .       g.59517
 tatctcgttaaagatctccaaatttgttccgatgacaagtgaaatttaaatgtgagattg       c.*480

          .         .         .         .         .         .       g.59577
 aactgaacaaaccctcatctcatgaaggacggggtgtgtgtgtggcgttgatctttagcc       c.*540

          .         .         .         .         .         .       g.59637
 tgtctcacaccagttcagaaaacactagacccaggattgaaaaagcaaaccacagcagaa       c.*600

          .         .         .         .         .         .       g.59697
 ccatccttttgtcattaatttgtctcaaagtgggaaggttttgggggagggggaaataca       c.*660

          .         .         .         .         .         .       g.59757
 gggatggtccatgttttcaagagtaggggaatgatgtttaaacacaaaaataaatttttt       c.*720

          .         .         .         .         .         .       g.59817
 ttcatttccagaaacactatttatttatggttttttttttttaattttttctttttgggg       c.*780

          .         .         .         .         .         .       g.59877
 gtgaaattggcagatgcctgaggtcatagctgtgtcctgggtcactgtggctggtgagga       c.*840

          .         .         .         .         .         .       g.59937
 cctcaaggaccccatcaagtgtacacagcagcagcaaaatcaagggatgaccctcctctg       c.*900

          .         .         .         .         .         .       g.59997
 gggccccctgtcctcagcacattccaggcagctgtgccctgacccacagggacccgtggg       c.*960

          .         .         .         .         .         .       g.60057
 gatgggaggaggtccaggcctgtgttgccagagctggcagtgtgagctgtaggcagggac       c.*1020

          .         .         .         .         .         .       g.60117
 ggggagggactgtcgctgtgatcagagtgggttaagctgaccaggaacacccatttaacc       c.*1080

          .         .         .         .         .         .       g.60177
 cctttttctttttgctttcatttttataaaggaaaagaggacctgtcagataggcagccc       c.*1140

          .         .         .         .         .         .       g.60237
 catgctacgtgattctttatgttgtgttgttttgttttgtaaattgtataatttttaaat       c.*1200

          .         .         .         .         .         .       g.60297
 atctgagttttaaaaaaagaaaaaagtacaaaaaaatcttgttatggccttaagaagggg       c.*1260

          .         .         .         .         .         .       g.60357
 ttagtgcatctttcaggggtcactctgccatggggataaaatagctgtttcacaaacagt       c.*1320

          .         .         .         .         .         .       g.60417
 tttatttaaaaaaacaaaaaacaaaaaaaatcaaaaaatcaaaaaaataataaacttcat       c.*1380

          .                                                         g.60427
 tttaaccttg                                                         c.*1390

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Transcription factor 20 (AR1) protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 21
©2004-2018 Leiden University Medical Center