AT rich interactive domain 2 (ARID, RFX-like) (ARID2) - coding DNA reference sequence

(used for variant description)

(last modified April 15, 2021)


This file was created to facilitate the description of sequence variants on transcript NM_152641.2 in the ARID2 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NC_000012.11, covering ARID2 transcript NM_152641.2.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
          .         .         .         .         .         .       g.60
 ATGGCAAACTCGACGGGGAAGGCGCCTCCGGACGAGCGGAGAAAGGGACTCGCTTTCCTG       c.60
 M  A  N  S  T  G  K  A  P  P  D  E  R  R  K  G  L  A  F  L         p.20

          .         .         .   | 02     .         .         .    g.235
 GACGAGCTGCGGCAGTTCCACCACAGCAGAGG | GTCGCCTTTTAAAAAAATCCCTGCGGTG    c.120
 D  E  L  R  Q  F  H  H  S  R  G  |  S  P  F  K  K  I  P  A  V      p.40

          .         .         .         .         .         .       g.295
 GGTGGGAAGGAGCTGGATCTTCACGGTCTCTACACCAGAGTCACTACTTTAGGCGGATTC       c.180
 G  G  K  E  L  D  L  H  G  L  Y  T  R  V  T  T  L  G  G  F         p.60

        | 03 .         .         .         .         .         .    g.1434
 GCGAAG | GTTTCTGAGAAGAATCAGTGGGGAGAAATTGTTGAAGAGTTCAACTTTCCCAGA    c.240
 A  K   | V  S  E  K  N  Q  W  G  E  I  V  E  E  F  N  F  P  R      p.80

          .         .         .         .     | 04   .         .    g.81597
 AGTTGTTCTAACGCTGCCTTTGCTTTAAAACAGTATTACTTGCG | TTACCTAGAAAAGTAC    c.300
 S  C  S  N  A  A  F  A  L  K  Q  Y  Y  L  R  |  Y  L  E  K  Y      p.100

          .         .         .         .         .         .       g.81657
 GAGAAAGTTCATCATTTTGGGGAGGATGATGATGAGGTACCACCAGGCAATCCAAAGCCA       c.360
 E  K  V  H  H  F  G  E  D  D  D  E  V  P  P  G  N  P  K  P         p.120

          .         .         .         .         .         | 05    g.87835
 CAGCTTCCTATTGGTGCAATTCCATCTTCCTACAATTACCAGCAACACAGTGTGTCGG | AT    c.420
 Q  L  P  I  G  A  I  P  S  S  Y  N  Y  Q  Q  H  S  V  S  D |       p.140

          .         .         .         .         .         .       g.87895
 TATCTGCGTCAAAGTTATGGGCTGTCCATGGACTTTAATTCGCCAAATGATTATAATAAA       c.480
 Y  L  R  Q  S  Y  G  L  S  M  D  F  N  S  P  N  D  Y  N  K         p.160

          .         .         .         .         .         .       g.87955
 TTGGTGCTTTCACTGTTATCTGGACTCCCAAATGAAGTGGACTTTGCTATTAACGTATGC       c.540
 L  V  L  S  L  L  S  G  L  P  N  E  V  D  F  A  I  N  V  C         p.180

          .         .         .         .         .         .       g.88015
 ACTCTCCTATCAAATGAAAGCAAGCACGTCATGCAACTTGAAAAAGATCCTAAAATCATC       c.600
 T  L  L  S  N  E  S  K  H  V  M  Q  L  E  K  D  P  K  I  I         p.200

          .         .         .        | 06.         .         .    g.91606
 ACTTTACTACTTGCTAATGCCGGGGTGTTTGACGACA | CTTTAGGATCCTTTTCCACTGTA    c.660
 T  L  L  L  A  N  A  G  V  F  D  D  T |   L  G  S  F  S  T  V      p.220

          .         .         .         .      | 07  .         .    g.106767
 TTTGGAGAAGAATGGAAAGAGAAGACTGATAGAGACTTCGTTAAG | TTTTGGAAAGACATC    c.720
 F  G  E  E  W  K  E  K  T  D  R  D  F  V  K   | F  W  K  D  I      p.240

          .         .         .         .         .   | 08     .    g.106912
 GTTGATGATAATGAAGTTCGTGACCTCATTTCTGACAGAAACAAGTCTCATG | AAGGTACA    c.780
 V  D  D  N  E  V  R  D  L  I  S  D  R  N  K  S  H  E |   G  T      p.260

          .         .         .         .         .         .       g.106972
 TCAGGAGAATGGATTTGGGAGTCTTTATTTCATCCACCTCGAAAGCTGGGCATTAACGAT       c.840
 S  G  E  W  I  W  E  S  L  F  H  P  P  R  K  L  G  I  N  D         p.280

          .         .         .         .         .         .       g.107032
 ATTGAAGGACAGCGGGTACTTCAGATTGCAGTGATTTTGAGAAATCTTTCCTTTGAGGAG       c.900
 I  E  G  Q  R  V  L  Q  I  A  V  I  L  R  N  L  S  F  E  E         p.300

          .         .         .         .         .         .       g.107092
 GGCAATGTTAAGCTCTTGGCAGCTAATCGTACCTGTCTTCGTTTCCTATTACTTTCTGCA       c.960
 G  N  V  K  L  L  A  A  N  R  T  C  L  R  F  L  L  L  S  A         p.320

          .         .         .         .         .         .       g.107152
 CATAGTCATTTTATTTCTTTAAGGCAATTAGGCCTTGACACATTAGGAAATATTGCAGCT       c.1020
 H  S  H  F  I  S  L  R  Q  L  G  L  D  T  L  G  N  I  A  A         p.340

     | 09    .         .         .         .         .         .    g.107541
 GAG | CTTTTACTGGACCCTGTTGATTTCAAAACTACTCATCTGATGTTTCATACTGTTACA    c.1080
 E   | L  L  L  D  P  V  D  F  K  T  T  H  L  M  F  H  T  V  T      p.360

          .         .         .         . | 10       .         .    g.107681
 AAATGTCTAATGTCAAGGGATAGATTTTTAAAGATGAGAG | GCATGGAAATTTTGGGAAAT    c.1140
 K  C  L  M  S  R  D  R  F  L  K  M  R  G |   M  E  I  L  G  N      p.380

          .         .         .         .         .         .       g.107741
 CTTTGCAAAGCAGAAGATAATGGTGTTTTAATTTGTGAATATGTGGATCAGGATTCCTAC       c.1200
 L  C  K  A  E  D  N  G  V  L  I  C  E  Y  V  D  Q  D  S  Y         p.400

          .         .         .         .         .         .       g.107801
 AGAGAGATCATTTGTCATCTCACTTTACCTGATGTGCTGCTTGTAATCTCAACACTCGAG       c.1260
 R  E  I  I  C  H  L  T  L  P  D  V  L  L  V  I  S  T  L  E         p.420

          .         .         .         .         .         .       g.107861
 GTGCTATACATGCTCACGGAAATGGGAGATGTTGCTTGCACAAAAATTGCAAAAGTAGAA       c.1320
 V  L  Y  M  L  T  E  M  G  D  V  A  C  T  K  I  A  K  V  E         p.440

          . | 11       .         .         .         .         .    g.109542
 AAGAGCATAG | ACATGTTAGTGTGTCTGGTTTCTATGGATATTCAGATGTTTGGCCCTGAT    c.1380
 K  S  I  D |   M  L  V  C  L  V  S  M  D  I  Q  M  F  G  P  D      p.460

          .         .         .         .         .         .       g.109602
 GCACTAGCTGCGGTAAAACTCATTGAACACCCAAGTTCCAGTCATCAAATGTTATCTGAA       c.1440
 A  L  A  A  V  K  L  I  E  H  P  S  S  S  H  Q  M  L  S  E         p.480

          .         .         .         .         .         | 12    g.117021
 ATTAGGCCACAAGCTATAGAGCAAGTCCAAACCCAGACTCATGTAGCATCTGCCCCAG | CT    c.1500
 I  R  P  Q  A  I  E  Q  V  Q  T  Q  T  H  V  A  S  A  P  A |       p.500

          .         .         .         .         .         .       g.117081
 TCCAGAGCAGTTGTAGCGCAGCATGTTGCTCCACCTCCAGGAATAGTGGAAATAGATAGT       c.1560
 S  R  A  V  V  A  Q  H  V  A  P  P  P  G  I  V  E  I  D  S         p.520

          .         . | 13       .         .         .         .    g.119039
 GAGAAGTTTGCTTGTCAGTG | GCTAAATGCTCATTTTGAAGTAAATCCAGATTGTTCTGTT    c.1620
 E  K  F  A  C  Q  W  |  L  N  A  H  F  E  V  N  P  D  C  S  V      p.540

          .         .         .         .         .         .       g.119099
 TCTCGAGCAGAAATGTATTCTGAATACCTCTCGACTTGCAGTAAATTAGCTCGTGGTGGA       c.1680
 S  R  A  E  M  Y  S  E  Y  L  S  T  C  S  K  L  A  R  G  G         p.560

          .         .         .      | 14  .         .         .    g.119768
 ATCCTAACATCAACTGGATTTTATAAATGTCTTAG | AACGGTCTTTCCAAATCATACAGTG    c.1740
 I  L  T  S  T  G  F  Y  K  C  L  R  |  T  V  F  P  N  H  T  V      p.580

          .         .         .         .         .         .       g.119828
 AAGAGAGTGGAGGATTCCAGTAGCAATGGGCAGGCACATATTCATGTGGTAGGAGTAAAA       c.1800
 K  R  V  E  D  S  S  S  N  G  Q  A  H  I  H  V  V  G  V  K         p.600

          .         .         .         .         .         .       g.119888
 CGGAGGGCTATACCACTTCCCATTCAGATGTACTATCAGCAGCAACCAGTTTCTACTTCT       c.1860
 R  R  A  I  P  L  P  I  Q  M  Y  Y  Q  Q  Q  P  V  S  T  S         p.620

          .         .         .         .         .   | 15     .    g.120207
 GTTGTTCGTGTTGATTCTGTTCCTGATGTATCTCCTGCTCCTTCACCTGCAG | GAATCCCT    c.1920
 V  V  R  V  D  S  V  P  D  V  S  P  A  P  S  P  A  G |   I  P      p.640

          .         .         .         .         .         .       g.120267
 CATGGATCACAAACCATAGGAAACCATTTTCAGAGGACTCCTGTTGCCAACCAATCTTCA       c.1980
 H  G  S  Q  T  I  G  N  H  F  Q  R  T  P  V  A  N  Q  S  S         p.660

          .         .         .         .         .         .       g.120327
 AATCTGACTGCAACACAAATGTCTTTTCCTGTACAAGGTGTTCATACTGTGGCACAAACT       c.2040
 N  L  T  A  T  Q  M  S  F  P  V  Q  G  V  H  T  V  A  Q  T         p.680

          .         .         .         .         .         .       g.120387
 GTTTCAAGAATTCCACAAAATCCTTCACCTCATACCCACCAGCAACAAAATGCTCCAGTG       c.2100
 V  S  R  I  P  Q  N  P  S  P  H  T  H  Q  Q  Q  N  A  P  V         p.700

          .         .         .         .         .         .       g.120447
 ACTGTCATTCAAAGTAAAGCTCCAATTCCTTGTGAAGTTGTTAAGGCTACAGTTATCCAG       c.2160
 T  V  I  Q  S  K  A  P  I  P  C  E  V  V  K  A  T  V  I  Q         p.720

          .         .         .         .         .         .       g.120507
 AATTCCATACCCCAGACAGGAGTTCCTGTTAGTATTGCTGTTGGAGGAGGACCTCCACAG       c.2220
 N  S  I  P  Q  T  G  V  P  V  S  I  A  V  G  G  G  P  P  Q         p.740

          .         .         .         .         .         .       g.120567
 AGTTCTGTTGTTCAGAATCATAGTACAGGGCCACAACCTGTTACAGTTGTGAATTCTCAG       c.2280
 S  S  V  V  Q  N  H  S  T  G  P  Q  P  V  T  V  V  N  S  Q         p.760

          .         .         .         .         .         .       g.120627
 ACATTGCTTCACCATCCATCTGTAATTCCACAGCAGTCTCCATTACACACAGTGGTACCA       c.2340
 T  L  L  H  H  P  S  V  I  P  Q  Q  S  P  L  H  T  V  V  P         p.780

          .         .         .         .         .         .       g.120687
 GGACAGATCCCTTCAGGCACTCCTGTTACAGTAATTCAACAAGCTGTCCCACAGAGTCAT       c.2400
 G  Q  I  P  S  G  T  P  V  T  V  I  Q  Q  A  V  P  Q  S  H         p.800

          .         .         .         .         .         .       g.120747
 ATGTTTGGCAGAGTACAGAACATACCAGCATGTACTTCTACAGTTTCACAGGGTCAACAG       c.2460
 M  F  G  R  V  Q  N  I  P  A  C  T  S  T  V  S  Q  G  Q  Q         p.820

          .         .         .         .         .         .       g.120807
 TTAATCACCACATCACCCCAACCTGTGCAAACTTCATCTCAACAGACATCAGCTGGTAGC       c.2520
 L  I  T  T  S  P  Q  P  V  Q  T  S  S  Q  Q  T  S  A  G  S         p.840

          .         .         .         .         .         .       g.120867
 CAGTCACAAGATACTGTTATCATAGCACCCCCACAGTATGTAACAACTTCTGCATCCAAT       c.2580
 Q  S  Q  D  T  V  I  I  A  P  P  Q  Y  V  T  T  S  A  S  N         p.860

          .         .         .         .         .         .       g.120927
 ATTGTCTCAGCAACTTCAGTACAGAATTTTCAGGTAGCTACAGGACAAATGGTTACTATT       c.2640
 I  V  S  A  T  S  V  Q  N  F  Q  V  A  T  G  Q  M  V  T  I         p.880

          .         .         .         .         .         .       g.120987
 GCTGGTGTCCCAAGTCCACAAGCCTCAAGGGTAGGGTTTCAGAACATTGCACCAAAACCT       c.2700
 A  G  V  P  S  P  Q  A  S  R  V  G  F  Q  N  I  A  P  K  P         p.900

          .         .         .         .         .         .       g.121047
 CTCCCTTCTCAGCAAGTTTCATCTACAGTGGTACAGCAGCCTATTCAACAACCACAGCAG       c.2760
 L  P  S  Q  Q  V  S  S  T  V  V  Q  Q  P  I  Q  Q  P  Q  Q         p.920

          .         .         .         .         .         .       g.121107
 CCAACCCAACAAAGCGTAGTGATTGTAAGCCAGCCAGCTCAACAAGGTCAAACTTATGCA       c.2820
 P  T  Q  Q  S  V  V  I  V  S  Q  P  A  Q  Q  G  Q  T  Y  A         p.940

          .         .         .         .         .         .       g.121167
 CCAGCCATTCACCAAATTGTTCTTGCTAATCCAGCAGCTCTTCCAGCTGGTCAGACAGTT       c.2880
 P  A  I  H  Q  I  V  L  A  N  P  A  A  L  P  A  G  Q  T  V         p.960

          .         .         .         .         .         .       g.121227
 CAGCTAACTGGACAACCTAACATAACTCCATCTTCTTCACCATCACCTGTCCCAGCTACT       c.2940
 Q  L  T  G  Q  P  N  I  T  P  S  S  S  P  S  P  V  P  A  T         p.980

          .         .         .         .         .         .       g.121287
 AATAACCAAGTCCCTACTGCCATGTCGTCGTCCTCTACCCCTCAATCACAGGGACCACCT       c.3000
 N  N  Q  V  P  T  A  M  S  S  S  S  T  P  Q  S  Q  G  P  P         p.1000

          .         .         .         .         .         .       g.121347
 CCTACTGTCAGTCAAATGTTATCTGTGAAAAGGCAGCAACAGCAGCAACATTCACCAGCA       c.3060
 P  T  V  S  Q  M  L  S  V  K  R  Q  Q  Q  Q  Q  H  S  P  A         p.1020

          .         .         .         .         .         .       g.121407
 CCCCCACCACAGCAGGTACAAGTACAAGTTCAGCAGCCCCAACAAGTACAGATGCAAGTT       c.3120
 P  P  P  Q  Q  V  Q  V  Q  V  Q  Q  P  Q  Q  V  Q  M  Q  V         p.1040

          .         .         .         .         .         .       g.121467
 CAACCTCAACAGTCGAATGCAGGAGTTGGTCAGCCTGCCTCTGGTGAGTCGAGTCTGATT       c.3180
 Q  P  Q  Q  S  N  A  G  V  G  Q  P  A  S  G  E  S  S  L  I         p.1060

          .         .         .         .         .         .       g.121527
 AAACAGCTTCTGCTTCCGAAACGTGGTCCTTCAACACCAGGTGGTAAGCTTATTCTCCCA       c.3240
 K  Q  L  L  L  P  K  R  G  P  S  T  P  G  G  K  L  I  L  P         p.1080

          .         .         .         .         .         .       g.121587
 GCTCCACAGATTCCTCCCCCTAATAATGCAAGAGCTCCTAGCCCTCAGGTGGTCTATCAG       c.3300
 A  P  Q  I  P  P  P  N  N  A  R  A  P  S  P  Q  V  V  Y  Q         p.1100

          .         .         .         .         .         .       g.121647
 GTGGCCAGTAACCAAGCCGCAGGTTTTGGAGTGCAGGGGCAAACTCCAGCTCAGCAGCTA       c.3360
 V  A  S  N  Q  A  A  G  F  G  V  Q  G  Q  T  P  A  Q  Q  L         p.1120

          .         .         .         .         .         .       g.121707
 TTGGTTGGGCAGCAAAATGTTCAGTTGGTCCCAAGTGCAATGCCACCCTCAGGGGGAGTA       c.3420
 L  V  G  Q  Q  N  V  Q  L  V  P  S  A  M  P  P  S  G  G  V         p.1140

          .         .         .         .         .         .       g.121767
 CAAACTGTGCCCATTTCGAACTTACAAATATTGCCAGGTCCACTGATCTCAAATAGCCCA       c.3480
 Q  T  V  P  I  S  N  L  Q  I  L  P  G  P  L  I  S  N  S  P         p.1160

          .         .         .         .         .         .       g.121827
 GCAACCATTTTCCAAGGGACTTCTGGCAACCAGGTAACCATAACAGTTGTGCCAAATACG       c.3540
 A  T  I  F  Q  G  T  S  G  N  Q  V  T  I  T  V  V  P  N  T         p.1180

          .         .         .         .         .         .       g.121887
 AGTTTTGCACCTGCAACTGTGAGTCAGGGAAATGCAACTCAGCTCATTGCTCCAGCAGGA       c.3600
 S  F  A  P  A  T  V  S  Q  G  N  A  T  Q  L  I  A  P  A  G         p.1200

          .         .         .         .         .         .       g.121947
 ATTACCATGAGCGGAACGCAGACAGGAGTTGGACTTCCAGTACAAACGCTTCCAGCCACT       c.3660
 I  T  M  S  G  T  Q  T  G  V  G  L  P  V  Q  T  L  P  A  T         p.1220

          .         .         .         .         .         .       g.122007
 CAAGCATCTCCTGCTGGACAATCATCATGTACTACTGCTACTCCCCCATTCAAAGGTGAT       c.3720
 Q  A  S  P  A  G  Q  S  S  C  T  T  A  T  P  P  F  K  G  D         p.1240

          .         .         .         .         .         .       g.122067
 AAAATAATTTGCCAAAAGGAGGAGGAAGCAAAGGAAGCAACAGGTTTACATGTTCATGAA       c.3780
 K  I  I  C  Q  K  E  E  E  A  K  E  A  T  G  L  H  V  H  E         p.1260

          .         .         .         .         .         .       g.122127
 CGTAAAATTGAAGTCATGGAGAACCCGTCCTGCCGACGAGGAGCCACAAACACCAGCAAT       c.3840
 R  K  I  E  V  M  E  N  P  S  C  R  R  G  A  T  N  T  S  N         p.1280

          .         .         .         .         .         .       g.122187
 GGGGATACAAAGGAAAATGAAATGCATGTGGGAAGTCTTTTAAATGGGAGAAAGTACAGT       c.3900
 G  D  T  K  E  N  E  M  H  V  G  S  L  L  N  G  R  K  Y  S         p.1300

          .         .         .         .         .         .       g.122247
 GACTCAAGTCTACCTCCTTCAAACTCAGGGAAAATTCAAAGTGAGACTAATCAGTGCTCA       c.3960
 D  S  S  L  P  P  S  N  S  G  K  I  Q  S  E  T  N  Q  C  S         p.1320

          .         .         .         .         .         .       g.122307
 CTAATCAGTAATGGGCCATCATTGGAATTAGGTGAGAATGGAGCATCTGGGAAACAGAAC       c.4020
 L  I  S  N  G  P  S  L  E  L  G  E  N  G  A  S  G  K  Q  N         p.1340

          .         .         .         .         .         .       g.122367
 TCAGAACAAATAGACATGCAAGATATCAAAAGTGATTTGAGAAAACCGCTAGTTAATGGA       c.4080
 S  E  Q  I  D  M  Q  D  I  K  S  D  L  R  K  P  L  V  N  G         p.1360

          .         .         .         .         .         .       g.122427
 ATCTGTGATTTTGATAAAGGAGATGGTTCTCATTTAAGCAAAAACATTCCAAATCATAAA       c.4140
 I  C  D  F  D  K  G  D  G  S  H  L  S  K  N  I  P  N  H  K         p.1380

          .         .         .         .         .         .       g.122487
 ACTTCCAATCATGTAGGAAATGGTGAGATATCTCCAATGGAACCACAAGGGACTTTAGAT       c.4200
 T  S  N  H  V  G  N  G  E  I  S  P  M  E  P  Q  G  T  L  D         p.1400

          .         .         .         .         .         .       g.122547
 ATCACTCAGCAAGATACTGCCAAAGGTGATCAACTAGAAAGAATTTCTAATGGACCTGTA       c.4260
 I  T  Q  Q  D  T  A  K  G  D  Q  L  E  R  I  S  N  G  P  V         p.1420

          .         .         .         .         .         .       g.122607
 TTAACTTTGGGTGGTTCATCTGTGAGCAGTATACAGGAGGCTTCAAATGCGGCAACACAG       c.4320
 L  T  L  G  G  S  S  V  S  S  I  Q  E  A  S  N  A  A  T  Q         p.1440

          .         .         .         .         .         .       g.122667
 CAATTTAGTGGTACTGATTTGCTTAATGGACCTCTAGCTTCAAGTTTGAATTCAGATGTG       c.4380
 Q  F  S  G  T  D  L  L  N  G  P  L  A  S  S  L  N  S  D  V         p.1460

          .         .         .         .         .         .       g.122727
 CCTCAGCAACGCCCAAGTGTAGTTGTCTCACCACATTCTACAACCTCTGTTATACAGGGA       c.4440
 P  Q  Q  R  P  S  V  V  V  S  P  H  S  T  T  S  V  I  Q  G         p.1480

          .         .         .         .         .         .       g.122787
 CATCAAATCATAGCAGTTCCCGACTCAGGATCAAAAGTATCCCATTCTCCTGCCCTATCA       c.4500
 H  Q  I  I  A  V  P  D  S  G  S  K  V  S  H  S  P  A  L  S         p.1500

          .         .         .         .         .         .       g.122847
 TCTGACGTTCGGTCTACAAATGGCACAGCAGAATGCAAAACTGTAAAGAGGCCAGCAGAG       c.4560
 S  D  V  R  S  T  N  G  T  A  E  C  K  T  V  K  R  P  A  E         p.1520

          .         .         .         .         .         .       g.122907
 GATACTGATAGGGAAACAGTCGCAGGAATTCCAAATAAAGTAGGAGTTAGAATTGTTACA       c.4620
 D  T  D  R  E  T  V  A  G  I  P  N  K  V  G  V  R  I  V  T         p.1540

          .         .         .         .         .         .       g.122967
 ATCAGTGACCCCAACAATGCTGGCTGCAGCGCAACAATGGTTGCTGTGCCAGCAGGAGCA       c.4680
 I  S  D  P  N  N  A  G  C  S  A  T  M  V  A  V  P  A  G  A         p.1560

          .         .         .         .         .         .       g.123027
 GATCCAAGCACTGTAGCTAAAGTAGCAATAGAAAGTGCTGTTCAGCAAAAGCAACAGCAT       c.4740
 D  P  S  T  V  A  K  V  A  I  E  S  A  V  Q  Q  K  Q  Q  H         p.1580

          .         .         .    | 16    .         .         .    g.130991
 CCACCAACATATGTACAGAATGTGGTCCCGCAG | AACACTCCTATGCCACCTTCACCAGCT    c.4800
 P  P  T  Y  V  Q  N  V  V  P  Q   | N  T  P  M  P  P  S  P  A      p.1600

          .         .         .         .         .         .       g.131051
 GTACAAGTGCAGGGCCAGCCTAACAGTTCTCAGCCTTCTCCATTCAGTGGATCCAGTCAG       c.4860
 V  Q  V  Q  G  Q  P  N  S  S  Q  P  S  P  F  S  G  S  S  Q         p.1620

          .         .         .         .         .         .       g.131111
 CCTGGAGATCCAATGAGAAAACCTGGACAGAACTTCATGTGTCTGTGGCAGTCTTGTAAA       c.4920
 P  G  D  P  M  R  K  P  G  Q  N  F  M  C  L  W  Q  S  C  K         p.1640

    | 17     .         .         .         .         .         .    g.162001
 AA | GTGGTTTCAGACACCCTCACAGGTTTTCTACCATGCAGCAACTGAACATGGAGGAAAA    c.4980
 K  |  W  F  Q  T  P  S  Q  V  F  Y  H  A  A  T  E  H  G  G  K      p.1660

          .         .         .         .         .         .       g.162061
 GATGTATATCCAGGGCAGTGTCTTTGGGAAGGTTGTGAGCCTTTTCAGCGACAGCGGTTT       c.5040
 D  V  Y  P  G  Q  C  L  W  E  G  C  E  P  F  Q  R  Q  R  F         p.1680

          .         .  | 18      .         .         .         .    g.162213
 TCTTTTATTACCCACTTGCAG | GATAAGCACTGTTCAAAGGATGCCCTACTTGCAGGATTA    c.5100
 S  F  I  T  H  L  Q   | D  K  H  C  S  K  D  A  L  L  A  G  L      p.1700

          .         .         .         .        | 19.         .    g.163596
 AAACAAGATGAACCAGGACAAGCAGGAAGTCAGAAGTCTTCTACCAA | GCAGCCAACTGTA    c.5160
 K  Q  D  E  P  G  Q  A  G  S  Q  K  S  S  T  K  |  Q  P  T  V      p.1720

          .         .         .         .         .         .       g.163656
 GGGGGCACAAGCTCAACTCCTAGAGCACAAAAGGCCATTGTGAATCATCCCAGTGCTGCA       c.5220
 G  G  T  S  S  T  P  R  A  Q  K  A  I  V  N  H  P  S  A  A         p.1740

          .         .         .         .         .  | 20      .    g.163802
 CTTATGGCTCTGAGGAGAGGATCAAGAAACCTTGTCTTTCGAGATTTTACA | GATGAAAAA    c.5280
 L  M  A  L  R  R  G  S  R  N  L  V  F  R  D  F  T   | D  E  K      p.1760

          .         .         .         .         .         .       g.163862
 GAGGGACCAATAACTAAACACATCCGACTAACAGCTGCCTTAATATTAAAAAATATTGGT       c.5340
 E  G  P  I  T  K  H  I  R  L  T  A  A  L  I  L  K  N  I  G         p.1780

          .         .    | 21    .         .         .         .    g.175134
 AAATATTCAGAATGTGGTCGCAG | ATTGTTAAAGAGACATGAAAATAACTTATCAGTGCTA    c.5400
 K  Y  S  E  C  G  R  R  |  L  L  K  R  H  E  N  N  L  S  V  L      p.1800

          .         .         .         .         .         .       g.175194
 GCCATTAGTAACATGGAAGCTTCCTCCACCCTTGCCAAATGCCTTTATGAACTTAATTTT       c.5460
 A  I  S  N  M  E  A  S  S  T  L  A  K  C  L  Y  E  L  N  F         p.1820

          .         .         .         .                           g.175242
 ACAGTTCAGAGTAAGGAACAAGAAAAAGACTCAGAAATGCTGCAGTGA                   c.5508
 T  V  Q  S  K  E  Q  E  K  D  S  E  M  L  Q  X                     p.1835

          .         .         .         .         .         .       g.175302
 aaaataattccacttacacagtgggggactcaaagtcagccacatttcacatactgttac       c.*60

          .         .         .         .         .         .       g.175362
 tgaagaaagcaccaagtcttaatggaacaaagaccatagaatgaattattttatctcctc       c.*120

          .         .         .         .         .         .       g.175422
 ccatgatgctgagaggaagcttcgtattctgatctctgagtgaatccctttgttctctgt       c.*180

          .         .         .         .         .         .       g.175482
 ttaaaaaaatctaaaaagaaaaaggaaaaaaaaaaaagaactgctgtgggattgtcaacc       c.*240

          .         .         .         .         .         .       g.175542
 agcttatctgcaggatgtttcagatctgataaatcctgatggaaactggtatgatcagaa       c.*300

          .         .         .         .         .         .       g.175602
 ttcagtaccatccacattggaatatacatggaatattgtaaaacctacatgagcagatga       c.*360

          .         .         .         .         .         .       g.175662
 aatagaagcattaaatatttttatctatatccaaaaaggagcacatttttatatttacaa       c.*420

          .         .         .         .         .         .       g.175722
 aaccgtttaagctggtttgaataatttaaaaaagtttcagcacacctatacccccgatct       c.*480

          .         .         .         .         .         .       g.175782
 cagagggggccaccaatatctagctatggatcgtgtgttttgtttagaaatcagtagctt       c.*540

          .         .         .         .         .         .       g.175842
 ggttttcttacttgagccaatatattttcacttatttattatcataaaaatttaccagtc       c.*600

          .         .         .         .         .         .       g.175902
 tgaatagatcttgtaaatatttgtgaatagaatgaatacctttcatgccactgcagccac       c.*660

          .         .         .         .         .         .       g.175962
 tggaaatacattctgtggtgtcctagaagcattattggtaggttctaaagttttctagac       c.*720

          .         .         .         .         .         .       g.176022
 tttcctgtcaattgtaagtaattgtgatatattctatgcagtggatgaatgttctttaaa       c.*780

          .         .         .         .         .         .       g.176082
 tttgtgtaaatacttctgcaaaggtactgatgctgtaaagtcaaaacagttttgtggaac       c.*840

          .         .         .         .         .         .       g.176142
 tgtgatttttttttcttttttctttttttttttctttttttttttgtattatacaccttg       c.*900

          .         .         .         .         .         .       g.176202
 tagaactcattttgctggctgaaagagtatggaataatatatctcatgtcattttttaga       c.*960

          .         .         .         .         .         .       g.176262
 agaaaaactatttgaaggtattttttggttttccttaacatgtatccactgtaaacgttt       c.*1020

          .         .         .         .         .         .       g.176322
 gtcgtgtacaagctcagagcttggacagaattttttgtatttgtaaattggtttaaatac       c.*1080

          .         .         .         .         .         .       g.176382
 atggaattttatacaggttttctcctgtgttatatatgcattatgtgcaggtatgatatt       c.*1140

          .         .         .         .         .         .       g.176442
 ttcttcactactttttctatcttaatatagtgtggaattttattgtattattcttccatt       c.*1200

          .         .         .         .         .         .       g.176502
 cttaatactgtaccacattcctgctcagaaactgctcacttccttaaattgtcttttttc       c.*1260

          .         .         .         .         .         .       g.176562
 ccccagcgtgaaatgtatccatttataactgcctattgcctgttctattagcatccaaaa       c.*1320

          .         .         .         .         .         .       g.176622
 atgtggaaggcctcccaaccaccatttctgctgtgtccttaggatgtgcagtaaaaaata       c.*1380

          .         .         .         .         .         .       g.176682
 tagacctaacagtttatgttatagaatggctttatttactttggtgactgtttatagttt       c.*1440

          .         .         .         .         .         .       g.176742
 ttaaataaaagactgaacattttcttgagtccttcatttctgagtatgcttaagacatct       c.*1500

          .         .         .         .         .         .       g.176802
 taaaaatatagagagaattctaaattcagctgaaggcaaggtataacggtcacctaccta       c.*1560

          .         .         .         .         .         .       g.176862
 tttgattatatgttgattgataacatattaaatagagaacaaataagagaggtcctttac       c.*1620

          .         .         .         .         .         .       g.176922
 atgacaaatttgcatgaaataagcagattaaccaagtatttatttttcatcttgttataa       c.*1680

          .         .         .         .         .         .       g.176982
 tgcagagcaaatgtagagaacagcaaatgattgatgcagttaaagctcaatatgcctttt       c.*1740

          .         .         .         .         .         .       g.177042
 tttactggatactgtacatttggctaaaagcttttattgtttgatgttgtgtttcttgac       c.*1800

          .         .         .         .         .         .       g.177102
 tgtttattcagaatcacagtgtatccaaatcttcagcttgaatttggaggcagattctta       c.*1860

          .         .         .         .         .         .       g.177162
 gagtgaaaaagcctcagtttccatattaaaaatgttttaaatattttgattgaattagta       c.*1920

          .         .         .         .         .         .       g.177222
 ccaatgtaaaatctagtttcttcctgaaggaggatccctggcgctgtcctgccatgtctc       c.*1980

          .         .         .         .         .         .       g.177282
 aaaggaatgtttgagaaacttcatctaatattagttataaggttgtggaatttatgcttg       c.*2040

          .         .         .         .         .         .       g.177342
 gcccaccttccaagactggcactgcccaacagacaccgctgaaatcatgtgggtatccct       c.*2100

          .         .         .         .         .         .       g.177402
 aggatggccttcagagccctcaaacttacaagcacctggtagttgacatcatatggggaa       c.*2160

          .         .         .         .         .         .       g.177462
 ttttctattcaccgtacttatccaaaaatctcttttaaaaagtaaatttgtgcaacaacg       c.*2220

          .         .         .         .         .         .       g.177522
 tttatttgaaagataatgtcttctcaaaatcagaaactgcagtggtaattaaattaatag       c.*2280

          .         .         .         .         .         .       g.177582
 aaaagagaacaaactgcaggtttagaaaaatggttttcatattcaccattcttccacctc       c.*2340

          .         .         .         .         .         .       g.177642
 attgaattgcatgctgtagttctagcttttctgctataatatgtaaatatgactgtagcc       c.*2400

          .         .         .         .         .         .       g.177702
 ttttaagcttcagtctcagcagagaatttcctaaatgcgtttgacctaatgaaactgatc       c.*2460

          .         .         .         .         .         .       g.177762
 atggcttcccacttaggtttttcttcttatagctttatagaactatataataatatggac       c.*2520

          .         .         .         .         .         .       g.177822
 ttgctgtgtaatggaattaaagtgcttttgcacaataagttctgcaaaaccctctcattc       c.*2580

          .         .         .         .         .         .       g.177882
 atgaaaaggtgctccttgctagacagaaacttgctgatttacagtattgttatttttgtc       c.*2640

          .         .         .         .         .         .       g.177942
 taaagttctgtaaatacatgctttaatgttatctttgagaaatctatgtaaataatatag       c.*2700

          .         .         .         .         .         .       g.178002
 tctacaacatagagactgtataattctgtgttatatatgtgcctagtgctctgttggcac       c.*2760

          .         .         .         .         .         .       g.178062
 tcaataaattttaagtaacaaaattgataatcatatagcgaaggcatatttttcttccaa       c.*2820

          .         .         .         .         .         .       g.178122
 gctcaagtcaggattgtgactatatattaatgagactcagtaatccaacccacacctgag       c.*2880

          .         .         .         .         .         .       g.178182
 aactcgtctcattactttatagtcatgtcatgtatgtttttttaaccatgaaatgacaat       c.*2940

          .                                                         g.178200
 aaaatgatttttaaaatg                                                 c.*2958

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The AT rich interactive domain 2 (ARID, RFX-like) protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 26
©2004-2021 Leiden University Medical Center