AT rich interactive domain 1A (SWI-like) (ARID1A) - coding DNA reference sequence

(used for variant description)

(last modified September 3, 2015)


This file was created to facilitate the description of sequence variants on transcript NM_006015.4 in the ARID1A gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_029965.1, covering ARID1A transcript NM_006015.4.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                                                   .                g.5013
                                                cagaaagcggaga       c.-361

 .         .         .         .         .         .                g.5073
 gtcacagcggggccaggccctggggagcggagcctccaccgcccccctcattcccaggca       c.-301

 .         .         .         .         .         .                g.5133
 agggcttggggggaatgagccgggagagccgggtcccgagcctacagagccgggagcagc       c.-241

 .         .         .         .         .         .                g.5193
 tgagccgccggcgcctcggccgccgccgccgcctcctcctcctccgccgccgccagcccg       c.-181

 .         .         .         .         .         .                g.5253
 gagcctgagccggcggggcgggggggagaggagcgagcgcagcgcagcagcggagccccg       c.-121

 .         .         .         .         .         .                g.5313
 cgaggcccgcccgggcgggtggggagggcagcccgggggactgggccccggggcggggtg       c.-61

 .         .         .         .         .         .                g.5373
 ggagggggggagaagacgaagacagggccgggtctctccgcggacgagacagcggggatc       c.-1

          .         .         .         .         .         .       g.5433
 ATGGCCGCGCAGGTCGCCCCCGCCGCCGCCAGCAGCCTGGGCAACCCGCCGCCGCCGCCG       c.60
 M  A  A  Q  V  A  P  A  A  A  S  S  L  G  N  P  P  P  P  P         p.20

          .         .         .         .         .         .       g.5493
 CCCTCGGAGCTGAAGAAAGCCGAGCAGCAGCAGCGGGAGGAGGCGGGGGGCGAGGCGGCG       c.120
 P  S  E  L  K  K  A  E  Q  Q  Q  R  E  E  A  G  G  E  A  A         p.40

          .         .         .         .         .         .       g.5553
 GCGGCGGCAGCGGCCGAGCGCGGGGAAATGAAGGCAGCCGCCGGGCAGGAAAGCGAGGGC       c.180
 A  A  A  A  A  E  R  G  E  M  K  A  A  A  G  Q  E  S  E  G         p.60

          .         .         .         .         .         .       g.5613
 CCCGCCGTGGGGCCGCCGCAGCCGCTGGGAAAGGAGCTGCAGGACGGGGCCGAGAGCAAT       c.240
 P  A  V  G  P  P  Q  P  L  G  K  E  L  Q  D  G  A  E  S  N         p.80

          .         .         .         .         .         .       g.5673
 GGGGGTGGCGGCGGCGGCGGAGCCGGCAGCGGCGGCGGGCCCGGCGCGGAGCCGGACCTG       c.300
 G  G  G  G  G  G  G  A  G  S  G  G  G  P  G  A  E  P  D  L         p.100

          .         .         .         .         .         .       g.5733
 AAGAACTCGAACGGGAACGCGGGCCCTAGGCCCGCCCTGAACAATAACCTCACGGAGCCG       c.360
 K  N  S  N  G  N  A  G  P  R  P  A  L  N  N  N  L  T  E  P         p.120

          .         .         .         .         .         .       g.5793
 CCCGGCGGCGGCGGTGGCGGCAGCAGCGATGGGGTGGGGGCGCCTCCTCACTCAGCCGCG       c.420
 P  G  G  G  G  G  G  S  S  D  G  V  G  A  P  P  H  S  A  A         p.140

          .         .         .         .         .         .       g.5853
 GCCGCCTTGCCGCCCCCAGCCTACGGCTTCGGGCAACCCTACGGCCGGAGCCCGTCTGCC       c.480
 A  A  L  P  P  P  A  Y  G  F  G  Q  P  Y  G  R  S  P  S  A         p.160

          .         .         .         .         .         .       g.5913
 GTCGCCGCCGCCGCGGCCGCCGTCTTCCACCAACAACATGGCGGACAACAAAGCCCTGGC       c.540
 V  A  A  A  A  A  A  V  F  H  Q  Q  H  G  G  Q  Q  S  P  G         p.180

          .         .         .         .         .         .       g.5973
 CTGGCAGCGCTGCAGAGCGGCGGCGGCGGGGGCCTGGAGCCCTACGCGGGGCCCCAGCAG       c.600
 L  A  A  L  Q  S  G  G  G  G  G  L  E  P  Y  A  G  P  Q  Q         p.200

          .         .         .         .         .         .       g.6033
 AACTCTCACGACCACGGCTTCCCCAACCACCAGTACAACTCCTACTACCCCAACCGCAGC       c.660
 N  S  H  D  H  G  F  P  N  H  Q  Y  N  S  Y  Y  P  N  R  S         p.220

          .         .         .         .         .         .       g.6093
 GCCTACCCCCCGCCCGCCCCGGCCTACGCGCTGAGCTCCCCGAGAGGTGGCACTCCGGGC       c.720
 A  Y  P  P  P  A  P  A  Y  A  L  S  S  P  R  G  G  T  P  G         p.240

          .         .         .         .         .         .       g.6153
 TCCGGCGCGGCGGCGGCTGCCGGCTCCAAGCCGCCTCCCTCCTCCAGCGCCTCCGCCTCC       c.780
 S  G  A  A  A  A  A  G  S  K  P  P  P  S  S  S  A  S  A  S         p.260

          .         .         .         .         .         .       g.6213
 TCGTCGTCTTCGTCCTTCGCTCAGCAGCGCTTCGGGGCCATGGGGGGAGGCGGCCCCTCC       c.840
 S  S  S  S  S  F  A  Q  Q  R  F  G  A  M  G  G  G  G  P  S         p.280

          .         .         .         .         .         .       g.6273
 GCGGCCGGCGGGGGAACTCCCCAGCCCACCGCCACCCCCACCCTCAACCAACTGCTCACG       c.900
 A  A  G  G  G  T  P  Q  P  T  A  T  P  T  L  N  Q  L  L  T         p.300

          .         .         .         .         .         .       g.6333
 TCGCCCAGCTCGGCCCGGGGCTACCAGGGCTACCCCGGGGGCGACTACAGTGGCGGGCCC       c.960
 S  P  S  S  A  R  G  Y  Q  G  Y  P  G  G  D  Y  S  G  G  P         p.320

          .         .         .         .         .         .       g.6393
 CAGGACGGGGGCGCCGGCAAGGGCCCGGCGGACATGGCCTCGCAGTGTTGGGGGGCTGCG       c.1020
 Q  D  G  G  A  G  K  G  P  A  D  M  A  S  Q  C  W  G  A  A         p.340

          .         .         .         .         .         .       g.6453
 GCGGCGGCAGCTGCGGCGGCGGCCGCCTCGGGAGGGGCCCAACAAAGGAGCCACCACGCG       c.1080
 A  A  A  A  A  A  A  A  A  S  G  G  A  Q  Q  R  S  H  H  A         p.360

          .         .         .         .         .        | 02.    g.38623
 CCCATGAGCCCCGGGAGCAGCGGCGGCGGGGGGCAGCCGCTCGCCCGGACCCCTCAG | CCA    c.1140
 P  M  S  P  G  S  S  G  G  G  G  Q  P  L  A  R  T  P  Q   | P      p.380

          .         .         .         .         .         .       g.38683
 TCCAGTCCAATGGATCAGATGGGCAAGATGAGACCTCAGCCATATGGCGGGACTAACCCA       c.1200
 S  S  P  M  D  Q  M  G  K  M  R  P  Q  P  Y  G  G  T  N  P         p.400

          .         .         .         .         .         .       g.38743
 TACTCGCAGCAACAGGGACCTCCGTCAGGACCGCAGCAAGGACATGGGTACCCAGGGCAG       c.1260
 Y  S  Q  Q  Q  G  P  P  S  G  P  Q  Q  G  H  G  Y  P  G  Q         p.420

          .         .         .         .         .         .       g.38803
 CCATACGGGTCCCAGACCCCGCAGCGGTACCCGATGACCATGCAGGGCCGGGCGCAGAGT       c.1320
 P  Y  G  S  Q  T  P  Q  R  Y  P  M  T  M  Q  G  R  A  Q  S         p.440

          .         .         . | 03       .         .         .    g.40151
 GCCATGGGCGGCCTCTCTTATACACAGCAG | ATTCCTCCTTATGGACAACAAGGCCCCAGC    c.1380
 A  M  G  G  L  S  Y  T  Q  Q   | I  P  P  Y  G  Q  Q  G  P  S      p.460

          .         .         .         .         .         .       g.40211
 GGGTATGGTCAACAGGGCCAGACTCCATATTACAACCAGCAAAGTCCTCACCCTCAGCAG       c.1440
 G  Y  G  Q  Q  G  Q  T  P  Y  Y  N  Q  Q  S  P  H  P  Q  Q         p.480

          .         .         .         .         .         .       g.40271
 CAGCAGCCACCCTACTCCCAGCAACCACCGTCCCAGACCCCTCATGCCCAACCTTCGTAT       c.1500
 Q  Q  P  P  Y  S  Q  Q  P  P  S  Q  T  P  H  A  Q  P  S  Y         p.500

          .         .         .         .         .         .       g.40331
 CAGCAGCAGCCACAGTCTCAACCACCACAGCTCCAGTCCTCTCAGCCTCCATACTCCCAG       c.1560
 Q  Q  Q  P  Q  S  Q  P  P  Q  L  Q  S  S  Q  P  P  Y  S  Q         p.520

          .         .         .         .         .         .       g.40391
 CAGCCATCCCAGCCTCCACATCAGCAGTCCCCGGCTCCATACCCCTCCCAGCAGTCGACG       c.1620
 Q  P  S  Q  P  P  H  Q  Q  S  P  A  P  Y  P  S  Q  Q  S  T         p.540

          .         .         .         .         .         .       g.40451
 ACACAGCAGCACCCCCAGAGCCAGCCCCCCTACTCACAGCCACAGGCTCAGTCTCCTTAC       c.1680
 T  Q  Q  H  P  Q  S  Q  P  P  Y  S  Q  P  Q  A  Q  S  P  Y         p.560

          .         .         .         .         .         .       g.40511
 CAGCAGCAGCAACCTCAGCAGCCAGCACCCTCGACGCTCTCCCAGCAGGCTGCGTATCCT       c.1740
 Q  Q  Q  Q  P  Q  Q  P  A  P  S  T  L  S  Q  Q  A  A  Y  P         p.580

          .         .         .         .         .         .       g.40571
 CAGCCCCAGTCTCAGCAGTCCCAGCAAACTGCCTATTCCCAGCAGCGCTTCCCTCCACCG       c.1800
 Q  P  Q  S  Q  Q  S  Q  Q  T  A  Y  S  Q  Q  R  F  P  P  P         p.600

     | 04    .         .         .         .         .         .    g.41702
 CAG | GAGCTATCTCAAGATTCATTTGGGTCTCAGGCATCCTCAGCCCCCTCAATGACCTCC    c.1860
 Q   | E  L  S  Q  D  S  F  G  S  Q  A  S  S  A  P  S  M  T  S      p.620

          .         .         .         .         .         .       g.41762
 AGTAAGGGAGGGCAAGAAGATATGAACCTGAGCCTTCAGTCAAGACCCTCCAGCTTGCCT       c.1920
 S  K  G  G  Q  E  D  M  N  L  S  L  Q  S  R  P  S  S  L  P         p.640

  | 05       .         .         .         .         .         .    g.69885
  | GATCTATCTGGTTCAATAGATGACCTCCCCATGGGGACAGAAGGAGCTCTGAGTCCTGGA    c.1980
  | D  L  S  G  S  I  D  D  L  P  M  G  T  E  G  A  L  S  P  G      p.660

          .         .         .         .         .         .       g.69945
 GTGAGCACATCAGGGATTTCCAGCAGCCAAGGAGAGCAGAGTAATCCAGCTCAGTCTCCT       c.2040
 V  S  T  S  G  I  S  S  S  Q  G  E  Q  S  N  P  A  Q  S  P         p.680

          .         .         .         .         .         .       g.70005
 TTCTCTCCTCATACCTCCCCTCACCTGCCTGGCATCCGAGGCCCTTCCCCGTCCCCTGTT       c.2100
 F  S  P  H  T  S  P  H  L  P  G  I  R  G  P  S  P  S  P  V         p.700

          .         .         .         .         .         .       g.70065
 GGCTCTCCCGCCAGTGTTGCTCAGTCTCGCTCAGGACCACTCTCGCCTGCTGCAGTGCCA       c.2160
 G  S  P  A  S  V  A  Q  S  R  S  G  P  L  S  P  A  A  V  P         p.720

   | 06      .         .         .         .         .         .    g.70412
 G | GCAACCAGATGCCACCTCGGCCACCCAGTGGCCAGTCGGACAGCATCATGCATCCTTCC    c.2220
 G |   N  Q  M  P  P  R  P  P  S  G  Q  S  D  S  I  M  H  P  S      p.740

          .         .         .  | 07      .         .         .    g.71150
 ATGAACCAATCAAGCATTGCCCAAGATCGAG | GTTATATGCAGAGGAACCCCCAGATGCCC    c.2280
 M  N  Q  S  S  I  A  Q  D  R  G |   Y  M  Q  R  N  P  Q  M  P      p.760

          .         .         .         .         .         .       g.71210
 CAGTACAGTTCCCCCCAGCCCGGCTCAGCCTTATCTCCGCGTCAGCCTTCCGGAGGACAG       c.2340
 Q  Y  S  S  P  Q  P  G  S  A  L  S  P  R  Q  P  S  G  G  Q         p.780

          .         .         .         .         .         .       g.71270
 ATACACACAGGCATGGGCTCCTACCAGCAGAACTCCATGGGGAGCTATGGTCCCCAGGGG       c.2400
 I  H  T  G  M  G  S  Y  Q  Q  N  S  M  G  S  Y  G  P  Q  G         p.800

          .          | 08        .         .         .         .    g.71983
 GGTCAGTATGGCCCACAAG | GTGGCTACCCCAGGCAGCCAAACTATAATGCCTTGCCCAAT    c.2460
 G  Q  Y  G  P  Q  G |   G  Y  P  R  Q  P  N  Y  N  A  L  P  N      p.820

          .         .         .         .         .         .       g.72043
 GCCAACTACCCCAGTGCAGGCATGGCTGGAGGCATAAACCCCATGGGTGCCGGAGGTCAA       c.2520
 A  N  Y  P  S  A  G  M  A  G  G  I  N  P  M  G  A  G  G  Q         p.840

          .         .         .         .         .         .       g.72103
 ATGCATGGACAGCCTGGCATCCCACCTTATGGCACACTCCCTCCAGGGAGGATGAGTCAC       c.2580
 M  H  G  Q  P  G  I  P  P  Y  G  T  L  P  P  G  R  M  S  H         p.860

          .         .         .         .         .         .       g.72163
 GCCTCCATGGGCAACCGGCCTTATGGCCCTAACATGGCCAATATGCCACCTCAGGTTGGG       c.2640
 A  S  M  G  N  R  P  Y  G  P  N  M  A  N  M  P  P  Q  V  G         p.880

          .         .         .         .         .         .       g.72223
 TCAGGGATGTGTCCCCCACCAGGGGGCATGAACCGGAAAACCCAAGAAACTGCTGTCGCC       c.2700
 S  G  M  C  P  P  P  G  G  M  N  R  K  T  Q  E  T  A  V  A         p.900

          .         .         .   | 09     .         .         .    g.75218
 ATGCATGTTGCTGCCAACTCTATCCAAAACAG | GCCGCCAGGCTACCCCAATATGAATCAA    c.2760
 M  H  V  A  A  N  S  I  Q  N  R  |  P  P  G  Y  P  N  M  N  Q      p.920

          .         .         .         .         .         .       g.75278
 GGGGGCATGATGGGAACTGGACCTCCTTATGGACAAGGGATTAATAGTATGGCTGGCATG       c.2820
 G  G  M  M  G  T  G  P  P  Y  G  Q  G  I  N  S  M  A  G  M         p.940

          .         .         .         .         .         | 10    g.75428
 ATCAACCCTCAGGGACCCCCATATTCCATGGGTGGAACCATGGCCAACAATTCTGCAG | GG    c.2880
 I  N  P  Q  G  P  P  Y  S  M  G  G  T  M  A  N  N  S  A  G |       p.960

          .         .         .         .         .         .       g.75488
 ATGGCAGCCAGCCCAGAGATGATGGGCCTTGGGGATGTAAAGTTAACTCCAGCCACCAAA       c.2940
 M  A  A  S  P  E  M  M  G  L  G  D  V  K  L  T  P  A  T  K         p.980

          .         .         .         .         | 11         .    g.76771
 ATGAACAACAAGGCAGATGGGACACCCAAGACAGAATCCAAATCCAAG | AAATCCAGTTCT    c.3000
 M  N  N  K  A  D  G  T  P  K  T  E  S  K  S  K   | K  S  S  S      p.1000

          .         .         .         .         .         .       g.76831
 TCTACTACAACCAATGAGAAGATCACCAAGTTGTATGAGCTGGGTGGTGAGCCTGAGAGG       c.3060
 S  T  T  T  N  E  K  I  T  K  L  Y  E  L  G  G  E  P  E  R         p.1020

          .         .         .         .         .         .       g.76891
 AAGATGTGGGTGGACCGTTATCTGGCCTTCACTGAGGAGAAGGCCATGGGCATGACAAAT       c.3120
 K  M  W  V  D  R  Y  L  A  F  T  E  E  K  A  M  G  M  T  N         p.1040

          .         .         .         .         .         .       g.76951
 CTGCCTGCTGTGGGTAGGAAACCTCTGGACCTCTATCGCCTCTATGTGTCTGTGAAGGAG       c.3180
 L  P  A  V  G  R  K  P  L  D  L  Y  R  L  Y  V  S  V  K  E         p.1060

          .         | 12         .         .         .         .    g.80130
 ATTGGTGGATTGACTCAG | GTCAACAAGAACAAAAAATGGCGGGAACTTGCAACCAACCTC    c.3240
 I  G  G  L  T  Q   | V  N  K  N  K  K  W  R  E  L  A  T  N  L      p.1080

          .         .         .         .         .         .       g.80190
 AATGTGGGCACATCAAGCAGTGCTGCCAGCTCCTTGAAAAAGCAGTATATCCAGTGTCTC       c.3300
 N  V  G  T  S  S  S  A  A  S  S  L  K  K  Q  Y  I  Q  C  L         p.1100

          .         .         .         .         .         .       g.80250
 TATGCCTTTGAATGCAAGATTGAACGGGGAGAAGACCCTCCCCCAGACATCTTTGCAGCT       c.3360
 Y  A  F  E  C  K  I  E  R  G  E  D  P  P  P  D  I  F  A  A         p.1120

          .         .         .         .       | 13 .         .    g.81483
 GCTGATTCCAAGAAGTCCCAGCCCAAGATCCAGCCTCCCTCTCCTG | CGGGATCAGGATCT    c.3420
 A  D  S  K  K  S  Q  P  K  I  Q  P  P  S  P  A |   G  S  G  S      p.1140

          .         .         .         .         .         .       g.81543
 ATGCAGGGGCCCCAGACTCCCCAGTCAACCAGCAGTTCCATGGCAGAAGGAGGAGACTTA       c.3480
 M  Q  G  P  Q  T  P  Q  S  T  S  S  S  M  A  E  G  G  D  L         p.1160

          .         .         .         .         .          | 14    g.81782
 AAGCCACCAACTCCAGCATCCACACCACACAGTCAGATCCCCCCATTGCCAGGCATGAG | C    c.3540
 K  P  P  T  P  A  S  T  P  H  S  Q  I  P  P  L  P  G  M  S  |      p.1180

          .         .         .         .         .         .       g.81842
 AGGAGCAATTCAGTTGGGATCCAGGATGCCTTTAATGATGGAAGTGACTCCACATTCCAG       c.3600
 R  S  N  S  V  G  I  Q  D  A  F  N  D  G  S  D  S  T  F  Q         p.1200

          .         .         .         .         .         .       g.81902
 AAGCGGAATTCCATGACTCCAAACCCTGGGTATCAGCCCAGTATGAATACCTCTGACATG       c.3660
 K  R  N  S  M  T  P  N  P  G  Y  Q  P  S  M  N  T  S  D  M         p.1220

          .         .         .         .         .      | 15  .    g.82320
 ATGGGGCGCATGTCCTATGAGCCAAATAAGGATCCTTATGGCAGCATGAGGAAAG | CTCCA    c.3720
 M  G  R  M  S  Y  E  P  N  K  D  P  Y  G  S  M  R  K  A |   P      p.1240

          .         .         .         .         .         .       g.82380
 GGGAGTGATCCCTTCATGTCCTCAGGGCAGGGCCCCAACGGCGGGATGGGTGACCCCTAC       c.3780
 G  S  D  P  F  M  S  S  G  Q  G  P  N  G  G  M  G  D  P  Y         p.1260

          .         .         .         .         .         .       g.82440
 AGTCGTGCTGCCGGCCCTGGGCTAGGAAATGTGGCGATGGGACCACGACAGCACTATCCC       c.3840
 S  R  A  A  G  P  G  L  G  N  V  A  M  G  P  R  Q  H  Y  P         p.1280

          .         .       | 16 .         .         .         .    g.82583
 TATGGAGGTCCTTATGACAGAGTGAG | GACGGAGCCTGGAATAGGGCCTGAGGGAAACATG    c.3900
 Y  G  G  P  Y  D  R  V  R  |  T  E  P  G  I  G  P  E  G  N  M      p.1300

          .         .         .         .         .         .       g.82643
 AGCACTGGGGCCCCACAGCCGAATCTCATGCCTTCCAACCCAGACTCGGGGATGTATTCT       c.3960
 S  T  G  A  P  Q  P  N  L  M  P  S  N  P  D  S  G  M  Y  S         p.1320

          .         .         .         .     | 17   .         .    g.82787
 CCTAGCCGCTACCCCCCGCAGCAGCAGCAGCAGCAGCAGCAACG | ACATGATTCCTATGGC    c.4020
 P  S  R  Y  P  P  Q  Q  Q  Q  Q  Q  Q  Q  R  |  H  D  S  Y  G      p.1340

          .         .         .         .         .         .       g.82847
 AATCAGTTCTCCACCCAAGGCACCCCTTCTGGCAGCCCCTTCCCCAGCCAGCAGACTACA       c.4080
 N  Q  F  S  T  Q  G  T  P  S  G  S  P  F  P  S  Q  Q  T  T         p.1360

          .         .  | 18      .         .         .         .    g.83337
 ATGTATCAACAGCAACAGCAG | AATTACAAGCGGCCAATGGATGGCACATATGGCCCTCCT    c.4140
 M  Y  Q  Q  Q  Q  Q   | N  Y  K  R  P  M  D  G  T  Y  G  P  P      p.1380

          .         .         .         .         .         .       g.83397
 GCCAAGCGGCACGAAGGGGAGATGTACAGCGTGCCATACAGCACTGGGCAGGGGCAGCCT       c.4200
 A  K  R  H  E  G  E  M  Y  S  V  P  Y  S  T  G  Q  G  Q  P         p.1400

          .         .         .         .         .         .       g.83457
 CAGCAGCAGCAGTTGCCCCCAGCCCAGCCCCAGCCTGCCAGCCAGCAACAAGCTGCCCAG       c.4260
 Q  Q  Q  Q  L  P  P  A  Q  P  Q  P  A  S  Q  Q  Q  A  A  Q         p.1420

          .         .         .         .         .         .       g.83517
 CCTTCCCCTCAGCAAGATGTATACAACCAGTATGGCAATGCCTATCCTGCCACTGCCACA       c.4320
 P  S  P  Q  Q  D  V  Y  N  Q  Y  G  N  A  Y  P  A  T  A  T         p.1440

          .         .         .         .         .         .       g.83577
 GCTGCTACTGAGCGCCGACCAGCAGGCGGCCCCCAGAACCAATTTCCATTCCAGTTTGGC       c.4380
 A  A  T  E  R  R  P  A  G  G  P  Q  N  Q  F  P  F  Q  F  G         p.1460

          .         .         .         .         .         .       g.83637
 CGAGACCGTGTCTCTGCACCCCCTGGCACCAATGCCCAGCAAAACATGCCACCACAAATG       c.4440
 R  D  R  V  S  A  P  P  G  T  N  A  Q  Q  N  M  P  P  Q  M         p.1480

          .         .         .         .         .         .       g.83697
 ATGGGCGGCCCCATACAGGCATCAGCTGAGGTTGCTCAGCAAGGCACCATGTGGCAGGGG       c.4500
 M  G  G  P  I  Q  A  S  A  E  V  A  Q  Q  G  T  M  W  Q  G         p.1500

          .         .         .         .         .         .       g.83757
 CGTAATGACATGACCTATAATTATGCCAACAGGCAGAGCACGGGCTCTGCCCCCCAGGGC       c.4560
 R  N  D  M  T  Y  N  Y  A  N  R  Q  S  T  G  S  A  P  Q  G         p.1520

          .         .         .         .         .         .       g.83817
 CCCGCCTATCATGGCGTGAACCGAACAGATGAAATGCTGCACACAGATCAGAGGGCCAAC       c.4620
 P  A  Y  H  G  V  N  R  T  D  E  M  L  H  T  D  Q  R  A  N         p.1540

          .         .         .         .         .         .       g.83877
 CACGAAGGCTCGTGGCCTTCCCATGGCACACGCCAGCCCCCATATGGTCCCTCTGCCCCT       c.4680
 H  E  G  S  W  P  S  H  G  T  R  Q  P  P  Y  G  P  S  A  P         p.1560

          .         .         .         .         .         .       g.83937
 GTGCCCCCCATGACAAGGCCCCCTCCATCTAACTACCAGCCCCCACCAAGCATGCAGAAT       c.4740
 V  P  P  M  T  R  P  P  P  S  N  Y  Q  P  P  P  S  M  Q  N         p.1580

          .         .         .         .         .         .       g.83997
 CACATTCCTCAGGTATCCAGCCCTGCTCCCCTGCCCCGGCCAATGGAGAACCGCACCTCT       c.4800
 H  I  P  Q  V  S  S  P  A  P  L  P  R  P  M  E  N  R  T  S         p.1600

          .         .         .         .         .         .       g.84057
 CCTAGCAAGTCTCCATTCCTGCACTCTGGGATGAAAATGCAGAAGGCAGGTCCCCCAGTA       c.4860
 P  S  K  S  P  F  L  H  S  G  M  K  M  Q  K  A  G  P  P  V         p.1620

          .         .         .         .         .         .       g.84117
 CCTGCCTCGCACATAGCACCTGCCCCTGTGCAGCCCCCCATGATTCGGCGGGATATCACC       c.4920
 P  A  S  H  I  A  P  A  P  V  Q  P  P  M  I  R  R  D  I  T         p.1640

          .         .         .         .         .         .       g.84177
 TTCCCACCTGGCTCTGTTGAAGCCACACAGCCTGTGTTGAAGCAGAGGAGGCGGCTCACA       c.4980
 F  P  P  G  S  V  E  A  T  Q  P  V  L  K  Q  R  R  R  L  T         p.1660

          .    | 19    .         .         .         .         .    g.84593
 ATGAAAGACATTG | GAACCCCGGAGGCATGGCGGGTAATGATGTCCCTCAAGTCTGGTCTC    c.5040
 M  K  D  I  G |   T  P  E  A  W  R  V  M  M  S  L  K  S  G  L      p.1680

          .         .         .         .         .         .       g.84653
 CTGGCAGAGAGCACATGGGCATTAGATACCATCAACATCCTGCTGTATGATGACAACAGC       c.5100
 L  A  E  S  T  W  A  L  D  T  I  N  I  L  L  Y  D  D  N  S         p.1700

          .         .     | 20   .         .         .         .    g.88028
 ATCATGACCTTCAACCTCAGTCAG | CTCCCAGGGTTGCTAGAGCTCCTTGTAGAATATTTC    c.5160
 I  M  T  F  N  L  S  Q   | L  P  G  L  L  E  L  L  V  E  Y  F      p.1720

          .         .         .         .         .         .       g.88088
 CGACGATGCCTGATTGAGATCTTTGGCATTTTAAAGGAGTATGAGGTGGGTGACCCAGGA       c.5220
 R  R  C  L  I  E  I  F  G  I  L  K  E  Y  E  V  G  D  P  G         p.1740

          .         .         .         .         .         .       g.88148
 CAGAGAACGCTACTGGATCCTGGGAGGTTCAGCAAGGTGTCTAGTCCAGCTCCCATGGAG       c.5280
 Q  R  T  L  L  D  P  G  R  F  S  K  V  S  S  P  A  P  M  E         p.1760

          .         .         .         .         .         .       g.88208
 GGTGGGGAAGAAGAAGAAGAACTTCTAGGTCCTAAACTAGAAGAGGAAGAAGAAGAGGAA       c.5340
 G  G  E  E  E  E  E  L  L  G  P  K  L  E  E  E  E  E  E  E         p.1780

          .         .         .         .         .         .       g.88268
 GTAGTTGAAAATGATGAGGAGATAGCCTTTTCAGGCAAGGACAAGCCAGCTTCAGAGAAT       c.5400
 V  V  E  N  D  E  E  I  A  F  S  G  K  D  K  P  A  S  E  N         p.1800

          .         .         .         .         .         .       g.88328
 AGTGAGGAGAAGCTGATCAGTAAGTTTGACAAGCTTCCAGTAAAGATCGTACAGAAGAAT       c.5460
 S  E  E  K  L  I  S  K  F  D  K  L  P  V  K  I  V  Q  K  N         p.1820

          .         .         .         .         .         .       g.88388
 GATCCATTTGTGGTGGACTGCTCAGATAAGCTTGGGCGTGTGCAGGAGTTTGACAGTGGC       c.5520
 D  P  F  V  V  D  C  S  D  K  L  G  R  V  Q  E  F  D  S  G         p.1840

          .         .         .         .         .         .       g.88448
 CTGCTGCACTGGCGGATTGGTGGGGGGGACACCACTGAGCATATCCAGACCCACTTCGAG       c.5580
 L  L  H  W  R  I  G  G  G  D  T  T  E  H  I  Q  T  H  F  E         p.1860

          .         .         .         .         .         .       g.88508
 AGCAAGACAGAGCTGCTGCCTTCCCGGCCTCACGCACCCTGCCCACCAGCCCCTCGGAAG       c.5640
 S  K  T  E  L  L  P  S  R  P  H  A  P  C  P  P  A  P  R  K         p.1880

          .         .         .         .         .         .       g.88568
 CATGTGACAACAGCAGAGGGTACACCAGGGACAACAGACCAGGAGGGGCCCCCACCTGAT       c.5700
 H  V  T  T  A  E  G  T  P  G  T  T  D  Q  E  G  P  P  P  D         p.1900

          .         .         .         .         .         .       g.88628
 GGACCTCCAGAAAAACGGATCACAGCCACTATGGATGACATGTTGTCTACTCGGTCTAGC       c.5760
 G  P  P  E  K  R  I  T  A  T  M  D  D  M  L  S  T  R  S  S         p.1920

          .         .         .         .         .         .       g.88688
 ACCTTGACCGAGGATGGAGCTAAGAGTTCAGAGGCCATCAAGGAGAGCAGCAAGTTTCCA       c.5820
 T  L  T  E  D  G  A  K  S  S  E  A  I  K  E  S  S  K  F  P         p.1940

          .         .         .         .         .         .       g.88748
 TTTGGCATTAGCCCAGCACAGAGCCACCGGAACATCAAGATCCTAGAGGACGAACCCCAC       c.5880
 F  G  I  S  P  A  Q  S  H  R  N  I  K  I  L  E  D  E  P  H         p.1960

          .         .         .         .         .         .       g.88808
 AGTAAGGATGAGACCCCACTGTGTACCCTTCTGGACTGGCAGGATTCTCTTGCCAAGCGC       c.5940
 S  K  D  E  T  P  L  C  T  L  L  D  W  Q  D  S  L  A  K  R         p.1980

          .         .         .         .         .         .       g.88868
 TGCGTCTGTGTGTCCAATACCATTCGAAGCCTGTCATTTGTGCCAGGCAATGACTTTGAG       c.6000
 C  V  C  V  S  N  T  I  R  S  L  S  F  V  P  G  N  D  F  E         p.2000

          .         .         .         .         .         .       g.88928
 ATGTCCAAACACCCAGGGCTGCTGCTCATCCTGGGCAAGCTGATCCTGCTGCACCACAAG       c.6060
 M  S  K  H  P  G  L  L  L  I  L  G  K  L  I  L  L  H  H  K         p.2020

          .         .         .         .         .         .       g.88988
 CACCCAGAACGGAAGCAGGCACCACTAACTTATGAAAAGGAGGAGGAACAGGACCAAGGG       c.6120
 H  P  E  R  K  Q  A  P  L  T  Y  E  K  E  E  E  Q  D  Q  G         p.2040

          .         .         .         .         .         .       g.89048
 GTGAGCTGCAACAAAGTGGAGTGGTGGTGGGACTGCTTGGAGATGCTCCGGGAAAACACC       c.6180
 V  S  C  N  K  V  E  W  W  W  D  C  L  E  M  L  R  E  N  T         p.2060

          .         .         .         .         .         .       g.89108
 TTGGTTACACTCGCCAACATCTCGGGGCAGTTGGACCTATCTCCATACCCCGAGAGCATT       c.6240
 L  V  T  L  A  N  I  S  G  Q  L  D  L  S  P  Y  P  E  S  I         p.2080

          .         .         .         .         .         .       g.89168
 TGCCTGCCTGTCCTGGACGGACTCCTACACTGGGCAGTTTGCCCTTCAGCTGAAGCCCAG       c.6300
 C  L  P  V  L  D  G  L  L  H  W  A  V  C  P  S  A  E  A  Q         p.2100

          .         .         .         .         .         .       g.89228
 GACCCCTTTTCCACCCTGGGCCCCAATGCCGTCCTTTCCCCGCAGAGACTGGTCTTGGAA       c.6360
 D  P  F  S  T  L  G  P  N  A  V  L  S  P  Q  R  L  V  L  E         p.2120

          .         .         .         .         .         .       g.89288
 ACCCTCAGCAAACTCAGCATCCAGGACAACAATGTGGACCTGATTCTGGCCACACCCCCC       c.6420
 T  L  S  K  L  S  I  Q  D  N  N  V  D  L  I  L  A  T  P  P         p.2140

          .         .         .         .         .         .       g.89348
 TTCAGCCGCCTGGAGAAGTTGTATAGCACTATGGTGCGCTTCCTCAGTGACCGAAAGAAC       c.6480
 F  S  R  L  E  K  L  Y  S  T  M  V  R  F  L  S  D  R  K  N         p.2160

          .         .         .         .         .         .       g.89408
 CCGGTGTGCCGGGAGATGGCTGTGGTACTGCTGGCCAACCTGGCTCAGGGGGACAGCCTG       c.6540
 P  V  C  R  E  M  A  V  V  L  L  A  N  L  A  Q  G  D  S  L         p.2180

          .         .         .         .         .         .       g.89468
 GCAGCTCGTGCCATTGCAGTGCAGAAGGGCAGTATCGGCAACCTCCTGGGCTTCCTAGAG       c.6600
 A  A  R  A  I  A  V  Q  K  G  S  I  G  N  L  L  G  F  L  E         p.2200

          .         .         .         .         .         .       g.89528
 GACAGCCTTGCCGCCACACAGTTCCAGCAGAGCCAGGCCAGCCTCCTCCACATGCAGAAC       c.6660
 D  S  L  A  A  T  Q  F  Q  Q  S  Q  A  S  L  L  H  M  Q  N         p.2220

          .         .         .         .         .         .       g.89588
 CCACCCTTTGAGCCAACTAGTGTGGACATGATGCGGCGGGCTGCCCGCGCGCTGCTTGCC       c.6720
 P  P  F  E  P  T  S  V  D  M  M  R  R  A  A  R  A  L  L  A         p.2240

          .         .         .         .         .         .       g.89648
 TTGGCCAAGGTGGACGAGAACCACTCAGAGTTTACTCTGTACGAATCACGGCTGTTGGAC       c.6780
 L  A  K  V  D  E  N  H  S  E  F  T  L  Y  E  S  R  L  L  D         p.2260

          .         .         .         .         .         .       g.89708
 ATCTCGGTATCACCGTTGATGAACTCATTGGTTTCACAAGTCATTTGTGATGTACTGTTT       c.6840
 I  S  V  S  P  L  M  N  S  L  V  S  Q  V  I  C  D  V  L  F         p.2280

          .                                                         g.89726
 TTGATTGGCCAGTCATGA                                                 c.6858
 L  I  G  Q  S  X                                                   p.2285

          .         .         .         .         .         .       g.89786
 cagccgtgggacacctcccccccccgtgtgtgtgtgcgtgtgtggagaacttagaaactg       c.*60

          .         .         .         .         .         .       g.89846
 actgttgccctttatttatgcaaaaccacctcagaatccagtttaccctgtgctgtccag       c.*120

          .         .         .         .         .         .       g.89906
 cttctcccttgggaaaaagtctctcctgtttctctctcctccttccacctcccctccctc       c.*180

          .         .         .         .         .         .       g.89966
 catcacctcacgcctttctgttccttgtcctcaccttactcccctcaggaccctacccca       c.*240

          .         .         .         .         .         .       g.90026
 ccctctttgaaaagacaaagctctgcctacatagaagactttttttattttaaccaaagt       c.*300

          .         .         .         .         .         .       g.90086
 tactgttgtttacagtgagtttggggaaaaaaaataaaataaaaatggctttcccagtcc       c.*360

          .         .         .         .         .         .       g.90146
 ttgcatcaacgggatgccacatttcataactgtttttaatggtaaaaaaaaaaaaaaaaa       c.*420

          .         .         .         .         .         .       g.90206
 atacaaaaaaaaattctgaaggacaaaaaaggtgactgctgaactgtgtgtggtttattg       c.*480

          .         .         .         .         .         .       g.90266
 ttgtacattcacaatcttgcaggagccaagaagttcgcagttgtgaacagaccctgttca       c.*540

          .         .         .         .         .         .       g.90326
 ctggagaggcctgtgcagtagagtgtagaccctttcatgtactgtactgtacacctgata       c.*600

          .         .         .         .         .         .       g.90386
 ctgtaaacatactgtaataataatgtctcacatggaaacagaaaacgctgggtcagcagc       c.*660

          .         .         .         .         .         .       g.90446
 aagctgtagtttttaaaaatgtttttagttaaacgttgaggagaaaaaaaaaaaaggctt       c.*720

          .         .         .         .         .         .       g.90506
 ttcccccaaagtatcatgtgtgaacctacaacaccctgacctctttctctcctccttgat       c.*780

          .         .         .         .         .         .       g.90566
 tgtatgaataaccctgagatcacctcttagaactggttttaacctttagctgcagcggct       c.*840

          .         .         .         .         .         .       g.90626
 acgctgccacgtgtgtatatatatgacgttgtacattgcacatacccttggatccccaca       c.*900

          .         .         .         .         .         .       g.90686
 gtttggtcctcctcccagctacccctttatagtatgacgagttaacaagttggtgacctg       c.*960

          .         .         .         .         .         .       g.90746
 cacaaagcgagacacagctatttaatctcttgccagatatcgcccctcttggtgcgatgc       c.*1020

          .         .         .         .         .         .       g.90806
 tgtacaggtctctgtaaaaagtccttgctgtctcagcagccaatcaacttatagtttatt       c.*1080

          .         .         .         .         .         .       g.90866
 tttttctgggtttttgttttgttttgttttctttctaatcgaggtgtgaaaaagttctag       c.*1140

          .         .         .         .         .         .       g.90926
 gttcagttgaagttctgatgaagaaacacaattgagattttttcagtgataaaatctgca       c.*1200

          .         .         .         .         .         .       g.90986
 tatttgtatttcaacaatgtagctaaaacttgatgtaaattcctcctttttttccttttt       c.*1260

          .         .         .         .         .         .       g.91046
 tggcttaatgaatatcatttattcagtatgaaatctttatactatatgttccacgtgtta       c.*1320

          .         .         .                                     g.91080
 agaataaatgtacattaaatcttggtaagacttt                                 c.*1354

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The AT rich interactive domain 1A (SWI-like) protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 13
©2004-2015 Leiden University Medical Center