ash1 (absent, small, or homeotic)-like (Drosophila) (ASH1L) - coding DNA reference sequence

(used for variant description)

(last modified September 30, 2020)


This file was created to facilitate the description of sequence variants on transcript NM_018489.2 in the ASH1L gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NC_000001.10, covering ASH1L transcript NM_018489.2.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
 .         .         .         .         .         .                g.5060
 aggagtggaaggttgaggggggcgctaggcgcccttcgctccctccctctggaggagctg       c.-421

 .         .         .         .         .         .                g.5120
 ccgccgccaccgccgccactctgctgctgccgccgccgccgccgccgctcccgccgccat       c.-361

 .         .         .         .         .         .                g.5180
 tttgggttcgctttgcggaggggagacgatcccagtctcggttgcgggacccgcctcccc       c.-301

 .         .         .         .         .         .                g.5240
 tcagtttgccccctttagccttccacctttcccttctcctctctcgcatttccgccagtc       c.-241

 .         .         .         .         .         .                g.5300
 agcttacccgctggccgcctcctgacaagcgggagggatccgccgtggacccagggaagc       c.-181

 .         .         .         .         .         .                g.5360
 ggaggagcctggcggccaccccctcttccccacttccctgcactctcatcgctctcggcc       c.-121

 .         .         . | 02       .         .         .             g.45954
 tcggcctcggcctccgacacg | agaaagatgctggtttcgagttttggagatccttgtttt    c.-61

 .         .         .         .         .         .                g.46014
 ttatggaacacagttctgtaaaattttcataagattccttggcaataacatacgcttgtg       c.-1

          .         .         .         .         .         .       g.46074
 ATGGACCCTAGAAATACTGCTATGTTAGGATTGGGTTCTGATTCCGAAGGTTTTTCAAGA       c.60
 M  D  P  R  N  T  A  M  L  G  L  G  S  D  S  E  G  F  S  R         p.20

          .         .         .         .         .         .       g.46134
 AAGAGTCCTTCTGCCATCAGTACTGGCACATTGGTCAGTAAGAGAGAAGTAGAGCTAGAA       c.120
 K  S  P  S  A  I  S  T  G  T  L  V  S  K  R  E  V  E  L  E         p.40

          .         .         .         .         .         .       g.46194
 AAAAACACAAAGGAGGAAGAGGACCTTCGCAAACGGAATCGAGAAAGAAACATCGAAGCT       c.180
 K  N  T  K  E  E  E  D  L  R  K  R  N  R  E  R  N  I  E  A         p.60

          .         .         .         .         .         .       g.46254
 GGGAAAGATGATGGTTTGACTGATGCACAGCAACAGTTTTCAGTGAAAGAAACAAACTTT       c.240
 G  K  D  D  G  L  T  D  A  Q  Q  Q  F  S  V  K  E  T  N  F         p.80

          .         .         .         .         .         .       g.46314
 TCAGAGGGAAATTTAAAATTGAAAATTGGCCTCCAGGCTAAGAGAACTAAAAAACCTCCA       c.300
 S  E  G  N  L  K  L  K  I  G  L  Q  A  K  R  T  K  K  P  P         p.100

          .         .         .         .         .         .       g.46374
 AAGAACTTGGAGAACTATGTATGTCGACCTGCCATAAAAACAACTATTAAGCACCCAAGG       c.360
 K  N  L  E  N  Y  V  C  R  P  A  I  K  T  T  I  K  H  P  R         p.120

          .         .         .         .         .         .       g.46434
 AAAGCACTTAAAAGTGGAAAGATGACGGATGAAAAGAATGAACACTGTCCTTCAAAACGA       c.420
 K  A  L  K  S  G  K  M  T  D  E  K  N  E  H  C  P  S  K  R         p.140

  | 03       .         .         .         .         .         .    g.85144
  | GACCCTTCAAAGTTGTACAAGAAAGCAGATGATGTTGCAGCCATTGAATGCCAGTCTGAA    c.480
  | D  P  S  K  L  Y  K  K  A  D  D  V  A  A  I  E  C  Q  S  E      p.160

          .         .         .         .         .         .       g.85204
 GAAGTCATCCGTCTTCATTCACAGGGAGAAAACAATCCTTTGTCTAAGAAGCTGTCTCCA       c.540
 E  V  I  R  L  H  S  Q  G  E  N  N  P  L  S  K  K  L  S  P         p.180

          .         .         .         .         .         .       g.85264
 GTACACTCAGAAATGGCAGATTATATTAATGCAACGCCATCTACTCTTCTTGGTAGCCGG       c.600
 V  H  S  E  M  A  D  Y  I  N  A  T  P  S  T  L  L  G  S  R         p.200

          .         .         .         .         .         .       g.85324
 GATCCTGATTTAAAGGACAGAGCATTACTTAATGGAGGAACTAGTGTAACAGAAAAGTTG       c.660
 D  P  D  L  K  D  R  A  L  L  N  G  G  T  S  V  T  E  K  L         p.220

          .         .         .         .         .         .       g.85384
 GCACAGCTGATTGCTACCTGTCCTCCTTCCAAGTCTTCCAAGACAAAACCGAAGAAGTTA       c.720
 A  Q  L  I  A  T  C  P  P  S  K  S  S  K  T  K  P  K  K  L         p.240

          .         .         .         .         .         .       g.85444
 GGAACTGGCACTACAGCAGGATTGGTTAGCAAGGATTTGATCAGGAAAGCAGGTGTTGGC       c.780
 G  T  G  T  T  A  G  L  V  S  K  D  L  I  R  K  A  G  V  G         p.260

          .         .         .         .         .         .       g.85504
 TCTGTAGCTGGAATAATACATAAGGACTTAATAAAAAAGCCAACCATCAGCACAGCAGTT       c.840
 S  V  A  G  I  I  H  K  D  L  I  K  K  P  T  I  S  T  A  V         p.280

          .         .         .         .         .         .       g.85564
 GGATTGGTAACTAAAGATCCTGGGAAAAAGCCAGTGTTTAATGCAGCAGTAGGATTGGTC       c.900
 G  L  V  T  K  D  P  G  K  K  P  V  F  N  A  A  V  G  L  V         p.300

          .         .         .         .         .         .       g.85624
 AATAAGGACTCTGTGAAAAAACTGGGAACTGGCACTACAGCGGTATTCATTAATAAAAAC       c.960
 N  K  D  S  V  K  K  L  G  T  G  T  T  A  V  F  I  N  K  N         p.320

          .         .         .         .         .         .       g.85684
 TTAGGCAAAAAGCCAGGAACTATCACTACAGTAGGACTGCTAAGCAAAGATTCAGGAAAG       c.1020
 L  G  K  K  P  G  T  I  T  T  V  G  L  L  S  K  D  S  G  K         p.340

          .         .         .         .         .         .       g.85744
 AAGCTAGGAATTGGTATTGTTCCAGGTTTAGTGCATAAAGAGTCTGGCAAGAAGTTAGGA       c.1080
 K  L  G  I  G  I  V  P  G  L  V  H  K  E  S  G  K  K  L  G         p.360

          .         .         .         .         .         .       g.85804
 CTTGGCACTGTGGTTGGACTGGTTAATAAAGATTTGGGAAAGAAATTGGGTTCTACTGTT       c.1140
 L  G  T  V  V  G  L  V  N  K  D  L  G  K  K  L  G  S  T  V         p.380

          .         .         .         .         .         .       g.85864
 GGCCTAGTGGCCAAGGACTGTGCAAAGAAGATTGTAGCAAGTTCAGCAATGGGATTGGTT       c.1200
 G  L  V  A  K  D  C  A  K  K  I  V  A  S  S  A  M  G  L  V         p.400

          .         .         .         .         .         .       g.85924
 AATAAGGACATTGGAAAGAAACTAATGAGTTGTCCTTTGGCAGGTCTGATCAGTAAAGAT       c.1260
 N  K  D  I  G  K  K  L  M  S  C  P  L  A  G  L  I  S  K  D         p.420

          .         .         .         .         .         .       g.85984
 GCCATAAACCTTAAAGCCGAAGCACTGCTCCCCACTCAGGAACCGCTTAAGGCTTCTTGT       c.1320
 A  I  N  L  K  A  E  A  L  L  P  T  Q  E  P  L  K  A  S  C         p.440

          .         .         .         .         .         .       g.86044
 AGTACAAACATCAATAATCAGGAAAGTCAGGAACTTTCTGAATCCCTGAAAGATAGTGCC       c.1380
 S  T  N  I  N  N  Q  E  S  Q  E  L  S  E  S  L  K  D  S  A         p.460

          .         .         .         .         .         .       g.86104
 ACCAGCAAAACTTTTGAAAAGAATGTTGTACGGCAGAATAAAGAAAGCATATTGGAAAAG       c.1440
 T  S  K  T  F  E  K  N  V  V  R  Q  N  K  E  S  I  L  E  K         p.480

          .         .         .         .         .         .       g.86164
 TTCTCAGTACGAAAAGAAATCATTAATTTGGAGAAAGAAATGTTTAATGAAGGAACATGC       c.1500
 F  S  V  R  K  E  I  I  N  L  E  K  E  M  F  N  E  G  T  C         p.500

          .         .         .         .         .         .       g.86224
 ATTCAGCAAGACAGTTTCTCATCCAGTGAAAAGGGATCTTATGAAACCTCAAAGCATGAA       c.1560
 I  Q  Q  D  S  F  S  S  S  E  K  G  S  Y  E  T  S  K  H  E         p.520

          .         .         .         .         .         .       g.86284
 AAGCAGCCTCCTGTATATTGCACTTCTCCGGACTTTAAAATGGGAGGTGCTTCTGATGTA       c.1620
 K  Q  P  P  V  Y  C  T  S  P  D  F  K  M  G  G  A  S  D  V         p.540

          .         .         .         .         .         .       g.86344
 TCTACCGCTAAATCCCCATTCAGTGCAGTAGGAGAAAGCAATCTCCCTTCCCCATCACCT       c.1680
 S  T  A  K  S  P  F  S  A  V  G  E  S  N  L  P  S  P  S  P         p.560

          .         .         .         .         .         .       g.86404
 ACTGTATCTGTTAATCCTTTAACCAGAAGTCCCCCTGAAACTTCTTCACAGTTGGCTCCT       c.1740
 T  V  S  V  N  P  L  T  R  S  P  P  E  T  S  S  Q  L  A  P         p.580

          .         .         .         .         .         .       g.86464
 AATCCATTACTTTTAAGTTCTACTACAGAACTAATCGAAGAAATTTCTGAATCTGTTGGA       c.1800
 N  P  L  L  L  S  S  T  T  E  L  I  E  E  I  S  E  S  V  G         p.600

          .         .         .         .         .         .       g.86524
 AAGAACCAGTTTACTTCTGAAAGTACCCACTTGAACGTTGGTCATAGGTCAGTTGGTCAT       c.1860
 K  N  Q  F  T  S  E  S  T  H  L  N  V  G  H  R  S  V  G  H         p.620

          .         .         .         .         .         .       g.86584
 AGTATAAGTATTGAATGTAAAGGGATTGATAAAGAGGTAAATGATTCAAAAACTACCCAT       c.1920
 S  I  S  I  E  C  K  G  I  D  K  E  V  N  D  S  K  T  T  H         p.640

          .         .         .         .         .         .       g.86644
 ATAGATATTCCAAGAATAAGCTCTTCCCTTGGAAAAAAGCCAAGTTTGACTTCTGAATCC       c.1980
 I  D  I  P  R  I  S  S  S  L  G  K  K  P  S  L  T  S  E  S         p.660

          .         .         .         .         .         .       g.86704
 AGCATTCATACTATTACTCCTTCAGTTGTTAACTTCACTAGTTTATTTAGTAATAAGCCT       c.2040
 S  I  H  T  I  T  P  S  V  V  N  F  T  S  L  F  S  N  K  P         p.680

          .         .         .         .         .         .       g.86764
 TTTTTAAAACTGGGTGCAGTATCTGCATCAGACAAACACTGCCAAGTTGCTGAAAGCCTA       c.2100
 F  L  K  L  G  A  V  S  A  S  D  K  H  C  Q  V  A  E  S  L         p.700

          .         .         .         .         .         .       g.86824
 AGTACTAGTTTGCAGTCCAAACCATTAAAAAAAAGAAAAGGAAGAAAACCTCGGTGGACT       c.2160
 S  T  S  L  Q  S  K  P  L  K  K  R  K  G  R  K  P  R  W  T         p.720

          .         .         .         .         .         .       g.86884
 AAAGTGGTGGCAAGAAGCACATGCCGGTCTCCAAAAGGGCTAGAATTAGAAAGATCAGAG       c.2220
 K  V  V  A  R  S  T  C  R  S  P  K  G  L  E  L  E  R  S  E         p.740

          .         .         .         .         .         .       g.86944
 CTTTTTAAAAACGTTTCATGTAGCTCACTATCAAATAGTAATTCTGAGCCAGCCAAGTTT       c.2280
 L  F  K  N  V  S  C  S  S  L  S  N  S  N  S  E  P  A  K  F         p.760

          .         .         .         .         .         .       g.87004
 ATGAAAAACATTGGACCCCCTTCATTTGTAGATCATGACTTCCTTAAACGCCGATTGCCA       c.2340
 M  K  N  I  G  P  P  S  F  V  D  H  D  F  L  K  R  R  L  P         p.780

          .         .         .         .         .         .       g.87064
 AAGTTGAGCAAATCCACAGCTCCATCTCTTGCTCTCTTAGCTGATAGTGAAAAACCATCT       c.2400
 K  L  S  K  S  T  A  P  S  L  A  L  L  A  D  S  E  K  P  S         p.800

          .         .         .         .         .         .       g.87124
 CATAAGTCTTTTGCTACTCACAAACTATCCTCCAGTATGTGTGTCTCTAGTGACCTTTTG       c.2460
 H  K  S  F  A  T  H  K  L  S  S  S  M  C  V  S  S  D  L  L         p.820

          .         .         .         .         .         .       g.87184
 TCTGATATTTATAAGCCCAAAAGAGGAAGGCCTAAATCTAAGGAGATGCCTCAACTGGAA       c.2520
 S  D  I  Y  K  P  K  R  G  R  P  K  S  K  E  M  P  Q  L  E         p.840

          .         .         .         .         .         .       g.87244
 GGGCCACCTAAAAGGACTTTAAAAATCCCTGCTTCTAAAGTGTTTTCTTTACAGTCTAAG       c.2580
 G  P  P  K  R  T  L  K  I  P  A  S  K  V  F  S  L  Q  S  K         p.860

          .         .         .         .         .         .       g.87304
 GAAGAACAAGAACCCCCAATTTTACAGCCAGAAATTGAAATCCCTTCCTTCAAACAAGGT       c.2640
 E  E  Q  E  P  P  I  L  Q  P  E  I  E  I  P  S  F  K  Q  G         p.880

          .         .         .         .         .         .       g.87364
 CTGTCTGTGTCTCCTTTTCCAAAAAAGAGAGGCAGGCCTAAGAGGCAAATGAGGTCACCA       c.2700
 L  S  V  S  P  F  P  K  K  R  G  R  P  K  R  Q  M  R  S  P         p.900

          .         .         .         .         .         .       g.87424
 GTCAAGATGAAGCCACCTGTACTGTCAGTGGCTCCATTTGTTGCCACTGAAAGTCCAAGC       c.2760
 V  K  M  K  P  P  V  L  S  V  A  P  F  V  A  T  E  S  P  S         p.920

          .         .         .         .         .         .       g.87484
 AAGCTAGAATCTGAAAGTGACAACCATAGAAGTAGCAGTGATTTCTTTGAGAGCGAGGAT       c.2820
 K  L  E  S  E  S  D  N  H  R  S  S  S  D  F  F  E  S  E  D         p.940

          .         .         .         .         .         .       g.87544
 CAACTTCAGGATCCAGATGACCTAGATGACAGTCATAGGCCAAGTGTCTGTAGTATGAGT       c.2880
 Q  L  Q  D  P  D  D  L  D  D  S  H  R  P  S  V  C  S  M  S         p.960

          .         .         .         .         .         .       g.87604
 GACCTTGAGATGGAACCAGATAAAAAAATTACCAAGAGAAACAATGGACAATTAATGAAA       c.2940
 D  L  E  M  E  P  D  K  K  I  T  K  R  N  N  G  Q  L  M  K         p.980

          .         .         .         .         .         .       g.87664
 ACAATTATCCGCAAAATAAATAAAATGAAGACTTTAAAGAGAAAGAAACTGTTGAATCAG       c.3000
 T  I  I  R  K  I  N  K  M  K  T  L  K  R  K  K  L  L  N  Q         p.1000

          .         .         .         .         .         .       g.87724
 ATTCTTTCAAGTTCTGTAGAATCAAGTAATAAAGGGAAAGTGCAATCCAAACTCCATAAT       c.3060
 I  L  S  S  S  V  E  S  S  N  K  G  K  V  Q  S  K  L  H  N         p.1020

          .         .         .         .         .         .       g.87784
 ACGGTATCAAGTCTTGCTGCCACATTTGGCTCTAAATTGGGCCAACAGATAAATGTCAGC       c.3120
 T  V  S  S  L  A  A  T  F  G  S  K  L  G  Q  Q  I  N  V  S         p.1040

          .         .         .         .         .         .       g.87844
 AAGAAAGGAACCATTTATATAGGAAAGAGAAGAGGTCGCAAACCAAAAACTGTCTTAAAT       c.3180
 K  K  G  T  I  Y  I  G  K  R  R  G  R  K  P  K  T  V  L  N         p.1060

          .         .         .         .         .         .       g.87904
 GGTATTCTTTCTGGTAGTCCTACTAGCCTTGCTGTTCTTGAGCAAACAGCTCAACAGGCA       c.3240
 G  I  L  S  G  S  P  T  S  L  A  V  L  E  Q  T  A  Q  Q  A         p.1080

          .         .         .         .         .         .       g.87964
 GCTGGGTCAGCATTAGGACAGATTCTTCCCCCATTACTGCCTTCATCTGCTAGTAGTTCT       c.3300
 A  G  S  A  L  G  Q  I  L  P  P  L  L  P  S  S  A  S  S  S         p.1100

          .         .         .         .         .         .       g.88024
 GAGATTCTTCCATCACCTATTTGCTCTCAGTCTTCTGGGACTAGTGGAGGTCAGAGCCCT       c.3360
 E  I  L  P  S  P  I  C  S  Q  S  S  G  T  S  G  G  Q  S  P         p.1120

          .         .         .         .         .         .       g.88084
 GTAAGTAGTGATGCAGGTTTTGTTGAACCCAGTTCAGTGCCATATTTGCATTTACACTCC       c.3420
 V  S  S  D  A  G  F  V  E  P  S  S  V  P  Y  L  H  L  H  S         p.1140

          .         .         .         .         .         .       g.88144
 AGACAGGGCAGTATGATTCAGACTCTTGCAATGAAGAAGGCCTCAAAGGGGAGGAGGCGG       c.3480
 R  Q  G  S  M  I  Q  T  L  A  M  K  K  A  S  K  G  R  R  R         p.1160

          .         .         .         .         .         .       g.88204
 TTATCTCCTCCTACTTTGTTGCCAAATTCTCCTTCGCACTTGAGTGAACTCACATCTCTA       c.3540
 L  S  P  P  T  L  L  P  N  S  P  S  H  L  S  E  L  T  S  L         p.1180

          .         .         .         .         .         .       g.88264
 AAAGAAGCTACTCCTTCCCCAATCAGTGAGTCTCATAGTGATGAGACCATTCCCAGTGAT       c.3600
 K  E  A  T  P  S  P  I  S  E  S  H  S  D  E  T  I  P  S  D         p.1200

          .         .         .         .         .         .       g.88324
 AGTGGAATTGGAACAGATAATAACAGCACATCAGACAGGGCAGAGAAATTTTGTGGGCAA       c.3660
 S  G  I  G  T  D  N  N  S  T  S  D  R  A  E  K  F  C  G  Q         p.1220

          .         .         .         .         .         .       g.88384
 AAAAAGAGGAGGCATTCTTTTGAGCATGTTTCTCTGATTCCCCCTGAAACCTCTACAGTG       c.3720
 K  K  R  R  H  S  F  E  H  V  S  L  I  P  P  E  T  S  T  V         p.1240

          .         .         .         .         .         .       g.88444
 CTAAGCAGTCTTAAAGAAAAACATAAACACAAATGTAAGCGCAGGAATCATGATTACCTC       c.3780
 L  S  S  L  K  E  K  H  K  H  K  C  K  R  R  N  H  D  Y  L         p.1260

          .         .         .         .         .         .       g.88504
 AGCTATGACAAGATGAAAAGGCAGAAACGAAAACGGAAAAAGAAATATCCCCAGCTTCGA       c.3840
 S  Y  D  K  M  K  R  Q  K  R  K  R  K  K  K  Y  P  Q  L  R         p.1280

          .         .         .         .         .         .       g.88564
 AATAGACAGGATCCAGACTTTATTGCAGAGCTGGAGGAACTAATAAGTCGCCTAAGTGAA       c.3900
 N  R  Q  D  P  D  F  I  A  E  L  E  E  L  I  S  R  L  S  E         p.1300

          .         .         .         .         .         .       g.88624
 ATTCGGATCACTCATCGAAGTCATCATTTTATCCCCCGAGATCTTCTGCCAACTATCTTT       c.3960
 I  R  I  T  H  R  S  H  H  F  I  P  R  D  L  L  P  T  I  F         p.1320

          .         .         .         .         .         .       g.88684
 CGAATCAACTTTAATAGTTTCTATACACATCCTTCTTTCCCCTTAGACCCTTTGCACTAC       c.4020
 R  I  N  F  N  S  F  Y  T  H  P  S  F  P  L  D  P  L  H  Y         p.1340

          .         .         .         .         .         .       g.88744
 ATTCGAAAACCTGACTTAAAAAAGAAAAGAGGGAGACCCCCTAAGATGAGGGAGGCAATG       c.4080
 I  R  K  P  D  L  K  K  K  R  G  R  P  P  K  M  R  E  A  M         p.1360

          .         .         .         .         .         .       g.88804
 GCTGAAATGCCTTTTATGCACAGCCTTAGTTTTCCTCTTTCTAGTACTGGATTCTATCCA       c.4140
 A  E  M  P  F  M  H  S  L  S  F  P  L  S  S  T  G  F  Y  P         p.1380

          .         .         .         .         .         .       g.88864
 TCTTATGGTATGCCTTACTCTCCTTCACCCCTTACAGCTGCTCCCATAGGATTAGGTTAC       c.4200
 S  Y  G  M  P  Y  S  P  S  P  L  T  A  A  P  I  G  L  G  Y         p.1400

          .         .         .         .         .         .       g.88924
 TATGGAAGGTATCCTCCCACTCTTTATCCACCTCCTCCATCTCCTTCTTTCACCACGCCA       c.4260
 Y  G  R  Y  P  P  T  L  Y  P  P  P  P  S  P  S  F  T  T  P         p.1420

          .         .         .         .         .         .       g.88984
 CTTCCACCTCCTTCCTATATGCATGCTGGTCATTTACTTCTCAATCCTGCCAAATACCAT       c.4320
 L  P  P  P  S  Y  M  H  A  G  H  L  L  L  N  P  A  K  Y  H         p.1440

          .         .         .         .         .         .       g.89044
 AAGAAAAAGCATAAGCTACTTCGACAGGAGGCCTTTCTTACAACCAGCAGGACTCCCCTC       c.4380
 K  K  K  H  K  L  L  R  Q  E  A  F  L  T  T  S  R  T  P  L         p.1460

          .         .         .         .         .         .       g.89104
 CTTTCCATGAGTACCTACCCCAGTGTTCCTCCTGAGATGGCCTATGGTTGGATGGTTGAG       c.4440
 L  S  M  S  T  Y  P  S  V  P  P  E  M  A  Y  G  W  M  V  E         p.1480

          .         .         .         .         .         .       g.89164
 CACAAACACAGGCACCGTCACAAACACAGAGAACACCGTTCTTCTGAACAACCCCAGGTT       c.4500
 H  K  H  R  H  R  H  K  H  R  E  H  R  S  S  E  Q  P  Q  V         p.1500

          .         .         .         .         .         .       g.89224
 TCTATGGACACTGGCTCTTCCCGATCTGTCCTGGAATCTTTGAAGCGCTATAGATTTGGA       c.4560
 S  M  D  T  G  S  S  R  S  V  L  E  S  L  K  R  Y  R  F  G         p.1520

          .         .         .         .         .         .       g.89284
 AAGGATGCTGTTGGAGAGCGATATAAGCATAAGGAAAAGCACCGTTGTCACATGTCCTGC       c.4620
 K  D  A  V  G  E  R  Y  K  H  K  E  K  H  R  C  H  M  S  C         p.1540

          .         .         .         .         .         .       g.89344
 CCTCATCTCTCTCCTTCAAAAAGCTTAATAAACAGAGAGGAACAGTGGGTCCACCGAGAG       c.4680
 P  H  L  S  P  S  K  S  L  I  N  R  E  E  Q  W  V  H  R  E         p.1560

          .         .         .         .         .         .       g.89404
 CCTTCAGAATCTAGTCCATTGGCCTTGGGATTGCAGACACCTTTACAGATTGACTGTTCA       c.4740
 P  S  E  S  S  P  L  A  L  G  L  Q  T  P  L  Q  I  D  C  S         p.1580

          .         .         .         .         .         .       g.89464
 GAAAGTTCTCCAAGCTTATCCCTTGGAGGATTCACTCCCAACTCTGAGCCAGCCAGCAGT       c.4800
 E  S  S  P  S  L  S  L  G  G  F  T  P  N  S  E  P  A  S  S         p.1600

          .         .         .         .         .         .       g.89524
 GATGAACATACAAACCTTTTCACAAGTGCAATAGGCAGCTGCAGAGTTTCAAACCCTAAC       c.4860
 D  E  H  T  N  L  F  T  S  A  I  G  S  C  R  V  S  N  P  N         p.1620

          .         .         .         .         .         .       g.89584
 TCCAGTGGCCGGAAGAAATTAACTGACAGCCCTGGACTCTTTTCTGCACAGGACACTTCA       c.4920
 S  S  G  R  K  K  L  T  D  S  P  G  L  F  S  A  Q  D  T  S         p.1640

          .         .         .         .         .         .       g.89644
 CTAAATCGGCTTCACAGAAAGGAGTCACTGCCTTCTAACGAAAGGGCAGTACAGACTTTG       c.4980
 L  N  R  L  H  R  K  E  S  L  P  S  N  E  R  A  V  Q  T  L         p.1660

      | 04   .         .         .         .         .         .    g.107691
 GCAG | GCTCCCAGCCAACCTCTGATAAACCCTCCCAGCGGCCATCAGAGAGCACAAATTGT    c.5040
 A  G |   S  Q  P  T  S  D  K  P  S  Q  R  P  S  E  S  T  N  C      p.1680

          .         .         .         .       | 05 .         .    g.128479
 AGCCCTACCCGGAAAAGGTCTTCATCTGAGAGTACTTCTTCAACAG | TAAACGGAGTTCCC    c.5100
 S  P  T  R  K  R  S  S  S  E  S  T  S  S  T  V |   N  G  V  P      p.1700

          .         .         .         .         .         .       g.128539
 TCTCGAAGTCCAAGATTAGTTGCTTCTGGGGATGACTCTGTGGATAGTCTGCTGCAGCGG       c.5160
 S  R  S  P  R  L  V  A  S  G  D  D  S  V  D  S  L  L  Q  R         p.1720

          .         .         .         .         .         .       g.128599
 ATGGTACAAAATGAGGACCAAGAGCCCATGGAGAAAAGTATTGATGCTGTGATTGCAACT       c.5220
 M  V  Q  N  E  D  Q  E  P  M  E  K  S  I  D  A  V  I  A  T         p.1740

          .         .         .         .         .         .       g.128659
 GCCTCTGCACCACCTTCTTCCAGTCCAGGCCGTAGCCACAGCAAGGACCGAACCCTGGGA       c.5280
 A  S  A  P  P  S  S  S  P  G  R  S  H  S  K  D  R  T  L  G         p.1760

          .         .         .         .         .         .       g.128719
 AAACCAGACAGCCTTTTAGTGCCTGCAGTCACAAGTGACTCTTGCAATAATAGCATCTCA       c.5340
 K  P  D  S  L  L  V  P  A  V  T  S  D  S  C  N  N  S  I  S         p.1780

          .         .         .         .         .         .       g.128779
 CTCCTATCTGAAAAGTTGACAAGCAGCTGTTCCCCCCATCATATCAAGAGAAGTGTAGTG       c.5400
 L  L  S  E  K  L  T  S  S  C  S  P  H  H  I  K  R  S  V  V         p.1800

          .         .         .         .         .         .       g.128839
 GAAGCTATGCAACGCCAAGCTCGGAAAATGTGCAATTACGACAAAATCTTGGCCACAAAG       c.5460
 E  A  M  Q  R  Q  A  R  K  M  C  N  Y  D  K  I  L  A  T  K         p.1820

          .         .         .         .         .         .       g.128899
 AAAAACCTAGACCATGTCAATAAAATCTTAAAAGCCAAAAAACTTCAAAGGCAGGCCAGG       c.5520
 K  N  L  D  H  V  N  K  I  L  K  A  K  K  L  Q  R  Q  A  R         p.1840

          .         .         .         .         .         .       g.128959
 ACAGGGAATAACTTTGTGAAACGTAGGCCAGGTCGACCTCGGAAATGTCCCCTTCAGGCT       c.5580
 T  G  N  N  F  V  K  R  R  P  G  R  P  R  K  C  P  L  Q  A         p.1860

          .         .         .         .         .         .       g.129019
 GTCGTATCAATGCAAGCATTCCAGGCTGCTCAGTTTGTCAACCCAGAATTGAACAGAGAC       c.5640
 V  V  S  M  Q  A  F  Q  A  A  Q  F  V  N  P  E  L  N  R  D         p.1880

          .         .         .         .         .         .       g.129079
 GAGGAAGGAGCAGCACTGCACCTCAGTCCTGACACAGTTACAGATGTAATTGAGGCTGTT       c.5700
 E  E  G  A  A  L  H  L  S  P  D  T  V  T  D  V  I  E  A  V         p.1900

          .         .         .         .         .         .       g.129139
 GTTCAGAGTGTAAATCTGAACCCAGAACATAAAAAGGGGTTGAAGAGAAAAGGTTGGCTA       c.5760
 V  Q  S  V  N  L  N  P  E  H  K  K  G  L  K  R  K  G  W  L         p.1920

          .         .         .         .         .         .       g.129199
 TTGGAAGAACAGACCAGAAAAAAGCAGAAGCCATTACCAGAGGAAGAAGAGCAAGAGAAT       c.5820
 L  E  E  Q  T  R  K  K  Q  K  P  L  P  E  E  E  E  Q  E  N         p.1940

          | 06         .         .         .         .         .    g.151662
 AATAAAAG | CTTTAATGAAGCACCAGTTGAGATTCCCAGTCCTTCTGAAACCCCAGCTAAA    c.5880
 N  K  S  |  F  N  E  A  P  V  E  I  P  S  P  S  E  T  P  A  K      p.1960

          .         .         .         .         .         .       g.151722
 CCTTCTGAACCTGAAAGTACCTTGCAGCCTGTGCTTTCTCTCATCCCAAGGGAAAAGAAG       c.5940
 P  S  E  P  E  S  T  L  Q  P  V  L  S  L  I  P  R  E  K  K         p.1980

          .         .         .         .         .         .       g.151782
 CCCCCACGTCCCCCAAAGAAGAAGTATCAGAAAGCAGGGCTGTATTCTGACGTTTACAAA       c.6000
 P  P  R  P  P  K  K  K  Y  Q  K  A  G  L  Y  S  D  V  Y  K         p.2000

          | 07         .         .         .         .         .    g.172032
 ACTACAGA | CCCAAAGAGTCGATTGATCCAATTAAAGAAAGAGAAGCTGGAGTATACTCCA    c.6060
 T  T  D  |  P  K  S  R  L  I  Q  L  K  K  E  K  L  E  Y  T  P      p.2020

          .         .         .         .    | 08    .         .    g.187434
 GGAGAGCATGAATATGGATTATTTCCAGCGCCCATTCATGTTG | GAAAGTATCTAAGACAA    c.6120
 G  E  H  E  Y  G  L  F  P  A  P  I  H  V  G |   K  Y  L  R  Q      p.2040

          .         .         .         .         .        | 09.    g.188988
 AAGAGAATTGACTTCCAGCTTCCTTATGATATCCTTTGGCAGTGGAAACACAATCAG | CTA    c.6180
 K  R  I  D  F  Q  L  P  Y  D  I  L  W  Q  W  K  H  N  Q   | L      p.2060

          .         .         .         .    | 10    .         .    g.189161
 TACAAAAAGCCAGATGTCCCACTATATAAGAAAATTCGTTCAA | ATGTCTACGTTGATGTC    c.6240
 Y  K  K  P  D  V  P  L  Y  K  K  I  R  S  N |   V  Y  V  D  V      p.2080

          .         .         .         .         .         .       g.189221
 AAACCCCTTTCTGGTTACGAAGCTACCACCTGTAACTGTAAGAAGCCAGATGATGACACC       c.6300
 K  P  L  S  G  Y  E  A  T  T  C  N  C  K  K  P  D  D  D  T         p.2100

          .         .         .   | 11     .         .         .    g.196578
 AGGAAGGGCTGTGTTGATGACTGCCTCAATAG | AATGATCTTTGCTGAGTGTTCCCCCAAC    c.6360
 R  K  G  C  V  D  D  C  L  N  R  |  M  I  F  A  E  C  S  P  N      p.2120

          .         .         .         .         .         .       g.196638
 ACTTGCCCATGTGGCGAGCAATGCTGTAACCAGAGGATACAGAGGCATGAATGGGTGCAA       c.6420
 T  C  P  C  G  E  Q  C  C  N  Q  R  I  Q  R  H  E  W  V  Q         p.2140

          .         .         .         .         .         .       g.196698
 TGTCTAGAACGATTTCGAGCTGAGGAAAAAGGTTGGGGAATCAGAACCAAAGAGCCCCTA       c.6480
 C  L  E  R  F  R  A  E  E  K  G  W  G  I  R  T  K  E  P  L         p.2160

          .         .         .         .         .          | 12    g.196884
 AAAGCTGGGCAGTTCATCATTGAATACCTAGGGGAGGTCGTCAGTGAACAGGAGTTCAG | G    c.6540
 K  A  G  Q  F  I  I  E  Y  L  G  E  V  V  S  E  Q  E  F  R  |      p.2180

          .         .         .         .         .         .       g.196944
 AACAGGATGATTGAGCAGTATCATAATCACAGTGACCACTACTGCCTGAACCTGGATAGT       c.6600
 N  R  M  I  E  Q  Y  H  N  H  S  D  H  Y  C  L  N  L  D  S         p.2200

          .         .         .         .         .         .       g.197004
 GGGATGGTGATTGACAGTTACCGCATGGGAAATGAGGCCCGATTCATCAACCATAGCTGT       c.6660
 G  M  V  I  D  S  Y  R  M  G  N  E  A  R  F  I  N  H  S  C         p.2220

          .         .       | 13 .         .         .         .    g.207158
 GACCCAAATTGTGAAATGCAGAAATG | GTCTGTTAATGGAGTATACCGGATTGGACTCTAT    c.6720
 D  P  N  C  E  M  Q  K  W  |  S  V  N  G  V  Y  R  I  G  L  Y      p.2240

          .         .         .         .         .         .       g.207218
 GCTCTTAAAGACATGCCAGCTGGGACTGAACTCACTTATGATTATAACTTTCATTCCTTC       c.6780
 A  L  K  D  M  P  A  G  T  E  L  T  Y  D  Y  N  F  H  S  F         p.2260

          .      | 14  .         .         .         .         .    g.209829
 AATGTGGAAAAACAG | CAACTTTGTAAGTGTGGCTTTGAGAAATGTCGAGGAATCATCGGA    c.6840
 N  V  E  K  Q   | Q  L  C  K  C  G  F  E  K  C  R  G  I  I  G      p.2280

          .         .         .         .         .         .       g.209889
 GGCAAGAGTCAGCGTGTGAATGGACTCACCAGCAGCAAAAACAGCCAGCCCATGGCCACA       c.6900
 G  K  S  Q  R  V  N  G  L  T  S  S  K  N  S  Q  P  M  A  T         p.2300

          .         .         .         .         .         .       g.209949
 CACAAAAAATCTGGACGGTCAAAAGAGAAGAGAAAGTCTAAGCACAAGCTGAAGAAAAGG       c.6960
 H  K  K  S  G  R  S  K  E  K  R  K  S  K  H  K  L  K  K  R         p.2320

  | 15       .         .         .         .         .         .    g.210183
  | AGAGGCCATCTCTCTGAGGAACCCAGTGAAAATATCAACACCCCAACTAGATTGACCCCC    c.7020
  | R  G  H  L  S  E  E  P  S  E  N  I  N  T  P  T  R  L  T  P      p.2340

          .         .         .      | 16  .         .         .    g.212928
 CAATTACAGATGAAGCCAATGTCCAATCGTGAAAG | GAACTTTGTGTTAAAGCATCATGTA    c.7080
 Q  L  Q  M  K  P  M  S  N  R  E  R  |  N  F  V  L  K  H  H  V      p.2360

          .         .         .         .         .         .       g.212988
 TTCTTGGTCCGAAACTGGGAGAAGATTCGTCAAAAACAGGAGGAAGTAAAGCACACCAGT       c.7140
 F  L  V  R  N  W  E  K  I  R  Q  K  Q  E  E  V  K  H  T  S         p.2380

          .         .         .         .         .         .       g.213048
 GATAATATTCACTCAGCATCATTATATACCCGTTGGAATGGGATCTGCCGAGATGATGGG       c.7200
 D  N  I  H  S  A  S  L  Y  T  R  W  N  G  I  C  R  D  D  G         p.2400

          .    | 17    .         .         .         .         .    g.214722
 AATATCAAGTCTG | ATGTCTTCATGACCCAGTTCTCTGCCCTGCAGACAGCTCGATCTGTT    c.7260
 N  I  K  S  D |   V  F  M  T  Q  F  S  A  L  Q  T  A  R  S  V      p.2420

          .         .         .         .         .         .       g.214782
 CGAACAAGACGGTTGGCAGCTGCAGAGGAAAATATTGAAGTGGCTCGGGCAGCCCGCCTA       c.7320
 R  T  R  R  L  A  A  A  E  E  N  I  E  V  A  R  A  A  R  L         p.2440

          .         .         .         .       | 18 .         .    g.217951
 GCCCAGATCTTCAAAGAAATTTGTGATGGTATCATCTCTTATAAAG | ATTCTTCCCGGCAA    c.7380
 A  Q  I  F  K  E  I  C  D  G  I  I  S  Y  K  D |   S  S  R  Q      p.2460

          .         .         .         .  | 19      .         .    g.218093
 GCACTGGCAGCTCCACTTTTGAACCTTCCCCCAAAGAAAAA | GAATGCTGATTATTATGAG    c.7440
 A  L  A  A  P  L  L  N  L  P  P  K  K  K  |  N  A  D  Y  Y  E      p.2480

          .         .         .         .         .         .       g.218153
 AAGATCTCTGATCCCCTAGATCTTATCACCATAGAGAAGCAGATCCTCACTGGTTACTAT       c.7500
 K  I  S  D  P  L  D  L  I  T  I  E  K  Q  I  L  T  G  Y  Y         p.2500

          .         .         .         .         .     | 20   .    g.219635
 AAGACAGTGGAAGCTTTTGATGCTGACATGCTCAAAGTCTTTCGGAATGCTGAG | AAGTAC    c.7560
 K  T  V  E  A  F  D  A  D  M  L  K  V  F  R  N  A  E   | K  Y      p.2520

          .         .         .         .         .         .       g.219695
 TATGGGCGTAAATCCCCAGTTGGGAGAGATGTTTGTCGTCTACGAAAGGCCTATTACAAT       c.7620
 Y  G  R  K  S  P  V  G  R  D  V  C  R  L  R  K  A  Y  Y  N         p.2540

          .         .         .         .         .         .       g.219755
 GCCCGGCATGAGGCATCAGCCCAGATTGATGAGATTGTGGGAGAGACAGCAAGTGAGGCA       c.7680
 A  R  H  E  A  S  A  Q  I  D  E  I  V  G  E  T  A  S  E  A         p.2560

          .         .         .         .         .         .       g.219815
 GACAGCAGTGAGACCTCAGTCTCTGAAAAGGAGAATGGGCATGAGAAGGACGACGATGTT       c.7740
 D  S  S  E  T  S  V  S  E  K  E  N  G  H  E  K  D  D  D  V         p.2580

          .         .         .         .         .         .       g.219875
 ATTCGCTGTATCTGTGGCCTCTACAAGGATGAAGGTCTCATGATCCAGTGTGACAAGTGC       c.7800
 I  R  C  I  C  G  L  Y  K  D  E  G  L  M  I  Q  C  D  K  C         p.2600

     | 21    .         .         .         .         .         .    g.221121
 ATG | GTATGGCAGCACTGTGATTGTATGGGAGTGAACTCAGATGTGGAGCACTACCTTTGT    c.7860
 M   | V  W  Q  H  C  D  C  M  G  V  N  S  D  V  E  H  Y  L  C      p.2620

          .         .         . | 22       .         .         .    g.223290
 GAGCAGTGTGACCCAAGGCCTGTGGACAGG | GAGGTTCCCATGATCCCTCGGCCCCACTAT    c.7920
 E  Q  C  D  P  R  P  V  D  R   | E  V  P  M  I  P  R  P  H  Y      p.2640

          .         .         .         .         .         .       g.223350
 GCCCAACCTGGCTGTGTCTACTTCATCTGTTTGCTCCGAGATGACTTGCTGCTTCGTCAG       c.7980
 A  Q  P  G  C  V  Y  F  I  C  L  L  R  D  D  L  L  L  R  Q         p.2660

   | 23      .         .         .         .         .         .    g.223850
 G | GTGACTGTGTGTATCTGATGAGGGATAGTCGGCGCACCCCTGATGGCCACCCGGTCCGT    c.8040
 G |   D  C  V  Y  L  M  R  D  S  R  R  T  P  D  G  H  P  V  R      p.2680

          .         .         .         .         .         .       g.223910
 CAGTCCTATCGACTGTTATCTCACATTAACCGAGATAAACTTGACATCTTTCGCATTGAG       c.8100
 Q  S  Y  R  L  L  S  H  I  N  R  D  K  L  D  I  F  R  I  E         p.2700

          .         . | 24       .         .         .         .    g.224087
 AAGCTTTGGAAGAATGAAAA | AGAGGAACGGTTTGCCTTTGGTCACCATTATTTCCGTCCC    c.8160
 K  L  W  K  N  E  K  |  E  E  R  F  A  F  G  H  H  Y  F  R  P      p.2720

          .         .         .         .         .         .       g.224147
 CACGAAACACACCACTCTCCATCCCGTCGGTTCTATCATAATGAACTATTTCGGGTGCCA       c.8220
 H  E  T  H  H  S  P  S  R  R  F  Y  H  N  E  L  F  R  V  P         p.2740

          .         .         .         .         .         .       g.224207
 CTCTATGAGATCATTCCCTTGGAGGCTGTAGTGGGGACCTGCTGTGTGTTGGACCTTTAT       c.8280
 L  Y  E  I  I  P  L  E  A  V  V  G  T  C  C  V  L  D  L  Y         p.2760

          .    | 25    .         .         .         .         .    g.225478
 ACGTATTGTAAAG | GGAGACCCAAAGGAGTAAAGGAGCAAGATGTGTACATCTGTGATTAT    c.8340
 T  Y  C  K  G |   R  P  K  G  V  K  E  Q  D  V  Y  I  C  D  Y      p.2780

          .         .         .         .         .         .       g.225538
 CGGCTTGACAAGTCAGCACACCTGTTTTACAAGATCCACCGGAACCGCTATCCTGTCTGC       c.8400
 R  L  D  K  S  A  H  L  F  Y  K  I  H  R  N  R  Y  P  V  C         p.2800

          .         .         .         .         .         .       g.225598
 ACCAAACCCTATGCTTTTGATCACTTCCCCAAGAAGCTCACTCCCAAAAAAGATTTCTCG       c.8460
 T  K  P  Y  A  F  D  H  F  P  K  K  L  T  P  K  K  D  F  S         p.2820

  | 26       .         .         .         .  | 27      .         . g.229162
  | CCTCATTACGTCCCAGACAACTACAAGAGGAATGGAGGACG | ATCATCCTGGAAGTCTGAG c.8520
  | P  H  Y  V  P  D  N  Y  K  R  N  G  G  R  |  S  S  W  K  S  E   p.2840

          .         .         .         .         .         .       g.229222
 CGCTCAAAGCCACCCCTAAAAGACTTGGGCCAGGAGGATGATGCTCTACCCTTGATTGAA       c.8580
 R  S  K  P  P  L  K  D  L  G  Q  E  D  D  A  L  P  L  I  E         p.2860

          .         .         .         .         .         .       g.229282
 GAGGTTCTAGCCAGTCAAGAGCAAGCAGCCAATGAGATACCCAGCCTGGAGGAGCCAGAA       c.8640
 E  V  L  A  S  Q  E  Q  A  A  N  E  I  P  S  L  E  E  P  E         p.2880

          .         .         .         .         .         .       g.229342
 CGGGAAGGGGCCACTGCTAACGTCAGTGAGGGTGAAAAAAAAACAGAGGAAAGTAGTCAA       c.8700
 R  E  G  A  T  A  N  V  S  E  G  E  K  K  T  E  E  S  S  Q         p.2900

          .         .         .         .         .         .       g.229402
 GAACCCCAGTCAACCTGTACCCCTGAGGAACGACGGCATAACCAACGGGAACGACTCAAC       c.8760
 E  P  Q  S  T  C  T  P  E  E  R  R  H  N  Q  R  E  R  L  N         p.2920

          .         .         .         .    | 28    .         .    g.229799
 CAGATCTTGCTCAATCTCCTTGAAAAAATCCCTGGAAAAAATG | CCATTGATGTGACCTAC    c.8820
 Q  I  L  L  N  L  L  E  K  I  P  G  K  N  A |   I  D  V  T  Y      p.2940

          .         .         .         .         .         .       g.229859
 TTGCTGGAGGAAGGATCAGGCAGGAAACTGCGAAGGCGTACTTTGTTTATCCCAGAAAAC       c.8880
 L  L  E  E  G  S  G  R  K  L  R  R  R  T  L  F  I  P  E  N         p.2960

          .                                                         g.229874
 AGCTTTCGAAAGTGA                                                    c.8895
 S  F  R  K  X                                                      p.2964

          .         .         .         .         .         .       g.229934
 ccctcaaagaatgagaacctcaagcatctgggatccagtggagctaatcagtcctgcctc       c.*60

          .         .         .         .         .         .       g.229994
 ctgctctctgggtatagacaggggtgggaagggtccatctgggcaaggggaatggggcca       c.*120

          .         .         .         .         .         .       g.230054
 tgttgttgacattaggtacttaataagccttggagctagtggagagggagaggaaagggt       c.*180

          .         .         .         .         .         .       g.230114
 tctgtccaagacagttcaggttaattaattttcttctccattgcttcaccttaagggtta       c.*240

          .         .         .         .         .         .       g.230174
 ataatgtagagaggagggaggaccacattgatgaccagaacctactggtactttatagca       c.*300

          .         .         .         .         .         .       g.230234
 tttgccccaccccacagcttaggtttttctgtcatcctcagatcccacaggcattgcgaa       c.*360

          .         .         .         .         .         .       g.230294
 gaagctgcttcctatacccaggtataactcaaaatccaaagggatagggccaggatccct       c.*420

          .         .         .         .         .         .       g.230354
 attcctaccccatctattctctgttggctccaagagctaccccagagaccttaaacagaa       c.*480

          .         .         .         .         .         .       g.230414
 acagtagctgaggcttcttcctagatacctgactagggaagtttgtctctcctttcttgc       c.*540

          .         .         .         .         .         .       g.230474
 ccaaccaggtcaaagtaaaatgtgagttgacagctcaaagcacttgtaactgctgccccc       c.*600

          .         .         .         .         .         .       g.230534
 tccctacctctactccccaaaatggaatcatgggatagggaaggcccccatggggtcaga       c.*660

          .         .         .         .         .         .       g.230594
 agggcacggtagttcttgcaattatttttgttttacccttcataacctgtcaaacatatt       c.*720

          .         .         .         .         .         .       g.230654
 tttttctaatgagaaagccaggcccccgccagcacacatgctgtttttaatgcgctgtag       c.*780

          .         .         .         .         .         .       g.230714
 ttcttgtgtgtctgctgtgctgtgcaaatggagattcagttcaaaataaaatcatttaaa       c.*840

          .         .         .         .         .         .       g.230774
 aacctacataaaaagaactctaaacccacccctgcaacaaaagtcactacataaactgtt       c.*900

          .         .         .         .         .         .       g.230834
 cagcagtattcacctatcagagtatttgttgtgagtatagattatcaattgaaaacacta       c.*960

          .         .         .         .         .         .       g.230894
 ctcttgttttcttaattgtacagttttcaatgtccctttcttaaagagacagtatatttc       c.*1020

          .         .         .         .         .         .       g.230954
 tcttcacccctagcccatcttccctcaccctcctgaatgacatcaggaggtatatccagg       c.*1080

          .         .         .         .         .         .       g.231014
 gtgtctccttccttcctactctcttgaccagaagttaacagactatactgtctctttaaa       c.*1140

          .         .         .         .         .         .       g.231074
 aataaaatttaaaaagctttgttgtcttttcagacatacatatgcatatatgttttagat       c.*1200

          .         .         .         .         .         .       g.231134
 gttcttataagagaaaagatggtttttaaatgtgccaagttgtgtgtgtgtgtgtatata       c.*1260

          .         .         .         .         .         .       g.231194
 tatgtgtgtatgtgtgtgtatatatatatgtgtgtgtgtatatatatacacacacacaca       c.*1320

          .         .         .         .         .         .       g.231254
 cacacctgctgtgtgattggtaagcaatacaatagtaaacatgtccccattacttttttc       c.*1380

          .         .         .         .         .         .       g.231314
 taatattggaccaatgctgtcctaattgtacatttccccttatggtgacgatgctctgac       c.*1440

          .         .         .         .         .         .       g.231374
 tcgtttaggtagacacattgaccaccttccattccattaaatattttttcctttttcccc       c.*1500

          .         .         .         .         .         .       g.231434
 tttctgtgtcattcttgaggaaaaaacaaaagagagaggggatgccaatgatccccttga       c.*1560

          .         .         .         .         .         .       g.231494
 gcagagaaaaagcaaaataaatattttattaaagaaaaaagagaattaagaaaatagttt       c.*1620

          .         .         .         .         .         .       g.231554
 ggagtattttcttactgtagagaagcactgtacattactaagagacctgggtataagata       c.*1680

          .         .         .         .         .         .       g.231614
 ctcacatgtggagctggaaaaatcgcatgtccaagcccgtttgagtggtttcttttgttt       c.*1740

          .         .         .         .         .         .       g.231674
 ttcattgcagggagtgggtgggagggaggtgggactaggggcactttgggggtctccttt       c.*1800

          .         .         .         .         .         .       g.231734
 tagtcaaaagcgagaaaatgacaagaaagagattaaaattcaatgtttcctttatagtgt       c.*1860

          .         .         .         .         .         .       g.231794
 taaacactaaaattttaaaaaagatgaaaaagaaaaaaaaactttgtaaaatgcgagaac       c.*1920

          .         .         .         .         .         .       g.231854
 agaagcaaaagacactacgctctgtcattttatctttcttttgttgaaagactaaaaaaa       c.*1980

          .         .         .         .         .         .       g.231914
 aactgaaatgttttttagacaatcaaatgttaggtaagtgcaaaaacttgttttttctta       c.*2040

          .         .         .         .         .         .       g.231974
 ctggtgtagaaattaatgcctttttttatttttcagttattttataataacgaaataaaa       c.*2100

          .         .         .         .         .         .       g.232034
 agaaccccccagctgccaggcgggttttggtgtttgaaatgcggggcaaagcactacatc       c.*2160

          .         .         .         .         .         .       g.232094
 actgcaaatagatacagagttagtctgcatgtctgtaggctgtgtgattgcggaaaatat       c.*2220

          .         .         .         .         .         .       g.232154
 aaatgctgctaatatatttcctttttacaaaagcatatctaaatagatgattgttttgat       c.*2280

          .         .         .         .         .         .       g.232214
 gttaatctttgtaaattatgtattaccaattttaacattggatgtaattgcatacaaagc       c.*2340

          .         .         .         .         .                 g.232273
 ttgcatctcaatccttgaaagtctagtattaaatggaaaaaacttttcctaactgtgga        c.*2399

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Ash1 (absent, small, or homeotic)-like (Drosophila) protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 24b
©2004-2020 Leiden University Medical Center