chromosome 5 open reading frame 42 (C5orf42) - coding DNA reference sequence

(used for variant description)

(last modified January 17, 2015)


This file was created to facilitate the description of sequence variants on transcript NM_023073.3 in the C5orf42 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_032772.1, covering C5orf42 transcript NM_023073.3.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
           .         .         .         .         .                g.5051
          ggttctgagcaggcgccgtggcagcgcccgcgccccctgcccgccctccgc       c.-181

 .         .         .         .         .         .                g.5111
 cgtcgccaggacgcaggctcgcgtcagggctggcggtagggcccgcagctgtcggcctgg       c.-121

 .         .         .         .         .         .                g.5171
 cgtcttcaccccgcgcagtcggcctcgcctagccttctcgcgcctgactggccggaccct       c.-61

 .         .   | 02     .         .         .         .             g.6730
 gcccctggcggag | aatcttgggactgttttacttaattggtcattgatagcttaacaaac    c.-1

          .         .         .         .         .         .       g.6790
 ATGGAGATAAGATTAGAAATCTTGACATCAACAGGTATTAAGCAGAAAAAACCATGGCCA       c.60
 M  E  I  R  L  E  I  L  T  S  T  G  I  K  Q  K  K  P  W  P         p.20

          .         .  | 03      .         .         .         .    g.8622
 CGTGTCTCCTGGTTGGGAAAG | GAAAAAGAAGCCGTTTTTCTTTTGGATGATAAATTCATA    c.120
 R  V  S  W  L  G  K   | E  K  E  A  V  F  L  L  D  D  K  F  I      p.40

          .         .         .         .         .         .       g.8682
 AATGAAATTAATTTGCTATCAGGAAAGATAAAGAAGAAAATTCCTAGTCTGCAGCCTTTC       c.180
 N  E  I  N  L  L  S  G  K  I  K  K  K  I  P  S  L  Q  P  F         p.60

          .         .         .        | 04.         .         .    g.8853
 TTGAAGGATGTTATTGTCCTAACAACATCCAGTAATG | ATGCCTGGCTGGCTGGGGTACTA    c.240
 L  K  D  V  I  V  L  T  T  S  S  N  D |   A  W  L  A  G  V  L      p.80

          .         .         .         .         .         .       g.8913
 ACTACAGGAGAGCTTTTCCTTTGGAACAAAGATCAAGATTGTTTGAAAACTATACCAATA       c.300
 T  T  G  E  L  F  L  W  N  K  D  Q  D  C  L  K  T  I  P  I         p.100

          .         .         .        | 05.         .         .    g.9844
 ACTGAAAAGCCTAAGGAAATGATCAAAGCTACAGTCG | CAAGCTCTTTGAGACTGTACTTG    c.360
 T  E  K  P  K  E  M  I  K  A  T  V  A |   S  S  L  R  L  Y  L      p.120

          .         .         .         .         .         .       g.9904
 TATGTATCTGGAAATGGGAAAAGAATTGTGCTCATAACACCTTCTGGATGCATATTTCTT       c.420
 Y  V  S  G  N  G  K  R  I  V  L  I  T  P  S  G  C  I  F  L         p.140

          .         .         .         .         .         .       g.9964
 TGGGAATATTTGGAATTAAAGAATATCTTATCTTCTAAAAGCCTTTCATTGGCGGGTCGG       c.480
 W  E  Y  L  E  L  K  N  I  L  S  S  K  S  L  S  L  A  G  R         p.160

          .         .         .         .         .         .       g.10024
 TGGTCCCAGGTCATACCTGAAGAAGCAGTTCTCTTGCCTTCCACCGAAGATAAAGAAGCT       c.540
 W  S  Q  V  I  P  E  E  A  V  L  L  P  S  T  E  D  K  E  A         p.180

          .         .         . | 06       .         .         .    g.11339
 GTAGTGAATGCTGTTTTTATAAAAAATGAG | TTATTTGGAGACTGCTGCCTGTGTTCATTT    c.600
 V  V  N  A  V  F  I  K  N  E   | L  F  G  D  C  C  L  C  S  F      p.200

          .         .         .         .         .         .       g.11399
 ACTTTTTATTCTGGGGAATGCCTGAAGTTAACATTTCTAGCAATTCGGTGGCATGAGAAT       c.660
 T  F  Y  S  G  E  C  L  K  L  T  F  L  A  I  R  W  H  E  N         p.220

          .        | 07.         .         .         .         .    g.14602
 GTATTTACATCTGTAAG | ATCATTGCCATACCATGTTCATTGGGCTCAACAAGACTGTCAT    c.720
 V  F  T  S  V  R  |  S  L  P  Y  H  V  H  W  A  Q  Q  D  C  H      p.240

          .         .         .         .         .         .       g.14662
 CTCTGTAGTTTAATTCCTAAATGTGAATCAGTAAAGTCAAGAGGAGCTCTAATTTCTGCC       c.780
 L  C  S  L  I  P  K  C  E  S  V  K  S  R  G  A  L  I  S  A         p.260

          .         .         .         .         .     | 08   .    g.15474
 TTTTCAAGAGATGGCCTTACCCTGGCAGTAACTCTTAATCAGAAAGACCCCAAG | GCAACT    c.840
 F  S  R  D  G  L  T  L  A  V  T  L  N  Q  K  D  P  K   | A  T      p.280

          .         .         .         .         .         .       g.15534
 CAGGTATTATTTATAAACACACTGAATTTTGTTACTCTCTGTGGTAGCCTTAAAGGATGT       c.900
 Q  V  L  F  I  N  T  L  N  F  V  T  L  C  G  S  L  K  G  C         p.300

          .         .         .         | 09         .         .    g.23401
 AGTAACAAGAGTCCCGTGGTTCCAGCTACACTTATTAG | GTCCTACTGGGTAGGTGATATC    c.960
 S  N  K  S  P  V  V  P  A  T  L  I  R  |  S  Y  W  V  G  D  I      p.320

          .         .         .         .         .         .       g.23461
 AGCTGGACGCATGATAGTCTTTTTCTGGCTTGTATGTTAAAACGTGGCTCTCTGGTTTTA       c.1020
 S  W  T  H  D  S  L  F  L  A  C  M  L  K  R  G  S  L  V  L         p.340

          .         .         .         .         .         .       g.23521
 TTGACCTGCCAAGGTGAATTGCTAACATTAATTACATTTGGTTGCTCTATAGAATTTGGC       c.1080
 L  T  C  Q  G  E  L  L  T  L  I  T  F  G  C  S  I  E  F  G         p.360

          .         .         .         .  | 10      .         .    g.26630
 CCAGCAGAATTTATTCCTCTTCATCCACTAATAACGTATAG | ACCACAGCAGTTTACGTTT    c.1140
 P  A  E  F  I  P  L  H  P  L  I  T  Y  R  |  P  Q  Q  F  T  F      p.380

          .         .         .         .         .         .       g.26690
 CAAGATTCAAATAATTCTGTTGATTCATCAGCTTCTGATAGTGACCCTATGAGACAGAGA       c.1200
 Q  D  S  N  N  S  V  D  S  S  A  S  D  S  D  P  M  R  Q  R         p.400

          .         .         .         .         .         .       g.26750
 TTTTCTATAAAAGCACACTCACGGTTACCCTACCTCGTTATATCTGATGGATATATGGTC       c.1260
 F  S  I  K  A  H  S  R  L  P  Y  L  V  I  S  D  G  Y  M  V         p.420

          .         .         .         .         .         .       g.26810
 ACAACCCTTCGATTTCTTGATAGCCTATCTCCATCAGTACACATGAGATCACTTCTACTT       c.1320
 T  T  L  R  F  L  D  S  L  S  P  S  V  H  M  R  S  L  L  L         p.440

          .         .         .         .         .  | 11      .    g.27045
 GATTCAACCCAGAGGCTTGAGAAAATATATCAAAGTGTGATATTGTCTAAG | CCAAAAGGC    c.1380
 D  S  T  Q  R  L  E  K  I  Y  Q  S  V  I  L  S  K   | P  K  G      p.460

          .         .         .         .         .         .       g.27105
 AAAGGACTGAACTTGCGATCACTGAATTCCCTAAGGTCTAGCCTGTTAGAACACCAAGGA       c.1440
 K  G  L  N  L  R  S  L  N  S  L  R  S  S  L  L  E  H  Q  G         p.480

          .         .         .         .         .         .       g.27165
 AATGAAAGTTCAGCCGATTTCACTGTCCCCAAATTCTTGCAGGCAGAAGAAACAATAAAT       c.1500
 N  E  S  S  A  D  F  T  V  P  K  F  L  Q  A  E  E  T  I  N         p.500

          .         .  | 12      .         .         .         .    g.27394
 GAAAATGCAGCAGATTTTCAG | GATTTTGAAGCAGAAGAAACTAACGAAGGCAGACACTTT    c.1560
 E  N  A  A  D  F  Q   | D  F  E  A  E  E  T  N  E  G  R  H  F      p.520

          .         .         .         .         .         .       g.27454
 CCAGACAACTTATGTCCTTTTTGGAACAAAAGAGATGATGTGCTGTGTAGTAGTATGAAG       c.1620
 P  D  N  L  C  P  F  W  N  K  R  D  D  V  L  C  S  S  M  K         p.540

          .         .         .         .         .         .       g.27514
 GAAGGAAGATTGGAATTTGCATCTATGTTTGATACGATACATGCAAAGGATGATAGTGAG       c.1680
 E  G  R  L  E  F  A  S  M  F  D  T  I  H  A  K  D  D  S  E         p.560

          .         .         .         .         .         .       g.27574
 GAGACAGATAGAACCATTACAGAACTGCATTCTATCCAGAAAAGTCTACTTGCAGCGTGG       c.1740
 E  T  D  R  T  I  T  E  L  H  S  I  Q  K  S  L  L  A  A  W         p.580

          .         .         .         .         .         .       g.27634
 ACTATAGGAATTTCAAAAACTGTGACAGAAAAAAATTTAATGTTAAATTACATAGTAGTT       c.1800
 T  I  G  I  S  K  T  V  T  E  K  N  L  M  L  N  Y  I  V  V         p.600

          .         .         .         .         .         .       g.27694
 TGTATCACTCATTTTTTTTACATTCTTCAATTTATAAAATGTCCTTTTCCTAAACTTGAT       c.1860
 C  I  T  H  F  F  Y  I  L  Q  F  I  K  C  P  F  P  K  L  D         p.620

          .         .         .         .         .         .       g.27754
 CTTGTTTTAAGCAAAAGCTCAAGACATAATGCATGGATACTTTGTATCTTTCAACTTTTT       c.1920
 L  V  L  S  K  S  S  R  H  N  A  W  I  L  C  I  F  Q  L  F         p.640

          .         .         .         .         .         .       g.27814
 CATCAGTGTTTATCAATCCATTATTGGGATATAAGATACAAACAAGATGTGGGGCATTTG       c.1980
 H  Q  C  L  S  I  H  Y  W  D  I  R  Y  K  Q  D  V  G  H  L         p.660

          .         .         .         .         .         .       g.27874
 ATAAAGCTGACCTCAAATACTGTAAAACTTTTGCTGACTCAGCAACAAAAGGGTCAGTTA       c.2040
 I  K  L  T  S  N  T  V  K  L  L  L  T  Q  Q  Q  K  G  Q  L         p.680

          .         .         .         .         .         .       g.27934
 TTCTCAGAGAAACTTTTAGCTTGTTTTTATTTACTCAAAATGGTAGCTGACAATTTAAAT       c.2100
 F  S  E  K  L  L  A  C  F  Y  L  L  K  M  V  A  D  N  L  N         p.700

          .         .         .         .         .         .       g.27994
 GGTGTATACATTCTTCAACCTGAAGTTATTTCAGCATCAGCTGATGGAAGTAAAATAACA       c.2160
 G  V  Y  I  L  Q  P  E  V  I  S  A  S  A  D  G  S  K  I  T         p.720

          .         .         .         .         .         .       g.28054
 GCTCAAGACTCATTGGTGGTACCTATTTTTCAGATGTTTCAAGATAGTGGTTTTCAGAAA       c.2220
 A  Q  D  S  L  V  V  P  I  F  Q  M  F  Q  D  S  G  F  Q  K         p.740

          .         .         .         .         .         .       g.28114
 AACTGGTCTTGGAACTCATTTTTCAAGATTCATCCTCAAGTAGTAAATCCTGTGCAACAG       c.2280
 N  W  S  W  N  S  F  F  K  I  H  P  Q  V  V  N  P  V  Q  Q         p.760

          .  | 13      .         .         .         .         .    g.29737
 CCAGGACACAG | ATTGCTTATTCTCTGGAGAATACTGTACAAAAAAACTTTATGGTATCAA    c.2340
 P  G  H  R  |  L  L  I  L  W  R  I  L  Y  K  K  T  L  W  Y  Q      p.780

          .         .         .         .         .         .       g.29797
 GCACAATTAAATCGAAGAGTTCCTGAAGCTGATAGTCAGTTAACTGAAAAGATGACACAT       c.2400
 A  Q  L  N  R  R  V  P  E  A  D  S  Q  L  T  E  K  M  T  H         p.800

          .         .         .         .         .         .       g.29857
 GAAGCATCTACTGTCAAGTCCCTGTTATGTCATTTGCAGGCTAACCTACAGAGTACTGGA       c.2460
 E  A  S  T  V  K  S  L  L  C  H  L  Q  A  N  L  Q  S  T  G         p.820

          .         .         .         . | 14       .         .    g.30115
 GATTGCTTGAATCAAACCTTAGAACTTAAATCTATCAATG | GGGAAGAATGTTTTTTATTA    c.2520
 D  C  L  N  Q  T  L  E  L  K  S  I  N  G |   E  E  C  F  L  L      p.840

          .         .         .         .         .         .       g.30175
 GGATCATATGAAAAGTCTGTTCAGCTGTGGAAAAAAGCTCTACAAGAAATCGAAGAGAAA       c.2580
 G  S  Y  E  K  S  V  Q  L  W  K  K  A  L  Q  E  I  E  E  K         p.860

   | 15      .         .         .         .         .         .    g.32999
 G | GAGGAAGAAGGACGTATTTTCTTCAGATACGCTATTATCTTTCTCTCTTATACTGCCAC    c.2640
 G |   G  R  R  T  Y  F  L  Q  I  R  Y  Y  L  S  L  L  Y  C  H      p.880

          .         .         .         .         .         .       g.33059
 CTCTATAGCTATAATTTAAATGATGCTCAAGGATTGTGTGATCAGCTAGCAAGAGAAATC       c.2700
 L  Y  S  Y  N  L  N  D  A  Q  G  L  C  D  Q  L  A  R  E  I         p.900

          .         .         .         .       | 16 .         .    g.40710
 CTGAGATGGTCCCAACTACCTGTAAAAGAAAATAAAGATTTTTCAG | GTGCTGCAAAGTCT    c.2760
 L  R  W  S  Q  L  P  V  K  E  N  K  D  F  S  G |   A  A  K  S      p.920

          .         .         .         .         .         .       g.40770
 CATTTTGAGTGTGGAATGGTGGGCGGTGTTCATCCTGAGGCAGCAGTGAGAGTCGTCCAG       c.2820
 H  F  E  C  G  M  V  G  G  V  H  P  E  A  A  V  R  V  V  Q         p.940

          .         .         .         .         .         .       g.40830
 TCCATGGCTCGTTTCATGGCTGCCTATTTCACCAATCAGCAGCTTTGCATTTTGCCCCCT       c.2880
 S  M  A  R  F  M  A  A  Y  F  T  N  Q  Q  L  C  I  L  P  P         p.960

          .         .         .         . | 17       .         .    g.48023
 CATCATGTGAATGTTCTTCCCCCACTTCATATTAAAACAG | AGCAGTCCTTTCGACTTATT    c.2940
 H  H  V  N  V  L  P  P  L  H  I  K  T  E |   Q  S  F  R  L  I      p.980

          .         .         .         .         .         .       g.48083
 CCTCTGCAACACTCTAAGGTGGCCAGTGTTGTTAGAGATCAGAATCTCTCTAATGTGTGG       c.3000
 P  L  Q  H  S  K  V  A  S  V  V  R  D  Q  N  L  S  N  V  W         p.1000

          .         .         .         .         .         .       g.48143
 ACAGTTGAATATGCACTTGAATTACTATTTATTGGTGGCCTGGTTCCAGAGGCTGTGTGG       c.3060
 T  V  E  Y  A  L  E  L  L  F  I  G  G  L  V  P  E  A  V  W         p.1020

          .         .         .         .         .         .       g.48203
 TTGGCATATAAACTTGGAGACTGGAAGACGTCTGTTTCAATTGGTGTGGCTTTCCAGCTG       c.3120
 L  A  Y  K  L  G  D  W  K  T  S  V  S  I  G  V  A  F  Q  L         p.1040

          .         .          | 18        .         .         .    g.49005
 TTCTGTAAACGTGATAGCAATTTCATGAG | GTCCAAGAAAAAGAGTCTGAATCTACCACTC    c.3180
 F  C  K  R  D  S  N  F  M  R  |  S  K  K  K  S  L  N  L  P  L      p.1060

          .         .         .         .         .         .       g.49065
 CGTATGACTCCAGCACAGATTTTTCAGGAAAAACTGCAGTGTGTTTTAGGTCAACCAGCC       c.3240
 R  M  T  P  A  Q  I  F  Q  E  K  L  Q  C  V  L  G  Q  P  A         p.1080

          .         .         .         .          | 19        .    g.52631
 TCTTTGGAAGCAAAAAATGAAATGGGCTCAAAATATAAACAGTTTACAG | ATCCCATTGAA    c.3300
 S  L  E  A  K  N  E  M  G  S  K  Y  K  Q  F  T  D |   P  I  E      p.1100

          .         .         .         .         .         .       g.52691
 GAGGAAGATGCAAATCTGCTATTTGGTTCAGTACAAGAAGTACTGAAAGCATCAGTTATG       c.3360
 E  E  D  A  N  L  L  F  G  S  V  Q  E  V  L  K  A  S  V  M         p.1120

          .         .         .         .         .         .       g.52751
 GCCGATGCAGATATTCTTTCGGAGACATTTCAACTTCTGATAGACTCTGCCAAGGACTTC       c.3420
 A  D  A  D  I  L  S  E  T  F  Q  L  L  I  D  S  A  K  D  F         p.1140

          .         .         .         .         .         .       g.52811
 AGTAAAAGACTGTGGGGCTTAGTGCCATTCGGCTTGTATCTTCCAGCTCCTCCATTGTAC       c.3480
 S  K  R  L  W  G  L  V  P  F  G  L  Y  L  P  A  P  P  L  Y         p.1160

          .         .        | 20.         .         .         .    g.55595
 TGTCCCCAGCCAGCTATTCTTAGTGAA | GAAGATGGTGATGATCTTCTTTTAAAAGCTGAA    c.3540
 C  P  Q  P  A  I  L  S  E   | E  D  G  D  D  L  L  L  K  A  E      p.1180

          .         .         .         .         .         .       g.55655
 AAAAATAATCGCCAGAAGGTATCTGGAATCCTTCAGCGTGTTCTCCTGCTTTTCCGGGCG       c.3600
 K  N  N  R  Q  K  V  S  G  I  L  Q  R  V  L  L  L  F  R  A         p.1200

          .         .         .         .         .         .       g.55715
 GCTCAGTGTTCTTTTCCTGTAGCACAGTGGTATATATTGCAGTTGAGGTGGGCAAGAAAA       c.3660
 A  Q  C  S  F  P  V  A  Q  W  Y  I  L  Q  L  R  W  A  R  K         p.1220

          .   | 21     .         .         .         .         .    g.58480
 GTCATGCAGAAG | ATTCGAATGAAAGGATCCCTTCCTTCACTGAGTCCTTTTCCTCAGTCA    c.3720
 V  M  Q  K   | I  R  M  K  G  S  L  P  S  L  S  P  F  P  Q  S      p.1240

          .         .         .         .         .         .       g.58540
 TTACTTAATTACTGTAAAGGAGGTATCGCATTTTTTAGACCTGGAGCAGCTGGAGACCAC       c.3780
 L  L  N  Y  C  K  G  G  I  A  F  F  R  P  G  A  A  G  D  H         p.1260

          .         .         .  | 22      .         .         .    g.66615
 AAGCTTGATGAAGTTTCCATTAGAGCAATAG | GTTGCTTCAGAGAACTTTGTGCTCTGTGT    c.3840
 K  L  D  E  V  S  I  R  A  I  G |   C  F  R  E  L  C  A  L  C      p.1280

          .         .         .         .         .         .       g.66675
 TGGATGCTGCATGTCCGTGATAAGTTATCCTATAGTTGCAGGCAATATCAGAAAGCAAGA       c.3900
 W  M  L  H  V  R  D  K  L  S  Y  S  C  R  Q  Y  Q  K  A  R         p.1300

          .         .  | 23      .         .         .         .    g.66895
 GAAAATGTAAAAGGAGAAAAG | GACCTTGAAGTGGAGTTTGATTCTTGTATGATTGAGCAC    c.3960
 E  N  V  K  G  E  K   | D  L  E  V  E  F  D  S  C  M  I  E  H      p.1320

          .         .         .         .         .         .       g.66955
 TGTCTTAGTGCAGTGGAATGGGCTTATAGAATGCTGCCTTTCTCTCGGTTTTTTAATATG       c.4020
 C  L  S  A  V  E  W  A  Y  R  M  L  P  F  S  R  F  F  N  M         p.1340

          .         .         .         .         .         .       g.67015
 GAAGAACTTATTCAGGATATAATTTTGAGCCTTATTGGAGAACTGCCACCAATCAGAAAG       c.4080
 E  E  L  I  Q  D  I  I  L  S  L  I  G  E  L  P  P  I  R  K         p.1360

  | 24       .         .         .         .         .         .    g.68094
  | GTAGCAGAAATTTTCGTGAAAGCATTTCCCTATCCTGAGGACGTGAGGGTTCCTTTAAGA    c.4140
  | V  A  E  I  F  V  K  A  F  P  Y  P  E  D  V  R  V  P  L  R      p.1380

          .         .         .         .          | 25        .    g.69360
 GACAAATATCACTCTCTTCACCAGAGACTCAGACACTGTGTTGTGAAAG | GACCCCAGACT    c.4200
 D  K  Y  H  S  L  H  Q  R  L  R  H  C  V  V  K  G |   P  Q  T      p.1400

          .         .         .         .         .         .       g.69420
 GAGGAAATGATGTCTGTTGTCATGCATTCTATCCAGAAAGTGAGGGTGAAAGCTCTAAAA       c.4260
 E  E  M  M  S  V  V  M  H  S  I  Q  K  V  R  V  K  A  L  K         p.1420

          .         .         .         .         .         .       g.69480
 CGTGTGCAGAGAAATATAGGCTCTTTTGAAGTGAATATATGGGAACCAATTGAAGAAGAG       c.4320
 R  V  Q  R  N  I  G  S  F  E  V  N  I  W  E  P  I  E  E  E         p.1440

          .         .         .         .         .         .       g.69540
 AAACCAGATGAGGCTCCAGGTGTTGACAGATATTCCCTGGGGACTAGTTTGAGCAGAAGT       c.4380
 K  P  D  E  A  P  G  V  D  R  Y  S  L  G  T  S  L  S  R  S         p.1460

          .         .         .         .         .         .       g.69600
 ACACTCACAGAACTAGGAGATTCTGTGGTTCACAGTGATGCAGATACGTTCTCTGAAGCT       c.4440
 T  L  T  E  L  G  D  S  V  V  H  S  D  A  D  T  F  S  E  A         p.1480

          .         .         .         .  | 26      .         .    g.70748
 TTGTCGGTTGAAGAAAAAAGTAGGATAAATATCTATCAAAG | AAATGCCCCAAATCACATG    c.4500
 L  S  V  E  E  K  S  R  I  N  I  Y  Q  R  |  N  A  P  N  H  M      p.1500

          .         .         .         .         .         .       g.70808
 GAATTAACATCAATTCATAAGCCAACTGATAAAAGGAAAATGTGTAATCAGAAAGAAAAT       c.4560
 E  L  T  S  I  H  K  P  T  D  K  R  K  M  C  N  Q  K  E  N         p.1520

          .         .         .         .         .         .       g.70868
 CCTACAAAGAAAGAAGATCATGAAAAGTTATCACAAAATACACTTCCTGTAATAGGTGTT       c.4620
 P  T  K  K  E  D  H  E  K  L  S  Q  N  T  L  P  V  I  G  V         p.1540

          .         .         .         .         .         .       g.70928
 TGGGAATTTGAACGTGATGATGATGAATATATTAAATTCCTTGATCTGTTTTTGAGTTAC       c.4680
 W  E  F  E  R  D  D  D  E  Y  I  K  F  L  D  L  F  L  S  Y         p.1560

          .         .         .         .         .         .       g.70988
 ATTCTTGAAAGAGACCTACCTTATTCCAGGGATGCTGACATTCCATTTCTAACTAGTTTT       c.4740
 I  L  E  R  D  L  P  Y  S  R  D  A  D  I  P  F  L  T  S  F         p.1580

          .         .         .         .         .         .       g.71048
 TCTGGAAAGCTTAGAGAACATGAACTTAATTCTTTACTTTTTGATGTACATACAACATTA       c.4800
 S  G  K  L  R  E  H  E  L  N  S  L  L  F  D  V  H  T  T  L         p.1600

          .         .         .         .         .         .       g.71108
 AAACGACATCAGAGCAAAACTAAAAGCCAGAATGTGTTTAGAGCTGGTTCTTGCTTTGTT       c.4860
 K  R  H  Q  S  K  T  K  S  Q  N  V  F  R  A  G  S  C  F  V         p.1620

          .         .         .         .         .         .       g.71168
 GTTGCTCCTGAGTCCTATGAATCAGAAAAATCATCCTCTTTAAATGATGAATATGGCATG       c.4920
 V  A  P  E  S  Y  E  S  E  K  S  S  S  L  N  D  E  Y  G  M         p.1640

          .         .         .         .         .         .       g.71228
 CATTTAGAAAACCAGAAACTTTCATCATCAGTACTGGTTAATCAAGGGATCAAACCTTTT       c.4980
 H  L  E  N  Q  K  L  S  S  S  V  L  V  N  Q  G  I  K  P  F         p.1660

          .         .         .         .         .         .       g.71288
 TTACAATATCCTTCGAATGAAGTCAATAAGAATGAAGGAATGAGTGGATTATTTGGTTTA       c.5040
 L  Q  Y  P  S  N  E  V  N  K  N  E  G  M  S  G  L  F  G  L         p.1680

          .         .         .         .         .         .       g.71348
 AAACAAAGGTCAATTTACAAAATACAAGATGACACTAGAGAGAAATGTCTAATCCAGAGA       c.5100
 K  Q  R  S  I  Y  K  I  Q  D  D  T  R  E  K  C  L  I  Q  R         p.1700

          .         .         .         .         .         .       g.71408
 TCATCAAACCACATTTTTTGGACTCCCAAGTCCATTAAAACTAGAAGATGTATTTTCAAA       c.5160
 S  S  N  H  I  F  W  T  P  K  S  I  K  T  R  R  C  I  F  K         p.1720

          .         .         .         .         .         .       g.71468
 GCTATTCAGTGCAATGATATTAACCCTCAAGAAGATCTTCCTTTAGCACTAAACACTTTT       c.5220
 A  I  Q  C  N  D  I  N  P  Q  E  D  L  P  L  A  L  N  T  F         p.1740

          .         .         .         .         .         .       g.71528
 GGCAGTATAGGAAGACTGCTGGAATGGATGATAAGGTGGTCTAATAGAAGGCTACTCTGT       c.5280
 G  S  I  G  R  L  L  E  W  M  I  R  W  S  N  R  R  L  L  C         p.1760

          .         .         .         .         .         .       g.71588
 GATTCTGGTATAACTGAGTCATCCTCTGAGTACAGTCCAGTAATTCGTGTAAAGACCTCT       c.5340
 D  S  G  I  T  E  S  S  S  E  Y  S  P  V  I  R  V  K  T  S         p.1780

          .         .         .         .         .         .       g.71648
 ACAGCTGCCATTCTTACATCATTATGGCTTTTGGAACAACCCTATTTTGCTACATATAAG       c.5400
 T  A  A  I  L  T  S  L  W  L  L  E  Q  P  Y  F  A  T  Y  K         p.1800

          .         .  | 27      .         .         .         .    g.73462
 GCAAAAAATGCCATTATTAAG | ATGGTAGAGAATCGTGACACTGGGTGTCAGATTGGACCC    c.5460
 A  K  N  A  I  I  K   | M  V  E  N  R  D  T  G  C  Q  I  G  P      p.1820

          .         .         .         .         .         .       g.73522
 AATATTGAGAGGGAGAGCAAATCAGATGCTGGCGGTTCAGTTGCAGTAGCAACTCCAGGT       c.5520
 N  I  E  R  E  S  K  S  D  A  G  G  S  V  A  V  A  T  P  G         p.1840

          .         .         .         .         . | 28       .    g.74255
 GGAACTGAGGAAAGAAATGGTCAGAATAAATCTTGTCAAAATATCTTGAA | TAGAATGCCA    c.5580
 G  T  E  E  R  N  G  Q  N  K  S  C  Q  N  I  L  N  |  R  M  P      p.1860

          .         .         .         .         .         .       g.74315
 ACTGAAGCAAAAAATCCTGATATAAAAGAAATCAATGATGATATTATTTCCATCACTCAT       c.5640
 T  E  A  K  N  P  D  I  K  E  I  N  D  D  I  I  S  I  T  H         p.1880

          .         .         .         .         .         .       g.74375
 AATACTAAAAAAGAATTTATAGATATTGATGAGAATCTTTTAGAAGTAGAAGCATTTACA       c.5700
 N  T  K  K  E  F  I  D  I  D  E  N  L  L  E  V  E  A  F  T         p.1900

          .         .         .        | 29.         .         .    g.75008
 GAAGAGGAAATGGATATGCACATATCAGACTATGAAG | AAGACATTGAAGAATCTGTTGGA    c.5760
 E  E  E  M  D  M  H  I  S  D  Y  E  E |   D  I  E  E  S  V  G      p.1920

          .         .         .         .         .         .       g.75068
 GGTTTCAGAAGTCCCAGTCTTGCCATTTGCATGATGACTTTACCACAGCAGTTAGAAGAA       c.5820
 G  F  R  S  P  S  L  A  I  C  M  M  T  L  P  Q  Q  L  E  E         p.1940

  | 30       .         .         .         .         .         .    g.76788
  | GAGTTCACAGAAGAGGTTCAGTGTCAAAGGGAAGAACCACTGGAGACAATTATGGAGGAA    c.5880
  | E  F  T  E  E  V  Q  C  Q  R  E  E  P  L  E  T  I  M  E  E      p.1960

          .         . | 31       .         .         .         .    g.78482
 AAATCGACTGAACAAAAAGG | TATGATCGAAGCCTTTTCACATCCTGGGCATACCACTCCT    c.5940
 K  S  T  E  Q  K  G  |  M  I  E  A  F  S  H  P  G  H  T  T  P      p.1980

          .         .         .         | 32         .         .    g.80503
 CAATCAATGCAAGTAGATACGAGTTCAGAAATTTCTAG | TGCACAGATTTCTACATATAAA    c.6000
 Q  S  M  Q  V  D  T  S  S  E  I  S  S  |  A  Q  I  S  T  Y  K      p.2000

          .         .         .         .         .         .       g.80563
 GAAAAATCTTCCTCAGTTCCACTTCTGATATCAAATGGAGTCAATGTTGCTTCACAACCA       c.6060
 E  K  S  S  S  V  P  L  L  I  S  N  G  V  N  V  A  S  Q  P         p.2020

          .         .         .         .         .         .       g.80623
 CCTGCTCCAACACCTCAGAAGACCCAGAGAAATGAATTCACGGCTCAGTTACCAGATTGT       c.6120
 P  A  P  T  P  Q  K  T  Q  R  N  E  F  T  A  Q  L  P  D  C         p.2040

          .         .         .         .         .  | 33      .    g.84106
 TCGGAGTCCGTTAGGCAGATGCTGCAAGATGAAATGTTTAAATTAGTTCAG | CTGCAACAG    c.6180
 S  E  S  V  R  Q  M  L  Q  D  E  M  F  K  L  V  Q   | L  Q  Q      p.2060

          .         .         .         .         .         .       g.84166
 ATCAACTTCATGAGCCTAATGCAAATAGTAGGATCATCCTTTGCTAATCTCCCAGATACA       c.6240
 I  N  F  M  S  L  M  Q  I  V  G  S  S  F  A  N  L  P  D  T         p.2080

          .         .         .         .         .         .       g.84226
 CAACAACTTGTACAGCAGTCTCAGTCTGTGCATTTAGGGGAAAGCCAAGAATCAAACCTA       c.6300
 Q  Q  L  V  Q  Q  S  Q  S  V  H  L  G  E  S  Q  E  S  N  L         p.2100

          .         .         .         .         .         .       g.84286
 AGAGGATGTGGTGATGTTGAAGACAGCAACAAAAATCTTAAGGAGAGATTTTTTATTAAA       c.6360
 R  G  C  G  D  V  E  D  S  N  K  N  L  K  E  R  F  F  I  K         p.2120

          .         .         .         .         .         .       g.84346
 CCACAGTCAATGGGAGAGAACGCCAGAGAGCCTCGCAAGAACAGCCCACACTGCCATGAA       c.6420
 P  Q  S  M  G  E  N  A  R  E  P  R  K  N  S  P  H  C  H  E         p.2140

          .         .         .         .   | 34     .         .    g.84885
 GGAACTATCCCATCTGGTCAAAATAGTACTGGAAACGTACAG | AATGTTCCACATGGGAGT    c.6480
 G  T  I  P  S  G  Q  N  S  T  G  N  V  Q   | N  V  P  H  G  S      p.2160

          .         .         .         .         .         .       g.84945
 ATTCCTTTATGTCAATTAAATGGCCAGCCCCGGAAAAAAGGACCAATTCCATCATCTCAA       c.6540
 I  P  L  C  Q  L  N  G  Q  P  R  K  K  G  P  I  P  S  S  Q         p.2180

          .         .         .         .         .         .       g.85005
 AACTTACCATCCACTTCGTTTTATCCAGCTCCTGCTGGAAATACTCACCTCTACCTTTTG       c.6600
 N  L  P  S  T  S  F  Y  P  A  P  A  G  N  T  H  L  Y  L  L         p.2200

          .         .         .         .         .         .       g.85065
 TCCACACCTTCTGTTGTTCAGAAGGCACCTAGACTTATCCCACATGCAAAAACATTTAGT       c.6660
 S  T  P  S  V  V  Q  K  A  P  R  L  I  P  H  A  K  T  F  S         p.2220

          .         .         .         .         .         .       g.85125
 CCTGGTGATGGCTTTCCTTTGCTTCAATTTAAGTCTAAACAAGAATTCCAGCCCCTTTTC       c.6720
 P  G  D  G  F  P  L  L  Q  F  K  S  K  Q  E  F  Q  P  L  F         p.2240

          .         .         .         .         .         .       g.85185
 TTACATACAGGAAGTATTCCACAAGTTCCCTTCAGGCCTTTGCCACAACCAAGAGAGGCT       c.6780
 L  H  T  G  S  I  P  Q  V  P  F  R  P  L  P  Q  P  R  E  A         p.2260

          .         .         .         .         .         .       g.85245
 TGGGGATTATCTGACTCCTTCCAACCTGCTCTGCCACAGAGAGCAGCACAAACTACTCCA       c.6840
 W  G  L  S  D  S  F  Q  P  A  L  P  Q  R  A  A  Q  T  T  P         p.2280

          .         .         .         .         .         .       g.85305
 GCATCCCATTTGAATGTAAGCCAGTATAACACTGAAGCCAGAAAAAAAGAAGTTGAGCAG       c.6900
 A  S  H  L  N  V  S  Q  Y  N  T  E  A  R  K  K  E  V  E  Q         p.2300

          .         .         .         .         .         .       g.85365
 AAGACGTGGGCAGAAACTGTAATTACAGAAATTCCTAATCATGTGAACTTGGATCAATAT       c.6960
 K  T  W  A  E  T  V  I  T  E  I  P  N  H  V  N  L  D  Q  Y         p.2320

          .         .         .         .         .         .       g.85425
 GTTGGACAAGAAAATTTGACACCTCAACAGGACTCTTCAGTGTTTATAAAACCAGAAAAA       c.7020
 V  G  Q  E  N  L  T  P  Q  Q  D  S  S  V  F  I  K  P  E  K         p.2340

          .         .         .         .         .         .       g.85485
 CTATTTGATGTTAAGCCAGGGACCCTTGAGATATCTCCTCACCATTCCTTTGGACTTCCG       c.7080
 L  F  D  V  K  P  G  T  L  E  I  S  P  H  H  S  F  G  L  P         p.2360

          .         .         .         .         .         .       g.85545
 TTACTATACCTGCCACTTAAACCTCCTAATATGTTTCCATCAACCTCAAGAGCATCTATT       c.7140
 L  L  Y  L  P  L  K  P  P  N  M  F  P  S  T  S  R  A  S  I         p.2380

          .         .         .         .         .         .       g.85605
 ACAGTTCCCTCAACACCTATCCAACCTATAGCAGAAGAAAGAAAATACCCAAGATTGTCA       c.7200
 T  V  P  S  T  P  I  Q  P  I  A  E  E  R  K  Y  P  R  L  S         p.2400

          .         .         .    | 35    .         .         .    g.87242
 TTACTTCATTCACATTTGTCCCCAGAAAATAGG | TGCAAAAAAACACAACTTATCCCACTT    c.7260
 L  L  H  S  H  L  S  P  E  N  R   | C  K  K  T  Q  L  I  P  L      p.2420

          .         .         .         .         .         .       g.87302
 GAAAACCTCATTGCGTTTAAACAAAGCCAACAGAAACTAACACATAATTTATTTGAACAA       c.7320
 E  N  L  I  A  F  K  Q  S  Q  Q  K  L  T  H  N  L  F  E  Q         p.2440

          .         .         .         .         .         .       g.87362
 GGTGATGCTGGACACCTTCAACTTCTAAAGGTCAAAATAGAACCACCTGAAGTAAGACAA       c.7380
 G  D  A  G  H  L  Q  L  L  K  V  K  I  E  P  P  E  V  R  Q         p.2460

          .         . | 36       .         .         .         .    g.88797
 GGAAAGGACAGTAAAAAAAG | GCAAAGAAGAAGAGCTGAGAAAGAGCTGCAAGAAAAAAGA    c.7440
 G  K  D  S  K  K  R  |  Q  R  R  R  A  E  K  E  L  Q  E  K  R      p.2480

          .         .         .         .         .         .       g.88857
 TGTGAGAAACTGAGGAGAAAACCAAATGTGACTTTTCGACCAGAGAATTCCATAATTAAT       c.7500
 C  E  K  L  R  R  K  P  N  V  T  F  R  P  E  N  S  I  I  N         p.2500

          .         .         .    | 37    .         .         .    g.90128
 AATGATGATTCAGAAATCATTAAGAAACCCAAG | GAACAACAAGAACATTGTGGTTCCCAT    c.7560
 N  D  D  S  E  I  I  K  K  P  K   | E  Q  Q  E  H  C  G  S  H      p.2520

          .         .         | 38         .         .         .    g.91894
 CCTTTGGATGACTTCGACGTTCCTTTTG | AAATGCTACAAGATGATAATACTTCAGCTGGA    c.7620
 P  L  D  D  F  D  V  P  F  E |   M  L  Q  D  D  N  T  S  A  G      p.2540

          .         .         .         .         .         .       g.91954
 TTGCATTTCATGGCCTCTGTAAAAAAGAAAGCTATAGGAAGTCAAGATGCAAGTACAAAT       c.7680
 L  H  F  M  A  S  V  K  K  K  A  I  G  S  Q  D  A  S  T  N         p.2560

          . | 39       .         .         .         .         .    g.96133
 ACAGACCCAG | AACATGAGCCTTTGACTGCTCCTCAGCTCTTGGTCCCAGATGTCTATCTA    c.7740
 T  D  P  E |   H  E  P  L  T  A  P  Q  L  L  V  P  D  V  Y  L      p.2580

          .         .         .         .         .         .       g.96193
 AATCTGAAGCTTTCCAGTGAAATGTCAGAGAAACCTTGGTCACCCTCAATACCTCATACA       c.7800
 N  L  K  L  S  S  E  M  S  E  K  P  W  S  P  S  I  P  H  T         p.2600

          .   | 40     .         .         .         .         .    g.96662
 GTAACAAACTTG | GAATTACCTGTGAGAGAAGAGCCTTCAAATGATAATGTTATCAAACAG    c.7860
 V  T  N  L   | E  L  P  V  R  E  E  P  S  N  D  N  V  I  K  Q      p.2620

          .         .         .         .         .         .       g.96722
 CAAAGCGATCATCTAGCAGTTCCATCGTCTGCAGAGTTACATTATATGGCAGCTTCAGTT       c.7920
 Q  S  D  H  L  A  V  P  S  S  A  E  L  H  Y  M  A  A  S  V         p.2640

          .         .         .        | 41.         .         .    g.100458
 ACTAATGCTGTTCCCCCACATAATTTTAAGAGTCAAG | GTCTGCCAAAACCAGAGTTCCGA    c.7980
 T  N  A  V  P  P  H  N  F  K  S  Q  G |   L  P  K  P  E  F  R      p.2660

          .         .         .         .         .         .       g.100518
 TTCAAAGGACAGAGCACAAAGTCAGACTCTGCAGAAGATTATCTATTGTGGAAACGGCTG       c.8040
 F  K  G  Q  S  T  K  S  D  S  A  E  D  Y  L  L  W  K  R  L         p.2680

          .         .         .         .         .         .       g.100578
 CAAGGTGTCTCTGCAGCTTGCCCTGCACCAAGCTCTGCAGCTCACCAACTAGAGCATCTC       c.8100
 Q  G  V  S  A  A  C  P  A  P  S  S  A  A  H  Q  L  E  H  L         p.2700

          .         .         .         .         .         .       g.100638
 AGTGCTAAGCTTCAGAAAATTGACGAGCAGTTGCTAGCAATACAGAACATTGCTGAAAAC       c.8160
 S  A  K  L  Q  K  I  D  E  Q  L  L  A  I  Q  N  I  A  E  N         p.2720

          .         .         .         .         .  | 42      .    g.106169
 ATAGAACAGGATTTCCCCAAGCCTGAAATGCTAGATCTACATTGTGATAAG | ATTGGACCA    c.8220
 I  E  Q  D  F  P  K  P  E  M  L  D  L  H  C  D  K   | I  G  P      p.2740

          .         .         .         .         .         .       g.106229
 GTGGATCACATTGAATTCTCTTCTGGCCCTGAATTCAAAAAAACATTAGCTTCAAAAACC       c.8280
 V  D  H  I  E  F  S  S  G  P  E  F  K  K  T  L  A  S  K  T         p.2760

          .          | 43        .         .         .         .    g.111989
 ATTAGCATTTCTGAAGAAG | TGCGTTTTTTGACCCATATGGATGAAGAAGATCAAAGTGAC    c.8340
 I  S  I  S  E  E  V |   R  F  L  T  H  M  D  E  E  D  Q  S  D      p.2780

          .         .         .         .         .         .       g.112049
 AAAAAGGAGACTTCAGAACCTGAATTTTCAATAACAGAAAATTATTCTGGTCAGAAAACC       c.8400
 K  K  E  T  S  E  P  E  F  S  I  T  E  N  Y  S  G  Q  K  T         p.2800

          .         .         .         .         .         .       g.112109
 TGTGTGTTTCCTACTGCCGATTCAGCTGTCAGCCTTTCCAGTTCCAGTGATCAGAATACT       c.8460
 C  V  F  P  T  A  D  S  A  V  S  L  S  S  S  S  D  Q  N  T         p.2820

          . | 44       .         .         .  | 45      .         . g.115599
 ACTTCTCCTG | GTATGAATAGCAGTGATGAATTGTGTGAGAG | TGTTTCAGTACATCCGCTC c.8520
 T  S  P  G |   M  N  S  S  D  E  L  C  E  S  |  V  S  V  H  P  L   p.2840

          .         .         .         .         .         .       g.115659
 CAGATGACTGGATTGACTGATATTGCAGACATTATTGATGACCTTATAATTAAAGACGGA       c.8580
 Q  M  T  G  L  T  D  I  A  D  I  I  D  D  L  I  I  K  D  G         p.2860

          .         .         .         .         . | 46       .    g.129029
 GTTTCCAGTGAAGAACTTGGCTTAACAGAACAAGCTATGGGCACCTCCAG | AATTCAGCAT    c.8640
 V  S  S  E  E  L  G  L  T  E  Q  A  M  G  T  S  R  |  I  Q  H      p.2880

          .         .         .         .         .         .       g.129089
 TATTCTGGCAGACATTCACAAAGAACTGACAAGGAAAGAAGAGAGATTCAAGCCTGGATG       c.8700
 Y  S  G  R  H  S  Q  R  T  D  K  E  R  R  E  I  Q  A  W  M         p.2900

          .         .         .         .         .         .       g.129149
 AAAAGAAAACGAAAAGAAAGAATGGCAAAGTACTTAAATGAGCTGGCAGAAAAGAGAGGG       c.8760
 K  R  K  R  K  E  R  M  A  K  Y  L  N  E  L  A  E  K  R  G         p.2920

          .         .         .       | 47 .         .         .    g.131964
 CAAGAACATGATCCTTTCTGTCCCAGAAGCAATCCA | CTTTACATGACTTCAAGGGAAATA    c.8820
 Q  E  H  D  P  F  C  P  R  S  N  P   | L  Y  M  T  S  R  E  I      p.2940

          .         .         .      | 48  .         .         .    g.132669
 AGGCTGAGACAAAAGATGAAGCATGAAAAAGACAG | ATTGCTGCTCTCTGAACACTATAGT    c.8880
 R  L  R  Q  K  M  K  H  E  K  D  R  |  L  L  L  S  E  H  Y  S      p.2960

          .         .         .         .         .         .       g.132729
 CGTCGAATCTCACAAGCGTACGGTCTGATGAATGAACTGTTATCTGAGTCAGTACAGCTA       c.8940
 R  R  I  S  Q  A  Y  G  L  M  N  E  L  L  S  E  S  V  Q  L         p.2980

          .         .         .         .         .         .       g.132789
 CCAACTCTACCACAGAAACCATTGCCTAACAAACCCAGCCCTACTCAGTCTTCCAGTTGT       c.9000
 P  T  L  P  Q  K  P  L  P  N  K  P  S  P  T  Q  S  S  S  C         p.3000

          .         .    | 49    .         .         .         .    g.134125
 CAACACTGCCCTTCTCCAAGAGG | AGAGAATCAACATGGTCACAGTTTTCTAATAAATCGA    c.9060
 Q  H  C  P  S  P  R  G  |  E  N  Q  H  G  H  S  F  L  I  N  R      p.3020

          .         .         .         .         .         .       g.134185
 CCTGGAAAAGTCAAATATATGTCCAAACCGAGTTATATCCATAAGAGGAAGTCTTTTGGG       c.9120
 P  G  K  V  K  Y  M  S  K  P  S  Y  I  H  K  R  K  S  F  G         p.3040

          .         .         | 50         .         .         .    g.139411
 CAACCTCAAGGCTCACCTTGGCCACATG | GAACTGCCACTTTCACCATACAGAAAAAAGCT    c.9180
 Q  P  Q  G  S  P  W  P  H  G |   T  A  T  F  T  I  Q  K  K  A      p.3060

          .         .         .         .         .         | 51    g.145959
 GGTGGAGCCAAAGCAGCAGTAAGAAAGGCTACGCAGTCTCCAGTTACCTTCCAAAAAG | GC    c.9240
 G  G  A  K  A  A  V  R  K  A  T  Q  S  P  V  T  F  Q  K  G |       p.3080

          .         .         .         .         .         .       g.146019
 TCTAATGCTCCGTGTCATAGTCTGCAGCATACAAAAAAACATGGAAGTGCTGGGCTTGCA       c.9300
 S  N  A  P  C  H  S  L  Q  H  T  K  K  H  G  S  A  G  L  A         p.3100

          .         .         .         .         .         .       g.146079
 CCTCAAACCAAGCAGGTGTGTGTAGAGTATGAAAGAGAGGAGACTGTGGTGAGTCCCTGG       c.9360
 P  Q  T  K  Q  V  C  V  E  Y  E  R  E  E  T  V  V  S  P  W         p.3120

          .         .         .         .         .        | 52.    g.146653
 ACGATACCTTCAGAAATCCATAAGATTCTTCATGAGAGTCACAATTCCCTTCTACAA | GAC    c.9420
 T  I  P  S  E  I  H  K  I  L  H  E  S  H  N  S  L  L  Q   | D      p.3140

          .         .         .         .         .         .       g.146713
 TTGTCTCCAACTGAAGAGGAAGAGCCAGAGCATCCTTTTGGGGTGGGCGGTGTGGACAGC       c.9480
 L  S  P  T  E  E  E  E  P  E  H  P  F  G  V  G  G  V  D  S         p.3160

          .         .         .         .         .         .       g.146773
 GTGTCTGAGAGCACTGGCAGCATCCTCAGCAAGCTGGACTGGAATGCCATCGAAGACATG       c.9540
 V  S  E  S  T  G  S  I  L  S  K  L  D  W  N  A  I  E  D  M         p.3180

          .         .         .         .         .                 g.146827
 GTGGCCAGCGTGGAGGACCAGGGCCTGTCTGTCCACTGGGCCCTGGACCTGTAA             c.9594
 V  A  S  V  E  D  Q  G  L  S  V  H  W  A  L  D  L  X               p.3197

          .         .         .         .         .         .       g.146887
 gacctggatatcattgggtttccatgcacaggccagcacctcagtaatgtggttctgaaa       c.*60

          .         .         .         .         .         .       g.146947
 gattaacaggtttaagggacagaagcaatgaaagaagcaatgtgaattttccatttgctt       c.*120

          .         .         .         .         .         .       g.147007
 tcatattattacctggattagccattaccagaggaaaaataaacatttctcagtaacttt       c.*180

          .         .         .         .         .         .       g.147067
 gcctttatggggaaagggttgactattgatgtattatatgtttttgtatttgatgcatca       c.*240

          .         .         .         .         .         .       g.147127
 ttaggcataatttttaaaatgataagtacctttcaagccaagtttgcataacctactttc       c.*300

          .         .         .         .         .         .       g.147187
 aataaaaaccctctatcttgcctcctcctttattaccctctgagttttgagaaacaacca       c.*360

          .         .         .         .         .         .       g.147247
 tatacagatgaatctaataggaaaaaaaaaaatcttttcattgagaagaaaatcagtctc       c.*420

          .         .         .         .         .         .       g.147307
 acctgagaactcaattatgaaccctattttaaaacacctatgcagggtttagcctaggag       c.*480

          .         .         .         .         .         .       g.147367
 tgaaaagaaaaaccaactaccttttaccaaccctgaatctctaaataagcaaagtttcat       c.*540

          .         .         .         .         .         .       g.147427
 ggaggccaggagatcttctgtcttctgccctgtagcctgaagccttggaggaagaaacag       c.*600

          .         .         .         .         .         .       g.147487
 gaatggatgctttgggcaggaaagtaagggaatatgactccggcctctagaaggctcatc       c.*660

          .         .         .         .         .         .       g.147547
 ttaaatttgtaagaaccatggtacagagacctgattagtttttggtattgtgctccaata       c.*720

          .         .         .         .         .         .       g.147607
 atgtcatagttttaagagataatttttatgagaattgactaagaaccagtatccttcaac       c.*780

          .         .         .         .         .         .       g.147667
 tacttcatcaatgtttggtataatataaaagcacactatcatctgaaaaagctattaaat       c.*840

          .         .         .         .         .         .       g.147727
 acccctctttttccaaatatctacctgtgtgaagccaggttttacaacatgtattgcagc       c.*900

          .         .         .         .         .         .       g.147787
 aagttgaatgcagaagcaggtatggtaattcagctgccttctatcaagctaaacattaaa       c.*960

          .         .         .         .         .         .       g.147847
 gagatttgtagaactataaaacaatgctactctccttaccaaattgttttagaaaatagc       c.*1020

          .         .         .         .         .         .       g.147907
 tttataggctaacattattgttaattgtcatttaattgttttgtcatttaaaatatttta       c.*1080

          .         .         .         .         .         .       g.147967
 aattgttttctgttagtttcttttttgtatattctatgggtattttattgatacatgata       c.*1140

          .         .         .         .         .         .       g.148027
 gttgtacatttttatggggtgcatgtgatattttgatatgtgcatacaatgtgtagcaat       c.*1200

          .         .         .         .         .         .       g.148087
 caaatcagggtaattgggatattcatcacctcaaacatttatcatttatttgtgttggaa       c.*1260

          .         .         .         .         .         .       g.148147
 acattcaaaccttttcttctagctatttatccattgttggatacttatatcaattctata       c.*1320

          .         .         .         .         .                 g.148201
 tcttagctgttgtgaatagagctgcaataaatgtaggagtgcagatatctcttt             c.*1374

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Chromosome 5 open reading frame 42 protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 12
©2004-2015 Leiden University Medical Center