G protein-coupled receptor 98 (GPR98) - coding DNA reference sequence

(used for variant description)

(last modified March 12, 2014)


This file was created to facilitate the description of sequence variants on transcript NM_032119.3 in the GPR98 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NC_000005.9, covering GPR98 transcript NM_032119.3.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                               .         .         .                g.5036
                         agtaagaatcagcagcgcgggcaaggagtacggacg       c.-61

 .         .         .         .         .         .                g.5096
 ggagtcagaggcagagcgagggtgtgtggagggccggcggggaccgccgggagcgcgcgg       c.-1

          .         .   | 02     .         .         .         .    g.61073
 ATGTCGGTGTTCCTGGGGCCAG | GGATGCCCTCTGCATCTTTATTAGTAAATCTTCTTTCA    c.60
 M  S  V  F  L  G  P  G |   M  P  S  A  S  L  L  V  N  L  L  S      p.20

          .         .         .         .         .         .       g.61133
 GCTTTACTCATCCTATTTGTGTTTGGAGAAACAGAAATAAGATTTACTGGACAAACTGAA       c.120
 A  L  L  I  L  F  V  F  G  E  T  E  I  R  F  T  G  Q  T  E         p.40

          .         .         .         .         .         .       g.61193
 TTTGTTGTTAATGAAACAAGTACAACAGTTATTCGTCTTATCATTGAAAGGATAGGAGAG       c.180
 F  V  V  N  E  T  S  T  T  V  I  R  L  I  I  E  R  I  G  E         p.60

          .         .        | 03.         .         .         .    g.64037
 CCAGCAAATGTTACTGCAATTGTATCG | CTGTATGGAGAGGACGCTGGTGACTTTTTTGAC    c.240
 P  A  N  V  T  A  I  V  S   | L  Y  G  E  D  A  G  D  F  F  D      p.80

          .         .         .         .         .         .       g.64097
 ACATATGCTGCAGCTTTTATACCTGCCGGAGAAACAAACAGAACAGTGTACATAGCAGTA       c.300
 T  Y  A  A  A  F  I  P  A  G  E  T  N  R  T  V  Y  I  A  V         p.100

          .         .         .         .         .        | 04.    g.65289
 TGTGATGATGACTTACCAGAGCCTGACGAAACTTTTATTTTTCACTTAACATTACAG | AAA    c.360
 C  D  D  D  L  P  E  P  D  E  T  F  I  F  H  L  T  L  Q   | K      p.120

          .         .         .         .         .         .       g.65349
 CCTTCAGCAAATGTGAAGCTTGGATGGCCAAGGACTGTTACTGTGACAATATTATCAAAT       c.420
 P  S  A  N  V  K  L  G  W  P  R  T  V  T  V  T  I  L  S  N         p.140

          .         .         .    | 05    .         .         .    g.68824
 GACAATGCATTTGGAATTATTTCATTTAATATG | CTTCCCTCAATCGCAGTGAGTGAGCCC    c.480
 D  N  A  F  G  I  I  S  F  N  M   | L  P  S  I  A  V  S  E  P      p.160

          .         .         .         .         .         .       g.68884
 AAGGGCAGAAATGAGTCTATGCCTCTTACTCTCATCAGGGAAAAGGGAACCTATGGAATG       c.540
 K  G  R  N  E  S  M  P  L  T  L  I  R  E  K  G  T  Y  G  M         p.180

          .         | 06         .         .         .         .    g.71372
 GTCATGGTGACTTTTGAG | GTAGAGGGTGGCCCAAATCCCCCTGATGAAGATTTGAGTCCA    c.600
 V  M  V  T  F  E   | V  E  G  G  P  N  P  P  D  E  D  L  S  P      p.200

          .         .         .         .         .         .       g.71432
 GTTAAAGGAAATATCACCTTTCCCCCTGGCAGAGCAACAGTAATTTATAACTTGACAGTA       c.660
 V  K  G  N  I  T  F  P  P  G  R  A  T  V  I  Y  N  L  T  V         p.220

          .   | 07     .         .         .         .         .    g.73459
 CTCGATGACGAG | GTACCAGAAAATGATGAAATATTTTTAATTCAACTGAAAAGTGTAGAA    c.720
 L  D  D  E   | V  P  E  N  D  E  I  F  L  I  Q  L  K  S  V  E      p.240

          .         .         .         .         .         .       g.73519
 GGAGGAGCTGAGATTAACACCTCTAGGAATTCCATTGAGATCATCATTAAGAAAAATGAT       c.780
 G  G  A  E  I  N  T  S  R  N  S  I  E  I  I  I  K  K  N  D         p.260

          .         .         .         .         .         .       g.73579
 AGTCCCGTGAGATTCCTTCAGAGTATTTATTTGGTTCCTGAGGAAGACCACATACTCATA       c.840
 S  P  V  R  F  L  Q  S  I  Y  L  V  P  E  E  D  H  I  L  I         p.280

          .         .         .         .         .         .       g.73639
 ATTCCAGTAGTTCGTGGAAAGGACAACAATGGAAATCTGATTGGATCTGATGAATATGAG       c.900
 I  P  V  V  R  G  K  D  N  N  G  N  L  I  G  S  D  E  Y  E         p.300

          .         .         .         .         .         .       g.73699
 GTTTCAATCAGTTATGCTGTCACAACTGGGAATTCCACAGCACATGCCCAGCAAAATCTG       c.960
 V  S  I  S  Y  A  V  T  T  G  N  S  T  A  H  A  Q  Q  N  L         p.320

          .         .         .         .         .         .       g.73759
 GACTTCATTGATCTTCAGCCAAACACAACTGTTGTTTTTCCACCTTTTATTCATGAATCT       c.1020
 D  F  I  D  L  Q  P  N  T  T  V  V  F  P  P  F  I  H  E  S         p.340

          .         .         .         .         .         .       g.73819
 CACTTGAAATTTCAAATAGTTGATGACACCATACCGGAGATTGCTGAATCGTTTCACATT       c.1080
 H  L  K  F  Q  I  V  D  D  T  I  P  E  I  A  E  S  F  H  I         p.360

          .         .         .         .         .         .       g.73879
 ATGTTACTAAAAGATACCTTACAGGGAGATGCTGTGCTAATAAGCCCTTCTGTTGTACAA       c.1140
 M  L  L  K  D  T  L  Q  G  D  A  V  L  I  S  P  S  V  V  Q         p.380

          .         .         .         .         .         .       g.73939
 GTCACCATTAAGCCAAATGATAAACCTTATGGAGTCCTTTCATTCAACAGTGTTTTGTTT       c.1200
 V  T  I  K  P  N  D  K  P  Y  G  V  L  S  F  N  S  V  L  F         p.400

          .         .         .         | 08         .         .    g.74784
 GAAAGGACAGTTATAATTGATGAAGATAGAATATCAAG | ATATGAAGAAATCACAGTGGTT    c.1260
 E  R  T  V  I  I  D  E  D  R  I  S  R  |  Y  E  E  I  T  V  V      p.420

          .         .         .         .         .         .       g.74844
 AGAAATGGAGGAACCCATGGGAATGTCTCTGCGAATTGGGTGTTGACACGGAACAGCACT       c.1320
 R  N  G  G  T  H  G  N  V  S  A  N  W  V  L  T  R  N  S  T         p.440

          .         .         .         .         .         .       g.74904
 GATCCCTCACCAGTAACAGCAGATATCAGACCGAGCTCTGGAGTTCTCCATTTTGCACAA       c.1380
 D  P  S  P  V  T  A  D  I  R  P  S  S  G  V  L  H  F  A  Q         p.460

          .         .         .         .         .         .       g.74964
 GGGCAGATGTTGGCAACAATTCCTCTTACTGTGGTTGATGATGATCTTCCAGAAGAGGCA       c.1440
 G  Q  M  L  A  T  I  P  L  T  V  V  D  D  D  L  P  E  E  A         p.480

          .         .         .         .         .         .       g.75024
 GAAGCTTATCTACTTCAAATTCTGCCTCATACAATACGAGGAGGTGCAGAAGTGAGCGAG       c.1500
 E  A  Y  L  L  Q  I  L  P  H  T  I  R  G  G  A  E  V  S  E         p.500

           | 09        .         .         .         .         .    g.75461
 CCAGCGGAG | CTTTTGTTCTACATTCAGGATAGTGATGATGTCTATGGCCTAATAACATTT    c.1560
 P  A  E   | L  L  F  Y  I  Q  D  S  D  D  V  Y  G  L  I  T  F      p.520

          .         .         .         .         .         .       g.75521
 TTTCCTATGGAAAACCAGAAGATTGAAAGCAGCCCAGGTGAACGATACTTATCCTTGAGT       c.1620
 F  P  M  E  N  Q  K  I  E  S  S  P  G  E  R  Y  L  S  L  S         p.540

          .         .         .         .         .         .       g.75581
 TTTACAAGACTAGGAGGGACTAAAGGAGATGTGAGGTTGCTTTATTCTGTACTTTACATT       c.1680
 F  T  R  L  G  G  T  K  G  D  V  R  L  L  Y  S  V  L  Y  I         p.560

          .         .         .         .         .         .       g.75641
 CCTGCTGGAGCTGTGGACCCCTTGCAAGCAAAAGAAGGCATCTTAAATATATCAAGGAGA       c.1740
 P  A  G  A  V  D  P  L  Q  A  K  E  G  I  L  N  I  S  R  R         p.580

          .         .         .         .         .         .       g.75701
 AATGACCTCATTTTTCCAGAGCAAAAAACTCAAGTCACTACAAAATTACCAATAAGAAAT       c.1800
 N  D  L  I  F  P  E  Q  K  T  Q  V  T  T  K  L  P  I  R  N         p.600

          .         .         .          | 10        .         .    g.81335
 GATGCATTCCTTCAAAATGGAGCTCACTTTCTAGTACAG | TTGGAAACTGTGGAGTTGTTA    c.1860
 D  A  F  L  Q  N  G  A  H  F  L  V  Q   | L  E  T  V  E  L  L      p.620

          .         .         .         .         .         .       g.81395
 AACATAATTCCTCTAATCCCACCCATAAGCCCTAGATTTGGGGAAATCTGCAATATTTCT       c.1920
 N  I  I  P  L  I  P  P  I  S  P  R  F  G  E  I  C  N  I  S         p.640

          .         .         .         .         .         .       g.81455
 TTACTGGTTACTCCAGCCATTGCAAATGGAGAAATTGGCTTTCTCAGCAATCTTCCAATT       c.1980
 L  L  V  T  P  A  I  A  N  G  E  I  G  F  L  S  N  L  P  I         p.660

          .         .         .       | 11 .         .         .    g.83949
 ATTTTGCATGAACCAGAAGATTTTGCTGCTGAAGTG | GTATACATTCCCTTACATCGGGAT    c.2040
 I  L  H  E  P  E  D  F  A  A  E  V   | V  Y  I  P  L  H  R  D      p.680

          .         .         .         .         .         .       g.84009
 GGAACTGATGGCCAGGCTACTGTCTACTGGAGTTTGAAGCCCTCTGGCTTTAATTCAAAA       c.2100
 G  T  D  G  Q  A  T  V  Y  W  S  L  K  P  S  G  F  N  S  K         p.700

          .         .         .         .         .         .       g.84069
 GCAGTGACCCCGGATGATATAGGCCCCTTTAATGGCTCTGTTTTGTTTTTATCTGGGCAA       c.2160
 A  V  T  P  D  D  I  G  P  F  N  G  S  V  L  F  L  S  G  Q         p.720

          .         .         .         .         .         .       g.84129
 AGTGACACAACAATCAACATTACTATCAAAGGTGATGACATACCGGAAATGAATGAAACT       c.2220
 S  D  T  T  I  N  I  T  I  K  G  D  D  I  P  E  M  N  E  T         p.740

          .         . | 12       .         .         .         .    g.88876
 GTAACACTTTCTCTAGACAG | GGTTAACGTGGAAAACCAAGTGCTGAAATCTGGATATACT    c.2280
 V  T  L  S  L  D  R  |  V  N  V  E  N  Q  V  L  K  S  G  Y  T      p.760

          .         .         .         .         .         .       g.88936
 AGCCGTGACCTAATTATTTTGGAAAATGATGACCCTGGGGGAGTTTTTGAATTTTCTCCT       c.2340
 S  R  D  L  I  I  L  E  N  D  D  P  G  G  V  F  E  F  S  P         p.780

          .         .        | 13.         .         .         .    g.89089
 GCTTCCAGAGGACCCTATGTTATAAAA | GAAGGAGAATCTGTAGAGCTCCACATCATCCGA    c.2400
 A  S  R  G  P  Y  V  I  K   | E  G  E  S  V  E  L  H  I  I  R      p.800

          .         .         .         .         .         .       g.89149
 TCAAGGGGGTCCCTTGTTAAGCAGTTTCTACACTACCGAGTAGAGCCAAGAGATAGCAAT       c.2460
 S  R  G  S  L  V  K  Q  F  L  H  Y  R  V  E  P  R  D  S  N         p.820

          .         .         .         .         .         .       g.89209
 GAATTCTATGGAAACACGGGAGTACTAGAATTTAAACCTGGAGAAAGGGAGATAGTGATC       c.2520
 E  F  Y  G  N  T  G  V  L  E  F  K  P  G  E  R  E  I  V  I         p.840

          .         .         .    | 14    .         .         .    g.90030
 ACCTTGCTAGCAAGATTGGATGGGATACCAGAG | TTGGATGAACACTACTGGGTGGTCCTC    c.2580
 T  L  L  A  R  L  D  G  I  P  E   | L  D  E  H  Y  W  V  V  L      p.860

          .         .         .         .         .         .       g.90090
 AGCAGCCACGGAGAACGGGAAAGCAAGTTGGGAAGTGCCACCATTGTCAATATAACGATT       c.2640
 S  S  H  G  E  R  E  S  K  L  G  S  A  T  I  V  N  I  T  I         p.880

          .         .         .         .         .         .       g.90150
 CTGAAAAATGATGATCCTCATGGCATTATAGAATTTGTTTCTGATGGTCTAATTGTGATG       c.2700
 L  K  N  D  D  P  H  G  I  I  E  F  V  S  D  G  L  I  V  M         p.900

          .         .         .     | 15   .         .         .    g.90932
 ATAAATGAAAGCAAAGGAGATGCTATCTATAGTG | CTGTTTATGATGTAGTAAGAAATCGA    c.2760
 I  N  E  S  K  G  D  A  I  Y  S  A |   V  Y  D  V  V  R  N  R      p.920

          .         .         .         .         .         .       g.90992
 GGCAACTTTGGTGATGTTAGTGTATCATGGGTGGTTAGTCCAGACTTTACACAAGATGTA       c.2820
 G  N  F  G  D  V  S  V  S  W  V  V  S  P  D  F  T  Q  D  V         p.940

          .         .         .         .         .         .       g.91052
 TTTCCTGTACAAGGGACTGTTGTCTTTGGAGATCAGGAATTTTCAAAAAATATCACCATT       c.2880
 F  P  V  Q  G  T  V  V  F  G  D  Q  E  F  S  K  N  I  T  I         p.960

          .         | 16         .         .         .         .    g.92210
 TACTCCCTTCCAGATGAG | ATTCCAGAAGAAATGGAAGAATTTACCGTTATCCTACTGAAT    c.2940
 Y  S  L  P  D  E   | I  P  E  E  M  E  E  F  T  V  I  L  L  N      p.980

          .         .         .         .         .         .       g.92270
 GGCACTGGAGGAGCTAAAGTGGGAAATAGAACAACTGCAACTCTGAGGATTAGAAGAAAT       c.3000
 G  T  G  G  A  K  V  G  N  R  T  T  A  T  L  R  I  R  R  N         p.1000

          .         .   | 17     .         .         .         .    g.93736
 GATGACCCCATTTATTTTGCAG | AACCTCGTGTAGTGAGGGTTCAGGAAGGTGAGACTGCC    c.3060
 D  D  P  I  Y  F  A  E |   P  R  V  V  R  V  Q  E  G  E  T  A      p.1020

          .         .         .         .         .         .       g.93796
 AACTTTACAGTTCTCAGAAATGGATCTGTTGATGTGACTTGCATGGTCCAGTATGCTACC       c.3120
 N  F  T  V  L  R  N  G  S  V  D  V  T  C  M  V  Q  Y  A  T         p.1040

          .         .         .         .         .         .       g.93856
 AAGGATGGGAAGGCTACTGCAAGAGAGAGAGATTTCATTCCTGTTGAAAAAGGAGAAACG       c.3180
 K  D  G  K  A  T  A  R  E  R  D  F  I  P  V  E  K  G  E  T         p.1060

          .         .         .         .         .         .       g.93916
 CTCATTTTTGAGGTTGGAAGTAGACAGCAGAGCATATCCATATTTGTTAATGAAGATGGT       c.3240
 L  I  F  E  V  G  S  R  Q  Q  S  I  S  I  F  V  N  E  D  G         p.1080

          .         .         .         .          | 18        .    g.97815
 ATCCCGGAAACAGATGAGCCCTTTTATATAATCCTCTTGAATTCAACAG | GTGATACAGTA    c.3300
 I  P  E  T  D  E  P  F  Y  I  I  L  L  N  S  T  G |   D  T  V      p.1100

          .         .         .         .         .         .       g.97875
 GTATATCAATATGGAGTAGCTACAGTAATAATTGAAGCTAATGATGACCCAAATGGCATT       c.3360
 V  Y  Q  Y  G  V  A  T  V  I  I  E  A  N  D  D  P  N  G  I         p.1120

          .         .         .         .         .       | 19 .    g.98550
 TTTTCTCTGGAGCCCATAGACAAAGCAGTGGAAGAAGGAAAGACTAATGCATTTTG | GATT    c.3420
 F  S  L  E  P  I  D  K  A  V  E  E  G  K  T  N  A  F  W  |  I      p.1140

          .         .         .         .         .         .       g.98610
 TTGAGGCACCGAGGATACTTTGGTAGTGTTTCTGTATCTTGGCAGCTCTTTCAGAATGAT       c.3480
 L  R  H  R  G  Y  F  G  S  V  S  V  S  W  Q  L  F  Q  N  D         p.1160

          .         .         .         .         .         .       g.98670
 TCTGCTTTGCAGCCTGGGCAGGAGTTCTATGAAACTTCAGGAACTGTTAACTTCATGGAT       c.3540
 S  A  L  Q  P  G  Q  E  F  Y  E  T  S  G  T  V  N  F  M  D         p.1180

          .         .         .         .         .         .       g.98730
 GGAGAAGAAGCAAAACCAATCATTCTCCATGCTTTTCCAGATAAAATTCCTGAATTCAAT       c.3600
 G  E  E  A  K  P  I  I  L  H  A  F  P  D  K  I  P  E  F  N         p.1200

          .         .         .     | 20   .         .         .    g.99435
 GAATTTTATTTCCTAAAACTTGTAAACATTTCAG | GTGGATCCCCAGGTCCTGGGGGCCAG    c.3660
 E  F  Y  F  L  K  L  V  N  I  S  G |   G  S  P  G  P  G  G  Q      p.1220

          .         .         .         .         .         .       g.99495
 CTAGCAGAAACCAACCTCCAGGTGACAGTAATGGTTCCATTCAATGATGATCCCTTTGGA       c.3720
 L  A  E  T  N  L  Q  V  T  V  M  V  P  F  N  D  D  P  F  G         p.1240

          .         .         .         .         .         .       g.99555
 GTTTTTATCTTGGATCCAGAGTGTTTAGAGAGAGAAGTGGCAGAAGATGTCCTGTCTGAA       c.3780
 V  F  I  L  D  P  E  C  L  E  R  E  V  A  E  D  V  L  S  E         p.1260

          .         .         .         .         .         .       g.99615
 GATGATATGTCTTATATTACCAACTTCACCATTTTGAGGCAGCAGGGTGTGTTTGGTGAT       c.3840
 D  D  M  S  Y  I  T  N  F  T  I  L  R  Q  Q  G  V  F  G  D         p.1280

          .         .         .         .         .         .       g.99675
 GTACAACTGGGCTGGGAAATACTGTCCAGTGAGTTCCCTGCTGGTTTGCCACCAATGATA       c.3900
 V  Q  L  G  W  E  I  L  S  S  E  F  P  A  G  L  P  P  M  I         p.1300

          .         .         .         .         .         .       g.99735
 GATTTTTTACTGGTTGGAATTTTCCCCACCACCGTGCATTTACAACAGCACATGCGGCGT       c.3960
 D  F  L  L  V  G  I  F  P  T  T  V  H  L  Q  Q  H  M  R  R         p.1320

          .         .         .         .         .         .       g.99795
 CACCACAGTGGAACGGATGCTTTGTACTTTACCGGACTAGAGGGTGCATTTGGGACTGTT       c.4020
 H  H  S  G  T  D  A  L  Y  F  T  G  L  E  G  A  F  G  T  V         p.1340

          .         .         .         .         .         .       g.99855
 AATCCAAAATACCATCCCTCCAGGAATAATACAATTGCCAACTTTACATTCTCAGCTTGG       c.4080
 N  P  K  Y  H  P  S  R  N  N  T  I  A  N  F  T  F  S  A  W         p.1360

          .         .         .         .         .         .       g.99915
 GTAATGCCCAATGCCAATACGAATGGATTCATTATAGCGAAGGATGACGGTAATGGAAGC       c.4140
 V  M  P  N  A  N  T  N  G  F  I  I  A  K  D  D  G  N  G  S         p.1380

          .         .         .         .         .         .       g.99975
 ATCTACTACGGGGTAAAAATACAAACAAACGAATCCCATGTGACACTTTCCCTTCATTAT       c.4200
 I  Y  Y  G  V  K  I  Q  T  N  E  S  H  V  T  L  S  L  H  Y         p.1400

          .         .         .         .         .         .       g.100035
 AAAACCTTGGGTTCCAATGCTACATACATTGCCAAGACAACAGTCATGAAATATTTAGAA       c.4260
 K  T  L  G  S  N  A  T  Y  I  A  K  T  T  V  M  K  Y  L  E         p.1420

          .         .         .         .         .         .       g.100095
 GAAAGTGTTTGGCTTCATCTACTAATTATCCTGGAGGATGGTATAATCGAATTCTACCTG       c.4320
 E  S  V  W  L  H  L  L  I  I  L  E  D  G  I  I  E  F  Y  L         p.1440

          .         .         .         .         .         | 21    g.104107
 GATGGAAATGCAATGCCCAGGGGAATCAAGAGTCTGAAAGGAGAAGCCATTACTGACG | GT    c.4380
 D  G  N  A  M  P  R  G  I  K  S  L  K  G  E  A  I  T  D  G |       p.1460

          .         .         .         .         .         .       g.104167
 CCTGGGATACTGAGAATTGGAGCAGGGATAAATGGCAATGACAGATTTACAGGTCTGATG       c.4440
 P  G  I  L  R  I  G  A  G  I  N  G  N  D  R  F  T  G  L  M         p.1480

          .         .         .         .         .         .       g.104227
 CAGGATGTGAGGTCCTATGAGCGGAAACTGACGCTTGAAGAAATTTATGAACTTCATGCC       c.4500
 Q  D  V  R  S  Y  E  R  K  L  T  L  E  E  I  Y  E  L  H  A         p.1500

          .         .         .         .         .         .       g.104287
 ATGCCCGCAAAAAGTGATTTACACCCAATTTCTGGATATCTGGAGTTCAGACAGGGAGAA       c.4560
 M  P  A  K  S  D  L  H  P  I  S  G  Y  L  E  F  R  Q  G  E         p.1520

          .         .         .         .         .         .       g.104347
 ACTAACAAATCATTCATTATTTCTGCAAGAGATGACAATGACGAGGAAGGAGAAGAATTA       c.4620
 T  N  K  S  F  I  I  S  A  R  D  D  N  D  E  E  G  E  E  L         p.1540

          .         .         .         .         .         .       g.104407
 TTCATTCTTAAACTAGTTTCTGTATATGGAGGAGCTCGTATTTCGGAAGAAAATACTACT       c.4680
 F  I  L  K  L  V  S  V  Y  G  G  A  R  I  S  E  E  N  T  T         p.1560

          .         .         .         .         .         .       g.104467
 GCAAGATTAACAATACAAAAAAGTGACAATGCAAATGGCTTGTTTGGTTTCACAGGAGCT       c.4740
 A  R  L  T  I  Q  K  S  D  N  A  N  G  L  F  G  F  T  G  A         p.1580

          .   | 22     .         .         .         .         .    g.118794
 TGTATACCAGAG | ATTGCAGAGGAGGGATCAACCATTTCTTGTGTGGTTGAGAGAACCAGA    c.4800
 C  I  P  E   | I  A  E  E  G  S  T  I  S  C  V  V  E  R  T  R      p.1600

          .         .         .         .         .         .       g.118854
 GGAGCTCTGGATTATGTGCATGTTTTTTACACCATTTCACAGATTGAAACTGATGGCATT       c.4860
 G  A  L  D  Y  V  H  V  F  Y  T  I  S  Q  I  E  T  D  G  I         p.1620

          .         .         .         .         .         .       g.118914
 AATTACCTTGTTGATGACTTTGCTAATGCCAGTGGAACTATTACATTCCTTCCTTGGCAG       c.4920
 N  Y  L  V  D  D  F  A  N  A  S  G  T  I  T  F  L  P  W  Q         p.1640

           | 23        .         .         .         .         .    g.120305
 AGATCAGAG | GTTCTGAATATATATGTTCTTGATGATGATATTCCTGAACTTAATGAGTAT    c.4980
 R  S  E   | V  L  N  I  Y  V  L  D  D  D  I  P  E  L  N  E  Y      p.1660

          .         .         .         .         .         .       g.120365
 TTCCGTGTGACATTGGTTTCTGCAATTCCTGGAGATGGGAAGCTAGGCTCAACTCCTACC       c.5040
 F  R  V  T  L  V  S  A  I  P  G  D  G  K  L  G  S  T  P  T         p.1680

          .         .         .         .         .         .       g.120425
 AGTGGTGCAAGCATAGATCCTGAAAAGGAAACGACTGATATCACCATCAAAGCTAGTGAT       c.5100
 S  G  A  S  I  D  P  E  K  E  T  T  D  I  T  I  K  A  S  D         p.1700

          . | 24       .         .         .         .         .    g.121493
 CATCCATATG | GCTTGCTGCAGTTCTCCACAGGGCTGCCTCCTCAGCCTAAGGACGCAATG    c.5160
 H  P  Y  G |   L  L  Q  F  S  T  G  L  P  P  Q  P  K  D  A  M      p.1720

          .         .         .         .         .         .       g.121553
 ACCCTGCCTGCAAGCAGCGTTCCACATATCACTGTGGAGGAGGAAGATGGAGAAATCAGG       c.5220
 T  L  P  A  S  S  V  P  H  I  T  V  E  E  E  D  G  E  I  R         p.1740

          .         .         .         .         .         .       g.121613
 TTATTGGTCATCCGTGCACAGGGACTTCTGGGAAGGGTGACTGCGGAATTTAGAACAGTG       c.5280
 L  L  V  I  R  A  Q  G  L  L  G  R  V  T  A  E  F  R  T  V         p.1760

          .         .         .    | 25    .         .         .    g.122307
 TCCTTGACAGCATTCAGTCCTGAGGATTACCAG | AATGTTGCTGGCACATTAGAATTTCAA    c.5340
 S  L  T  A  F  S  P  E  D  Y  Q   | N  V  A  G  T  L  E  F  Q      p.1780

          .         .         .         .         .         .       g.122367
 CCAGGAGAAAGATATAAATACATTTTCATAAACATCACTGATAATTCTATTCCTGAACTG       c.5400
 P  G  E  R  Y  K  Y  I  F  I  N  I  T  D  N  S  I  P  E  L         p.1800

          .         .         .         .    | 26    .         .    g.125766
 GAAAAATCTTTTAAAGTTGAGTTGTTAAACTTGGAAGGAGGAG | TAGCTGAACTCTTTAGG    c.5460
 E  K  S  F  K  V  E  L  L  N  L  E  G  G  V |   A  E  L  F  R      p.1820

          .         .         .         .         .         .       g.125826
 GTTGATGGAAGTGGTAGTGGTGATGGGGACATGGAATTCTTCCTTCCAACTATTCACAAA       c.5520
 V  D  G  S  G  S  G  D  G  D  M  E  F  F  L  P  T  I  H  K         p.1840

      | 27   .         .         .         .         .         .    g.127571
 CGTG | CCAGTCTAGGAGTGGCTTCCCAAATTCTAGTGACAATTGCAGCCTCTGACCACGCT    c.5580
 R  A |   S  L  G  V  A  S  Q  I  L  V  T  I  A  A  S  D  H  A      p.1860

          .         .         .         .         .         .       g.127631
 CATGGCGTATTTGAATTTAGCCCTGAGTCACTCTTTGTCAGTGGAACTGAACCAGAAGAT       c.5640
 H  G  V  F  E  F  S  P  E  S  L  F  V  S  G  T  E  P  E  D         p.1880

          .         .     | 28   .         .         .         .    g.129822
 GGGTATAGCACTGTTACATTAAAT | GTTATAAGACATCATGGAACTCTGTCTCCAGTGACT    c.5700
 G  Y  S  T  V  T  L  N   | V  I  R  H  H  G  T  L  S  P  V  T      p.1900

          .         .         .         .         .         .       g.129882
 TTGCATTGGAACATAGACTCTGATCCTGATGGTGATCTCGCCTTCACCTCTGGCAACATC       c.5760
 L  H  W  N  I  D  S  D  P  D  G  D  L  A  F  T  S  G  N  I         p.1920

          .         .         .         .         .         .       g.129942
 ACATTTGAGATTGGGCAGACGAGCGCCAATATCACTGTGGAGATATTGCCTGACGAAGAC       c.5820
 T  F  E  I  G  Q  T  S  A  N  I  T  V  E  I  L  P  D  E  D         p.1940

          .         .         .         .         .         .       g.130002
 CCAGAACTGGATAAGGCATTCTCTGTGTCAGTCCTCAGTGTTTCCAGTGGTTCTTTGGGA       c.5880
 P  E  L  D  K  A  F  S  V  S  V  L  S  V  S  S  G  S  L  G         p.1960

          .         .         .         .         .         .       g.130062
 GCTCATATTAATGCCACGTTAACAGTTTTGGCTAGTGATGATCCATATGGGATATTCATT       c.5940
 A  H  I  N  A  T  L  T  V  L  A  S  D  D  P  Y  G  I  F  I         p.1980

          .         .         .         .         .         .       g.130122
 TTTTCTGAGAAAAACAGACCTGTTAAAGTTGAGGAAGCAACCCAGAACATCACACTATCA       c.6000
 F  S  E  K  N  R  P  V  K  V  E  E  A  T  Q  N  I  T  L  S         p.2000

          .         .         .         .         .         .       g.130182
 ATAATAAGGTTGAAAGGCCTCATGGGAAAAGTCCTTGTCTCATATGCAACACTAGATGAT       c.6060
 I  I  R  L  K  G  L  M  G  K  V  L  V  S  Y  A  T  L  D  D         p.2020

          .         .         .         .         .         .       g.130242
 ATGGAAAAACCACCTTATTTTCCACCTAATTTAGCGAGAGCAACTCAAGGAAGAGACTAT       c.6120
 M  E  K  P  P  Y  F  P  P  N  L  A  R  A  T  Q  G  R  D  Y         p.2040

          .         .         .         .         .         .       g.130302
 ATACCAGCTTCTGGATTTGCTCTTTTTGGAGCTAATCAGAGTGAGGCAACAATAGCTATT       c.6180
 I  P  A  S  G  F  A  L  F  G  A  N  Q  S  E  A  T  I  A  I         p.2060

          .         .         .         .         .         .       g.130362
 TCAATTTTGGATGATGATGAGCCAGAAAGGTCCGAATCTGTCTTTATCGAACTACTCAAC       c.6240
 S  I  L  D  D  D  E  P  E  R  S  E  S  V  F  I  E  L  L  N         p.2080

          .         .         .     | 29   .         .         .    g.132006
 TCTACTTTAGTAGCGAAAGTACAGAGTCGTTCAA | TTCCAAATTCTCCACGTCTTGGGCCT    c.6300
 S  T  L  V  A  K  V  Q  S  R  S  I |   P  N  S  P  R  L  G  P      p.2100

          .         .         .         .         .         .       g.132066
 AAGGTAGAAACTATTGCGCAACTAATTATCATTGCCAATGATGATGCATTTGGAACTCTT       c.6360
 K  V  E  T  I  A  Q  L  I  I  I  A  N  D  D  A  F  G  T  L         p.2120

          .         .         .         .         .         .       g.132126
 CAGCTCTCAGCACCAATTGTCCGAGTGGCAGAAAATCATGTTGGACCCATTATCAATGTG       c.6420
 Q  L  S  A  P  I  V  R  V  A  E  N  H  V  G  P  I  I  N  V         p.2140

          .         .         .         .         .         .       g.132186
 ACTAGAACAGGAGGAGCATTTGCAGATGTCTCTGTGAAGTTTAAAGCTGTGCCAATAACT       c.6480
 T  R  T  G  G  A  F  A  D  V  S  V  K  F  K  A  V  P  I  T         p.2160

          . | 30       .         .         .         .         .    g.136111
 GCAATAGCTG | GTGAAGATTATAGTATAGCTTCATCAGATGTGGTCTTGCTAGAAGGGGAA    c.6540
 A  I  A  G |   E  D  Y  S  I  A  S  S  D  V  V  L  L  E  G  E      p.2180

          .         .         .         .         .         .       g.136171
 ACCAGTAAAGCCGTGCCAATATATGTCATTAATGATATCTATCCTGAACTGGAAGAATCT       c.6600
 T  S  K  A  V  P  I  Y  V  I  N  D  I  Y  P  E  L  E  E  S         p.2200

          .         .         .         .         .         .       g.136231
 TTTCTTGTGCAACTGATGAATGAAACAACAGGAGGAGCCAGACTAGGGGCTTTAACAGAG       c.6660
 F  L  V  Q  L  M  N  E  T  T  G  G  A  R  L  G  A  L  T  E         p.2220

          .         .         .         .       | 31 .         .    g.137011
 GCAGTCATTATTATTGAGGCCTCTGATGACCCCTATGGATTATTTG | GTTTTCAGATTACT    c.6720
 A  V  I  I  I  E  A  S  D  D  P  Y  G  L  F  G |   F  Q  I  T      p.2240

          .         .         .         .         .         .       g.137071
 AAACTTATTGTAGAGGAACCTGAGTTTAACTCAGTGAAGGTAAACCTGCCAATAATTCGA       c.6780
 K  L  I  V  E  E  P  E  F  N  S  V  K  V  N  L  P  I  I  R         p.2260

          .         .         .         .         .         .       g.137131
 AATTCTGGGACACTCGGCAATGTTACTGTTCAGTGGGTTGCCACCATTAATGGACAGCTT       c.6840
 N  S  G  T  L  G  N  V  T  V  Q  W  V  A  T  I  N  G  Q  L         p.2280

          .         .         .         .         .         .       g.137191
 GCTACTGGCGACCTGCGAGTTGTCTCAGGTAATGTGACCTTTGCCCCTGGGGAAACCATT       c.6900
 A  T  G  D  L  R  V  V  S  G  N  V  T  F  A  P  G  E  T  I         p.2300

          .         .         .         .         .  | 32      .    g.138814
 CAAACCTTGTTGTTAGAGGTCCTGGCTGACGACGTTCCGGAGATTGAAGAG | GTTATCCAA    c.6960
 Q  T  L  L  L  E  V  L  A  D  D  V  P  E  I  E  E   | V  I  Q      p.2320

          .         .         .         .         .         .       g.138874
 GTGCAACTAACTGATGCCTCTGGTGGAGGTACTATTGGGTTAGATCGAATTGCAAATATT       c.7020
 V  Q  L  T  D  A  S  G  G  G  T  I  G  L  D  R  I  A  N  I         p.2340

          .         .         .         .         .         .       g.138934
 ATTATTCCTGCCAATGATGATCCTTATGGTACAGTAGCCTTTGCTCAGATGGTTTATCGT       c.7080
 I  I  P  A  N  D  D  P  Y  G  T  V  A  F  A  Q  M  V  Y  R         p.2360

          .         .         .         .         .    | 33    .    g.140097
 GTTCAAGAGCCTCTGGAAAGAAGTTCCTGTGCTAATATAACTGTCAGGCGAAG | CGGAGGG    c.7140
 V  Q  E  P  L  E  R  S  S  C  A  N  I  T  V  R  R  S  |  G  G      p.2380

          .         .         .         .         .         .       g.140157
 CACTTTGGTCGGCTGTTGTTGTTCTACAGTACTTCCGACATTGATGTAGTGGCTCTGGCA       c.7200
 H  F  G  R  L  L  L  F  Y  S  T  S  D  I  D  V  V  A  L  A         p.2400

          .         .         .         .         .         .       g.140217
 ATGGAGGAAGGTCAAGATTTACTGTCCTACTATGAATCTCCAATTCAAGGGGTGCCTGAC       c.7260
 M  E  E  G  Q  D  L  L  S  Y  Y  E  S  P  I  Q  G  V  P  D         p.2420

          .         .         .         .         .         .       g.140277
 CCACTTTGGAGAACTTGGATGAATGTCTCTGCCGTGGGGGAGCCCCTGTATACCTGTGCC       c.7320
 P  L  W  R  T  W  M  N  V  S  A  V  G  E  P  L  Y  T  C  A         p.2440

          .         .         .         .         .         .       g.140337
 ACTTTGTGCCTTAAGGAACAAGCTTGCTCAGCGTTTTCATTTTTCAGTGCTTCTGAGGGT       c.7380
 T  L  C  L  K  E  Q  A  C  S  A  F  S  F  F  S  A  S  E  G         p.2460

          .         .         .         .         .         .       g.140397
 CCCCAGTGTTTCTGGATGACATCATGGATCAGCCCAGCTGTCAACAATTCAGACTTCTGG       c.7440
 P  Q  C  F  W  M  T  S  W  I  S  P  A  V  N  N  S  D  F  W         p.2480

          .         .         .         .         .         .       g.140457
 ACCTACAGGAAAAACATGACCAGGGTAGCATCTCTTTTTAGTGGTCAGGCTGTGGCTGGG       c.7500
 T  Y  R  K  N  M  T  R  V  A  S  L  F  S  G  Q  A  V  A  G         p.2500

          .         .         .         .         .         .       g.140517
 AGTGACTATGAGCCTGTGACAAGGCAATGGGCCATAATGCAGGAAGGTGATGAATTCGCA       c.7560
 S  D  Y  E  P  V  T  R  Q  W  A  I  M  Q  E  G  D  E  F  A         p.2520

          .         .         .         .         .         .       g.140577
 AATCTCACAGTGTCTATTCTTCCTGATGATTTCCCAGAGATGGATGAGAGTTTTCTAATT       c.7620
 N  L  T  V  S  I  L  P  D  D  F  P  E  M  D  E  S  F  L  I         p.2540

          .         .         .         .         .         .       g.140637
 TCTCTCCTTGAAGTTCACCTCATGAACATTTCAGCCAGTTTGAAAAATCAGCCAACCATA       c.7680
 S  L  L  E  V  H  L  M  N  I  S  A  S  L  K  N  Q  P  T  I         p.2560

          .         .         .         .         .         .       g.140697
 GGACAGCCAAATATTTCTACAGTTGTCATAGCACTAAATGGTGATGCCTTTGGAGTGTTT       c.7740
 G  Q  P  N  I  S  T  V  V  I  A  L  N  G  D  A  F  G  V  F         p.2580

          .         .         .         .         .         .       g.140757
 GTGATCTACAATATTAGTCCCAATACTTCCGAAGATGGCTTATTTGTTGAAGTTCAGGAG       c.7800
 V  I  Y  N  I  S  P  N  T  S  E  D  G  L  F  V  E  V  Q  E         p.2600

          .         .         .         .         .         .       g.140817
 CAGCCCCAAACCTTGGTGGAGCTGATGATACACAGGACAGGGGGCAGCTTAGGTCAAGTG       c.7860
 Q  P  Q  T  L  V  E  L  M  I  H  R  T  G  G  S  L  G  Q  V         p.2620

          .         .         .         .         .         .       g.140877
 GCAGTCGAATGGCGTGTTGTTGGTGGAACAGCTACTGAAGGTTTAGATTTTATAGGTGCT       c.7920
 A  V  E  W  R  V  V  G  G  T  A  T  E  G  L  D  F  I  G  A         p.2640

          .         .      | 34  .         .         .         .    g.143172
 GGAGAGATTCTGACCTTTGCTGAAG | GTGAAACCAAAAAGACAGTCATTTTAACCATCTTG    c.7980
 G  E  I  L  T  F  A  E  G |   E  T  K  K  T  V  I  L  T  I  L      p.2660

          .         .         .         .         .         .       g.143232
 GATGACTCTGAACCAGAGGATGACGAAAGTATCATAGTTAGTTTGGTGTACACTGAAGGT       c.8040
 D  D  S  E  P  E  D  D  E  S  I  I  V  S  L  V  Y  T  E  G         p.2680

          .         .         .         .         .         .       g.143292
 GGAAGTAGAATTTTGCCAAGCTCCGACACTGTTAGAGTGAACATTTTGGCCAATGACAAT       c.8100
 G  S  R  I  L  P  S  S  D  T  V  R  V  N  I  L  A  N  D  N         p.2700

          .         .         .         .         .      | 35  .    g.149870
 GTGGCAGGAATTGTTAGCTTTCAGACAGCTTCCAGATCTGTCATAGGTCATGAAG | GAGAA    c.8160
 V  A  G  I  V  S  F  Q  T  A  S  R  S  V  I  G  H  E  G |   E      p.2720

          .         .         .         .         .         .       g.149930
 ATTTTACAATTCCATGTGATAAGAACTTTCCCTGGTCGAGGAAATGTTACTGTTAACTGG       c.8220
 I  L  Q  F  H  V  I  R  T  F  P  G  R  G  N  V  T  V  N  W         p.2740

          .         .         .         .         .         .       g.149990
 AAAATTATTGGGCAAAATCTAGAACTCAATTTTGCTAACTTTAGCGGACAACTTTTCTTT       c.8280
 K  I  I  G  Q  N  L  E  L  N  F  A  N  F  S  G  Q  L  F  F         p.2760

        | 36 .         .         .         .         .         .    g.150643
 CCTGAG | GGGTCGTTGAATACAACATTGTTTGTGCATTTGTTGGATGACAACATTCCTGAG    c.8340
 P  E   | G  S  L  N  T  T  L  F  V  H  L  L  D  D  N  I  P  E      p.2780

          .         .         .         .       | 37 .         .    g.151614
 GAGAAAGAAGTATACCAAGTCATTCTGTATGATGTCAGGACACAAG | GAGTTCCACCAGCC    c.8400
 E  K  E  V  Y  Q  V  I  L  Y  D  V  R  T  Q  G |   V  P  P  A      p.2800

          .         .         .         .         .         .       g.151674
 GGAATCGCCCTGCTTGATGCTCAAGGATATGCAGCTGTCCTCACAGTAGAAGCCAGTGAT       c.8460
 G  I  A  L  L  D  A  Q  G  Y  A  A  V  L  T  V  E  A  S  D         p.2820

          .         .         .         .         .         .       g.151734
 GAACCACATGGAGTTTTAAATTTTGCTCTTTCATCAAGATTTGTGTTACTACAAGAGGCT       c.8520
 E  P  H  G  V  L  N  F  A  L  S  S  R  F  V  L  L  Q  E  A         p.2840

          .         .         .         .       | 38 .         .    g.152445
 AACATAACAATTCAGCTTTTCATCAACAGAGAATTTGGATCTCTAG | GAGCTATCAATGTC    c.8580
 N  I  T  I  Q  L  F  I  N  R  E  F  G  S  L  G |   A  I  N  V      p.2860

          .         .         .         .         .         .       g.152505
 ACATATACCACGGTTCCTGGAATGCTGAGTCTGAAGAACCAAACAGTAGGAAACCTAGCA       c.8640
 T  Y  T  T  V  P  G  M  L  S  L  K  N  Q  T  V  G  N  L  A         p.2880

          .         .         .         .         .         .       g.152565
 GAGCCAGAAGTTGATTTTGTCCCTATCATTGGCTTTCTGATTTTAGAAGAAGGGGAAACA       c.8700
 E  P  E  V  D  F  V  P  I  I  G  F  L  I  L  E  E  G  E  T         p.2900

          .         .         . | 39       .         .         .    g.155046
 GCAGCAGCCATCAACATTACCATTCTTGAG | GATGATGTACCAGAGCTAGAAGAATATTTC    c.8760
 A  A  A  I  N  I  T  I  L  E   | D  D  V  P  E  L  E  E  Y  F      p.2920

          .         .         .         .         .         .       g.155106
 CTGGTGAATTTAACTTACGTTGGACTTACCATGGCTGCTTCAACTTCATTTCCTCCCAGA       c.8820
 L  V  N  L  T  Y  V  G  L  T  M  A  A  S  T  S  F  P  P  R         p.2940

      | 40   .         .         .         .         .         .    g.157237
 CTAG | ATTCAGAAGGTTTGACTGCACAAGTTATTATTGATGCCAATGATGGGGCCCGAGGT    c.8880
 L  D |   S  E  G  L  T  A  Q  V  I  I  D  A  N  D  G  A  R  G      p.2960

          .         .    | 41    .         .         .         .    g.157421
 GTAATTGAATGGCAACAAAGCAG | GTTTGAAGTAAATGAAACCCATGGAAGTTTAACATTG    c.8940
 V  I  E  W  Q  Q  S  R  |  F  E  V  N  E  T  H  G  S  L  T  L      p.2980

          .         .         .         .         .         .       g.157481
 GTAGCCCAGAGGAGCAGAGAACCTCTTGGCCATGTTTCCTTATTTGTGTATGCTCAGAAT       c.9000
 V  A  Q  R  S  R  E  P  L  G  H  V  S  L  F  V  Y  A  Q  N         p.3000

          .         .         .         .   | 42     .         .    g.158505
 TTGGAAGCACAAGTGGGGCTGGATTATATCTTCACCCCAATG | ATTCTTCATTTTGCTGAT    c.9060
 L  E  A  Q  V  G  L  D  Y  I  F  T  P  M   | I  L  H  F  A  D      p.3020

          .         .         .         .         .         .       g.158565
 GGAGAAAGGTATAAAAATGTCAATATCATGATTCTTGATGATGACATTCCAGAAGGAGAT       c.9120
 G  E  R  Y  K  N  V  N  I  M  I  L  D  D  D  I  P  E  G  D         p.3040

          .         .         .         .         .         .       g.158625
 GAAAAATTTCAGCTGATTTTAACAAATCCTTCTCCTGGACTAGAGCTAGGGAAAAATACA       c.9180
 E  K  F  Q  L  I  L  T  N  P  S  P  G  L  E  L  G  K  N  T         p.3060

      | 43   .         .         .         .         .         .    g.162723
 ATAG | CCTTAATTATTGTCCTTGCTAATGATGACGGCCCTGGAGTTCTATCATTTAACAAC    c.9240
 I  A |   L  I  I  V  L  A  N  D  D  G  P  G  V  L  S  F  N  N      p.3080

          .         .         .         .         .         .       g.162783
 AGTGAGCACTTTTTCCTAAGAGAGCCAACAGCTCTCTACGTCCAGGAGAGTGTTGCAGTA       c.9300
 S  E  H  F  F  L  R  E  P  T  A  L  Y  V  Q  E  S  V  A  V         p.3100

          .         .         .         .         .         .       g.162843
 TTGTACATTGTTCGGGAACCTGCACAAGGATTGTTTGGAACAGTGACAGTTCAGTTCATT       c.9360
 L  Y  I  V  R  E  P  A  Q  G  L  F  G  T  V  T  V  Q  F  I         p.3120

          .         .         .         .         .         .       g.162903
 GTGACAGAAGTGAATTCCTCAAATGAATCTAAAGATCTGACTCCTTCCAAAGGCTATATT       c.9420
 V  T  E  V  N  S  S  N  E  S  K  D  L  T  P  S  K  G  Y  I         p.3140

          .         .        | 44.         .         .         .    g.166281
 GTTTTAGAAGAAGGTGTTCGATTCAAG | GCCCTACAAATATCTGCCATATTAGACACGGAA    c.9480
 V  L  E  E  G  V  R  F  K   | A  L  Q  I  S  A  I  L  D  T  E      p.3160

          .         .         .         .         .         .       g.166341
 CCAGAAATGGATGAGTATTTTGTTTGCACCTTGTTTAATCCAACTGGAGGTGCTAGACTA       c.9540
 P  E  M  D  E  Y  F  V  C  T  L  F  N  P  T  G  G  A  R  L         p.3180

          .         .         .         .         .         .       g.166401
 GGGGTGCATGTTCAAACCCTGATAACAGTTTTGCAAAACCAGGCCCCTTTGGGGCTATTC       c.9600
 G  V  H  V  Q  T  L  I  T  V  L  Q  N  Q  A  P  L  G  L  F         p.3200

          .         .    | 45    .         .         .         .    g.167172
 AGTATCTCTGCAGTTGAAAATAG | AGCCACCTCCATAGACATCGAAGAAGCCAATAGGACC    c.9660
 S  I  S  A  V  E  N  R  |  A  T  S  I  D  I  E  E  A  N  R  T      p.3220

          .         .         .         .         .         .       g.167232
 GTGTATTTAAATGTATCTCGAACTAATGGCATTGATTTGGCTGTGAGTGTGCAGTGGGAG       c.9720
 V  Y  L  N  V  S  R  T  N  G  I  D  L  A  V  S  V  Q  W  E         p.3240

          .         .         | 46         .         .         .    g.171064
 ACAGTATCTGAAACAGCCTTTGGCATGA | GGGGAATGGATGTTGTGTTTTCCGTATTTCAA    c.9780
 T  V  S  E  T  A  F  G  M  R |   G  M  D  V  V  F  S  V  F  Q      p.3260

          .         .         .         .         .         .       g.171124
 AGTTTTTTGGATGAATCAGCTTCTGGCTGGTGTTTCTTTACTTTGGAAAATTTAATATAT       c.9840
 S  F  L  D  E  S  A  S  G  W  C  F  F  T  L  E  N  L  I  Y         p.3280

          .         .         .         .         .         .       g.171184
 GGTATAATGTTAAGAAAATCATCTGTTACTGTTTACCGATGGCAGGGGATTTTTATTCCA       c.9900
 G  I  M  L  R  K  S  S  V  T  V  Y  R  W  Q  G  I  F  I  P         p.3300

        | 47 .         .         .         .         .         .    g.171340
 GTTGAG | GATTTAAATATAGAAAATCCTAAAACTTGTGAGGCCTTTAATATTGGTTTTTCT    c.9960
 V  E   | D  L  N  I  E  N  P  K  T  C  E  A  F  N  I  G  F  S      p.3320

          .         .         .         .         .         .       g.171400
 CCCTACTTTGTGATTACTCATGAAGAAAGAAATGAAGAAAAGCCTTCTCTTAACAGTGTG       c.10020
 P  Y  F  V  I  T  H  E  E  R  N  E  E  K  P  S  L  N  S  V         p.3340

          .         .         .    | 48    .         .         .    g.171776
 TTTACATTCACATCTGGATTTAAATTATTCCTG | GTACAAACAATCATTATTCTGGAAAGT    c.10080
 F  T  F  T  S  G  F  K  L  F  L   | V  Q  T  I  I  I  L  E  S      p.3360

          .         .         .         .         .         .       g.171836
 TCTCAAGTAAGATATTTTACTTCAGACAGCCAAGATTATTTAATCATTGCAAGTCAAAGA       c.10140
 S  Q  V  R  Y  F  T  S  D  S  Q  D  Y  L  I  I  A  S  Q  R         p.3380

          .         .  | 49      .         .         .         .    g.174908
 GATGATTCCGAATTAACTCAG | GTCTTCAGGTGGAATGGAGGAAGCTTCGTGTTGCATCAA    c.10200
 D  D  S  E  L  T  Q   | V  F  R  W  N  G  G  S  F  V  L  H  Q      p.3400

          .         .         .         .         .         .       g.174968
 AAACTCCCTGTCCGAGGTGTGCTGACCGTGGCCTTGTTCAACAAGGGAGGCTCTGTGTTC       c.10260
 K  L  P  V  R  G  V  L  T  V  A  L  F  N  K  G  G  S  V  F         p.3420

          .         .         .         .         .         .       g.175028
 TTAGCCATTTCCCAGGCTAATGCCAGGCTAAACTCCCTTTTATTCAGATGGTCTGGCAGT       c.10320
 L  A  I  S  Q  A  N  A  R  L  N  S  L  L  F  R  W  S  G  S         p.3440

          .         .         .         .         .         .       g.175088
 GGGTTTATTAACTTTCAAGAGGTGCCTGTCAGTGGGACAACAGAAGTTGAGGCTTTGTCT       c.10380
 G  F  I  N  F  Q  E  V  P  V  S  G  T  T  E  V  E  A  L  S         p.3460

          .         .         .         .       | 50 .         .    g.175856
 TCAGCCAATGATATTTACCTAATATTTGCCGAAAATGTCTTTCTAG | GAGATCAGAATTCA    c.10440
 S  A  N  D  I  Y  L  I  F  A  E  N  V  F  L  G |   D  Q  N  S      p.3480

          .         .         .         .         .         .       g.175916
 ATTGATATTTTCATCTGGGAGATGGGACAGTCTTCCTTCAGGTATTTTCAGTCTGTAGAT       c.10500
 I  D  I  F  I  W  E  M  G  Q  S  S  F  R  Y  F  Q  S  V  D         p.3500

          .         .         .         .          | 51        .    g.191257
 TTTGCTGCTGTTAACAGAATCCACTCCTTCACACCAGCCTCAGGAATAG | CCCACATACTT    c.10560
 F  A  A  V  N  R  I  H  S  F  T  P  A  S  G  I  A |   H  I  L      p.3520

          .         .         .         .         .         .       g.191317
 CTTATTGGCCAAGATATGTCTGCTCTTTACTGCTGGAATTCGGAGCGTAATCAATTCTCT       c.10620
 L  I  G  Q  D  M  S  A  L  Y  C  W  N  S  E  R  N  Q  F  S         p.3540

          .         .         .         .         .         .       g.191377
 TTTGTTCTGGAAGTACCTTCTGCTTATGATGTGGCTTCTGTTACAGTAAAGTCCCTTAAT       c.10680
 F  V  L  E  V  P  S  A  Y  D  V  A  S  V  T  V  K  S  L  N         p.3560

          .         .         .         .         .         .       g.191437
 TCAAGCAAGAATTTAATAGCTCTAGTGGGAGCTCATTCACATATATATGAGCTAGCCTAC       c.10740
 S  S  K  N  L  I  A  L  V  G  A  H  S  H  I  Y  E  L  A  Y         p.3580

          .         .          | 52        .         .         .    g.191822
 ATTTCCAGCCATTCTGACTTTATTCCTAG | TTCAGGTGAACTGATATTTGAACCTGGTGAG    c.10800
 I  S  S  H  S  D  F  I  P  S  |  S  G  E  L  I  F  E  P  G  E      p.3600

          .         .         .         .         .         .       g.191882
 AGAGAAGCTACAATAGCAGTAAATATCCTTGATGATACAGTTCCAGAAAAAGAAGAATCC       c.10860
 R  E  A  T  I  A  V  N  I  L  D  D  T  V  P  E  K  E  E  S         p.3620

          .         .         .         .         .         .       g.191942
 TTCAAAGTTCAACTTAAAAATCCCAAAGGAGGAGCAGAGATTGGCATTAATGATTCTGTA       c.10920
 F  K  V  Q  L  K  N  P  K  G  G  A  E  I  G  I  N  D  S  V         p.3640

          .         .         .         .         .     | 53   .    g.196757
 ACAATAACCATTCTGTCTAATGATGATGCCTATGGAATTGTTGCATTTGCTCAG | AATTCA    c.10980
 T  I  T  I  L  S  N  D  D  A  Y  G  I  V  A  F  A  Q   | N  S      p.3660

          .         .         .         .         .         .       g.196817
 TTATATAAGCAAGTGGAAGAAATGGAGCAAGATAGCCTAGTAACCTTGAACGTTGAACGC       c.11040
 L  Y  K  Q  V  E  E  M  E  Q  D  S  L  V  T  L  N  V  E  R         p.3680

          .         .         .         .         .         .       g.196877
 TTAAAAGGAACATATGGCCGTATAACCATAGCATGGGAAGCTGATGGAAGTATTAGTGAT       c.11100
 L  K  G  T  Y  G  R  I  T  I  A  W  E  A  D  G  S  I  S  D         p.3700

          .         .  | 54      .         .         .         .    g.199813
 ATATTTCCTACCTCAGGAGTG | ATTTTATTTACTGAAGGCCAGGTACTGTCAACAATCACT    c.11160
 I  F  P  T  S  G  V   | I  L  F  T  E  G  Q  V  L  S  T  I  T      p.3720

          .         .         .         .         .         .       g.199873
 CTAACTATTCTTGCTGATAATATACCAGAGTTATCAGAGGTTGTGATTGTAACCCTCACC       c.11220
 L  T  I  L  A  D  N  I  P  E  L  S  E  V  V  I  V  T  L  T         p.3740

          .         .         .         .         .         .       g.199933
 CGTATCACCACAGAAGGGGTTGAGGACTCATACAAAGGTGCTACTATTGATCAGGACAGA       c.11280
 R  I  T  T  E  G  V  E  D  S  Y  K  G  A  T  I  D  Q  D  R         p.3760

          .         .         .         .         .         .       g.199993
 AGCAAGTCTGTTATAACAACTTTGCCCAATGACTCACCTTTTGGCTTGGTGGGCTGGCGT       c.11340
 S  K  S  V  I  T  T  L  P  N  D  S  P  F  G  L  V  G  W  R         p.3780

          .         .         .        | 55.         .         .    g.201206
 GCTGCGTCTGTCTTCATTAGAGTAGCAGAGCCTAAAG | AAAACACCACCACTCTTCAGTTA    c.11400
 A  A  S  V  F  I  R  V  A  E  P  K  E |   N  T  T  T  L  Q  L      p.3800

          .         .         .         .         .         .       g.201266
 CAAATAGCTCGAGATAAAGGACTACTTGGGGATATTGCCATTCACTTGAGAGCTCAACCC       c.11460
 Q  I  A  R  D  K  G  L  L  G  D  I  A  I  H  L  R  A  Q  P         p.3820

          .         .         .         .         .         .       g.201326
 AATTTCTTACTGCATGTCGATAATCAAGCTACTGAGAATGAAGATTATGTATTGCAAGAA       c.11520
 N  F  L  L  H  V  D  N  Q  A  T  E  N  E  D  Y  V  L  Q  E         p.3840

          .         .         .         .         .         .       g.201386
 ACAATAATAATAATGAAAGAAAACATAAAAGAAGCTCATGCCGAAGTTTCCATTTTGCCG       c.11580
 T  I  I  I  M  K  E  N  I  K  E  A  H  A  E  V  S  I  L  P         p.3860

  | 56       .         .         .         .         .         .    g.202714
  | GATGACCTTCCTGAATTGGAGGAAGGATTTATTGTCACTATCACTGAGGTGAACCTGGTG    c.11640
  | D  D  L  P  E  L  E  E  G  F  I  V  T  I  T  E  V  N  L  V      p.3880

          .         .         .         .         .         .       g.202774
 AACTCTGACTTCTCTACAGGACAGCCAAGTGTGCGGAGGCCCGGAATGGAAATAGCTGAG       c.11700
 N  S  D  F  S  T  G  Q  P  S  V  R  R  P  G  M  E  I  A  E         p.3900

          .         .         .         .         .        | 57.    g.203182
 ATAATGATAGAAGAAAATGACGATCCCAGAGGAATTTTTATGTTTCATGTTACTAGA | GGC    c.11760
 I  M  I  E  E  N  D  D  P  R  G  I  F  M  F  H  V  T  R   | G      p.3920

          .         .         .         .         .         .       g.203242
 GCTGGGGAAGTTATTACTGCCTATGAGGTGCCTCCACCCTTGAACGTTCTTCAAGTTCCT       c.11820
 A  G  E  V  I  T  A  Y  E  V  P  P  P  L  N  V  L  Q  V  P         p.3940

          .         .         .         .         .         .       g.203302
 GTAGTCCGGCTGGCTGGAAGCTTTGGGGCAGTAAATGTTTATTGGAAAGCATCACCAGAC       c.11880
 V  V  R  L  A  G  S  F  G  A  V  N  V  Y  W  K  A  S  P  D         p.3960

          .         .         .         .         .         .       g.203362
 AGTGCTGGCCTGGAAGACTTTAAACCATCTCATGGGATTCTTGAATTTGCAGATAAACAG       c.11940
 S  A  G  L  E  D  F  K  P  S  H  G  I  L  E  F  A  D  K  Q         p.3980

  | 58       .         .         .         .         .         .    g.205669
  | GTTACTGCAATGATAGAAATCACCATAATTGATGATGCTGAATTTGAATTGACAGAGACG    c.12000
  | V  T  A  M  I  E  I  T  I  I  D  D  A  E  F  E  L  T  E  T      p.4000

          .         .         .         .         .         .       g.205729
 TTCAATATTTCCTTGATCAGTGTTGCTGGAGGTGGCAGACTTGGTGATGATGTTGTGGTA       c.12060
 F  N  I  S  L  I  S  V  A  G  G  G  R  L  G  D  D  V  V  V         p.4020

          .         .         .         .         .         .       g.205789
 ACTGTTGTTATTCCACAAAATGATTCTCCATTTGGAGTATTTGGATTTGAAGAAAAGACT       c.12120
 T  V  V  I  P  Q  N  D  S  P  F  G  V  F  G  F  E  E  K  T         p.4040

  | 59       .         .         .         .         .         .    g.209565
  | GTAATGATTGATGAATCCCTTTCATCCGATGACCCTGATTCATATGTGACATTGACGGTT    c.12180
  | V  M  I  D  E  S  L  S  S  D  D  P  D  S  Y  V  T  L  T  V      p.4060

          .         .         .         .         .         .       g.209625
 GTCCGGTCCCCAGGAGGAAAAGGAACCGTCCGACTTGAGTGGACCATAGATGAGAAGGCT       c.12240
 V  R  S  P  G  G  K  G  T  V  R  L  E  W  T  I  D  E  K  A         p.4080

          .         .         .         .      | 60  .         .    g.220401
 AAACATAACCTTAGTCCTTTGAATGGGACCCTTCATTTTGATGAG | ACTGAGTCCCAGAAG    c.12300
 K  H  N  L  S  P  L  N  G  T  L  H  F  D  E   | T  E  S  Q  K      p.4100

          .         .         .         .         .         .       g.220461
 ACCATTGTGTTGCACACACTTCAAGACACAGTGTTGGAGGAGGACAGGCGTTTCACCATT       c.12360
 T  I  V  L  H  T  L  Q  D  T  V  L  E  E  D  R  R  F  T  I         p.4120

          .         .         .         .    | 61    .         .    g.222670
 CAGCTGATATCAATTGATGAGGTAGAAATATCTCCAGTAAAAG | GTAGTGCATCAATAATT    c.12420
 Q  L  I  S  I  D  E  V  E  I  S  P  V  K  G |   S  A  S  I  I      p.4140

          .         .         .         .         .         .       g.222730
 ATTCGGGGTGATAAGCGAGCATCAGGAGAAGTTGGGATAGCTCCGTCATCTAGGCACATC       c.12480
 I  R  G  D  K  R  A  S  G  E  V  G  I  A  P  S  S  R  H  I         p.4160

          .         .         .         .        | 62.         .    g.224118
 CTCATTGGGGAACCCTCAGCAAAATATAATGGTACCGCTATTATCAG | CCTTGTTCGAGGC    c.12540
 L  I  G  E  P  S  A  K  Y  N  G  T  A  I  I  S  |  L  V  R  G      p.4180

          .         .         .         .         .         .       g.224178
 CCAGGGATTTTGGGGGAGGTCACAGTGTTCTGGAGGATATTCCCTCCTTCCGTGGGGGAA       c.12600
 P  G  I  L  G  E  V  T  V  F  W  R  I  F  P  P  S  V  G  E         p.4200

          .         .         .         .         .         .       g.224238
 TTTGCTGAAACATCAGGAAAACTGACAATGCGAGACGAACAGTCTGCAGTCATTGTAGTA       c.12660
 F  A  E  T  S  G  K  L  T  M  R  D  E  Q  S  A  V  I  V  V         p.4220

        | 63 .         .         .         .         .         .    g.224681
 ATACAG | GCTTTGAACGATGACATTCCCGAGGAAAAAAGCTTCTATGAGTTTCAGCTCACT    c.12720
 I  Q   | A  L  N  D  D  I  P  E  E  K  S  F  Y  E  F  Q  L  T      p.4240

          .         .         .         .         .         .       g.224741
 GCAGTCAGTGAGGGAGGAGTTCTGAGTGAATCCAGCAGCACTGCCAACATCACGGTGGTG       c.12780
 A  V  S  E  G  G  V  L  S  E  S  S  S  T  A  N  I  T  V  V         p.4260

          .         .         .         .         .         .       g.224801
 GCCAGCGACTCTCCCTATGGCCGATTTGCCTTTTCACATGAGCAACTTCGAGTGTCAGAA       c.12840
 A  S  D  S  P  Y  G  R  F  A  F  S  H  E  Q  L  R  V  S  E         p.4280

           | 64        .         .         .         .         .    g.225116
 GCACAGAGG | GTTAACATCACAATCATCCGTTCCAGTGGAGATTTTGGCCATGTGCGACTC    c.12900
 A  Q  R   | V  N  I  T  I  I  R  S  S  G  D  F  G  H  V  R  L      p.4300

          .         .         .         .         .         .       g.225176
 TGGTACAAGACGATGAGCGGGACAGCGGAAGCAGGCTTGGATTTTGTTCCTGCAGCAGGG       c.12960
 W  Y  K  T  M  S  G  T  A  E  A  G  L  D  F  V  P  A  A  G         p.4320

          .         .         .         .         .         .       g.225236
 GAGCTCCTCTTTGAAGCAGGGGAGATGAGGAAAAGTCTGCATGTTGAAATCCTTGATGAT       c.13020
 E  L  L  F  E  A  G  E  M  R  K  S  L  H  V  E  I  L  D  D         p.4340

          .         .         .         .         .         .       g.225296
 GACTATCCTGAAGGCCCAGAGGAATTTTCTCTAACAATTACAAAGGTGGAACTCCAGGGA       c.13080
 D  Y  P  E  G  P  E  E  F  S  L  T  I  T  K  V  E  L  Q  G         p.4360

    | 65     .         .         .         .         .         .    g.227688
 AG | AGGGTATGATTTTACCATTCAAGAAAATGGACTTCAGATAGATCAACCTCCTGAAATA    c.13140
 R  |  G  Y  D  F  T  I  Q  E  N  G  L  Q  I  D  Q  P  P  E  I      p.4380

          .         .         .         .         .         .       g.227748
 GGAAACATCTCCATTGTTCGCATCATAATAATGAAAAATGATAACGCAGAAGGCATCATT       c.13200
 G  N  I  S  I  V  R  I  I  I  M  K  N  D  N  A  E  G  I  I         p.4400

          .         .         .  | 66      .         .         .    g.229353
 GAATTTGACCCAAAGTATACTGCCTTCGAAG | TGGAGGAAGATGTTGGGCTGATCATGATC    c.13260
 E  F  D  P  K  Y  T  A  F  E  V |   E  E  D  V  G  L  I  M  I      p.4420

          .         .         .         .         .         .       g.229413
 CCAGTGGTGAGGCTACATGGAACTTATGGCTATGTGACAGCTGATTTCATCTCTCAGAGC       c.13320
 P  V  V  R  L  H  G  T  Y  G  Y  V  T  A  D  F  I  S  Q  S         p.4440

          .         .         .         .         .         .       g.229473
 TCCTCTGCCAGTCCCGGAGGTGTTGATTACATTTTGCATGGCAGTACAGTCACCTTTCAG       c.13380
 S  S  A  S  P  G  G  V  D  Y  I  L  H  G  S  T  V  T  F  Q         p.4460

          .         .         .         .         .    | 67    .    g.230045
 CATGGGCAAAACTTAAGTTTTATAAATATCTCCATCATTGATGACAATGAAAG | TGAATTT    c.13440
 H  G  Q  N  L  S  F  I  N  I  S  I  I  D  D  N  E  S  |  E  F      p.4480

          .         .         .         .         .         .       g.230105
 GAGGAGCCCATTGAAATTCTACTCACTGGAGCTACTGGAGGAGCGGTCCTTGGGCGCCAC       c.13500
 E  E  P  I  E  I  L  L  T  G  A  T  G  G  A  V  L  G  R  H         p.4500

          .         .         .         .         .         .       g.230165
 CTAGTGAGCAGAATCATAATAGCTAAGAGTGACTCTCCCTTTGGAGTTATAAGGTTTCTC       c.13560
 L  V  S  R  I  I  I  A  K  S  D  S  P  F  G  V  I  R  F  L         p.4520

          .         .         .         .         .         .       g.230225
 AATCAAAGCAAAATTTCTATTGCTAATCCCAATTCCACAATGATTTTATCACTGGTGCTG       c.13620
 N  Q  S  K  I  S  I  A  N  P  N  S  T  M  I  L  S  L  V  L         p.4540

          .         .         .    | 68    .         .         .    g.234298
 GAGCGGACTGGAGGACTCTTGGGAGAGATTCAG | GTGAACTGGGAGACAGTAGGACCCAAC    c.13680
 E  R  T  G  G  L  L  G  E  I  Q   | V  N  W  E  T  V  G  P  N      p.4560

          .         .         .         .         .         .       g.234358
 TCTCAAGAAGCCTTACTGCCACAGAATAGAGACATTGCAGACCCAGTGAGCGGGTTGTTC       c.13740
 S  Q  E  A  L  L  P  Q  N  R  D  I  A  D  P  V  S  G  L  F         p.4580

          .         .         .         .         .         .       g.234418
 TATTTTGGAGAAGGAGAAGGAGGAGTGAGAACCATAATTCTGACAATCTATCCTCATGAA       c.13800
 Y  F  G  E  G  E  G  G  V  R  T  I  I  L  T  I  Y  P  H  E         p.4600

          .         .         .         .         .         .       g.234478
 GAAATTGAAGTTGAAGAGACATTCATTATTAAACTTCATCTTGTGAAAGGAGAAGCTAAA       c.13860
 E  I  E  V  E  E  T  F  I  I  K  L  H  L  V  K  G  E  A  K         p.4620

          .         .         .    | 69    .         .         .    g.235929
 TTAGACTCCAGAGCTAAAGATGTTACATTAACC | ATACAAGAGTTTGGTGACCCAAATGGA    c.13920
 L  D  S  R  A  K  D  V  T  L  T   | I  Q  E  F  G  D  P  N  G      p.4640

          .         .         .         .         .         .       g.235989
 GTTGTTCAGTTTGCTCCTGAAACTTTGTCTAAGAAGACTTATTCAGAGCCTCTGGCTCTG       c.13980
 V  V  Q  F  A  P  E  T  L  S  K  K  T  Y  S  E  P  L  A  L         p.4660

          .         .         .         .         .         .       g.236049
 GAAGGGCCCCTGCTCATTACCTTCTTTGTCAGAAGAGTCAAGGGCACCTTTGGAGAGATT       c.14040
 E  G  P  L  L  I  T  F  F  V  R  R  V  K  G  T  F  G  E  I         p.4680

     | 70    .         .         .         .         .         .    g.237130
 ATG | GTTTACTGGGAATTAAGTAGTGAGTTTGACATTACTGAAGACTTTCTTTCCACCAGT    c.14100
 M   | V  Y  W  E  L  S  S  E  F  D  I  T  E  D  F  L  S  T  S      p.4700

          .         .         .         .         .         .       g.237190
 GGATTTTTCACCATTGCTGATGGAGAGAGTGAAGCTAGCTTTGATGTTCATTTGCTACCA       c.14160
 G  F  F  T  I  A  D  G  E  S  E  A  S  F  D  V  H  L  L  P         p.4720

          .         .         .         .         .         .       g.237250
 GATGAGGTACCTGAGATAGAGGAAGATTATGTGATCCAGCTTGTTTCTGTAGAGGGAGGA       c.14220
 D  E  V  P  E  I  E  E  D  Y  V  I  Q  L  V  S  V  E  G  G         p.4740

          .         .         .         .         .         .       g.237310
 GCCGAACTGGATCTGGAGAAGAGTATCACATGGTTCTCTGTTTATGCAAATGATGACCCA       c.14280
 A  E  L  D  L  E  K  S  I  T  W  F  S  V  Y  A  N  D  D  P         p.4760

          .         .         .         .         .         .       g.237370
 CATGGAGTATTTGCCCTGTATTCGGATCGCCAGTCAATACTTATTGGGCAGAACCTTATT       c.14340
 H  G  V  F  A  L  Y  S  D  R  Q  S  I  L  I  G  Q  N  L  I         p.4780

          .         .         .         .         .         .       g.237430
 AGATCCATCCAAATTAACATAACCCGGCTTGCTGGAACATTTGGAGATGTGGCTGTTGGG       c.14400
 R  S  I  Q  I  N  I  T  R  L  A  G  T  F  G  D  V  A  V  G         p.4800

          .         .         .         .         .         .       g.237490
 CTTCGAATATCATCGGATCATAAAGAACAGCCGATTGTTACCGAAAATGCAGAGAGGCAG       c.14460
 L  R  I  S  S  D  H  K  E  Q  P  I  V  T  E  N  A  E  R  Q         p.4820

          .         .         .         .         .        | 71.    g.248942
 CTGGTGGTCAAAGATGGTGCCACATATAAAGTGGACGTGGTGCCAATAAAGAATCAG | GTC    c.14520
 L  V  V  K  D  G  A  T  Y  K  V  D  V  V  P  I  K  N  Q   | V      p.4840

          .         .         .         .         .         .       g.249002
 TTCCTATCACTGGGCTCTAATTTCACTTTGCAACTGGTGACTGTGATGCTTGTCGGTGGA       c.14580
 F  L  S  L  G  S  N  F  T  L  Q  L  V  T  V  M  L  V  G  G         p.4860

          .         .         .         .         .         .       g.249062
 CGTTTCTATGGAATGCCAACAATTCTTCAGGAAGCAAAATCTGCTGTCCTTCCAGTCTCT       c.14640
 R  F  Y  G  M  P  T  I  L  Q  E  A  K  S  A  V  L  P  V  S         p.4880

          .         .  | 72      .         .         .         .    g.251523
 GAGAAAGCTGCCAATTCTCAG | GTCGGATTTGAATCCACTGCTTTTCAACTCATGAACATC    c.14700
 E  K  A  A  N  S  Q   | V  G  F  E  S  T  A  F  Q  L  M  N  I      p.4900

          .         .         .         .         .         .       g.251583
 ACTGCTGGCACAAGCCACGTTATGATTTCTAGGAGAGGCACATATGGAGCTCTCTCGGTT       c.14760
 T  A  G  T  S  H  V  M  I  S  R  R  G  T  Y  G  A  L  S  V         p.4920

          .         .         .         .         .         .       g.251643
 GCCTGGACCACTGGATATGCTCCTGGGTTAGAAATTCCTGAATTCATTGTTGTTGGCAAC       c.14820
 A  W  T  T  G  Y  A  P  G  L  E  I  P  E  F  I  V  V  G  N         p.4940

          .       | 73 .         .         .         .         .    g.253846
 ATGACCCCAACACTGG | GGAGCCTTTCATTTTCCCACGGTGAACAAAGGAAAGGAGTTTTC    c.14880
 M  T  P  T  L  G |   S  L  S  F  S  H  G  E  Q  R  K  G  V  F      p.4960

          .         .         .         .         .         .       g.253906
 CTGTGGACGTTTCCTAGCCCTGGTTGGCCAGAGGCCTTTGTTCTTCACCTATCAGGAGTG       c.14940
 L  W  T  F  P  S  P  G  W  P  E  A  F  V  L  H  L  S  G  V         p.4980

          .         .         .   | 74     .         .         .    g.256461
 CAGAGCAGTGCTCCTGGCGGAGCTCAACTCCG | ATCAGGTTTCATTGTTGCTGAAATTGAA    c.15000
 Q  S  S  A  P  G  G  A  Q  L  R  |  S  G  F  I  V  A  E  I  E      p.5000

          .         .         .         .         .         .       g.256521
 CCAATGGGCGTCTTCCAATTTTCCACTAGCTCAAGAAATATCATAGTGTCAGAAGATACA       c.15060
 P  M  G  V  F  Q  F  S  T  S  S  R  N  I  I  V  S  E  D  T         p.5020

          .         .         .         .         .         .       g.256581
 CAGATGATCAGATTACATGTACAAAGACTATTTGGGTTCCACAGCGATCTTATTAAAGTT       c.15120
 Q  M  I  R  L  H  V  Q  R  L  F  G  F  H  S  D  L  I  K  V         p.5040

          .         .         .         .         .         .       g.256641
 TCTTATCAGACCACTGCAGGAAGCGCCAAGCCACTGGAAGATTTTGAGCCTGTTCAGAAT       c.15180
 S  Y  Q  T  T  A  G  S  A  K  P  L  E  D  F  E  P  V  Q  N         p.5060

          .         .         .         .         .         .       g.256701
 GGGGAACTGTTTTTTCAAAAATTCCAAACTGAGGTTGATTTTGAAATAACCATTATTAAT       c.15240
 G  E  L  F  F  Q  K  F  Q  T  E  V  D  F  E  I  T  I  I  N         p.5080

          .         .         .         .         .         .       g.256761
 GATCAGCTTTCTGAGATAGAAGAATTTTTTTACATTAACCTTACTTCAGTAGAAATTAGG       c.15300
 D  Q  L  S  E  I  E  E  F  F  Y  I  N  L  T  S  V  E  I  R         p.5100

          .         .         .         .         .         .       g.256821
 GGATTACAAAAGTTTGATGTTAATTGGAGCCCACGCCTGAATCTAGATTTCAGTGTTGCA       c.15360
 G  L  Q  K  F  D  V  N  W  S  P  R  L  N  L  D  F  S  V  A         p.5120

          .         .         .         .         .         .       g.256881
 GTGATTACAATATTGGATAATGATGACCTGGCAGGAATGGATATTTCCTTCCCCGAGACA       c.15420
 V  I  T  I  L  D  N  D  D  L  A  G  M  D  I  S  F  P  E  T         p.5140

          .         .         .         .         .         .       g.256941
 ACTGTGGCTGTAGCAGTTGACACAACTCTCATTCCTGTAGAAACTGAATCCACCACATAC       c.15480
 T  V  A  V  A  V  D  T  T  L  I  P  V  E  T  E  S  T  T  Y         p.5160

          .         .         .         .         .         .       g.257001
 CTCAGCACAAGCAAGACGACTACCATTCTGCAGCCAACCAACGTGGTTGCCATTGTTACT       c.15540
 L  S  T  S  K  T  T  T  I  L  Q  P  T  N  V  V  A  I  V  T         p.5180

          .         .         .         .         .         .       g.257061
 GAGGCAACTGGTGTATCTGCCATCCCTGAGAAACTTGTCACCCTTCATGGCACACCTGCT       c.15600
 E  A  T  G  V  S  A  I  P  E  K  L  V  T  L  H  G  T  P  A         p.5200

          .         .         .         .         .         .       g.257121
 GTGTCTGAAAAGCCTGATGTGGCCACTGTAACTGCCAATGTTTCCATTCATGGAACATTC       c.15660
 V  S  E  K  P  D  V  A  T  V  T  A  N  V  S  I  H  G  T  F         p.5220

          .         .         .         .         .         .       g.257181
 AGCCTTGGGCCATCCATTGTTTATATTGAAGAGGAGATGAAGAATGGCACATTCAACACT       c.15720
 S  L  G  P  S  I  V  Y  I  E  E  E  M  K  N  G  T  F  N  T         p.5240

          .         .         .         .         .         .       g.257241
 GCAGAAGTTCTTATCCGAAGAACTGGTGGGTTTACTGGCAATGTCAGCATAACAGTTAAA       c.15780
 A  E  V  L  I  R  R  T  G  G  F  T  G  N  V  S  I  T  V  K         p.5260

          .         .         .         .         .         .       g.257301
 ACTTTCGGTGAAAGATGTGCTCAGATGGAACCAAATGCATTGCCCTTTCGTGGTATCTAT       c.15840
 T  F  G  E  R  C  A  Q  M  E  P  N  A  L  P  F  R  G  I  Y         p.5280

          .         .         .         .         .         .       g.257361
 GGGATTTCCAACCTAACATGGGCAGTTGAAGAAGAAGACTTTGAAGAACAAACTCTTACC       c.15900
 G  I  S  N  L  T  W  A  V  E  E  E  D  F  E  E  Q  T  L  T         p.5300

          .         .         .         .         .         .       g.257421
 CTTATATTCCTAGATGGAGAAAGAGAACGTAAAGTATCAGTTCAAATTTTGGATGATGAT       c.15960
 L  I  F  L  D  G  E  R  E  R  K  V  S  V  Q  I  L  D  D  D         p.5320

          .         .         .         .         .         .       g.257481
 GAGCCTGAGGGGCAGGAATTCTTCTACGTGTTTCTCACAAACCCTCAAGGGGGAGCACAG       c.16020
 E  P  E  G  Q  E  F  F  Y  V  F  L  T  N  P  Q  G  G  A  Q         p.5340

          .         .         .         .         .         | 75    g.261821
 ATTGTGGAGGAGAAGGATGATACTGGATTTGCAGCTTTTGCCATGGTTATTATTACAG | GG    c.16080
 I  V  E  E  K  D  D  T  G  F  A  A  F  A  M  V  I  I  T  G |       p.5360

          .         .         .         .         .         .       g.261881
 AGTGACCTTCACAATGGCATCATAGGATTCAGTGAGGAGTCCCAGAGTGGACTAGAACTC       c.16140
 S  D  L  H  N  G  I  I  G  F  S  E  E  S  Q  S  G  L  E  L         p.5380

          .         .         .         .         .       | 76 .    g.269629
 AGGGAAGGAGCTGTTATGAGAAGATTGCACCTTATTGTCACAAGACAGCCAAACAG | GGCC    c.16200
 R  E  G  A  V  M  R  R  L  H  L  I  V  T  R  Q  P  N  R  |  A      p.5400

          .         .         .         .         .         .       g.269689
 TTTGAAGATGTCAAGGTCTTTTGGCGAGTCACACTTAACAAAACAGTCGTCGTGCTCCAG       c.16260
 F  E  D  V  K  V  F  W  R  V  T  L  N  K  T  V  V  V  L  Q         p.5420

          .         .         .         .         .         .       g.269749
 AAGGATGGGGTAAACCTGGTGGAGGAACTTCAGTCTGTGTCAGGGACCACAACCTGTACA       c.16320
 K  D  G  V  N  L  V  E  E  L  Q  S  V  S  G  T  T  T  C  T         p.5440

          .         .         .         .         | 77         .    g.275156
 ATGGGTCAAACAAAATGCTTTATCAGCATTGAACTCAAACCAGAAAAG | GTACCACAGGTT    c.16380
 M  G  Q  T  K  C  F  I  S  I  E  L  K  P  E  K   | V  P  Q  V      p.5460

          .         .         .         .         .         .       g.275216
 GAAGTGTATTTTTTTGTGGAACTATATGAAGCTACTGCTGGAGCAGCAATAAACAACAGT       c.16440
 E  V  Y  F  F  V  E  L  Y  E  A  T  A  G  A  A  I  N  N  S         p.5480

          .         .         .         .         .         .       g.275276
 GCCAGATTCGCACAGATTAAAATCTTAGAAAGTGATGAATCTCAAAGCCTTGTGTATTTT       c.16500
 A  R  F  A  Q  I  K  I  L  E  S  D  E  S  Q  S  L  V  Y  F         p.5500

          .         .         .         .         .         .       g.275336
 TCTGTGGGTTCTCGGCTGGCAGTGGCTCACAAGAAGGCCACTTTAATCAGTCTGCAGGTG       c.16560
 S  V  G  S  R  L  A  V  A  H  K  K  A  T  L  I  S  L  Q  V         p.5520

          .         .         .         .         .  | 78      .    g.286787
 GCCAGAGATTCTGGGACAGGACTAATGATGTCTGTTAACTTTAGTACCCAG | GAGTTGAGG    c.16620
 A  R  D  S  G  T  G  L  M  M  S  V  N  F  S  T  Q   | E  L  R      p.5540

          .         .         .         .         .         .       g.286847
 AGTGCTGAAACAATTGGTCGTACCATCATATCTCCAGCTATTTCTGGAAAGGATTTTGTG       c.16680
 S  A  E  T  I  G  R  T  I  I  S  P  A  I  S  G  K  D  F  V         p.5560

          .         .         .         .         .         .       g.286907
 ATAACTGAAGGCACATTGGTCTTTGAACCTGGCCAGAGAAGCACTGTATTGGATGTCATC       c.16740
 I  T  E  G  T  L  V  F  E  P  G  Q  R  S  T  V  L  D  V  I         p.5580

          .         .         .         .         .         .       g.286967
 CTAACGCCAGAGACAGGATCTTTAAATTCATTTCCTAAACGCTTCCAGATTGTCCTTTTT       c.16800
 L  T  P  E  T  G  S  L  N  S  F  P  K  R  F  Q  I  V  L  F         p.5600

          .         .         .         .         .         .       g.287027
 GACCCAAAAGGTGGTGCCAGAATTGATAAAGTGTATGGGACTGCCAACATCACTCTTGTC       c.16860
 D  P  K  G  G  A  R  I  D  K  V  Y  G  T  A  N  I  T  L  V         p.5620

          .         .         .         .         .         .       g.287087
 TCAGATGCAGATTCGCAGGCCATTTGGGGGCTTGCAGATCAGCTACATCAGCCTGTGAAT       c.16920
 S  D  A  D  S  Q  A  I  W  G  L  A  D  Q  L  H  Q  P  V  N         p.5640

          .         .         .         .         .         .       g.287147
 GATGATATTCTCAACAGAGTGCTCCATACCATCAGCATGAAAGTGGCCACAGAAAACACA       c.16980
 D  D  I  L  N  R  V  L  H  T  I  S  M  K  V  A  T  E  N  T         p.5660

          .         .         .          | 79        .         .    g.294858
 GATGAACAACTCAGTGCCATGATGCATTTAATAGAAAAG | ATAACTACTGAAGGAAAAATT    c.17040
 D  E  Q  L  S  A  M  M  H  L  I  E  K   | I  T  T  E  G  K  I      p.5680

          .         .         .         .         .         .       g.294918
 CAAGCTTTCAGTGTTGCCAGCCGAACTCTTTTCTATGAGATTCTTTGTTCTCTTATTAAC       c.17100
 Q  A  F  S  V  A  S  R  T  L  F  Y  E  I  L  C  S  L  I  N         p.5700

          .         .         .         .         .         .       g.294978
 CCAAAGCGCAAGGACACTAGGGGATTCAGTCACTTTGCTGAAGTGACTGAGAATTTTGCC       c.17160
 P  K  R  K  D  T  R  G  F  S  H  F  A  E  V  T  E  N  F  A         p.5720

          .         .         .         .     | 80   .         .    g.299500
 TTTTCTCTGCTGACTAATGTTACTTGCGGCTCTCCTGGTGAAAA | AAGCAAAACCATCCTT    c.17220
 F  S  L  L  T  N  V  T  C  G  S  P  G  E  K  |  S  K  T  I  L      p.5740

          .         .         .         .         .         .       g.299560
 GATAGTTGCCCATATTTGTCAATATTGGCTCTTCACTGGTATCCTCAGCAAATCAATGGA       c.17280
 D  S  C  P  Y  L  S  I  L  A  L  H  W  Y  P  Q  Q  I  N  G         p.5760

          .         .         .         .         .         .       g.299620
 CACAAGTTTGAAGGAAAGGAAGGAGATTACATTCGAATTCCAGAGAGGCTACTGGATGTC       c.17340
 H  K  F  E  G  K  E  G  D  Y  I  R  I  P  E  R  L  L  D  V         p.5780

          .         .         .         .         .         .       g.299680
 CAGGATGCAGAAATAATGGCTGGGAAAAGTACATGTAAATTAGTCCAGTTTACAGAGTAT       c.17400
 Q  D  A  E  I  M  A  G  K  S  T  C  K  L  V  Q  F  T  E  Y         p.5800

          .         .         .         .         .     | 81   .    g.300268
 AGCAGCCAACAGTGGTTTATAAGTGGAAACAATCTTCCTACCCTAAAAAATAAG | GTATTA    c.17460
 S  S  Q  Q  W  F  I  S  G  N  N  L  P  T  L  K  N  K   | V  L      p.5820

          .         .         .         .         .         .       g.300328
 TCTTTGAGTGTGAAAGGTCAGAGTTCACAACTCCTGACTAATGACAATGAGGTTCTCTAC       c.17520
 S  L  S  V  K  G  Q  S  S  Q  L  L  T  N  D  N  E  V  L  Y         p.5840

          .         .         .         .         .         .       g.300388
 AGGATTTATGCTGCTGAGCCTAGAATTATTCCTCAGACATCTCTGTGTCTCCTTTGGAAT       c.17580
 R  I  Y  A  A  E  P  R  I  I  P  Q  T  S  L  C  L  L  W  N         p.5860

          .     | 82   .         .         .         .         .    g.301987
 CAGGCTGCTGCAAG | CTGGTTGTCTGACAGTCAGTTTTGCAAAGTGGTTGAGGAAACTGCA    c.17640
 Q  A  A  A  S  |  W  L  S  D  S  Q  F  C  K  V  V  E  E  T  A      p.5880

          .         .         .         .         .         .       g.302047
 GACTATGTGGAATGTGCCTGTTCACACATGTCTGTGTATGCTGTCTATGCTCGGACTGAC       c.17700
 D  Y  V  E  C  A  C  S  H  M  S  V  Y  A  V  Y  A  R  T  D         p.5900

          .         .         .         .         .      | 83  .    g.309962
 AACTTGTCTTCATACAATGAAGCCTTCTTCACTTCTGGATTTATATGTATCTCAG | GTCTT    c.17760
 N  L  S  S  Y  N  E  A  F  F  T  S  G  F  I  C  I  S  G |   L      p.5920

          .         .         .         .         .         .       g.310022
 TGCTTGGCTGTTCTTTCCCATATCTTCTGTGCCAGGTACTCCATGTTTGCAGCTAAACTT       c.17820
 C  L  A  V  L  S  H  I  F  C  A  R  Y  S  M  F  A  A  K  L         p.5940

          .         .         .       | 84 .         .         .    g.411639
 CTGACTCACATGATGGCAGCCAGCTTAGGTACACAG | ATTCTGTTTCTGGCGTCTGCATAC    c.17880
 L  T  H  M  M  A  A  S  L  G  T  Q   | I  L  F  L  A  S  A  Y      p.5960

          .         .         .         .         .         .       g.411699
 GCAAGTCCCCAACTCGCTGAGGAGAGCTGTTCAGCTATGGCTGCTGTCACACATTACCTG       c.17940
 A  S  P  Q  L  A  E  E  S  C  S  A  M  A  A  V  T  H  Y  L         p.5980

          .         .         .    | 85    .         .         .    g.431571
 TATCTTTGCCAGTTTAGCTGGATGCTCATTCAG | TCTGTGAATTTCTGGTACGTGCTGGTG    c.18000
 Y  L  C  Q  F  S  W  M  L  I  Q   | S  V  N  F  W  Y  V  L  V      p.6000

          .         .         .         .         .         .       g.431631
 ATGAATGATGAGCACACAGAGAGGCGATATCTGCTGTTTTTCCTTCTGAGTTGGGGACTA       c.18060
 M  N  D  E  H  T  E  R  R  Y  L  L  F  F  L  L  S  W  G  L         p.6020

          .         .         .         .         .         .       g.431691
 CCAGCTTTTGTGGTGATTCTCCTCATAGTTATTTTGAAAGGAATCTATCATCAGAGCATG       c.18120
 P  A  F  V  V  I  L  L  I  V  I  L  K  G  I  Y  H  Q  S  M         p.6040

          .         .         .   | 86     .         .         .    g.518675
 TCACAGATCTATGGACTCATTCATGGTGACCT | GTGTTTTATTCCAAACGTCTATGCTGCT    c.18180
 S  Q  I  Y  G  L  I  H  G  D  L  |  C  F  I  P  N  V  Y  A  A      p.6060

          .         .         .         .         .         .       g.518735
 TTGTTCACTGCAGCTCTTGTTCCTTTGACGTGCCTCGTGGTGGTGTTCGTGGTGTTCATC       c.18240
 L  F  T  A  A  L  V  P  L  T  C  L  V  V  V  F  V  V  F  I         p.6080

          .         .         .         .         .         .       g.518795
 CATGCCTACCAGGTGAAGCCACAGTGGAAAGCATATGATGATGTCTTCAGAGGAAGGACA       c.18300
 H  A  Y  Q  V  K  P  Q  W  K  A  Y  D  D  V  F  R  G  R  T         p.6100

          . | 87       .         .         .         .         .    g.548469
 AATGCTGCAG | AAATTCCACTGATTTTATATCTCTTTGCTCTGATTTCCGTGACATGGCTT    c.18360
 N  A  A  E |   I  P  L  I  L  Y  L  F  A  L  I  S  V  T  W  L      p.6120

          .         .         .         .         .         .       g.548529
 TGGGGAGGACTACACATGGCCTACAGACACTTCTGGATGTTGGTTCTCTTTGTCATTTTC       c.18420
 W  G  G  L  H  M  A  Y  R  H  F  W  M  L  V  L  F  V  I  F         p.6140

          .   | 88     .         .         .         .         .    g.596278
 AACAGTCTGCAG | GGACTTTATGTTTTCATGGTTTATTTCATTTTACACAACCAAATGTGT    c.18480
 N  S  L  Q   | G  L  Y  V  F  M  V  Y  F  I  L  H  N  Q  M  C      p.6160

          .         .         .         .         .         .       g.596338
 TGCCCTATGAAGGCCAGTTACACTGTGGAAATGAATGGGCATCCTGGACCCAGCACAGCC       c.18540
 C  P  M  K  A  S  Y  T  V  E  M  N  G  H  P  G  P  S  T  A         p.6180

          .         .         .         .         .         .       g.596398
 TTTTTCACGCCCGGGAGTGGAATGCCTCCTGCTGGAGGGGAAATCAGCAAGTCCACCCAG       c.18600
 F  F  T  P  G  S  G  M  P  P  A  G  G  E  I  S  K  S  T  Q         p.6200

          .         .     | 89   .         .         .         .    g.599457
 AATCTCATCGGTGCTATGGAGGAG | GTGCCACCTGACTGGGAGAGAGCATCCTTCCAACAG    c.18660
 N  L  I  G  A  M  E  E   | V  P  P  D  W  E  R  A  S  F  Q  Q      p.6220

          .         .         .         .         .         .       g.599517
 GGCAGTCAGGCCAGCCCTGATTTAAAGCCAAGTCCACAAAATGGAGCCACGTTCCCGTCC       c.18720
 G  S  Q  A  S  P  D  L  K  P  S  P  Q  N  G  A  T  F  P  S         p.6240

          .         .         .         .         .         .       g.599577
 TCTGGAGGATATGGCCAGGGGTCACTGATAGCCGATGAGGAGTCCCAGGAGTTTGATGAT       c.18780
 S  G  G  Y  G  Q  G  S  L  I  A  D  E  E  S  Q  E  F  D  D         p.6260

          .         .   | 90     .         .         .         .    g.610020
 TTAATATTTGCATTAAAAACTG | GTGCTGGTCTCAGTGTCAGTGATAATGAATCTGGTCAA    c.18840
 L  I  F  A  L  K  T  G |   A  G  L  S  V  S  D  N  E  S  G  Q      p.6280

          .         .         .         .         .         .       g.610080
 GGCAGCCAGGAGGGGGGCACCTTGACTGACTCCCAGATCGTGGAGCTCAGGAGGATACCC       c.18900
 G  S  Q  E  G  G  T  L  T  D  S  Q  I  V  E  L  R  R  I  P         p.6300

          .         .                                               g.610101
 ATCGCCGACACTCACCTGTAG                                              c.18921
 I  A  D  T  H  L  X                                                p.6306

          .         .         .         .         .         .       g.610161
 cacctcactaaccattcgactgagcacactttcatatttgtatcagcttttgtgctaaaa       c.*60

          .         .         .         .         .         .       g.610221
 ctctctaagtacatccacctgtgtaataggaacctgtgaattgtactggatgattaatac       c.*120

          .         .         .         .         .         .       g.610281
 aaacgtgattgttgtatttggagtataaattactgattgtatgtgacctgaaaattcact       c.*180

          .         .         .         .         .         .       g.610341
 gctataagaaaggtggagtcagtttgtatcagttaataggatgttcatattccaaggata       c.*240

          .         .         .         .         .         .       g.610401
 ttagttgtttttttaatcatcctatatggctaacattgtttaatgaaagtaataatcaat       c.*300

          .                                                         g.610417
 aaagcaatagaatcta                                                   c.*316

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The G protein-coupled receptor 98 protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 09
©2004-2014 Leiden University Medical Center