heparan sulfate proteoglycan 2 (HSPG2) - coding DNA reference sequence

(used for variant description)

(last modified June 21, 2015)


This file was created to facilitate the description of sequence variants on transcript NM_005529.5 in the HSPG2 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_016740.1, covering HSPG2 transcript NM_005529.5.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                     .         .         .         .                g.5040
                     gcgcggagcgagcgagcgagagagcggcgcgggccgggcc       c.-1

          .         .         .         .         .         .       g.5100
 ATGGGGTGGCGGGCGGCGGGCGCGCTGCTGCTGGCGCTGCTGCTGCACGGGCGGCTGCTG       c.60
 M  G  W  R  A  A  G  A  L  L  L  A  L  L  L  H  G  R  L  L         p.20

     | 02    .         .         .         .         .         .    g.46004
 GCG | GTGACCCATGGGCTGAGGGCATACGATGGCTTGTCTCTGCCTGAGGACATAGAGACC    c.120
 A   | V  T  H  G  L  R  A  Y  D  G  L  S  L  P  E  D  I  E  T      p.40

          .         .         .         .         .         .       g.46064
 GTCACAGCAAGCCAAATGCGCTGGACACATTCGTACCTTTCTGATGATGAGGACATGCTG       c.180
 V  T  A  S  Q  M  R  W  T  H  S  Y  L  S  D  D  E  D  M  L         p.60

          .          | 03        .         .         .         .    g.46332
 GCTGACAGCATCTCAGGAG | ACGACCTGGGCAGTGGGGACCTGGGCAGCGGGGACTTCCAG    c.240
 A  D  S  I  S  G  D |   D  L  G  S  G  D  L  G  S  G  D  F  Q      p.80

      | 04   .         .         .         .         .         .    g.51619
 ATGG | TTTATTTCCGAGCCCTGGTGAATTTCACTCGCTCCATCGAGTACAGCCCTCAGCTG    c.300
 M  V |   Y  F  R  A  L  V  N  F  T  R  S  I  E  Y  S  P  Q  L      p.100

          .         .         .         .         .     | 05   .    g.51778
 GAGGATGCAGGCTCCAGAGAGTTCCGAGAGGTGTCCGAGGCTGTGGTAGACACG | CTGGAG    c.360
 E  D  A  G  S  R  E  F  R  E  V  S  E  A  V  V  D  T   | L  E      p.120

          .         .         .         .         .    | 06    .    g.52123
 TCGGAGTACTTGAAAATTCCCGGAGACCAGGTTGTCAGTGTGGTGTTCATCAA | GGAGCTG    c.420
 S  E  Y  L  K  I  P  G  D  Q  V  V  S  V  V  F  I  K  |  E  L      p.140

          .         .         .         .         .         .       g.52183
 GATGGCTGGGTTTTTGTGGAGCTGGATGTGGGCTCGGAAGGGAATGCGGATGGGGCTCAG       c.480
 D  G  W  V  F  V  E  L  D  V  G  S  E  G  N  A  D  G  A  Q         p.160

          .         .         .         .         .         .       g.52243
 ATTCAGGAGATGCTGCTCAGGGTCATCTCCAGCGGCTCTGTGGCCTCCTACGTCACCTCT       c.540
 I  Q  E  M  L  L  R  V  I  S  S  G  S  V  A  S  Y  V  T  S         p.180

          .         .         .     | 07   .         .         .    g.54217
 CCCCAGGGATTCCAGTTCCGACGCCTGGGCACAG | TGCCCCAGTTCCCAAGAGCCTGCACG    c.600
 P  Q  G  F  Q  F  R  R  L  G  T  V |   P  Q  F  P  R  A  C  T      p.200

          .         .         .         .         .         .       g.54277
 GAGGCCGAGTTTGCCTGCCACAGCTACAATGAGTGTGTGGCCCTGGAGTATCGCTGTGAC       c.660
 E  A  E  F  A  C  H  S  Y  N  E  C  V  A  L  E  Y  R  C  D         p.220

          .         .         .         .    | 08    .         .    g.54600
 CGGCGGCCCGACTGCAGGGACATGTCTGATGAGCTCAATTGTG | AGGAGCCAGTCCTGGGT    c.720
 R  R  P  D  C  R  D  M  S  D  E  L  N  C  E |   E  P  V  L  G      p.240

          .         .         .         .         .         .       g.54660
 ATCAGCCCCACATTCTCTCTCCTTGTGGAGACGACATCTTTACCGCCCCGGCCAGAGACA       c.780
 I  S  P  T  F  S  L  L  V  E  T  T  S  L  P  P  R  P  E  T         p.260

          .         .         .         .         .         .       g.54720
 ACCATCATGCGACAGCCACCAGTCACCCACGCTCCTCAGCCCCTGCTTCCCGGTTCCGTC       c.840
 T  I  M  R  Q  P  P  V  T  H  A  P  Q  P  L  L  P  G  S  V         p.280

          .         .         .         .         .         .       g.54780
 AGGCCCCTGCCCTGTGGGCCCCAGGAGGCCGCATGCCGCAATGGGCACTGCATCCCCAGA       c.900
 R  P  L  P  C  G  P  Q  E  A  A  C  R  N  G  H  C  I  P  R         p.300

          .         .         .         .         .         | 09    g.54925
 GACTACCTCTGCGACGGACAGGAGGACTGCGAGGACGGCAGCGATGAGCTAGACTGTG | GC    c.960
 D  Y  L  C  D  G  Q  E  D  C  E  D  G  S  D  E  L  D  C  G |       p.320

          .         .         .         .         .         .       g.54985
 CCCCCGCCACCCTGTGAGCCCAACGAGTTCCCCTGCGGGAATGGACATTGTGCCCTCAAG       c.1020
 P  P  P  P  C  E  P  N  E  F  P  C  G  N  G  H  C  A  L  K         p.340

          .         .         .         .         .         | 10    g.56808
 CTGTGGCGCTGCGATGGTGACTTTGACTGTGAGGACCGAACTGATGAAGCCAACTGCC | CC    c.1080
 L  W  R  C  D  G  D  F  D  C  E  D  R  T  D  E  A  N  C  P |       p.360

          .         .         .         .         .         .       g.56868
 ACCAAGCGTCCTGAGGAAGTGTGCGGGCCCACACAGTTCCGATGCGTCTCTACCAACATG       c.1140
 T  K  R  P  E  E  V  C  G  P  T  Q  F  R  C  V  S  T  N  M         p.380

          .         .         .         .         .         .       g.56928
 TGCATCCCAGCCAGCTTCCACTGTGACGAGGAGAGCGACTGTCCTGACCGGAGCGACGAG       c.1200
 C  I  P  A  S  F  H  C  D  E  E  S  D  C  P  D  R  S  D  E         p.400

          . | 11       .         .         .         .         .    g.57150
 TTTGGCTGCA | TGCCCCCCCAGGTGGTGACACCTCCCCGGGAGTCCATCCAGGCTTCCCGG    c.1260
 F  G  C  M |   P  P  Q  V  V  T  P  P  R  E  S  I  Q  A  S  R      p.420

          .         .         .         .         .         .       g.57210
 GGCCAGACAGTGACCTTCACCTGCGTGGCCATTGGCGTCCCCACCCCCATCATCAATTGG       c.1320
 G  Q  T  V  T  F  T  C  V  A  I  G  V  P  T  P  I  I  N  W         p.440

          .         .         .      | 12  .         .         .    g.57364
 AGGCTCAACTGGGGCCACATCCCCTCTCATCCCAG | GGTGACAGTGACCAGCGAGGGTGGC    c.1380
 R  L  N  W  G  H  I  P  S  H  P  R  |  V  T  V  T  S  E  G  G      p.460

          .         .         .         .         .         .       g.57424
 CGTGGCACACTGATCATCCGTGATGTGAAGGAGTCAGACCAGGGTGCCTACACCTGTGAG       c.1440
 R  G  T  L  I  I  R  D  V  K  E  S  D  Q  G  A  Y  T  C  E         p.480

          .         .         .         .         .         .       g.57484
 GCCATGAACGCCCGGGGCATGGTGTTTGGCATTCCTGACGGTGTCCTTGAGCTCGTCCCA       c.1500
 A  M  N  A  R  G  M  V  F  G  I  P  D  G  V  L  E  L  V  P         p.500

         | 13.         .         .         .         .         .    g.57636
 CAACGAG | GCCCCTGCCCTGACGGCCACTTCTACCTGGAGCACAGCGCCGCCTGCCTGCCC    c.1560
 Q  R  G |   P  C  P  D  G  H  F  Y  L  E  H  S  A  A  C  L  P      p.520

          .         .         .         .         .         .       g.57696
 TGCTTCTGCTTTGGCATCACCAGCGTGTGCCAGAGCACCCGCCGCTTCCGGGACCAGATC       c.1620
 C  F  C  F  G  I  T  S  V  C  Q  S  T  R  R  F  R  D  Q  I         p.540

          .         .         .     | 14   .         .         .    g.60781
 AGGCTGCGCTTTGACCAACCCGATGACTTCAAGG | GTGTGAATGTGACAATGCCTGCGCAG    c.1680
 R  L  R  F  D  Q  P  D  D  F  K  G |   V  N  V  T  M  P  A  Q      p.560

          .         .         .         .         .         .       g.60841
 CCCGGCACGCCACCCCTCTCCTCCACGCAGCTGCAGATCGACCCATCCCTGCACGAGTTC       c.1740
 P  G  T  P  P  L  S  S  T  Q  L  Q  I  D  P  S  L  H  E  F         p.580

          .         .         .         .         .         .       g.60901
 CAGCTAGTCGACCTGTCCCGCCGCTTCCTCGTCCACGACTCCTTCTGGGCTCTGCCTGAA       c.1800
 Q  L  V  D  L  S  R  R  F  L  V  H  D  S  F  W  A  L  P  E         p.600

          .         | 15         .         .         .         .    g.61464
 CAGTTCCTGGGCAACAAG | GTGGACTCCTATGGCGGCTCCCTGCGTTACAACGTGCGCTAC    c.1860
 Q  F  L  G  N  K   | V  D  S  Y  G  G  S  L  R  Y  N  V  R  Y      p.620

          .         .         .         .         .         .       g.61524
 GAGTTGGCCCGTGGCATGCTGGAGCCAGTGCAGCGGCCGGACGTGGTCCTCATGGGTGCC       c.1920
 E  L  A  R  G  M  L  E  P  V  Q  R  P  D  V  V  L  M  G  A         p.640

          .         .         .         .         .         .       g.61584
 GGGTACCGCCTCCTCTCCCGAGGCCACACACCCACCCAACCTGGTGCTCTGAACCAGCGC       c.1980
 G  Y  R  L  L  S  R  G  H  T  P  T  Q  P  G  A  L  N  Q  R         p.660

          .         | 16         .         .         .         .    g.61740
 CAGGTCCAGTTCTCTGAG | GAGCACTGGGTCCATGAGTCTGGCCGGCCGGTGCAGCGCGCG    c.2040
 Q  V  Q  F  S  E   | E  H  W  V  H  E  S  G  R  P  V  Q  R  A      p.680

          .         .         .         .         .         .       g.61800
 GAGCTGCTGCAGGTGCTGCAGAGCCTGGAGGCCGTGCTCATCCAGACCGTGTACAACACC       c.2100
 E  L  L  Q  V  L  Q  S  L  E  A  V  L  I  Q  T  V  Y  N  T         p.700

          .         .         .         .         .         .       g.61860
 AAGATGGCCAGCGTGGGACTTAGCGACATCGCCATGGATACCACCGTCACCCATGCCACC       c.2160
 K  M  A  S  V  G  L  S  D  I  A  M  D  T  T  V  T  H  A  T         p.720

          .         .         .      | 17  .         .         .    g.62028
 AGCCATGGCCGTGCCCACAGTGTGGAGGAGTGCAG | ATGCCCCATTGGCTATTCTGGCTTG    c.2220
 S  H  G  R  A  H  S  V  E  E  C  R  |  C  P  I  G  Y  S  G  L      p.740

          .         .         .         .         .         .       g.62088
 TCCTGCGAGAGCTGTGATGCCCACTTCACTCGGGTGCCTGGTGGGCCCTACCTGGGCACC       c.2280
 S  C  E  S  C  D  A  H  F  T  R  V  P  G  G  P  Y  L  G  T         p.760

          .         .         .         .         .         .       g.62148
 TGCTCTGGTTGCAATTGCAATGGCCATGCCAGCTCCTGTGACCCTGTGTATGGCCACTGC       c.2340
 C  S  G  C  N  C  N  G  H  A  S  S  C  D  P  V  Y  G  H  C         p.780

     | 18    .         .         .         .         .         .    g.63193
 CTG | AATTGCCAGCACAACACGGAGGGGCCACAGTGCAACAAGTGCAAGGCTGGCTTCTTT    c.2400
 L   | N  C  Q  H  N  T  E  G  P  Q  C  N  K  C  K  A  G  F  F      p.800

          .         .         .         .         .         .       g.63253
 GGGGACGCCATGAAGGCCACGGCCACTTCCTGCCGGCCCTGCCCTTGCCCATACATCGAT       c.2460
 G  D  A  M  K  A  T  A  T  S  C  R  P  C  P  C  P  Y  I  D         p.820

          .  | 19      .         .         .         .         .    g.63643
 GCCTCCCGCAG | ATTCTCAGACACTTGCTTCCTGGACACGGATGGCCAAGCCACATGTGAC    c.2520
 A  S  R  R  |  F  S  D  T  C  F  L  D  T  D  G  Q  A  T  C  D      p.840

          .         .         .         | 20         .         .    g.63788
 GCCTGTGCCCCAGGCTACACTGGCCGCCGCTGTGAGAG | CTGTGCCCCCGGATACGAGGGC    c.2580
 A  C  A  P  G  Y  T  G  R  R  C  E  S  |  C  A  P  G  Y  E  G      p.860

          .         .         .        | 21.         .         .    g.64027
 AACCCCATCCAGCCCGGCGGGAAGTGCAGGCCCGTCA | ACCAGGAGATTGTGCGCTGTGAC    c.2640
 N  P  I  Q  P  G  G  K  C  R  P  V  N |   Q  E  I  V  R  C  D      p.880

          .         .         .         .      | 22  .         .    g.65620
 GAGCGTGGCAGCATGGGGACCTCCGGGGAGGCCTGCCGCTGTAAG | AACAATGTGGTGGGG    c.2700
 E  R  G  S  M  G  T  S  G  E  A  C  R  C  K   | N  N  V  V  G      p.900

          .         .         .         .         .         .       g.65680
 CGCTTGTGCAATGAATGTGCTGACGGCTCTTTCCACCTGAGTACCCGAAACCCCGATGGC       c.2760
 R  L  C  N  E  C  A  D  G  S  F  H  L  S  T  R  N  P  D  G         p.920

          .         .         .         .         .         .       g.65740
 TGCCTCAAGTGCTTCTGCATGGGTGTCAGTCGCCACTGCACCAGCTCTTCATGGAGCCGT       c.2820
 C  L  K  C  F  C  M  G  V  S  R  H  C  T  S  S  S  W  S  R         p.940

        | 23 .         .         .         .         .         .    g.65906
 GCCCAG | TTGCATGGGGCCTCTGAGGAGCCTGGTCACTTCAGCCTGACCAACGCCGCAAGC    c.2880
 A  Q   | L  H  G  A  S  E  E  P  G  H  F  S  L  T  N  A  A  S      p.960

          .         .         .         .         .         .       g.65966
 ACCCACACCACCAACGAGGGCATCTTCTCCCCCACGCCCGGGGAACTGGGATTCTCCTCC       c.2940
 T  H  T  T  N  E  G  I  F  S  P  T  P  G  E  L  G  F  S  S         p.980

          .         .         .         .         .         .       g.66026
 TTCCACAGACTCTTATCTGGACCCTACTTCTGGAGCCTCCCTTCACGCTTCCTGGGGGAC       c.3000
 F  H  R  L  L  S  G  P  Y  F  W  S  L  P  S  R  F  L  G  D         p.1000

     | 24    .         .         .         .         .         .    g.66272
 AAG | GTGACCTCCTATGGAGGAGAGCTGCGCTTCACAGTGACCCAGAGGTCCCAGCCGGGC    c.3060
 K   | V  T  S  Y  G  G  E  L  R  F  T  V  T  Q  R  S  Q  P  G      p.1020

          .         .         .         .         .         .       g.66332
 TCCACACCCCTGCACGGGCAGCCGTTGGTGGTGCTGCAAGGTAACAACATCATCCTAGAG       c.3120
 S  T  P  L  H  G  Q  P  L  V  V  L  Q  G  N  N  I  I  L  E         p.1040

          .         .         .         .         .         .       g.66392
 CACCATGTGGCCCAGGAGCCCAGCCCCGGCCAGCCCAGCACCTTCATTGTGCCTTTCCGG       c.3180
 H  H  V  A  Q  E  P  S  P  G  Q  P  S  T  F  I  V  P  F  R         p.1060

     | 25    .         .         .         .         .         .    g.66567
 GAG | CAAGCATGGCAGCGGCCCGATGGGCAGCCAGCCACACGGGAGCACCTGCTGATGGCA    c.3240
 E   | Q  A  W  Q  R  P  D  G  Q  P  A  T  R  E  H  L  L  M  A      p.1080

          .         .         .         .         .         .       g.66627
 CTGGCAGGCATCGACACCCTCCTGATCCGAGCATCCTACGCCCAGCAGCCCGCTGAGAGC       c.3300
 L  A  G  I  D  T  L  L  I  R  A  S  Y  A  Q  Q  P  A  E  S         p.1100

    | 26     .         .         .         .         .         .    g.67313
 AG | GGTCTCTGGCATCAGCATGGACGTGGCTGTGCCCGAGGAAACCGGCCAGGACCCCGCG    c.3360
 R  |  V  S  G  I  S  M  D  V  A  V  P  E  E  T  G  Q  D  P  A      p.1120

          .         .         .         .         .     | 27   .    g.67534
 CTGGAAGTGGAACAGTGCTCCTGCCCACCCGGGTACCGTGGGCCGTCCTGCCAG | GACTGT    c.3420
 L  E  V  E  Q  C  S  C  P  P  G  Y  R  G  P  S  C  Q   | D  C      p.1140

          .         .         .         .         .         .       g.67594
 GACACAGGCTACACACGCACGCCCAGTGGCCTCTACCTGGGTACCTGTGAACGCTGCAGC       c.3480
 D  T  G  Y  T  R  T  P  S  G  L  Y  L  G  T  C  E  R  C  S         p.1160

          .         .         .         .         | 28         .    g.67736
 TGCCATGGCCACTCAGAGGCCTGCGAGCCAGAAACAGGTGCCTGCCAG | GGCTGCCAGCAT    c.3540
 C  H  G  H  S  E  A  C  E  P  E  T  G  A  C  Q   | G  C  Q  H      p.1180

          .         .         .         .         .         .       g.67796
 CACACGGAGGGCCCTCGGTGTGAGCAGTGCCAGCCAGGATACTACGGGGACGCCCAGCGG       c.3600
 H  T  E  G  P  R  C  E  Q  C  Q  P  G  Y  Y  G  D  A  Q  R         p.1200

          .         .         .         .         .       | 29 .    g.68250
 GGGACACCACAGGACTGCCAGCTGTGCCCCTGCTACGGAGACCCTGCTGCCGGCCA | GGCT    c.3660
 G  T  P  Q  D  C  Q  L  C  P  C  Y  G  D  P  A  A  G  Q  |  A      p.1220

          .         .         .         .         .         .       g.68310
 GCCCACACTTGTTTTCTGGACACAGACGGCCACCCCACCTGTGATGCGTGCTCCCCAGGC       c.3720
 A  H  T  C  F  L  D  T  D  G  H  P  T  C  D  A  C  S  P  G         p.1240

          .         .    | 30    .         .         .         .    g.68870
 CACAGTGGGCGTCACTGTGAGAG | GTGCGCCCCTGGCTACTATGGCAACCCCAGCCAGGGC    c.3780
 H  S  G  R  H  C  E  R  |  C  A  P  G  Y  Y  G  N  P  S  Q  G      p.1260

          .    | 31    .         .         .         .         .    g.69213
 CAGCCATGCCAGA | GAGACAGCCAGGTGCCAGGGCCCATAGGCTGCAACTGTGACCCCCAA    c.3840
 Q  P  C  Q  R |   D  S  Q  V  P  G  P  I  G  C  N  C  D  P  Q      p.1280

          .         .         .         .         | 32         .    g.69509
 GGCAGCGTCAGCAGCCAGTGTGATGCTGCTGGTCAGTGCCAGTGCAAG | GCCCAGGTGGAA    c.3900
 G  S  V  S  S  Q  C  D  A  A  G  Q  C  Q  C  K   | A  Q  V  E      p.1300

          .         .         .         .         .         .       g.69569
 GGCCTCACTTGCAGCCACTGCCGGCCCCACCACTTCCACCTGAGTGCCAGCAACCCAGAC       c.3960
 G  L  T  C  S  H  C  R  P  H  H  F  H  L  S  A  S  N  P  D         p.1320

          .         .         .         .         .         .       g.69629
 GGCTGCCTGCCCTGCTTCTGTATGGGCATCACCCAGCAGTGCGCCAGCTCTGCCTACACA       c.4020
 G  C  L  P  C  F  C  M  G  I  T  Q  Q  C  A  S  S  A  Y  T         p.1340

           | 33        .         .         .         .         .    g.69931
 CGCCACCTG | ATCTCCACCCACTTTGCCCCTGGGGACTTCCAAGGCTTTGCCCTGGTGAAC    c.4080
 R  H  L   | I  S  T  H  F  A  P  G  D  F  Q  G  F  A  L  V  N      p.1360

          .         .         .         .         .         .       g.69991
 CCACAGCGAAACAGCCGCCTGACAGGAGAATTCACTGTGGAACCCGTGCCCGAGGGTGCC       c.4140
 P  Q  R  N  S  R  L  T  G  E  F  T  V  E  P  V  P  E  G  A         p.1380

          .         .         .         .         .         .       g.70051
 CAGCTCTCTTTTGGCAACTTTGCCCAACTCGGCCATGAGTCCTTCTACTGGCAGCTGCCG       c.4200
 Q  L  S  F  G  N  F  A  Q  L  G  H  E  S  F  Y  W  Q  L  P         p.1400

          .         .  | 34      .         .         .         .    g.76487
 GAGACATACCAGGGAGACAAG | GTGGCGGCCTACGGTGGGAAGTTGCGATACACCCTCTCC    c.4260
 E  T  Y  Q  G  D  K   | V  A  A  Y  G  G  K  L  R  Y  T  L  S      p.1420

          .         .         .         .         .     | 35   .    g.76898
 TACACAGCAGGCCCACAGGGCAGCCCACTCTCTGACCCCGATGTGCAGATCACG | GGCAAC    c.4320
 Y  T  A  G  P  Q  G  S  P  L  S  D  P  D  V  Q  I  T   | G  N      p.1440

          .         .         .         .         .         .       g.76958
 AACATCATGCTAGTGGCCTCCCAGCCAGCGCTGCAGGGCCCTGAGAGGAGGAGCTACGAG       c.4380
 N  I  M  L  V  A  S  Q  P  A  L  Q  G  P  E  R  R  S  Y  E         p.1460

          .      | 36  .         .         .         .         .    g.77229
 ATCATGTTCCGAGAG | GAATTCTGGCGCCGGCCCGATGGGCAGCCGGCCACACGCGAGCAC    c.4440
 I  M  F  R  E   | E  F  W  R  R  P  D  G  Q  P  A  T  R  E  H      p.1480

          .         .         .         .         .         .       g.77289
 CTCCTGATGGCACTGGCCGACCTGGATGAGCTCCTGATCCGGGCCACGTTCTCCTCCGTG       c.4500
 L  L  M  A  L  A  D  L  D  E  L  L  I  R  A  T  F  S  S  V         p.1500

          .         .         .         .         .         .       g.77349
 CCGCTGGCGGCCAGCATCAGCGCAGTCAGCCTGGAGGTCGCCCAGCCGGGGCCCTCAAAC       c.4560
 P  L  A  A  S  I  S  A  V  S  L  E  V  A  Q  P  G  P  S  N         p.1520

          .         .         .         .         .         .       g.77409
 AGACCCCGCGCCCTCGAGGTGGAGGAGTGCCGCTGCCCGCCAGGCTACATCGGTCTGTCC       c.4620
 R  P  R  A  L  E  V  E  E  C  R  C  P  P  G  Y  I  G  L  S         p.1540

        | 37 .         .         .         .         .         .    g.78098
 TGCCAG | GACTGTGCCCCCGGCTACACGCGCACCGGGAGTGGGCTCTACCTCGGCCACTGC    c.4680
 C  Q   | D  C  A  P  G  Y  T  R  T  G  S  G  L  Y  L  G  H  C      p.1560

          .         .         .         .         .         .       g.78158
 GAGCTATGTGAATGCAATGGCCACTCAGACCTGTGCCACCCAGAGACTGGGGCCTGCTCG       c.4740
 E  L  C  E  C  N  G  H  S  D  L  C  H  P  E  T  G  A  C  S         p.1580

  | 38       .         .         .         .         .         .    g.80202
  | CAATGCCAGCACAACGCCGCAGGGGAGTTCTGCGAGCTTTGTGCCCCTGGCTACTACGGA    c.4800
  | Q  C  Q  H  N  A  A  G  E  F  C  E  L  C  A  P  G  Y  Y  G      p.1600

          .         .         .         .         .         .       g.80262
 GATGCCACAGCCGGGACGCCTGAGGACTGCCAGCCCTGTGCCTGCCCACTGACCAACCCA       c.4860
 D  A  T  A  G  T  P  E  D  C  Q  P  C  A  C  P  L  T  N  P         p.1620

          | 39         .         .         .         .         .    g.80466
 GAGAACAT | GTTTTCCCGCACCTGTGAGAGCCTGGGAGCCGGCGGGTACCGCTGCACGGCC    c.4920
 E  N  M  |  F  S  R  T  C  E  S  L  G  A  G  G  Y  R  C  T  A      p.1640

          .         .         .      | 40  .         .         .    g.82047
 TGCGAACCCGGCTACACTGGCCAGTACTGTGAGCA | GTGTGGCCCAGGTTACGTGGGTAAC    c.4980
 C  E  P  G  Y  T  G  Q  Y  C  E  Q  |  C  G  P  G  Y  V  G  N      p.1660

          .         .         .     | 41   .         .         .    g.82281
 CCCAGTGTGCAAGGGGGCCAGTGCCTGCCAGAGA | CAAACCAAGCCCCACTGGTGGTCGAG    c.5040
 P  S  V  Q  G  G  Q  C  L  P  E  T |   N  Q  A  P  L  V  V  E      p.1680

          .         .         .         .         .         .       g.82341
 GTCCATCCTGCTCGAAGCATAGTGCCCCAAGGTGGCTCCCACTCCCTGCGGTGTCAGGTC       c.5100
 V  H  P  A  R  S  I  V  P  Q  G  G  S  H  S  L  R  C  Q  V         p.1700

          .         .         .         .         .         .       g.82401
 AGTGGGAGCCCACCCCACTACTTCTATTGGTCCCGTGAGGATGGGCGGCCTGTGCCCAGC       c.5160
 S  G  S  P  P  H  Y  F  Y  W  S  R  E  D  G  R  P  V  P  S         p.1720

          .         .   | 42     .         .         .         .    g.82619
 GGCACCCAGCAGCGACATCAAG | GCTCCGAGCTCCACTTCCCCAGCGTCCAGCCCTCGGAT    c.5220
 G  T  Q  Q  R  H  Q  G |   S  E  L  H  F  P  S  V  Q  P  S  D      p.1740

          .         .         .         .         .         .       g.82679
 GCTGGGGTCTACATTTGCACCTGCCGTAATCTCCACCAATCCAATACCAGCCGGGCAGAG       c.5280
 A  G  V  Y  I  C  T  C  R  N  L  H  Q  S  N  T  S  R  A  E         p.1760

          .    | 43    .         .         .         .         .    g.84919
 CTGCTGGTCACTG | AGGCTCCAAGCAAGCCCATCACAGTGACTGTGGAGGAGCAGCGGAGC    c.5340
 L  L  V  T  E |   A  P  S  K  P  I  T  V  T  V  E  E  Q  R  S      p.1780

          .         .         .         .         .     | 44   .    g.85068
 CAGAGCGTGCGCCCCGGAGCTGACGTCACCTTCATCTGCACAGCCAAAAGCAAG | TCCCCA    c.5400
 Q  S  V  R  P  G  A  D  V  T  F  I  C  T  A  K  S  K   | S  P      p.1800

          .         .         .         .         .         .       g.85128
 GCCTATACCCTGGTGTGGACCCGCCTGCACAACGGGAAACTGCCCACCCGAGCCATGGAT       c.5460
 A  Y  T  L  V  W  T  R  L  H  N  G  K  L  P  T  R  A  M  D         p.1820

          .         .         .         .         .         .       g.85188
 TTCAATGGCATCCTGACCATTCGCAACGTCCAGCTGAGTGATGCAGGCACCTACGTGTGC       c.5520
 F  N  G  I  L  T  I  R  N  V  Q  L  S  D  A  G  T  Y  V  C         p.1840

          .         .         .         .         .      | 45  .    g.86350
 ACCGGCTCCAACATGTTTGCCATGGACCAGGGCACAGCCACTCTACATGTGCAGG | CCTCG    c.5580
 T  G  S  N  M  F  A  M  D  Q  G  T  A  T  L  H  V  Q  A |   S      p.1860

          .         .         .         .         .         .       g.86410
 GGCACCTTGTCCGCCCCCGTGGTCTCCATCCATCCGCCACAGCTCACAGTGCAGCCCGGG       c.5640
 G  T  L  S  A  P  V  V  S  I  H  P  P  Q  L  T  V  Q  P  G         p.1880

          .         .         .         .         .         .       g.86470
 CAACTGGCGGAGTTCCGCTGCAGCGCCACAGGGAGCCCCACGCCCACCCTCGAGTGGACA       c.5700
 Q  L  A  E  F  R  C  S  A  T  G  S  P  T  P  T  L  E  W  T         p.1900

   | 46      .         .         .         .         .         .    g.86641
 G | GGGGCCCCGGCGGCCAGCTCCCTGCGAAGGCACAAATCCACGGCGGCATCCTGCGCCTG    c.5760
 G |   G  P  G  G  Q  L  P  A  K  A  Q  I  H  G  G  I  L  R  L      p.1920

          .         .         .         .         .         .       g.86701
 CCAGCTGTCGAGCCCACGGATCAGGCCCAGTACTTGTGCCGAGCCCACAGCAGCGCTGGG       c.5820
 P  A  V  E  P  T  D  Q  A  Q  Y  L  C  R  A  H  S  S  A  G         p.1940

          .         .         .     | 47   .         .         .    g.86837
 CAGCAGGTGGCCAGGGCTGTGCTCCACGTGCATG | GGGGCGGTGGGCCCAGAGTCCAAGTG    c.5880
 Q  Q  V  A  R  A  V  L  H  V  H  G |   G  G  G  P  R  V  Q  V      p.1960

          .         .         .         .         .         .       g.86897
 AGCCCAGAGAGGACCCAGGTCCACGCAGGCCGCACCGTCAGGCTGTACTGCAGGGCTGCA       c.5940
 S  P  E  R  T  Q  V  H  A  G  R  T  V  R  L  Y  C  R  A  A         p.1980

          .         .         .         .         .        | 48.    g.87277
 GGCGTGCCTAGCGCCACCATCACCTGGAGGAAGGAAGGGGGCAGCCTCCCACCACAG | GCC    c.6000
 G  V  P  S  A  T  I  T  W  R  K  E  G  G  S  L  P  P  Q   | A      p.2000

          .         .         .         .         .         .       g.87337
 CGGTCAGAGCGCACAGACATCGCGACACTGCTCATCCCAGCCATCACGACTGCTGACGCC       c.6060
 R  S  E  R  T  D  I  A  T  L  L  I  P  A  I  T  T  A  D  A         p.2020

          .         .         .         .         .         .       g.87397
 GGCTTCTACCTCTGCGTGGCCACCAGCCCTGCAGGCACTGCCCAGGCCCGGATCCAAGTG       c.6120
 G  F  Y  L  C  V  A  T  S  P  A  G  T  A  Q  A  R  I  Q  V         p.2040

          .    | 49    .         .         .         .         .    g.87539
 GTTGTCCTTTCAG | CCTCAGATGCCAGCCCACCGCCGGTCAAGATTGAGTCCTCATCGCCT    c.6180
 V  V  L  S  A |   S  D  A  S  P  P  P  V  K  I  E  S  S  S  P      p.2060

          .         .         .         .         .         .       g.87599
 TCTGTGACAGAAGGGCAAACACTCGACCTCAACTGTGTGGTGGCAGGGTCAGCCCATGCC       c.6240
 S  V  T  E  G  Q  T  L  D  L  N  C  V  V  A  G  S  A  H  A         p.2080

          .         .         .         .         | 50         .    g.87926
 CAGGTCACCTGGTACAGGCGAGGGGGTAGCCTGCCTCCCCACACCCAG | GTGCACGGCTCC    c.6300
 Q  V  T  W  Y  R  R  G  G  S  L  P  P  H  T  Q   | V  H  G  S      p.2100

          .         .         .         .         .         .       g.87986
 CGTCTGCGGCTCCCCCAGGTCTCACCAGCTGATTCTGGAGAATATGTGTGCCGTGTGGAG       c.6360
 R  L  R  L  P  Q  V  S  P  A  D  S  G  E  Y  V  C  R  V  E         p.2120

          .         .         .         .         .         .       g.88046
 AATGGATCGGGCCCCAAGGAGGCCTCCATTACTGTGTCTGTGCTCCACGGCACCCATTCT       c.6420
 N  G  S  G  P  K  E  A  S  I  T  V  S  V  L  H  G  T  H  S         p.2140

          .          | 51        .         .         .         .    g.89228
 GGCCCCAGCTACACCCCAG | TGCCCGGCAGCACCCGGCCCATCCGCATCGAGCCCTCCTCC    c.6480
 G  P  S  Y  T  P  V |   P  G  S  T  R  P  I  R  I  E  P  S  S      p.2160

          .         .         .         .         .         .       g.89288
 TCACACGTGGCGGAAGGGCAGACCCTGGATCTGAACTGCGTGGTGCCCGGGCAGGCCCAC       c.6540
 S  H  V  A  E  G  Q  T  L  D  L  N  C  V  V  P  G  Q  A  H         p.2180

          .         .         .         .         .  | 52      .    g.89434
 GCCCAGGTCACGTGGCACAAGCGTGGGGGCAGCCTCCCTGCCCGGCACCAG | ACCCACGGC    c.6600
 A  Q  V  T  W  H  K  R  G  G  S  L  P  A  R  H  Q   | T  H  G      p.2200

          .         .         .         .         .         .       g.89494
 TCGCTGCTGCGGCTGCACCAGGTGACCCCGGCCGACTCAGGCGAGTATGTGTGCCATGTG       c.6660
 S  L  L  R  L  H  Q  V  T  P  A  D  S  G  E  Y  V  C  H  V         p.2220

          .         .         .         .         .         .       g.89554
 GTGGGCACCTCCGGCCCCCTAGAGGCCTCAGTCCTGGTCACCATCGAAGCCTCTGTCATC       c.6720
 V  G  T  S  G  P  L  E  A  S  V  L  V  T  I  E  A  S  V  I         p.2240

      | 53   .         .         .         .         .         .    g.90080
 CCTG | GACCCATCCCACCTGTCAGGATCGAGTCTTCATCCTCCACAGTGGCCGAGGGCCAG    c.6780
 P  G |   P  I  P  P  V  R  I  E  S  S  S  S  T  V  A  E  G  Q      p.2260

          .         .         .         .         .         .       g.90140
 ACCCTGGATCTGAGCTGCGTGGTGGCAGGGCAGGCCCACGCCCAGGTCACATGGTACAAG       c.6840
 T  L  D  L  S  C  V  V  A  G  Q  A  H  A  Q  V  T  W  Y  K         p.2280

          .         .         . | 54       .         .         .    g.90361
 CGTGGGGGCAGCCTCCCTGCCCGGCACCAG | GTTCGTGGCTCCCGCCTGTACATCTTCCAG    c.6900
 R  G  G  S  L  P  A  R  H  Q   | V  R  G  S  R  L  Y  I  F  Q      p.2300

          .         .         .         .         .         .       g.90421
 GCCTCACCTGCCGATGCGGGACAGTACGTCTGCCGGGCCAGCAACGGCATGGAGGCCTCC       c.6960
 A  S  P  A  D  A  G  Q  Y  V  C  R  A  S  N  G  M  E  A  S         p.2320

          .         .         .         .       | 55 .         .    g.90574
 ATCACGGTCACAGTAACTGGGACCCAGGGGGCCAACTTAGCCTACC | CTGCCGGCAGCACC    c.7020
 I  T  V  T  V  T  G  T  Q  G  A  N  L  A  Y  P |   A  G  S  T      p.2340

          .         .         .         .         .         .       g.90634
 CAGCCCATCCGCATCGAGCCCTCCTCCTCGCAAGTGGCGGAAGGGCAGACCCTGGATCTG       c.7080
 Q  P  I  R  I  E  P  S  S  S  Q  V  A  E  G  Q  T  L  D  L         p.2360

          .         .         .         .         .         .       g.90694
 AACTGCGTGGTGCCCGGGCAGTCCCATGCCCAGGTCACGTGGCACAAGCGTGGGGGCAGC       c.7140
 N  C  V  V  P  G  Q  S  H  A  Q  V  T  W  H  K  R  G  G  S         p.2380

          .         | 56         .         .         .         .    g.91801
 CTCCCTGTCCGGCACCAG | ACCCACGGCTCCCTGCTGAGACTCTACCAAGCGTCCCCCGCC    c.7200
 L  P  V  R  H  Q   | T  H  G  S  L  L  R  L  Y  Q  A  S  P  A      p.2400

          .         .         .         .         .         .       g.91861
 GACTCGGGCGAGTACGTGTGCCGAGTGTTGGGCAGCTCCGTGCCTCTAGAGGCCTCTGTC       c.7260
 D  S  G  E  Y  V  C  R  V  L  G  S  S  V  P  L  E  A  S  V         p.2420

          .         .         .     | 57   .         .         .    g.92091
 CTGGTCACCATTGAGCCTGCGGGCTCAGTGCCTG | CACTTGGGGTCACCCCCACGGTCCGG    c.7320
 L  V  T  I  E  P  A  G  S  V  P  A |   L  G  V  T  P  T  V  R      p.2440

          .         .         .         .         .         .       g.92151
 ATCGAGTCATCGTCTTCGCAAGTGGCCGAGGGGCAGACCCTGGACCTGAACTGCCTCGTT       c.7380
 I  E  S  S  S  S  Q  V  A  E  G  Q  T  L  D  L  N  C  L  V         p.2460

          .         .         .         .         .         .       g.92211
 GCTGGTCAGGCCCATGCCCAGGTCACGTGGCACAAGCGCGGGGGCAGCCTCCCGGCCCGG       c.7440
 A  G  Q  A  H  A  Q  V  T  W  H  K  R  G  G  S  L  P  A  R         p.2480

        | 58 .         .         .         .         .         .    g.93280
 CACCAG | GTGCATGGCTCGAGGCTACGCCTGCTCCAGGTGACCCCAGCTGATTCAGGGGAG    c.7500
 H  Q   | V  H  G  S  R  L  R  L  L  Q  V  T  P  A  D  S  G  E      p.2500

          .         .         .         .         .         .       g.93340
 TACGTGTGCCGTGTGGTCGGCAGCTCAGGTACCCAGGAAGCCTCAGTCCTTGTCACCATC       c.7560
 Y  V  C  R  V  V  G  S  S  G  T  Q  E  A  S  V  L  V  T  I         p.2520

          .         .      | 59  .         .         .         .    g.93498
 CAGCAGCGCCTTAGTGGCTCCCACT | CCCAGGGTGTGGCGTACCCCGTCCGCATCGAGTCC    c.7620
 Q  Q  R  L  S  G  S  H  S |   Q  G  V  A  Y  P  V  R  I  E  S      p.2540

          .         .         .         .         .         .       g.93558
 TCCTCAGCCTCCCTGGCCAATGGACACACCCTGGACCTCAACTGCCTGGTTGCCAGCCAG       c.7680
 S  S  A  S  L  A  N  G  H  T  L  D  L  N  C  L  V  A  S  Q         p.2560

          .         .         .         .         .        | 60.    g.94167
 GCTCCCCACACCATCACCTGGTATAAGCGTGGAGGCAGCTTACCCAGCCGGCACCAG | ATC    c.7740
 A  P  H  T  I  T  W  Y  K  R  G  G  S  L  P  S  R  H  Q   | I      p.2580

          .         .         .         .         .         .       g.94227
 GTGGGCTCCCGGCTGCGGATCCCTCAGGTGACTCCGGCAGACTCGGGCGAGTACGTGTGT       c.7800
 V  G  S  R  L  R  I  P  Q  V  T  P  A  D  S  G  E  Y  V  C         p.2600

          .         .         .         .         .         .       g.94287
 CACGTCAGTAACGGTGCAGGCTCCCGGGAGACCTCGCTCATCGTCACCATCCAGGGCAGC       c.7860
 H  V  S  N  G  A  G  S  R  E  T  S  L  I  V  T  I  Q  G  S         p.2620

          .    | 61    .         .         .         .         .    g.94464
 GGTTCCTCCCACG | TGCCCAGCGTCTCCCCACCGATCAGGATCGAGTCGTCTTCCCCCACG    c.7920
 G  S  S  H  V |   P  S  V  S  P  P  I  R  I  E  S  S  S  P  T      p.2640

          .         .         .         .         .         .       g.94524
 GTGGTGGAAGGGCAGACCTTGGATCTGAACTGCGTGGTCGCCAGGCAGCCCCAGGCTATC       c.7980
 V  V  E  G  Q  T  L  D  L  N  C  V  V  A  R  Q  P  Q  A  I         p.2660

          .         .         .         .      | 62  .         .    g.94780
 ATCACATGGTACAAGCGTGGGGGCAGCCTTCCCTCCCGACACCAG | ACCCATGGCTCCCAC    c.8040
 I  T  W  Y  K  R  G  G  S  L  P  S  R  H  Q   | T  H  G  S  H      p.2680

          .         .         .         .         .         .       g.94840
 CTGCGGTTGCACCAAATGTCTGTGGCTGACTCGGGCGAGTATGTGTGCCGGGCCAACAAC       c.8100
 L  R  L  H  Q  M  S  V  A  D  S  G  E  Y  V  C  R  A  N  N         p.2700

          .         .         .         .         .         .       g.94900
 AACATCGATGCCCTGGAGGCCTCCATCGTCATCTCCGTCTCCCCTAGCGCCGGCAGCCCC       c.8160
 N  I  D  A  L  E  A  S  I  V  I  S  V  S  P  S  A  G  S  P         p.2720

      | 63   .         .         .         .         .         .    g.95714
 TCCG | CCCCTGGCAGCTCCATGCCCATCAGAATTGAGTCATCCTCCTCACACGTGGCCGAA    c.8220
 S  A |   P  G  S  S  M  P  I  R  I  E  S  S  S  S  H  V  A  E      p.2740

          .         .         .         .         .         .       g.95774
 GGGGAGACCCTGGATCTGAACTGCGTGGTCCCCGGGCAGGCCCATGCCCAGGTCACTTGG       c.8280
 G  E  T  L  D  L  N  C  V  V  P  G  Q  A  H  A  Q  V  T  W         p.2760

          .         .         .       | 64 .         .         .    g.96026
 CACAAGCGTGGGGGCAGCCTCCCCAGTCACCATCAG | ACCCGCGGCTCACGGCTGCGGCTG    c.8340
 H  K  R  G  G  S  L  P  S  H  H  Q   | T  R  G  S  R  L  R  L      p.2780

          .         .         .         .         .         .       g.96086
 CACCATGTGTCCCCGGCCGACTCGGGTGAATACGTGTGCCGGGTGATGGGCAGCTCTGGC       c.8400
 H  H  V  S  P  A  D  S  G  E  Y  V  C  R  V  M  G  S  S  G         p.2800

          .         .         .         .         .         .       g.96146
 CCCCTGGAGGCCTCAGTCCTGGTCACCATCGAAGCCTCTGGCTCAAGTGCTGTCCACGTC       c.8460
 P  L  E  A  S  V  L  V  T  I  E  A  S  G  S  S  A  V  H  V         p.2820

      | 65   .         .         .         .         .         .    g.98014
 CCCG | CCCCAGGTGGAGCCCCACCCATCCGCATCGAGCCCTCCTCCTCCCGAGTGGCAGAA    c.8520
 P  A |   P  G  G  A  P  P  I  R  I  E  P  S  S  S  R  V  A  E      p.2840

          .         .         .         .         .         .       g.98074
 GGGCAGACCCTGGATCTGAAGTGCGTGGTGCCCGGGCAGGCCCACGCCCAGGTCACGTGG       c.8580
 G  Q  T  L  D  L  K  C  V  V  P  G  Q  A  H  A  Q  V  T  W         p.2860

          .         .         .       | 66 .         .         .    g.98843
 CACAAGCGTGGAGGAAACCTCCCTGCCCGGCACCAG | GTCCACGGCCCACTGCTGAGGCTG    c.8640
 H  K  R  G  G  N  L  P  A  R  H  Q   | V  H  G  P  L  L  R  L      p.2880

          .         .         .         .         .         .       g.98903
 AACCAGGTGTCCCCGGCTGACTCTGGCGAGTACTCGTGCCAAGTGACCGGAAGCTCAGGC       c.8700
 N  Q  V  S  P  A  D  S  G  E  Y  S  C  Q  V  T  G  S  S  G         p.2900

          .         .         .         .         .         | 67    g.99338
 ACCCTGGAGGCATCTGTCCTGGTCACAATTGAGCCCTCCAGCCCAGGACCCATTCCTG | CT    c.8760
 T  L  E  A  S  V  L  V  T  I  E  P  S  S  P  G  P  I  P  A |       p.2920

          .         .         .         .         .         .       g.99398
 CCAGGACTGGCCCAGCCCATCTACATCGAGGCCTCCTCTTCACACGTGACTGAAGGGCAG       c.8820
 P  G  L  A  Q  P  I  Y  I  E  A  S  S  S  H  V  T  E  G  Q         p.2940

          .         .         .         .         .         .       g.99458
 ACTCTGGATCTGAACTGTGTGGTGCCCGGGCAGGCCCATGCCCAGGTCACGTGGTACAAG       c.8880
 T  L  D  L  N  C  V  V  P  G  Q  A  H  A  Q  V  T  W  Y  K         p.2960

          .         .         . | 68       .         .         .    g.99907
 CGCGGGGGCAGCCTCCCCGCCCGGCACCAG | ACCCATGGCTCCCAGCTGCGGCTCCACCTC    c.8940
 R  G  G  S  L  P  A  R  H  Q   | T  H  G  S  Q  L  R  L  H  L      p.2980

          .         .         .         .         .         .       g.99967
 GTCTCCCCTGCCGACTCAGGCGAGTATGTGTGTCGTGCAGCCAGCGGCCCAGGCCCTGAG       c.9000
 V  S  P  A  D  S  G  E  Y  V  C  R  A  A  S  G  P  G  P  E         p.3000

          .         .         .         .         .   | 69     .    g.100123
 CAAGAAGCCTCCTTCACAGTCACCGTCCCGCCCAGTGAGGGGTCTTCCTACC | GCCTTAGG    c.9060
 Q  E  A  S  F  T  V  T  V  P  P  S  E  G  S  S  Y  R |   L  R      p.3020

          .         .         .         .         .         .       g.100183
 AGCCCGGTCATCTCCATCGACCCGCCCAGCAGCACCGTGCAGCAGGGCCAGGATGCCAGC       c.9120
 S  P  V  I  S  I  D  P  P  S  S  T  V  Q  Q  G  Q  D  A  S         p.3040

          .         .         .         .         .         .       g.100243
 TTCAAGTGCCTCATCCATGACGGGGCAGCCCCCATCAGCCTCGAGTGGAAGACCCGGAAC       c.9180
 F  K  C  L  I  H  D  G  A  A  P  I  S  L  E  W  K  T  R  N         p.3060

          .    | 70    .         .         .         .         .    g.100631
 CAGGAGCTGGAGG | ACAACGTCCACATCAGTCCCAATGGCTCCATCATCACCATCGTGGGC    c.9240
 Q  E  L  E  D |   N  V  H  I  S  P  N  G  S  I  I  T  I  V  G      p.3080

          .         .         .         .         .         .       g.100691
 ACCCGGCCCAGCAACCACGGTACCTACCGCTGCGTGGCCTCCAATGCCTACGGTGTGGCC       c.9300
 T  R  P  S  N  H  G  T  Y  R  C  V  A  S  N  A  Y  G  V  A         p.3100

          .         .         | 71         .         .         .    g.101004
 CAGAGTGTGGTGAACCTCAGTGTGCACG | GGCCCCCTACAGTGTCCGTGCTCCCCGAGGGC    c.9360
 Q  S  V  V  N  L  S  V  H  G |   P  P  T  V  S  V  L  P  E  G      p.3120

          .         .         .         .         .         .       g.101064
 CCCGTGTGGGTGAAAGTGGGAAAGGCTGTCACCCTGGAGTGTGTCAGTGCCGGGGAGCCC       c.9420
 P  V  W  V  K  V  G  K  A  V  T  L  E  C  V  S  A  G  E  P         p.3140

          .         .         .         .         .         .       g.101124
 CGCTCCTCTGCTCGTTGGACCCGGATCAGCAGCACCCCTGCCAAGTTGGAGCAGCGGACA       c.9480
 R  S  S  A  R  W  T  R  I  S  S  T  P  A  K  L  E  Q  R  T         p.3160

          .         .         .    | 72    .         .         .    g.102267
 TATGGGCTCATGGACAGCCACGCGGTGCTGCAG | ATTTCATCAGCTAAACCATCAGATGCG    c.9540
 Y  G  L  M  D  S  H  A  V  L  Q   | I  S  S  A  K  P  S  D  A      p.3180

          .         .         .         .         .         .       g.102327
 GGCACTTATGTGTGCCTTGCTCAGAATGCACTAGGCACAGCACAGAAGCAGGTGGAGGTG       c.9600
 G  T  Y  V  C  L  A  Q  N  A  L  G  T  A  Q  K  Q  V  E  V         p.3200

          .         .         .         .         .         .       g.102387
 ATCGTGGACACGGGCGCCATGGCCCCAGGGGCCCCTCAGGTCCAAGCTGAAGAAGCTGAG       c.9660
 I  V  D  T  G  A  M  A  P  G  A  P  Q  V  Q  A  E  E  A  E         p.3220

          .         .         .         .          | 73        .    g.102718
 CTGACTGTGGAGGCTGGACACACGGCCACCTTGCGCTGCTCAGCCACAG | GCAGCCCCGCG    c.9720
 L  T  V  E  A  G  H  T  A  T  L  R  C  S  A  T  G |   S  P  A      p.3240

          .         .         .         .         .         .       g.102778
 CCCACCATCCACTGGTCCAAGCTGCGTTCCCCACTGCCCTGGCAGCACCGGCTGGAAGGT       c.9780
 P  T  I  H  W  S  K  L  R  S  P  L  P  W  Q  H  R  L  E  G         p.3260

          .         .         .         .         .         .       g.102838
 GACACACTCATCATACCCCGGGTAGCCCAGCAGGACTCGGGCCAGTACATCTGCAATGCC       c.9840
 D  T  L  I  I  P  R  V  A  Q  Q  D  S  G  Q  Y  I  C  N  A         p.3280

          .         .         .         .          | 74        .    g.103183
 ACTAGCCCTGCTGGGCACGCTGAGGCCACCATCATCCTGCACGTGGAGA | GCCCACCATAT    c.9900
 T  S  P  A  G  H  A  E  A  T  I  I  L  H  V  E  S |   P  P  Y      p.3300

          .         .         .         .         .         .       g.103243
 GCCACCACGGTCCCAGAGCACGCTTCGGTGCAGGCAGGGGAGACGGTGCAGCTCCAGTGC       c.9960
 A  T  T  V  P  E  H  A  S  V  Q  A  G  E  T  V  Q  L  Q  C         p.3320

          .         .         .         .         .         .       g.103303
 CTGGCTCACGGGACACCCCCACTCACCTTCCAGTGGAGCCGCGTGGGCAGCAGCCTTCCT       c.10020
 L  A  H  G  T  P  P  L  T  F  Q  W  S  R  V  G  S  S  L  P         p.3340

          .         .         .         .         .         .       g.103363
 GGGAGGGCGACCGCCAGGAACGAGCTGCTGCACTTTGAGCGTGCAGCCCCTGAGGACTCA       c.10080
 G  R  A  T  A  R  N  E  L  L  H  F  E  R  A  A  P  E  D  S         p.3360

          .         .         .         .         .         .       g.103423
 GGCCGCTACCGCTGCCGGGTCACCAACAAGGTGGGCTCAGCCGAGGCCTTTGCCCAGCTG       c.10140
 G  R  Y  R  C  R  V  T  N  K  V  G  S  A  E  A  F  A  Q  L         p.3380

          . | 75       .         .         .         .         .    g.105301
 CTCGTCCAAG | GCCCTCCCGGCTCTCTCCCTGCCACCTCCATCCCAGCAGGGTCCACGCCC    c.10200
 L  V  Q  G |   P  P  G  S  L  P  A  T  S  I  P  A  G  S  T  P      p.3400

          .         .         .         .         .         .       g.105361
 ACCGTGCAGGTCACGCCTCAGCTAGAGACCAAGAGCATTGGGGCCAGCGTTGAGTTCCAC       c.10260
 T  V  Q  V  T  P  Q  L  E  T  K  S  I  G  A  S  V  E  F  H         p.3420

          .         .         .         .         .         .       g.105421
 TGTGCTGTGCCCAGCGACCGGGGTACCCAGCTCCGTTGGTTCAAGGAAGGGGGTCAGCTG       c.10320
 C  A  V  P  S  D  R  G  T  Q  L  R  W  F  K  E  G  G  Q  L         p.3440

          .         .         .      | 76  .         .         .    g.106645
 CCTCCGGGTCACAGCGTGCAGGATGGGGTGCTCCG | AATCCAGAACTTGGACCAGAGCTGC    c.10380
 P  P  G  H  S  V  Q  D  G  V  L  R  |  I  Q  N  L  D  Q  S  C      p.3460

          .         .         .         .         .         .       g.106705
 CAAGGGACGTATATATGCCAGGCCCATGGACCTTGGGGGAAGGCCCAGGCCAGTGCCCAG       c.10440
 Q  G  T  Y  I  C  Q  A  H  G  P  W  G  K  A  Q  A  S  A  Q         p.3480

          .    | 77    .         .         .         .         .    g.107359
 CTGGTTATCCAAG | CCCTGCCCTCGGTGCTCATCAACATCCGGACCTCTGTGCAGACCGTG    c.10500
 L  V  I  Q  A |   L  P  S  V  L  I  N  I  R  T  S  V  Q  T  V      p.3500

          .         .         .         .         .         .       g.107419
 GTGGTTGGCCACGCCGTGGAGTTCGAATGCCTGGCACTGGGTGACCCCAAGCCTCAGGTG       c.10560
 V  V  G  H  A  V  E  F  E  C  L  A  L  G  D  P  K  P  Q  V         p.3520

          .         .         .         .         .         .       g.107479
 ACATGGAGCAAAGTTGGAGGGCACCTGCGGCCAGGCATTGTGCAGAGCGGAGGTGTCGTC       c.10620
 T  W  S  K  V  G  G  H  L  R  P  G  I  V  Q  S  G  G  V  V         p.3540

          .         .         .         .         .         .       g.107539
 AGGATCGCCCACGTAGAGCTGGCTGATGCGGGACAGTATCGCTGCACTGCCACCAACGCA       c.10680
 R  I  A  H  V  E  L  A  D  A  G  Q  Y  R  C  T  A  T  N  A         p.3560

          .         .         .         . | 78       .         .    g.108352
 GCTGGCACCACACAATCCCACGTCCTGCTGCTTGTGCAAG | CCTTGCCCCAGATCTCAATG    c.10740
 A  G  T  T  Q  S  H  V  L  L  L  V  Q  A |   L  P  Q  I  S  M      p.3580

          .         .         .         .         .         .       g.108412
 CCCCAAGAAGTCCGTGTGCCTGCTGGTTCTGCAGCTGTCTTCCCCTGCATAGCCTCAGGC       c.10800
 P  Q  E  V  R  V  P  A  G  S  A  A  V  F  P  C  I  A  S  G         p.3600

          .         .         . | 79       .         .         .    g.108673
 TACCCCACTCCTGACATCAGCTGGAGCAAG | CTGGATGGCAGCCTGCCACCTGACAGCCGC    c.10860
 Y  P  T  P  D  I  S  W  S  K   | L  D  G  S  L  P  P  D  S  R      p.3620

          .         .         .         .         .         .       g.108733
 CTGGAGAACAACATGCTGATGCTGCCCTCAGTCCGACCCCAGGACGCAGGTACCTACGTC       c.10920
 L  E  N  N  M  L  M  L  P  S  V  R  P  Q  D  A  G  T  Y  V         p.3640

          .         .         .         .         .         | 80    g.108875
 TGCACCGCCACTAACCGCCAGGGCAAGGTCAAAGCCTTTGCCCACCTGCAGGTGCCAG | AG    c.10980
 C  T  A  T  N  R  Q  G  K  V  K  A  F  A  H  L  Q  V  P  E |       p.3660

          .         .         .         .         .         .       g.108935
 CGGGTGGTGCCCTACTTCACGCAGACCCCCTACTCCTTCCTACCGCTGCCCACCATCAAG       c.11040
 R  V  V  P  Y  F  T  Q  T  P  Y  S  F  L  P  L  P  T  I  K         p.3680

          .         .         .         .         .      | 81  .    g.109656
 GATGCCTACAGGAAGTTCGAGATCAAGATCACCTTCCGGCCCGACTCAGCCGATG | GGATG    c.11100
 D  A  Y  R  K  F  E  I  K  I  T  F  R  P  D  S  A  D  G |   M      p.3700

          .         .         .         .         .         .       g.109716
 CTGCTGTACAATGGGCAGAAGCGAGTCCCAGGGAGCCCCACCAACCTGGCCAACCGGCAG       c.11160
 L  L  Y  N  G  Q  K  R  V  P  G  S  P  T  N  L  A  N  R  Q         p.3720

          .         .         .         .        | 82.         .    g.110474
 CCCGACTTCATCTCCTTCGGCCTCGTGGGGGGAAGGCCCGAGTTCCG | GTTCGATGCAGGC    c.11220
 P  D  F  I  S  F  G  L  V  G  G  R  P  E  F  R  |  F  D  A  G      p.3740

          .         .         .         .         .         .       g.110534
 TCAGGCATGGCCACCATCCGCCATCCCACACCACTGGCCCTGGGCCATTTCCACACCGTG       c.11280
 S  G  M  A  T  I  R  H  P  T  P  L  A  L  G  H  F  H  T  V         p.3760

          .         .         .         .         .         .       g.110594
 ACCCTGCTGCGCAGCCTCACCCAGGGCTCCCTGATTGTGGGTGACCTGGCCCCGGTCAAT       c.11340
 T  L  L  R  S  L  T  Q  G  S  L  I  V  G  D  L  A  P  V  N         p.3780

          .   | 83     .         .         .         .         .    g.110743
 GGGACCTCCCAG | GGCAAGTTCCAGGGCCTGGATCTGAACGAGGAACTCTACCTGGGTGGC    c.11400
 G  T  S  Q   | G  K  F  Q  G  L  D  L  N  E  E  L  Y  L  G  G      p.3800

          .         .         .         .         .   | 84     .    g.110941
 TATCCTGACTATGGTGCCATCCCCAAGGCGGGGCTGAGCAGCGGCTTCATAG | GCTGTGTC    c.11460
 Y  P  D  Y  G  A  I  P  K  A  G  L  S  S  G  F  I  G |   C  V      p.3820

          .         .         .         .         .         .       g.111001
 CGGGAGCTGCGCATCCAGGGCGAGGAGATCGTCTTCCATGACCTCAACCTCACGGCGCAC       c.11520
 R  E  L  R  I  Q  G  E  E  I  V  F  H  D  L  N  L  T  A  H         p.3840

          .         .         .         .   | 85     .         .    g.111185
 GGCATCTCCCACTGCCCCACCTGTCGGGACCGGCCCTGCCAG | AATGGCGGTCAGTGCCAT    c.11580
 G  I  S  H  C  P  T  C  R  D  R  P  C  Q   | N  G  G  Q  C  H      p.3860

          .         .         .         .         .         .       g.111245
 GACTCTGAGAGCAGCAGCTACGTGTGCGTCTGCCCAGCTGGCTTCACCGGGAGCCGCTGT       c.11640
 D  S  E  S  S  S  Y  V  C  V  C  P  A  G  F  T  G  S  R  C         p.3880

          .         .         .  | 86      .         .         .    g.112195
 GAGCACTCGCAGGCCCTGCACTGCCATCCAG | AGGCCTGTGGGCCCGACGCCACCTGTGTG    c.11700
 E  H  S  Q  A  L  H  C  H  P  E |   A  C  G  P  D  A  T  C  V      p.3900

          .         .         .         .         .         .       g.112255
 AACCGGCCTGACGGTCGAGGCTACACCTGCCGCTGCCACCTGGGCCGCTCGGGGTTGCGG       c.11760
 N  R  P  D  G  R  G  Y  T  C  R  C  H  L  G  R  S  G  L  R         p.3920

          . | 87       .         .         .         .         .    g.112703
 TGTGAGGAAG | GTGTGACAGTGACCACCCCCTCGCTGTCGGGTGCTGGCTCCTACCTGGCA    c.11820
 C  E  E  G |   V  T  V  T  T  P  S  L  S  G  A  G  S  Y  L  A      p.3940

          .         .         .         .         .         .       g.112763
 CTGCCCGCCCTCACCAACACACACCACGAGCTACGCCTGGACGTGGAGTTCAAGCCACTC       c.11880
 L  P  A  L  T  N  T  H  H  E  L  R  L  D  V  E  F  K  P  L         p.3960

          .         .         .         .         .         .       g.112823
 GCCCCTGACGGGGTCCTGCTGTTCAGCGGGGGGAAGAGCGGGCCTGTGGAGGACTTCGTG       c.11940
 A  P  D  G  V  L  L  F  S  G  G  K  S  G  P  V  E  D  F  V         p.3980

          .         .         .         .         .   | 88     .    g.113186
 TCCCTGGCGATGGTGGGCGGCCACCTGGAGTTCCGCTATGAGTTGGGGTCAG | GGCTGGCC    c.12000
 S  L  A  M  V  G  G  H  L  E  F  R  Y  E  L  G  S  G |   L  A      p.4000

          .         .         .         .         .         .       g.113246
 GTTCTGCGGAGCGCCGAGCCGCTGGCCCTGGGCCGCTGGCACCGTGTGTCTGCAGAGCGT       c.12060
 V  L  R  S  A  E  P  L  A  L  G  R  W  H  R  V  S  A  E  R         p.4020

          .         .         .         .         .         .       g.113306
 CTCAACAAGGACGGCAGCCTGCGGGTGAATGGTGGACGCCCTGTGCTGCGCTCCTCGCCC       c.12120
 L  N  K  D  G  S  L  R  V  N  G  G  R  P  V  L  R  S  S  P         p.4040

          .         .         .         .         .         .       g.113366
 GGCAAGAGCCAGGGCCTCAACCTGCACACCCTGCTCTACCTGGGGGGTGTGGAGCCTTCC       c.12180
 G  K  S  Q  G  L  N  L  H  T  L  L  Y  L  G  G  V  E  P  S         p.4060

          .         .         .         .         .        | 89.    g.113834
 GTGCCACTGTCCCCGGCCACCAACATGAGCGCTCACTTCCGCGGCTGTGTGGGCGAG | GTG    c.12240
 V  P  L  S  P  A  T  N  M  S  A  H  F  R  G  C  V  G  E   | V      p.4080

          .         .         .         .         .         .       g.113894
 TCAGTGAATGGCAAACGGCTGGACCTCACCTACAGTTTCCTAGGCAGCCAGGGCATCGGG       c.12300
 S  V  N  G  K  R  L  D  L  T  Y  S  F  L  G  S  Q  G  I  G         p.4100

          .         .         .         .         .         .       g.113954
 CAATGCTATGATAGCTCCCCATGTGAGCGCCAGCCTTGCCAACATGGTGCCACGTGCATG       c.12360
 Q  C  Y  D  S  S  P  C  E  R  Q  P  C  Q  H  G  A  T  C  M         p.4120

          .         .         .         .          | 90        .    g.114116
 CCCGCTGGCGAGTATGAGTTCCAGTGCCTGTGTCGAGATGGATTCAAAG | GAGACCTGTGT    c.12420
 P  A  G  E  Y  E  F  Q  C  L  C  R  D  G  F  K  G |   D  L  C      p.4140

          .         .         .         .         .         .       g.114176
 GAGCACGAGGAGAACCCCTGCCAGCTCCGTGAACCCTGTCTGCATGGGGGCACCTGCCAG       c.12480
 E  H  E  E  N  P  C  Q  L  R  E  P  C  L  H  G  G  T  C  Q         p.4160

          .         .         .         .         .   | 91     .    g.114346
 GGCACCCGCTGCCTCTGCCTCCCTGGCTTCTCTGGCCCACGCTGCCAACAAG | GCTCTGGA    c.12540
 G  T  R  C  L  C  L  P  G  F  S  G  P  R  C  Q  Q  G |   S  G      p.4180

          .         .         .         .          | 92        .    g.117489
 CATGGCATAGCAGAGTCCGACTGGCATCTTGAAGGCAGCGGGGGCAATG | ATGCCCCTGGG    c.12600
 H  G  I  A  E  S  D  W  H  L  E  G  S  G  G  N  D |   A  P  G      p.4200

          .         .         .         .         .         .       g.117549
 CAGTACGGAGCCTATTTCCACGATGATGGCTTCCTCGCCTTCCCTGGCCATGTCTTCTCC       c.12660
 Q  Y  G  A  Y  F  H  D  D  G  F  L  A  F  P  G  H  V  F  S         p.4220

       | 93  .         .         .         .         .         .    g.117697
 AGGAG | CCTGCCCGAGGTGCCCGAGACCATCGAGCTGGAGGTTCGGACCAGCACAGCCAGT    c.12720
 R  S  |  L  P  E  V  P  E  T  I  E  L  E  V  R  T  S  T  A  S      p.4240

          .         .     | 94   .         .         .         .    g.117917
 GGCCTCCTGCTCTGGCAGGGTGTG | GAGGTGGGAGAGGCCGGCCAAGGCAAGGACTTCATC    c.12780
 G  L  L  L  W  Q  G  V   | E  V  G  E  A  G  Q  G  K  D  F  I      p.4260

          .         .         .      | 95  .         .         .    g.118078
 AGCCTCGGGCTTCAAGACGGGCACCTTGTCTTCAG | GTACCAGCTGGGTAGTGGGGAGGCC    c.12840
 S  L  G  L  Q  D  G  H  L  V  F  R  |  Y  Q  L  G  S  G  E  A      p.4280

          .         .         .         .         .          | 96    g.118539
 CGCCTGGTCTCTGAGGACCCCATCAATGACGGCGAGTGGCACCGGGTGACAGCACTGCG | G    c.12900
 R  L  V  S  E  D  P  I  N  D  G  E  W  H  R  V  T  A  L  R  |      p.4300

          .         .         .         .         .         .       g.118599
 GAGGGCCGCAGAGGTTCCATCCAAGTCGACGGTGAGGAGCTGGTCAGCGGCCGGTCCCCA       c.12960
 E  G  R  R  G  S  I  Q  V  D  G  E  E  L  V  S  G  R  S  P         p.4320

          .         .         .         .    | 97    .         .    g.118786
 GGTCCCAACGTGGCAGTCAACGCCAAGGGCAGCGTCTACATCG | GCGGAGCCCCTGACGTG    c.13020
 G  P  N  V  A  V  N  A  K  G  S  V  Y  I  G |   G  A  P  D  V      p.4340

          .         .         .         .         .         .       g.118846
 GCCACGCTGACCGGGGGCAGATTCTCCTCAGGCATCACAGGCTGTGTCAAGAACCTGGTG       c.13080
 A  T  L  T  G  G  R  F  S  S  G  I  T  G  C  V  K  N  L  V         p.4360

          .         .         .         .         .         .       g.118906
 CTGCACTCGGCCCGACCCGGCGCCCCGCCCCCACAGCCCCTGGACCTGCAGCACCGCGCC       c.13140
 L  H  S  A  R  P  G  A  P  P  P  Q  P  L  D  L  Q  H  R  A         p.4380

          .         .         .                                     g.118942
 CAGGCCGGGGCCAACACACGCCCCTGCCCCTCGTAG                               c.13176
 Q  A  G  A  N  T  R  P  C  P  S  X                                 p.4391

          .         .         .         .         .         .       g.119002
 gcacctgcctgccccacacggactcccgggccacgccccagcccgacaatgtcgagtata       c.*60

          .         .         .         .         .         .       g.119062
 ttattattaatattattatgaatttttgtaagaaaccgaggcgatgccacgctttgctgc       c.*120

          .         .         .         .         .         .       g.119122
 taccgccctgggctggactggaggtgggcatgccaccctcacacacacagctgggcaaag       c.*180

          .         .         .         .         .         .       g.119182
 ccacaaggctggccagcaaggcaggttggatgggagtgggcacctcagaaagtcaccagg       c.*240

          .         .         .         .         .         .       g.119242
 acttggggtcaggaacagtggctgggtgggcccagaactgcccccactgtccccctaccc       c.*300

          .         .         .         .         .         .       g.119302
 accgatggagcccccagatagagctgggtggcctgtttctgcagcccttgggcagttctc       c.*360

          .         .         .         .         .         .       g.119362
 actcctaggagagccaacctcggcttgtgggctggtgccccacagctacctgagacgggc       c.*420

          .         .         .         .         .         .       g.119422
 atcgcaggagtctctgccacccactcaggattgggaattgtctttagtgccggctgtgga       c.*480

          .         .         .         .         .         .       g.119482
 gcaaaaggcagctcacccctgggcaggcggtccccatccccaccagctcgtttttcagca       c.*540

          .         .         .         .         .         .       g.119542
 cccccacccacctccacccagcccctggcacctcctctggcagactccccctcctaccac       c.*600

          .         .         .         .         .         .       g.119602
 gtcctcctggcctgcattcccaccccctcctgccagcacacagcctggggtccctccctc       c.*660

          .         .         .         .         .         .       g.119662
 aggggctgtaagggaaggcccaccccaactcttaccaggagctgctacaggcagagccca       c.*720

          .         .         .         .         .         .       g.119722
 gcactgatagggccccgcccaccgggccccgcccaccccaggccacatccccacccatct       c.*780

          .         .         .         .         .         .       g.119782
 ggaagtgaaggcccagggactcctccaacagacaacggacggacggatgccgctggtgct       c.*840

          .         .         .         .         .         .       g.119842
 caggaagagctagtgccttaggtgggggaaggcaggactcacgactgagagagagaggag       c.*900

          .         .         .         .         .         .       g.119902
 ggggatatgaccaccctgccccatctgcaggagcctgaagatccagctcaagtgccatcc       c.*960

          .         .         .         .         .         .       g.119962
 tgccagtggcccccagactgtggggttgggacgcctggcctctgtgtcctagaagggacc       c.*1020

          .         .         .         .         .                 g.120014
 ctcctgtggtctttgtcttgatttttcttaataaacggtgctatccccgcca               c.*1072

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Heparan sulfate proteoglycan 2 protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 13
©2004-2015 Leiden University Medical Center