versican (VCAN) - coding DNA reference sequence

(used for variant description)

(last modified November 26, 2020)


This file was created to facilitate the description of sequence variants on transcript NM_004385.4 in the VCAN gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_012682.1, covering VCAN transcript NM_004385.4.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
           .         .         .         .         .                g.5056
     cttcttctcgctgagtctcctcctcggctctgacggtacagtgatataatgatgat       c.-301

 .         .         .         .         .         .                g.5116
 gggtgtcacaacccgcatttgaacttgcaggcgagctgccccgagcctttctggggaaga       c.-241

 .         .         .         .         .         .                g.5176
 actccaggcgtgcggacgcaacagccgagaacattaggtgttgtggacaggagctgggac       c.-181

 .         .         .         .         .         .                g.5236
 caagatcttcggccagccccgcatcctcccgcatcttccagcaccgtcccgcaccctccg       c.-121

 .         .         .         .         .         .                g.5296
 catccttccccgggccaccacgcttcctatgtgacccgcctgggcaacgccgaacccagt       c.-61

 .         .         .         .         .         .    | 02        g.16845
 cgcgcagcgctgcagtgaattttccccccaaactgcaataagccgccttccaag | gccaag    c.-1

          .         .         .         .         .         .       g.16905
 ATGTTCATAAATATAAAGAGCATCTTATGGATGTGTTCAACCTTAATAGTAACCCATGCG       c.60
 M  F  I  N  I  K  S  I  L  W  M  C  S  T  L  I  V  T  H  A         p.20

          . | 03       .         .         .         .         .    g.23474
 CTACATAAAG | TCAAAGTGGGAAAAAGCCCACCGGTGAGGGGCTCCCTCTCTGGAAAAGTC    c.120
 L  H  K  V |   K  V  G  K  S  P  P  V  R  G  S  L  S  G  K  V      p.40

          .         .         .         .         .         .       g.23534
 AGCCTACCTTGTCATTTTTCAACGATGCCTACTTTGCCACCCAGTTACAACACCAGTGAA       c.180
 S  L  P  C  H  F  S  T  M  P  T  L  P  P  S  Y  N  T  S  E         p.60

          .         .         .         .         .         .       g.23594
 TTTCTCCGCATCAAATGGTCTAAGATTGAAGTGGACAAAAATGGAAAAGATTTGAAAGAG       c.240
 F  L  R  I  K  W  S  K  I  E  V  D  K  N  G  K  D  L  K  E         p.80

          .         .         .         .         .         .       g.23654
 ACTACTGTCCTTGTGGCCCAAAATGGAAATATCAAGATTGGTCAGGACTACAAAGGGAGA       c.300
 T  T  V  L  V  A  Q  N  G  N  I  K  I  G  Q  D  Y  K  G  R         p.100

          .         .         .         .         .         .       g.23714
 GTGTCTGTGCCCACACATCCCGAGGCTGTGGGCGATGCCTCCCTCACTGTGGTCAAGCTG       c.360
 V  S  V  P  T  H  P  E  A  V  G  D  A  S  L  T  V  V  K  L         p.120

          .         .         .         .         .         .       g.23774
 CTGGCAAGTGATGCGGGTCTTTACCGCTGTGACGTCATGTACGGGATTGAAGACACACAA       c.420
 L  A  S  D  A  G  L  Y  R  C  D  V  M  Y  G  I  E  D  T  Q         p.140

          .         .      | 04  .         .         .         .    g.26907
 GACACGGTGTCACTGACTGTGGATG | GGGTTGTGTTTCACTACAGGGCGGCAACCAGCAGG    c.480
 D  T  V  S  L  T  V  D  G |   V  V  F  H  Y  R  A  A  T  S  R      p.160

          .         .         .         .         .         .       g.26967
 TACACACTGAATTTTGAGGCTGCTCAGAAGGCTTGTTTGGACGTTGGGGCAGTCATAGCA       c.540
 Y  T  L  N  F  E  A  A  Q  K  A  C  L  D  V  G  A  V  I  A         p.180

          .         .         .         .         .         .       g.27027
 ACTCCAGAGCAGCTCTTTGCTGCCTATGAAGATGGATTTGAGCAGTGTGACGCAGGCTGG       c.600
 T  P  E  Q  L  F  A  A  Y  E  D  G  F  E  Q  C  D  A  G  W         p.200

          .         . | 05       .         .         .         .    g.27170
 CTGGCTGATCAGACTGTCAG | ATATCCCATCCGGGCTCCCAGAGTAGGCTGTTATGGAGAT    c.660
 L  A  D  Q  T  V  R  |  Y  P  I  R  A  P  R  V  G  C  Y  G  D      p.220

          .         .         .         .         .         .       g.27230
 AAGATGGGAAAGGCAGGAGTCAGGACTTATGGATTCCGTTCTCCCCAGGAAACTTACGAT       c.720
 K  M  G  K  A  G  V  R  T  Y  G  F  R  S  P  Q  E  T  Y  D         p.240

          .         .         | 06         .         .         .    g.45461
 GTGTATTGTTATGTGGATCATCTGGATG | GTGATGTGTTCCACCTCACTGTCCCCAGTAAA    c.780
 V  Y  C  Y  V  D  H  L  D  G |   D  V  F  H  L  T  V  P  S  K      p.260

          .         .         .         .         .         .       g.45521
 TTCACCTTCGAGGAGGCTGCAAAAGAGTGTGAAAACCAGGATGCCAGGCTGGCAACAGTG       c.840
 F  T  F  E  E  A  A  K  E  C  E  N  Q  D  A  R  L  A  T  V         p.280

          .         .         .         .         .         .       g.45581
 GGGGAACTCCAGGCGGCATGGAGGAACGGCTTTGACCAGTGCGATTACGGGTGGCTGTCG       c.900
 G  E  L  Q  A  A  W  R  N  G  F  D  Q  C  D  Y  G  W  L  S         p.300

          .         .         .         .         .         .       g.45641
 GATGCCAGCGTGCGCCACCCTGTGACTGTGGCCAGGGCCCAGTGTGGAGGTGGTCTACTT       c.960
 D  A  S  V  R  H  P  V  T  V  A  R  A  Q  C  G  G  G  L  L         p.320

          .         .         .         .         .         .       g.45701
 GGGGTGAGAACCCTGTATCGTTTTGAGAACCAGACAGGCTTCCCTCCCCCTGATAGCAGA       c.1020
 G  V  R  T  L  Y  R  F  E  N  Q  T  G  F  P  P  P  D  S  R         p.340

          .         .   | 07     .         .         .         .    g.52713
 TTTGATGCCTACTGCTTTAAAC | CTAAAGAGGCTACAACCATCGATTTGAGTATCCTCGCA    c.1080
 F  D  A  Y  C  F  K  P |   K  E  A  T  T  I  D  L  S  I  L  A      p.360

          .         .         .         .         .         .       g.52773
 GAAACTGCATCACCCAGTTTATCCAAAGAACCACAAATGGTTTCTGATAGAACTACACCA       c.1140
 E  T  A  S  P  S  L  S  K  E  P  Q  M  V  S  D  R  T  T  P         p.380

          .         .         .         .         .         .       g.52833
 ATCATCCCTTTAGTTGATGAATTACCTGTCATTCCAACAGAGTTCCCTCCCGTGGGAAAT       c.1200
 I  I  P  L  V  D  E  L  P  V  I  P  T  E  F  P  P  V  G  N         p.400

          .         .         .         .         .         .       g.52893
 ATTGTCAGTTTTGAACAGAAAGCCACAGTCCAACCTCAGGCTATCACAGATAGTTTAGCC       c.1260
 I  V  S  F  E  Q  K  A  T  V  Q  P  Q  A  I  T  D  S  L  A         p.420

          .         .         .         .         .         .       g.52953
 ACCAAATTACCCACACCTACTGGCAGTACCAAGAAGCCCTGGGATATGGATGACTACTCA       c.1320
 T  K  L  P  T  P  T  G  S  T  K  K  P  W  D  M  D  D  Y  S         p.440

          .         .         .         .         .         .       g.53013
 CCTTCTGCTTCAGGACCTCTTGGAAAGCTAGACATATCAGAAATTAAGGAAGAAGTGCTC       c.1380
 P  S  A  S  G  P  L  G  K  L  D  I  S  E  I  K  E  E  V  L         p.460

          .         .         .         .         .         .       g.53073
 CAGAGTACAACTGGCGTCTCTCATTATGCTACGGATTCATGGGATGGTGTCGTGGAAGAT       c.1440
 Q  S  T  T  G  V  S  H  Y  A  T  D  S  W  D  G  V  V  E  D         p.480

          .         .         .         .         .         .       g.53133
 AAACAAACACAAGAATCGGTTACACAGATTGAACAAATAGAAGTGGGTCCTTTGGTAACA       c.1500
 K  Q  T  Q  E  S  V  T  Q  I  E  Q  I  E  V  G  P  L  V  T         p.500

          .         .         .         .         .         .       g.53193
 TCTATGGAAATCTTAAAGCACATTCCTTCCAAGGAATTCCCTGTAACTGAAACACCATTG       c.1560
 S  M  E  I  L  K  H  I  P  S  K  E  F  P  V  T  E  T  P  L         p.520

          .         .         .         .         .         .       g.53253
 GTAACTGCAAGAATGATCCTGGAATCCAAAACTGAAAAGAAAATGGTAAGCACTGTTTCT       c.1620
 V  T  A  R  M  I  L  E  S  K  T  E  K  K  M  V  S  T  V  S         p.540

          .         .         .         .         .         .       g.53313
 GAATTGGTAACCACAGGTCACTATGGATTCACCTTGGGAGAAGAGGATGATGAAGACAGA       c.1680
 E  L  V  T  T  G  H  Y  G  F  T  L  G  E  E  D  D  E  D  R         p.560

          .         .         .         .         .         .       g.53373
 ACACTTACAGTTGGATCTGATGAGAGCACCTTGATCTTTGACCAAATTCCTGAAGTCATT       c.1740
 T  L  T  V  G  S  D  E  S  T  L  I  F  D  Q  I  P  E  V  I         p.580

          .         .         .         .         .         .       g.53433
 ACGGTGTCAAAGACTTCAGAAGACACCATCCACACTCATTTAGAAGACTTGGAGTCAGTC       c.1800
 T  V  S  K  T  S  E  D  T  I  H  T  H  L  E  D  L  E  S  V         p.600

          .         .         .         .         .         .       g.53493
 TCAGCATCCACAACTGTTTCCCCTTTAATTATGCCTGATAATAATGGATCATCCATGGAT       c.1860
 S  A  S  T  T  V  S  P  L  I  M  P  D  N  N  G  S  S  M  D         p.620

          .         .         .         .         .         .       g.53553
 GACTGGGAAGAGAGACAAACTAGTGGTAGGATAACGGAAGAGTTTCTTGGCAAATATCTG       c.1920
 D  W  E  E  R  Q  T  S  G  R  I  T  E  E  F  L  G  K  Y  L         p.640

          .         .         .         .         .         .       g.53613
 TCTACTACACCTTTTCCATCACAGCATCGTACAGAAATAGAATTGTTTCCTTATTCTGGT       c.1980
 S  T  T  P  F  P  S  Q  H  R  T  E  I  E  L  F  P  Y  S  G         p.660

          .         .         .         .         .         .       g.53673
 GATAAAATATTAGTAGAGGGAATTTCCACAGTTATTTATCCTTCTCTACAAACAGAAATG       c.2040
 D  K  I  L  V  E  G  I  S  T  V  I  Y  P  S  L  Q  T  E  M         p.680

          .         .         .         .         .         .       g.53733
 ACACATAGAAGAGAAAGAACAGAAACACTAATACCAGAGATGAGAACAGATACTTATACA       c.2100
 T  H  R  R  E  R  T  E  T  L  I  P  E  M  R  T  D  T  Y  T         p.700

          .         .         .         .         .         .       g.53793
 GATGAAATACAAGAAGAGATCACTAAAAGTCCATTTATGGGAAAAACAGAAGAAGAAGTC       c.2160
 D  E  I  Q  E  E  I  T  K  S  P  F  M  G  K  T  E  E  E  V         p.720

          .         .         .         .         .         .       g.53853
 TTCTCTGGGATGAAACTCTCTACATCTCTCTCAGAGCCAATTCATGTTACAGAGTCTTCT       c.2220
 F  S  G  M  K  L  S  T  S  L  S  E  P  I  H  V  T  E  S  S         p.740

          .         .         .         .         .         .       g.53913
 GTGGAAATGACCAAGTCTTTTGATTTCCCAACATTGATAACAAAGTTAAGTGCAGAGCCA       c.2280
 V  E  M  T  K  S  F  D  F  P  T  L  I  T  K  L  S  A  E  P         p.760

          .         .         .         .         .         .       g.53973
 ACAGAAGTAAGAGATATGGAGGAAGACTTTACAGCAACTCCAGGTACTACAAAATATGAT       c.2340
 T  E  V  R  D  M  E  E  D  F  T  A  T  P  G  T  T  K  Y  D         p.780

          .         .         .         .         .         .       g.54033
 GAAAATATTACAACAGTGCTTTTGGCCCATGGTACTTTAAGTGTTGAAGCAGCCACTGTA       c.2400
 E  N  I  T  T  V  L  L  A  H  G  T  L  S  V  E  A  A  T  V         p.800

          .         .         .         .         .         .       g.54093
 TCAAAATGGTCATGGGATGAAGATAATACAACATCCAAGCCTTTAGAGTCTACAGAACCT       c.2460
 S  K  W  S  W  D  E  D  N  T  T  S  K  P  L  E  S  T  E  P         p.820

          .         .         .         .         .         .       g.54153
 TCAGCCTCTTCAAAATTGCCCCCTGCCTTACTCACAACTGTGGGGATGAATGGAAAGGAT       c.2520
 S  A  S  S  K  L  P  P  A  L  L  T  T  V  G  M  N  G  K  D         p.840

          .         .         .         .         .         .       g.54213
 AAAGACATCCCAAGTTTCACTGAAGATGGAGCAGATGAATTTACTCTTATTCCAGATAGT       c.2580
 K  D  I  P  S  F  T  E  D  G  A  D  E  F  T  L  I  P  D  S         p.860

          .         .         .         .         .         .       g.54273
 ACTCAAAAGCAGTTAGAGGAGGTTACTGATGAAGACATAGCAGCCCATGGAAAATTCACA       c.2640
 T  Q  K  Q  L  E  E  V  T  D  E  D  I  A  A  H  G  K  F  T         p.880

          .         .         .         .         .         .       g.54333
 ATTAGATTTCAGCCAACTACATCAACTGGTATTGCAGAAAAGTCAACTTTGAGAGATTCT       c.2700
 I  R  F  Q  P  T  T  S  T  G  I  A  E  K  S  T  L  R  D  S         p.900

          .         .         .         .         .         .       g.54393
 ACAACTGAAGAAAAAGTTCCACCTATCACAAGCACTGAAGGCCAAGTTTATGCAACCATG       c.2760
 T  T  E  E  K  V  P  P  I  T  S  T  E  G  Q  V  Y  A  T  M         p.920

          .         .         .         .         .         .       g.54453
 GAAGGAAGTGCTTTGGGTGAAGTAGAAGATGTGGACCTCTCTAAGCCAGTATCTACTGTT       c.2820
 E  G  S  A  L  G  E  V  E  D  V  D  L  S  K  P  V  S  T  V         p.940

          .         .         .         .         .         .       g.54513
 CCCCAATTTGCACACACTTCAGAGGTGGAAGGATTAGCATTTGTTAGTTATAGTAGCACC       c.2880
 P  Q  F  A  H  T  S  E  V  E  G  L  A  F  V  S  Y  S  S  T         p.960

          .         .         .         .         .         .       g.54573
 CAAGAGCCTACTACTTATGTAGACTCTTCCCATACCATTCCTCTTTCTGTAATTCCCAAG       c.2940
 Q  E  P  T  T  Y  V  D  S  S  H  T  I  P  L  S  V  I  P  K         p.980

          .         .         .         .         .         .       g.54633
 ACAGACTGGGGAGTGTTAGTACCTTCTGTTCCATCAGAAGATGAAGTTCTAGGTGAACCC       c.3000
 T  D  W  G  V  L  V  P  S  V  P  S  E  D  E  V  L  G  E  P         p.1000

          .         .         .         .         .         .       g.54693
 TCTCAAGACATACTTGTCATTGATCAGACTCGCCTTGAAGCGACTATTTCTCCAGAAACT       c.3060
 S  Q  D  I  L  V  I  D  Q  T  R  L  E  A  T  I  S  P  E  T         p.1020

          .         .         .         .         .         .       g.54753
 ATGAGAACAACAAAAATCACAGAGGGAACAACTCAGGAAGAATTCCCTTGGAAAGAACAG       c.3120
 M  R  T  T  K  I  T  E  G  T  T  Q  E  E  F  P  W  K  E  Q         p.1040

          .         .         .         .         .         .       g.54813
 ACTGCAGAGAAACCAGTTCCTGCTCTCAGTTCTACAGCTTGGACTCCCAAGGAGGCAGTA       c.3180
 T  A  E  K  P  V  P  A  L  S  S  T  A  W  T  P  K  E  A  V         p.1060

          .         .         .         .         .         .       g.54873
 ACACCACTGGATGAACAAGAGGGCGATGGATCAGCATATACAGTCTCTGAAGATGAATTG       c.3240
 T  P  L  D  E  Q  E  G  D  G  S  A  Y  T  V  S  E  D  E  L         p.1080

          .         .         .         .         .         .       g.54933
 TTGACAGGTTCTGAGAGGGTCCCAGTTTTAGAAACAACTCCAGTTGGAAAAATTGATCAC       c.3300
 L  T  G  S  E  R  V  P  V  L  E  T  T  P  V  G  K  I  D  H         p.1100

          .         .         .         .         .         .       g.54993
 AGTGTGTCTTATCCACCAGGTGCTGTAACTGAGCACAAAGTGAAAACAGATGAAGTGGTA       c.3360
 S  V  S  Y  P  P  G  A  V  T  E  H  K  V  K  T  D  E  V  V         p.1120

          .         .         .         .         .         .       g.55053
 ACACTAACACCACGCATTGGGCCAAAAGTATCTTTAAGTCCAGGGCCTGAACAAAAATAT       c.3420
 T  L  T  P  R  I  G  P  K  V  S  L  S  P  G  P  E  Q  K  Y         p.1140

          .         .         .         .         .         .       g.55113
 GAAACAGAAGGTAGTAGTACAACAGGATTTACATCATCTTTGAGTCCTTTTAGTACCCAC       c.3480
 E  T  E  G  S  S  T  T  G  F  T  S  S  L  S  P  F  S  T  H         p.1160

          .         .         .         .         .         .       g.55173
 ATTACCCAGCTTATGGAAGAAACCACTACTGAGAAAACATCCCTAGAGGATATTGATTTA       c.3540
 I  T  Q  L  M  E  E  T  T  T  E  K  T  S  L  E  D  I  D  L         p.1180

          .         .         .         .         .         .       g.55233
 GGCTCAGGATTATTTGAAAAGCCCAAAGCCACAGAACTCATAGAATTTTCAACAATCAAA       c.3600
 G  S  G  L  F  E  K  P  K  A  T  E  L  I  E  F  S  T  I  K         p.1200

          .         .         .         .         .         .       g.55293
 GTCACAGTTCCAAGTGATATTACCACTGCCTTCAGTTCAGTAGACAGACTTCACACAACT       c.3660
 V  T  V  P  S  D  I  T  T  A  F  S  S  V  D  R  L  H  T  T         p.1220

          .         .         .         .         .         .       g.55353
 TCAGCATTCAAGCCATCTTCCGCGATCACTAAGAAACCACCTCTCATCGACAGGGAACCT       c.3720
 S  A  F  K  P  S  S  A  I  T  K  K  P  P  L  I  D  R  E  P         p.1240

          .         .         .         .         .         .       g.55413
 GGTGAAGAAACAACCAGTGACATGGTAATCATTGGAGAATCAACATCTCATGTTCCTCCC       c.3780
 G  E  E  T  T  S  D  M  V  I  I  G  E  S  T  S  H  V  P  P         p.1260

          .         .         .         .         .         .       g.55473
 ACTACCCTTGAAGATATTGTAGCCAAGGAAACAGAAACCGATATTGATAGAGAGTATTTC       c.3840
 T  T  L  E  D  I  V  A  K  E  T  E  T  D  I  D  R  E  Y  F         p.1280

          .         .         .         .         .         .       g.55533
 ACGACTTCAAGTCCTCCTGCTACACAGCCAACAAGACCACCCACTGTGGAAGACAAAGAG       c.3900
 T  T  S  S  P  P  A  T  Q  P  T  R  P  P  T  V  E  D  K  E         p.1300

          .         .         .         .         .         .       g.55593
 GCCTTTGGACCTCAGGCGCTTTCTACGCCACAGCCCCCAGCAAGCACAAAATTTCACCCT       c.3960
 A  F  G  P  Q  A  L  S  T  P  Q  P  P  A  S  T  K  F  H  P         p.1320

          .         .         .         .    | 08    .         .    g.70350
 GACATTAATGTTTATATTATTGAGGTCAGAGAAAATAAGACAG | GTCGAATGAGTGATTTG    c.4020
 D  I  N  V  Y  I  I  E  V  R  E  N  K  T  G |   R  M  S  D  L      p.1340

          .         .         .         .         .         .       g.70410
 AGTGTAATTGGTCATCCAATAGATTCAGAATCTAAAGAAGATGAACCTTGTAGTGAAGAA       c.4080
 S  V  I  G  H  P  I  D  S  E  S  K  E  D  E  P  C  S  E  E         p.1360

          .         .         .         .         .         .       g.70470
 ACAGATCCAGTGCATGATCTAATGGCTGAAATTTTACCTGAATTCCCTGACATAATTGAA       c.4140
 T  D  P  V  H  D  L  M  A  E  I  L  P  E  F  P  D  I  I  E         p.1380

          .         .         .         .         .         .       g.70530
 ATAGACCTATACCACAGTGAAGAAAATGAAGAAGAAGAAGAAGAGTGTGCAAATGCTACT       c.4200
 I  D  L  Y  H  S  E  E  N  E  E  E  E  E  E  C  A  N  A  T         p.1400

          .         .         .         .         .         .       g.70590
 GATGTGACAACCACCCCATCTGTGCAGTACATAAATGGGAAGCATCTCGTTACCACTGTG       c.4260
 D  V  T  T  T  P  S  V  Q  Y  I  N  G  K  H  L  V  T  T  V         p.1420

          .         .         .         .         .         .       g.70650
 CCCAAGGACCCAGAAGCTGCAGAAGCTAGGCGTGGCCAGTTTGAAAGTGTTGCACCTTCT       c.4320
 P  K  D  P  E  A  A  E  A  R  R  G  Q  F  E  S  V  A  P  S         p.1440

          .         .         .         .         .         .       g.70710
 CAGAATTTCTCGGACAGCTCTGAAAGTGATACTCATCCATTTGTAATAGCCAAAACGGAA       c.4380
 Q  N  F  S  D  S  S  E  S  D  T  H  P  F  V  I  A  K  T  E         p.1460

          .         .         .         .         .         .       g.70770
 TTGTCTACTGCTGTGCAACCTAATGAATCTACAGAAACAACTGAGTCTCTTGAAGTTACA       c.4440
 L  S  T  A  V  Q  P  N  E  S  T  E  T  T  E  S  L  E  V  T         p.1480

          .         .         .         .         .         .       g.70830
 TGGAAGCCTGAGACTTACCCTGAAACATCAGAACATTTTTCAGGTGGTGAGCCTGATGTT       c.4500
 W  K  P  E  T  Y  P  E  T  S  E  H  F  S  G  G  E  P  D  V         p.1500

          .         .         .         .         .         .       g.70890
 TTCCCCACAGTCCCATTCCATGAGGAATTTGAAAGTGGAACAGCCAAAAAAGGGGCAGAA       c.4560
 F  P  T  V  P  F  H  E  E  F  E  S  G  T  A  K  K  G  A  E         p.1520

          .         .         .         .         .         .       g.70950
 TCAGTCACAGAGAGAGATACTGAAGTTGGTCATCAGGCACATGAACATACTGAACCTGTA       c.4620
 S  V  T  E  R  D  T  E  V  G  H  Q  A  H  E  H  T  E  P  V         p.1540

          .         .         .         .         .         .       g.71010
 TCTCTGTTTCCTGAAGAGTCTTCAGGAGAGATTGCCATTGACCAAGAATCTCAGAAAATA       c.4680
 S  L  F  P  E  E  S  S  G  E  I  A  I  D  Q  E  S  Q  K  I         p.1560

          .         .         .         .         .         .       g.71070
 GCCTTTGCAAGGGCTACAGAAGTAACATTTGGTGAAGAGGTAGAAAAAAGTACTTCTGTC       c.4740
 A  F  A  R  A  T  E  V  T  F  G  E  E  V  E  K  S  T  S  V         p.1580

          .         .         .         .         .         .       g.71130
 ACATACACTCCCACTATAGTTCCAAGTTCTGCATCAGCATATGTTTCAGAGGAAGAAGCA       c.4800
 T  Y  T  P  T  I  V  P  S  S  A  S  A  Y  V  S  E  E  E  A         p.1600

          .         .         .         .         .         .       g.71190
 GTTACCCTAATAGGAAATCCTTGGCCAGATGACCTGTTGTCTACCAAAGAAAGCTGGGTA       c.4860
 V  T  L  I  G  N  P  W  P  D  D  L  L  S  T  K  E  S  W  V         p.1620

          .         .         .         .         .         .       g.71250
 GAAGCAACTCCTAGACAAGTTGTAGAGCTCTCAGGGAGTTCTTCGATTCCAATTACAGAA       c.4920
 E  A  T  P  R  Q  V  V  E  L  S  G  S  S  S  I  P  I  T  E         p.1640

          .         .         .         .         .         .       g.71310
 GGCTCTGGAGAAGCAGAAGAAGATGAAGATACAATGTTCACCATGGTAACTGATTTATCA       c.4980
 G  S  G  E  A  E  E  D  E  D  T  M  F  T  M  V  T  D  L  S         p.1660

          .         .         .         .         .         .       g.71370
 CAGAGAAATACTACTGATACACTCATTACTTTAGACACTAGCAGGATAATCACAGAAAGC       c.5040
 Q  R  N  T  T  D  T  L  I  T  L  D  T  S  R  I  I  T  E  S         p.1680

          .         .         .         .         .         .       g.71430
 TTTTTTGAGGTTCCTGCAACCACCATTTATCCAGTTTCTGAACAACCTTCTGCAAAAGTG       c.5100
 F  F  E  V  P  A  T  T  I  Y  P  V  S  E  Q  P  S  A  K  V         p.1700

          .         .         .         .         .         .       g.71490
 GTGCCTACCAAGTTTGTAAGTGAAACAGACACTTCTGAGTGGATTTCCAGTACCACTGTT       c.5160
 V  P  T  K  F  V  S  E  T  D  T  S  E  W  I  S  S  T  T  V         p.1720

          .         .         .         .         .         .       g.71550
 GAGGAAAAGAAAAGGAAGGAGGAGGAGGGAACTACAGGTACGGCTTCTACATTTGAGGTA       c.5220
 E  E  K  K  R  K  E  E  E  G  T  T  G  T  A  S  T  F  E  V         p.1740

          .         .         .         .         .         .       g.71610
 TATTCATCTACACAGAGATCGGATCAATTAATTTTACCCTTTGAATTAGAAAGTCCAAAT       c.5280
 Y  S  S  T  Q  R  S  D  Q  L  I  L  P  F  E  L  E  S  P  N         p.1760

          .         .         .         .         .         .       g.71670
 GTAGCTACATCTAGTGATTCAGGTACCAGGAAAAGTTTTATGTCCTTGACAACACCAACA       c.5340
 V  A  T  S  S  D  S  G  T  R  K  S  F  M  S  L  T  T  P  T         p.1780

          .         .         .         .         .         .       g.71730
 CAGTCTGAAAGGGAAATGACAGATTCTACTCCTGTCTTTACAGAAACAAATACATTAGAA       c.5400
 Q  S  E  R  E  M  T  D  S  T  P  V  F  T  E  T  N  T  L  E         p.1800

          .         .         .         .         .         .       g.71790
 AATTTGGGGGCACAGACCACTGAGCACAGCAGTATCCATCAACCTGGGGTTCAGGAAGGG       c.5460
 N  L  G  A  Q  T  T  E  H  S  S  I  H  Q  P  G  V  Q  E  G         p.1820

          .         .         .         .         .         .       g.71850
 CTGACCACTCTCCCACGTAGTCCTGCCTCTGTCTTTATGGAGCAGGGCTCTGGAGAAGCT       c.5520
 L  T  T  L  P  R  S  P  A  S  V  F  M  E  Q  G  S  G  E  A         p.1840

          .         .         .         .         .         .       g.71910
 GCTGCCGACCCAGAAACCACCACTGTTTCTTCATTTTCATTAAACGTAGAGTATGCAATT       c.5580
 A  A  D  P  E  T  T  T  V  S  S  F  S  L  N  V  E  Y  A  I         p.1860

          .         .         .         .         .         .       g.71970
 CAAGCCGAAAAGGAAGTAGCTGGCACTTTGTCTCCGCATGTGGAAACTACATTCTCCACT       c.5640
 Q  A  E  K  E  V  A  G  T  L  S  P  H  V  E  T  T  F  S  T         p.1880

          .         .         .         .         .         .       g.72030
 GAGCCAACAGGACTGGTTTTGAGTACAGTAATGGACAGAGTAGTTGCTGAAAATATAACC       c.5700
 E  P  T  G  L  V  L  S  T  V  M  D  R  V  V  A  E  N  I  T         p.1900

          .         .         .         .         .         .       g.72090
 CAAACATCCAGGGAAATAGTGATTTCAGAGCGATTAGGAGAACCAAATTATGGGGCAGAA       c.5760
 Q  T  S  R  E  I  V  I  S  E  R  L  G  E  P  N  Y  G  A  E         p.1920

          .         .         .         .         .         .       g.72150
 ATAAGGGGCTTTTCCACAGGTTTTCCTTTGGAGGAAGATTTCAGTGGTGACTTTAGAGAA       c.5820
 I  R  G  F  S  T  G  F  P  L  E  E  D  F  S  G  D  F  R  E         p.1940

          .         .         .         .         .         .       g.72210
 TACTCAACAGTGTCTCATCCCATAGCAAAAGAAGAAACGGTAATGATGGAAGGCTCTGGA       c.5880
 Y  S  T  V  S  H  P  I  A  K  E  E  T  V  M  M  E  G  S  G         p.1960

          .         .         .         .         .         .       g.72270
 GATGCAGCATTTAGGGACACCCAGACTTCACCATCTACAGTACCTACTTCAGTTCACATC       c.5940
 D  A  A  F  R  D  T  Q  T  S  P  S  T  V  P  T  S  V  H  I         p.1980

          .         .         .         .         .         .       g.72330
 AGTCACATATCTGACTCAGAAGGACCCAGTAGCACCATGGTCAGCACTTCAGCCTTCCCC       c.6000
 S  H  I  S  D  S  E  G  P  S  S  T  M  V  S  T  S  A  F  P         p.2000

          .         .         .         .         .         .       g.72390
 TGGGAAGAGTTTACATCCTCAGCTGAGGGCTCAGGTGAGCAACTGGTCACAGTCAGCAGC       c.6060
 W  E  E  F  T  S  S  A  E  G  S  G  E  Q  L  V  T  V  S  S         p.2020

          .         .         .         .         .         .       g.72450
 TCTGTTGTTCCAGTGCTTCCCAGTGCTGTGCAAAAGTTTTCTGGTACAGCTTCCTCCATT       c.6120
 S  V  V  P  V  L  P  S  A  V  Q  K  F  S  G  T  A  S  S  I         p.2040

          .         .         .         .         .         .       g.72510
 ATCGACGAAGGATTGGGAGAAGTGGGTACTGTCAATGAAATTGATAGAAGATCCACCATT       c.6180
 I  D  E  G  L  G  E  V  G  T  V  N  E  I  D  R  R  S  T  I         p.2060

          .         .         .         .         .         .       g.72570
 TTACCAACAGCAGAAGTGGAAGGTACGAAAGCTCCAGTAGAGAAGGAGGAAGTAAAGGTC       c.6240
 L  P  T  A  E  V  E  G  T  K  A  P  V  E  K  E  E  V  K  V         p.2080

          .         .         .         .         .         .       g.72630
 AGTGGCACAGTTTCAACAAACTTTCCCCAAACTATAGAGCCAGCCAAATTATGGTCTAGG       c.6300
 S  G  T  V  S  T  N  F  P  Q  T  I  E  P  A  K  L  W  S  R         p.2100

          .         .         .         .         .         .       g.72690
 CAAGAAGTCAACCCTGTAAGACAAGAAATTGAAAGTGAAACAACATCAGAGGAACAAATT       c.6360
 Q  E  V  N  P  V  R  Q  E  I  E  S  E  T  T  S  E  E  Q  I         p.2120

          .         .         .         .         .         .       g.72750
 CAAGAAGAAAAGTCATTTGAATCCCCTCAAAACTCTCCTGCAACAGAACAAACAATCTTT       c.6420
 Q  E  E  K  S  F  E  S  P  Q  N  S  P  A  T  E  Q  T  I  F         p.2140

          .         .         .         .         .         .       g.72810
 GATTCACAGACATTTACTGAAACTGAACTCAAAACCACAGATTATTCTGTACTAACAACA       c.6480
 D  S  Q  T  F  T  E  T  E  L  K  T  T  D  Y  S  V  L  T  T         p.2160

          .         .         .         .         .         .       g.72870
 AAGAAAACTTACAGTGATGATAAAGAAATGAAGGAGGAAGACACTTCTTTAGTTAACATG       c.6540
 K  K  T  Y  S  D  D  K  E  M  K  E  E  D  T  S  L  V  N  M         p.2180

          .         .         .         .         .         .       g.72930
 TCTACTCCAGATCCAGATGCAAATGGCTTGGAATCTTACACAACTCTCCCTGAAGCTACT       c.6600
 S  T  P  D  P  D  A  N  G  L  E  S  Y  T  T  L  P  E  A  T         p.2200

          .         .         .         .         .         .       g.72990
 GAAAAGTCACATTTTTTCTTAGCTACTGCATTAGTAACTGAATCTATACCAGCTGAACAT       c.6660
 E  K  S  H  F  F  L  A  T  A  L  V  T  E  S  I  P  A  E  H         p.2220

          .         .         .         .         .         .       g.73050
 GTAGTCACAGATTCACCAATCAAAAAGGAAGAAAGTACAAAACATTTTCCGAAAGGCATG       c.6720
 V  V  T  D  S  P  I  K  K  E  E  S  T  K  H  F  P  K  G  M         p.2240

          .         .         .         .         .         .       g.73110
 AGACCAACAATTCAAGAGTCAGATACTGAGCTCTTATTCTCTGGACTGGGATCAGGAGAA       c.6780
 R  P  T  I  Q  E  S  D  T  E  L  L  F  S  G  L  G  S  G  E         p.2260

          .         .         .         .         .         .       g.73170
 GAAGTTTTACCTACTCTACCAACAGAGTCAGTGAATTTTACTGAAGTGGAACAAATCAAT       c.6840
 E  V  L  P  T  L  P  T  E  S  V  N  F  T  E  V  E  Q  I  N         p.2280

          .         .         .         .         .         .       g.73230
 AACACATTATATCCCCACACTTCTCAAGTGGAAAGTACCTCAAGTGACAAAATTGAAGAC       c.6900
 N  T  L  Y  P  H  T  S  Q  V  E  S  T  S  S  D  K  I  E  D         p.2300

          .         .         .         .         .         .       g.73290
 TTTAACAGAATGGAAAATGTGGCAAAAGAAGTTGGACCACTCGTATCTCAAACAGACATC       c.6960
 F  N  R  M  E  N  V  A  K  E  V  G  P  L  V  S  Q  T  D  I         p.2320

          .         .         .         .         .         .       g.73350
 TTTGAAGGTAGTGGGTCAGTAACCAGCACAACATTAATAGAAATTTTAAGTGACACTGGA       c.7020
 F  E  G  S  G  S  V  T  S  T  T  L  I  E  I  L  S  D  T  G         p.2340

          .         .         .         .         .         .       g.73410
 GCAGAAGGACCCACGGTGGCACCTCTCCCTTTCTCCACGGACATCGGACATCCTCAAAAT       c.7080
 A  E  G  P  T  V  A  P  L  P  F  S  T  D  I  G  H  P  Q  N         p.2360

          .         .         .         .         .         .       g.73470
 CAGACTGTCAGGTGGGCAGAAGAAATCCAGACTAGTAGACCACAAACCATAACTGAACAA       c.7140
 Q  T  V  R  W  A  E  E  I  Q  T  S  R  P  Q  T  I  T  E  Q         p.2380

          .         .         .         .         .         .       g.73530
 GACTCTAACAAGAATTCTTCAACAGCAGAAATTAACGAAACAACAACCTCATCTACTGAT       c.7200
 D  S  N  K  N  S  S  T  A  E  I  N  E  T  T  T  S  S  T  D         p.2400

          .         .         .         .         .         .       g.73590
 TTTCTGGCTAGAGCTTATGGTTTTGAAATGGCCAAAGAATTTGTTACATCAGCACCAAAA       c.7260
 F  L  A  R  A  Y  G  F  E  M  A  K  E  F  V  T  S  A  P  K         p.2420

          .         .         .         .         .         .       g.73650
 CCATCTGACTTGTATTATGAACCTTCTGGAGAAGGATCTGGAGAAGTGGATATTGTTGAT       c.7320
 P  S  D  L  Y  Y  E  P  S  G  E  G  S  G  E  V  D  I  V  D         p.2440

          .         .         .         .         .         .       g.73710
 TCATTTCACACTTCTGCAACTACTCAGGCAACCAGACAAGAAAGCAGCACCACATTTGTT       c.7380
 S  F  H  T  S  A  T  T  Q  A  T  R  Q  E  S  S  T  T  F  V         p.2460

          .         .         .         .         .         .       g.73770
 TCTGATGGGTCCCTGGAAAAACATCCTGAGGTGCCAAGCGCTAAAGCTGTTACTGCTGAT       c.7440
 S  D  G  S  L  E  K  H  P  E  V  P  S  A  K  A  V  T  A  D         p.2480

          .         .         .         .         .         .       g.73830
 GGATTCCCAACAGTTTCAGTGATGCTGCCTCTTCATTCAGAGCAGAACAAAAGCTCCCCT       c.7500
 G  F  P  T  V  S  V  M  L  P  L  H  S  E  Q  N  K  S  S  P         p.2500

          .         .         .         .         .         .       g.73890
 GATCCAACTAGCACACTGTCAAATACAGTGTCATATGAGAGGTCCACAGACGGTAGTTTC       c.7560
 D  P  T  S  T  L  S  N  T  V  S  Y  E  R  S  T  D  G  S  F         p.2520

          .         .         .         .         .         .       g.73950
 CAAGACCGTTTCAGGGAATTCGAGGATTCCACCTTAAAACCTAACAGAAAAAAACCCACT       c.7620
 Q  D  R  F  R  E  F  E  D  S  T  L  K  P  N  R  K  K  P  T         p.2540

          .         .         .         .         .         .       g.74010
 GAAAATATTATCATAGACCTGGACAAAGAGGACAAGGATTTAATATTGACAATTACAGAG       c.7680
 E  N  I  I  I  D  L  D  K  E  D  K  D  L  I  L  T  I  T  E         p.2560

          .         .         .         .         .         .       g.74070
 AGTACCATCCTTGAAATTCTACCTGAGCTGACATCGGATAAAAATACTATCATAGATATT       c.7740
 S  T  I  L  E  I  L  P  E  L  T  S  D  K  N  T  I  I  D  I         p.2580

          .         .         .         .         .         .       g.74130
 GATCATACTAAACCTGTGTATGAAGACATTCTTGGAATGCAAACAGATATAGATACAGAG       c.7800
 D  H  T  K  P  V  Y  E  D  I  L  G  M  Q  T  D  I  D  T  E         p.2600

          .         .         .         .         .         .       g.74190
 GTACCATCAGAACCACATGACAGTAATGATGAAAGTAATGATGACAGCACTCAAGTTCAA       c.7860
 V  P  S  E  P  H  D  S  N  D  E  S  N  D  D  S  T  Q  V  Q         p.2620

          .         .         .         .         .         .       g.74250
 GAGATCTATGAGGCAGCTGTCAACCTTTCTTTAACTGAGGAAACATTTGAGGGCTCTGCT       c.7920
 E  I  Y  E  A  A  V  N  L  S  L  T  E  E  T  F  E  G  S  A         p.2640

          .         .         .         .         .         .       g.74310
 GATGTTCTGGCTAGCTACACTCAGGCAACACATGATGAATCAATGACTTATGAAGATAGA       c.7980
 D  V  L  A  S  Y  T  Q  A  T  H  D  E  S  M  T  Y  E  D  R         p.2660

          .         .         .         .         .         .       g.74370
 AGCCAACTAGATCACATGGGCTTTCACTTCACAACTGGGATCCCTGCTCCTAGCACAGAA       c.8040
 S  Q  L  D  H  M  G  F  H  F  T  T  G  I  P  A  P  S  T  E         p.2680

          .         .         .         .         .         .       g.74430
 ACAGAATTAGACGTTTTACTTCCCACGGCAACATCCCTGCCAATTCCTCGTAAGTCTGCC       c.8100
 T  E  L  D  V  L  L  P  T  A  T  S  L  P  I  P  R  K  S  A         p.2700

          .         .         .         .         .         .       g.74490
 ACAGTTATTCCAGAGATTGAAGGAATAAAAGCTGAAGCAAAAGCCCTGGATGACATGTTT       c.8160
 T  V  I  P  E  I  E  G  I  K  A  E  A  K  A  L  D  D  M  F         p.2720

          .         .         .         .         .         .       g.74550
 GAATCAAGCACTTTGTCTGATGGTCAAGCTATTGCAGACCAAAGTGAAATAATACCAACA       c.8220
 E  S  S  T  L  S  D  G  Q  A  I  A  D  Q  S  E  I  I  P  T         p.2740

          .         .         .         .         .         .       g.74610
 TTGGGCCAATTTGAAAGGACTCAGGAGGAGTATGAAGACAAAAAACATGCTGGTCCTTCT       c.8280
 L  G  Q  F  E  R  T  Q  E  E  Y  E  D  K  K  H  A  G  P  S         p.2760

          .         .         .         .         .         .       g.74670
 TTTCAGCCAGAATTCTCTTCAGGAGCTGAGGAGGCATTAGTAGACCATACTCCCTATCTA       c.8340
 F  Q  P  E  F  S  S  G  A  E  E  A  L  V  D  H  T  P  Y  L         p.2780

          .         .         .         .         .         .       g.74730
 AGTATTGCTACTACCCACCTTATGGATCAGAGTGTAACAGAGGTGCCTGATGTGATGGAA       c.8400
 S  I  A  T  T  H  L  M  D  Q  S  V  T  E  V  P  D  V  M  E         p.2800

          .         .         .         .         .         .       g.74790
 GGATCCAATCCCCCATATTACACTGATACAACATTAGCAGTTTCAACATTTGCGAAGTTG       c.8460
 G  S  N  P  P  Y  Y  T  D  T  T  L  A  V  S  T  F  A  K  L         p.2820

          .         .         .         .         .         .       g.74850
 TCTTCTCAGACACCATCATCTCCCCTCACTATCTACTCAGGCAGTGAAGCCTCTGGACAC       c.8520
 S  S  Q  T  P  S  S  P  L  T  I  Y  S  G  S  E  A  S  G  H         p.2840

          .         .         .         .         .         .       g.74910
 ACAGAGATCCCCCAGCCCAGTGCTCTGCCAGGAATAGACGTCGGCTCATCTGTAATGTCC       c.8580
 T  E  I  P  Q  P  S  A  L  P  G  I  D  V  G  S  S  V  M  S         p.2860

          .         .         .         .         .         .       g.74970
 CCACAGGATTCTTTTAAGGAAATTCATGTAAATATTGAAGCGACTTTCAAACCATCAAGT       c.8640
 P  Q  D  S  F  K  E  I  H  V  N  I  E  A  T  F  K  P  S  S         p.2880

          .         .         .         .         .         .       g.75030
 GAGGAATACCTTCACATAACTGAGCCTCCCTCTTTATCTCCTGACACAAAATTAGAACCT       c.8700
 E  E  Y  L  H  I  T  E  P  P  S  L  S  P  D  T  K  L  E  P         p.2900

          .         .         .         .         .         .       g.75090
 TCAGAAGATGATGGTAAACCTGAGTTATTAGAAGAAATGGAAGCTTCTCCCACAGAACTT       c.8760
 S  E  D  D  G  K  P  E  L  L  E  E  M  E  A  S  P  T  E  L         p.2920

          .         .         .         .         .         .       g.75150
 ATTGCTGTGGAAGGAACTGAGATTCTCCAAGATTTCCAAAACAAAACCGATGGTCAAGTT       c.8820
 I  A  V  E  G  T  E  I  L  Q  D  F  Q  N  K  T  D  G  Q  V         p.2940

          .         .         .         .         .         .       g.75210
 TCTGGAGAAGCAATCAAGATGTTTCCCACCATTAAAACACCTGAGGCTGGAACTGTTATT       c.8880
 S  G  E  A  I  K  M  F  P  T  I  K  T  P  E  A  G  T  V  I         p.2960

          .         .         .         .         .         .       g.75270
 ACAACTGCCGATGAAATTGAATTAGAAGGTGCTACACAGTGGCCACACTCTACTTCTGCT       c.8940
 T  T  A  D  E  I  E  L  E  G  A  T  Q  W  P  H  S  T  S  A         p.2980

          .         .         .         .         .         .       g.75330
 TCTGCCACCTATGGGGTCGAGGCAGGTGTGGTGCCTTGGCTAAGTCCACAGACTTCTGAG       c.9000
 S  A  T  Y  G  V  E  A  G  V  V  P  W  L  S  P  Q  T  S  E         p.3000

          .         .         .         .         .         .       g.75390
 AGGCCCACGCTTTCTTCTTCTCCAGAAATAAACCCTGAAACTCAAGCAGCTTTAATCAGA       c.9060
 R  P  T  L  S  S  S  P  E  I  N  P  E  T  Q  A  A  L  I  R         p.3020

          .         .         .         .         .         .       g.75450
 GGGCAGGATTCCACGATAGCAGCATCAGAACAGCAAGTGGCAGCGAGAATTCTTGATTCC       c.9120
 G  Q  D  S  T  I  A  A  S  E  Q  Q  V  A  A  R  I  L  D  S         p.3040

          .         .         .         .         .         .       g.75510
 AATGATCAGGCAACAGTAAACCCTGTGGAATTTAATACTGAGGTTGCAACACCACCATTT       c.9180
 N  D  Q  A  T  V  N  P  V  E  F  N  T  E  V  A  T  P  P  F         p.3060

          .         .         .         .         .         .       g.75570
 TCCCTTCTGGAGACTTCTAATGAAACAGATTTCCTGATTGGCATTAATGAAGAGTCAGTG       c.9240
 S  L  L  E  T  S  N  E  T  D  F  L  I  G  I  N  E  E  S  V         p.3080

          .         .      | 09  .         .         .         .    g.78898
 GAAGGCACGGCAATCTATTTACCAG | GACCTGATCGCTGCAAAATGAACCCGTGCCTTAAC    c.9300
 E  G  T  A  I  Y  L  P  G |   P  D  R  C  K  M  N  P  C  L  N      p.3100

          .         .         .         .         .         .       g.78958
 GGAGGCACCTGTTATCCTACTGAAACTTCCTACGTATGCACCTGTGTGCCAGGATACAGC       c.9360
 G  G  T  C  Y  P  T  E  T  S  Y  V  C  T  C  V  P  G  Y  S         p.3120

          .          | 10        .         .         .         .    g.81338
 GGAGACCAGTGTGAACTTG | ATTTTGATGAATGTCACTCTAATCCCTGTCGTAATGGAGCC    c.9420
 G  D  Q  C  E  L  D |   F  D  E  C  H  S  N  P  C  R  N  G  A      p.3140

          .         .         .         .         .         .       g.81398
 ACTTGTGTTGATGGTTTTAACACATTCAGGTGCCTCTGCCTTCCAAGTTATGTTGGTGCA       c.9480
 T  C  V  D  G  F  N  T  F  R  C  L  C  L  P  S  Y  V  G  A         p.3160

          .    | 11    .         .         .         .         .    g.86737
 CTTTGTGAGCAAG | ATACCGAGACATGTGACTATGGCTGGCACAAATTCCAAGGGCAGTGC    c.9540
 L  C  E  Q  D |   T  E  T  C  D  Y  G  W  H  K  F  Q  G  Q  C      p.3180

          .         .         .         .         .         .       g.86797
 TACAAATACTTTGCCCATCGACGCACATGGGATGCAGCTGAACGGGAATGCCGTCTGCAG       c.9600
 Y  K  Y  F  A  H  R  R  T  W  D  A  A  E  R  E  C  R  L  Q         p.3200

          .         .         .         .         .   | 12     .    g.88290
 GGTGCCCATCTCACAAGCATCCTGTCTCACGAAGAACAAATGTTTGTTAATC | GTGTGGGC    c.9660
 G  A  H  L  T  S  I  L  S  H  E  E  Q  M  F  V  N  R |   V  G      p.3220

          .         .         .         .         .         .       g.88350
 CATGATTATCAGTGGATAGGCCTCAATGACAAGATGTTTGAGCATGACTTCCGTTGGACT       c.9720
 H  D  Y  Q  W  I  G  L  N  D  K  M  F  E  H  D  F  R  W  T         p.3240

          .      | 13  .         .         .         .         .    g.105787
 GATGGCAGCACACTG | CAATACGAGAATTGGAGACCCAACCAGCCAGACAGCTTCTTTTCT    c.9780
 D  G  S  T  L   | Q  Y  E  N  W  R  P  N  Q  P  D  S  F  F  S      p.3260

          .         .         .         .         .         .       g.105847
 GCTGGAGAAGACTGTGTTGTAATCATTTGGCATGAGAATGGCCAGTGGAATGATGTTCCC       c.9840
 A  G  E  D  C  V  V  I  I  W  H  E  N  G  Q  W  N  D  V  P         p.3280

          .         .         .         . | 14       .         .    g.113326
 TGCAATTACCATCTCACCTATACGTGCAAGAAAGGAACAG | TCGCTTGCGGCCAGCCCCCT    c.9900
 C  N  Y  H  L  T  Y  T  C  K  K  G  T  V |   A  C  G  Q  P  P      p.3300

          .         .         .         .         .         .       g.113386
 GTTGTAGAAAATGCCAAGACCTTTGGAAAGATGAAACCTCGTTATGAAATCAACTCCCTG       c.9960
 V  V  E  N  A  K  T  F  G  K  M  K  P  R  Y  E  I  N  S  L         p.3320

          .         .         .         .         .         .       g.113446
 ATTAGATACCACTGCAAAGATGGTTTCATTCAACGTCACCTTCCAACTATCCGGTGCTTA       c.10020
 I  R  Y  H  C  K  D  G  F  I  Q  R  H  L  P  T  I  R  C  L         p.3340

          .         .         .         .    | 15    .         .    g.113650
 GGAAATGGAAGATGGGCTATACCTAAAATTACCTGCATGAACC | CATCTGCATACCAAAGG    c.10080
 G  N  G  R  W  A  I  P  K  I  T  C  M  N  P |   S  A  Y  Q  R      p.3360

          .         .         .         .         .         .       g.113710
 ACTTATTCTATGAAATACTTTAAAAATTCCTCATCAGCAAAGGACAATTCAATAAATACA       c.10140
 T  Y  S  M  K  Y  F  K  N  S  S  S  A  K  D  N  S  I  N  T         p.3380

          .         .         .         .         .                 g.113761
 TCCAAACATGATCATCGTTGGAGCCGGAGGTGGCAGGAGTCGAGGCGCTGA                c.10191
 S  K  H  D  H  R  W  S  R  R  W  Q  E  S  R  R  X                  p.3396

          .         .         .         .         .         .       g.113821
 tccctaaaatggcgaacatgtgttttcatcatttcagccaaagtcctaacttcctgtgcc       c.*60

          .         .         .         .         .         .       g.113881
 tttcctatcacctcgagaagtaattatcagttggtttggatttttggaccaccgttcagt       c.*120

          .         .         .         .         .         .       g.113941
 cattttgggttgccgtgctcccaaaacattttaaatgaaagtattggcattcaaaaagac       c.*180

          .         .         .         .         .         .       g.114001
 agcagacaaaatgaaagaaaatgagagcagaaagtaagcatttccagcctatctaatttc       c.*240

          .         .         .         .         .         .       g.114061
 tttagttttctatttgcctccagtgcagtccatttcctaatgtataccagcctactgtac       c.*300

          .         .         .         .         .         .       g.114121
 tatttaaaatgctcaatttcagcaccgatggccatgtaaataagatgatttaatgttgat       c.*360

          .         .         .         .         .         .       g.114181
 tttaatcctgtatataaaataaaaagtcacaatgagtttgggcatatttaatgatgatta       c.*420

          .         .         .         .         .         .       g.114241
 tggagccttagaggtctttaatcattggttcggctgcttttatgtagtttaggctggaaa       c.*480

          .         .         .         .         .         .       g.114301
 tggtttcacttgctctttgactgtcagcaagactgaagatggcttttcctggacagctag       c.*540

          .         .         .         .         .         .       g.114361
 aaaacacaaaatcttgtaggtcattgcacctatctcagccataggtgcagtttgcttcta       c.*600

          .         .         .         .         .         .       g.114421
 catgatgctaaaggctgcgaatgggatcctgatggaactaaggactccaatgtcgaactc       c.*660

          .         .         .         .         .         .       g.114481
 ttctttgctgcattcctttttcttcacttacaagaaaggcctgaatggaggacttttctg       c.*720

          .         .         .         .         .         .       g.114541
 taaccaggaacattttttaggggtcaaagtgctaataattaactcaaccaggtctacttt       c.*780

          .         .         .         .         .         .       g.114601
 ttaatggctttcataacactaactcataaggttaccgatcaatgcatttcatacggatat       c.*840

          .         .         .         .         .         .       g.114661
 agacctagggctctggagggtgggggattgttaaaacacatgcaaaaaaaaaaaaaaaaa       c.*900

          .         .         .         .         .         .       g.114721
 aaaaaaaagaaattttgtatatataaccattttaatcttttataaagttttgaatgttca       c.*960

          .         .         .         .         .         .       g.114781
 tgtatgaatgctgcagctgtgaagcatacataaataaatgaagtaagccatactgattta       c.*1020

          .         .         .         .         .         .       g.114841
 atttattggatgttattttccctaagacctgaaaatgaacatagtatgctagttattttt       c.*1080

          .         .         .         .         .         .       g.114901
 cagtgttagccttttactttcctcacacaatttggaatcatataatataggtactttgtc       c.*1140

          .         .         .         .         .         .       g.114961
 cctgattaaataatgtgacggatagaatgcatcaagtgtttattatgaaaagagtggaaa       c.*1200

          .         .         .         .         .         .       g.115021
 agtatatagcttttagcaaaaggtgtttgcccattctaagaaatgagcgaatatatagaa       c.*1260

          .         .         .         .         .         .       g.115081
 atagtgtgggcatttcttcctgttaggtggagtgtatgtgttgacatttctccccatctc       c.*1320

          .         .         .         .         .         .       g.115141
 ttcccactctgttttctccccattatttgaataaagtgactgctgaagatgactttgaat       c.*1380

          .         .         .         .         .         .       g.115201
 ccttatccacttaatttaatgtttaaagaaaaacctgtaatggaaagtaagactccttcc       c.*1440

          .         .         .         .         .         .       g.115261
 ctaatttcagtttagagcaacttgaagaagagtagacaaaaaataaaatgcacatagaaa       c.*1500

          .         .         .         .         .         .       g.115321
 aagagaaaaagggcacaaagggattggcccaatattgattctttttttataaaacctcct       c.*1560

          .         .         .         .         .         .       g.115381
 ttggcttagaaggaatgactctagctacaataatacacagtatgtttaagcaggttccct       c.*1620

          .         .         .         .         .         .       g.115441
 tggttgttgcattaaatgtaatccacctttaggtattttagagcacagaacaacactgtg       c.*1680

          .         .         .         .         .         .       g.115501
 ttgatctagtaggtttctatttttcctttctctttacaatgcacataatactttcctgta       c.*1740

          .         .         .         .         .         .       g.115561
 tttatatcataacgtgtatagtgtaaaatgtgaatgactttttttgtgaatgaaaatcta       c.*1800

          .         .         .         .         .         .       g.115621
 aaatctttgtaactttttatatctgcttttgtttcaccaaagaaacctaaaatccttctt       c.*1860

                                                                    g.115630
 ttactacac                                                          c.*1869

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Versican protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 25b
©2004-2020 Leiden University Medical Center