polycystic kidney disease 1 (autosomal dominant) (PKD1) - coding DNA reference sequence

(used for variant description)

(last modified June 17, 2014)


This file was created to facilitate the description of sequence variants on transcript NM_001009944.2 in the PKD1 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_008617.1, covering PKD1 transcript NM_001009944.2.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                                         .         .                g.5029
                                gcactgcagcgccagcgtccgagcgggcg       c.-181

 .         .         .         .         .         .                g.5089
 gccgagctcccggagcggcctggccccgagccccgagcgggcgtcgctcagcagcaggtc       c.-121

 .         .         .         .         .         .                g.5149
 gcggccgcagccccatccagccccgcgcccgccatgccgtccgcgggccccgcctgagct       c.-61

 .         .         .         .         .         .                g.5209
 gcggcctccgcgcgcgggcgggcctggggacggcggggccatgcgcgcgctgccctaacg       c.-1

          .         .         .         .         .         .       g.5269
 ATGCCGCCCGCCGCGCCCGCCCGCCTGGCGCTGGCCCTGGGCCTGGGCCTGTGGCTCGGG       c.60
 M  P  P  A  A  P  A  R  L  A  L  A  L  G  L  G  L  W  L  G         p.20

          .         .         .         .         .         .       g.5329
 GCGCTGGCGGGGGGCCCCGGGCGCGGCTGCGGGCCCTGCGAGCCCCCCTGCCTCTGCGGC       c.120
 A  L  A  G  G  P  G  R  G  C  G  P  C  E  P  P  C  L  C  G         p.40

          .         .         .         .         .         .       g.5389
 CCAGCGCCCGGCGCCGCCTGCCGCGTCAACTGCTCGGGCCGCGGGCTGCGGACGCTCGGT       c.180
 P  A  P  G  A  A  C  R  V  N  C  S  G  R  G  L  R  T  L  G         p.60

          .         .         .      | 02  .         .         .    g.21545
 CCCGCGCTGCGCATCCCCGCGGACGCCACAGCGCT | AGACGTCTCCCACAACCTGCTCCGG    c.240
 P  A  L  R  I  P  A  D  A  T  A  L  |  D  V  S  H  N  L  L  R      p.80

          .         .         .         .        | 03.         .    g.21726
 GCGCTGGACGTTGGGCTCCTGGCGAACCTCTCGGCGCTGGCAGAGCT | GGATATAAGCAAC    c.300
 A  L  D  V  G  L  L  A  N  L  S  A  L  A  E  L  |  D  I  S  N      p.100

          .         .         .         .         .          | 04    g.22054
 AACAAGATTTCTACGTTAGAAGAAGGAATATTTGCTAATTTATTTAATTTAAGTGAAAT | A    c.360
 N  K  I  S  T  L  E  E  G  I  F  A  N  L  F  N  L  S  E  I  |      p.120

          .         .         .         .         .         .       g.22114
 AACCTGAGTGGGAACCCGTTTGAGTGTGACTGTGGCCTGGCGTGGCTGCCGCGATGGGCG       c.420
 N  L  S  G  N  P  F  E  C  D  C  G  L  A  W  L  P  R  W  A         p.140

          .         .         .         .         .         .       g.22174
 GAGGAGCAGCAGGTGCGGGTGGTGCAGCCCGAGGCAGCCACGTGTGCTGGGCCTGGCTCC       c.480
 E  E  Q  Q  V  R  V  V  Q  P  E  A  A  T  C  A  G  P  G  S         p.160

          .         .         .         .          | 05        .    g.22447
 CTGGCTGGCCAGCCTCTGCTTGGCATCCCCTTGCTGGACAGTGGCTGTG | GTGAGGAGTAT    c.540
 L  A  G  Q  P  L  L  G  I  P  L  L  D  S  G  C  G |   E  E  Y      p.180

          .         .         .         .         .         .       g.22507
 GTCGCCTGCCTCCCTGACAACAGCTCAGGCACCGTGGCAGCAGTGTCCTTTTCAGCTGCC       c.600
 V  A  C  L  P  D  N  S  S  G  T  V  A  A  V  S  F  S  A  A         p.200

          .         .         .         .         .         .       g.22567
 CACGAAGGCCTGCTTCAGCCAGAGGCCTGCAGCGCCTTCTGCTTCTCCACCGGCCAGGGC       c.660
 H  E  G  L  L  Q  P  E  A  C  S  A  F  C  F  S  T  G  Q  G         p.220

          .         .         .         .         .         .       g.22627
 CTCGCAGCCCTCTCGGAGCAGGGCTGGTGCCTGTGTGGGGCGGCCCAGCCCTCCAGTGCC       c.720
 L  A  A  L  S  E  Q  G  W  C  L  C  G  A  A  Q  P  S  S  A         p.240

          .         .         .         .         .         .       g.22687
 TCCTTTGCCTGCCTGTCCCTCTGCTCCGGCCCCCCGCCACCTCCTGCCCCCACCTGTAGG       c.780
 S  F  A  C  L  S  L  C  S  G  P  P  P  P  P  A  P  T  C  R         p.260

          .         .         .         .         .         .       g.22747
 GGCCCCACCCTCCTCCAGCACGTCTTCCCTGCCTCCCCAGGGGCCACCCTGGTGGGGCCC       c.840
 G  P  T  L  L  Q  H  V  F  P  A  S  P  G  A  T  L  V  G  P         p.280

          .         .         .         .         .         .       g.22807
 CACGGACCTCTGGCCTCTGGCCAGCTAGCAGCCTTCCACATCGCTGCCCCGCTCCCTGTC       c.900
 H  G  P  L  A  S  G  Q  L  A  A  F  H  I  A  A  P  L  P  V         p.300

          .         .         .         .         .         .       g.22867
 ACTGCCACACGCTGGGACTTCGGAGACGGCTCCGCCGAGGTGGATGCCGCTGGGCCGGCT       c.960
 T  A  T  R  W  D  F  G  D  G  S  A  E  V  D  A  A  G  P  A         p.320

          .         .         .         .         .         .       g.22927
 GCCTCGCATCGCTATGTGCTGCCTGGGCGCTATCACGTGACGGCCGTGCTGGCCCTGGGG       c.1020
 A  S  H  R  Y  V  L  P  G  R  Y  H  V  T  A  V  L  A  L  G         p.340

          .         .         .         .         .         .       g.22987
 GCCGGCTCAGCCCTGCTGGGGACAGACGTGCAGGTGGAAGCGGCACCTGCCGCCCTGGAG       c.1080
 A  G  S  A  L  L  G  T  D  V  Q  V  E  A  A  P  A  A  L  E         p.360

          .         .         .         .         .         .       g.23047
 CTCGTGTGCCCGTCCTCGGTGCAGAGTGACGAGAGCCTCGACCTCAGCATCCAGAACCGC       c.1140
 L  V  C  P  S  S  V  Q  S  D  E  S  L  D  L  S  I  Q  N  R         p.380

          .         .         .         .         .         .       g.23107
 GGTGGTTCAGGCCTGGAGGCCGCCTACAGCATCGTGGCCCTGGGCGAGGAGCCGGCCCGA       c.1200
 G  G  S  G  L  E  A  A  Y  S  I  V  A  L  G  E  E  P  A  R         p.400

   | 06      .         .         .         .         .         .    g.23285
 G | CGGTGCACCCGCTCTGCCCCTCGGACACGGAGATCTTCCCTGGCAACGGGCACTGCTAC    c.1260
 A |   V  H  P  L  C  P  S  D  T  E  I  F  P  G  N  G  H  C  Y      p.420

          .         .         .         .         .         .       g.23345
 CGCCTGGTGGTGGAGAAGGCGGCCTGGCTGCAGGCGCAGGAGCAGTGTCAGGCCTGGGCC       c.1320
 R  L  V  V  E  K  A  A  W  L  Q  A  Q  E  Q  C  Q  A  W  A         p.440

          .         .         .         .         .         .       g.23405
 GGGGCCGCCCTGGCAATGGTGGACAGTCCCGCCGTGCAGCGCTTCCTGGTCTCCCGGGTC       c.1380
 G  A  A  L  A  M  V  D  S  P  A  V  Q  R  F  L  V  S  R  V         p.460

       | 07  .         .         .         .         .         .    g.23900
 ACCAG | GAGCCTAGACGTGTGGATCGGCTTCTCGACTGTGCAGGGGGTGGAGGTGGGCCCA    c.1440
 T  R  |  S  L  D  V  W  I  G  F  S  T  V  Q  G  V  E  V  G  P      p.480

          .         .         .         .         .         .       g.23960
 GCGCCGCAGGGCGAGGCCTTCAGCCTGGAGAGCTGCCAGAACTGGCTGCCCGGGGAGCCA       c.1500
 A  P  Q  G  E  A  F  S  L  E  S  C  Q  N  W  L  P  G  E  P         p.500

          .         .         .         .         .         .       g.24020
 CACCCAGCCACAGCCGAGCACTGCGTCCGGCTCGGGCCCACCGGGTGGTGTAACACCGAC       c.1560
 H  P  A  T  A  E  H  C  V  R  L  G  P  T  G  W  C  N  T  D         p.520

          .         .         .         .       | 08 .         .    g.24268
 CTGTGCTCAGCGCCGCACAGCTACGTCTGCGAGCTGCAGCCCGGAG | GCCCAGTGCAGGAT    c.1620
 L  C  S  A  P  H  S  Y  V  C  E  L  Q  P  G  G |   P  V  Q  D      p.540

          .         .         .         .         .         .       g.24328
 GCCGAGAACCTCCTCGTGGGAGCGCCCAGTGGGGACCTGCAGGGACCCCTGACGCCTCTG       c.1680
 A  E  N  L  L  V  G  A  P  S  G  D  L  Q  G  P  L  T  P  L         p.560

          .         .         .         .   | 09     .         .    g.24798
 GCACAGCAGGACGGCCTCTCAGCCCCGCACGAGCCCGTGGAG | GTCATGGTATTCCCGGGC    c.1740
 A  Q  Q  D  G  L  S  A  P  H  E  P  V  E   | V  M  V  F  P  G      p.580

          .         .         .         .         .         .       g.24858
 CTGCGTCTGAGCCGTGAAGCCTTCCTCACCACGGCCGAATTTGGGACCCAGGAGCTCCGG       c.1800
 L  R  L  S  R  E  A  F  L  T  T  A  E  F  G  T  Q  E  L  R         p.600

          .         .         .         .          | 10        .    g.25284
 CGGCCCGCCCAGCTGCGGCTGCAGGTGTACCGGCTCCTCAGCACAGCAG | GGACCCCGGAG    c.1860
 R  P  A  Q  L  R  L  Q  V  Y  R  L  L  S  T  A  G |   T  P  E      p.620

          .         .         .         .         .         .       g.25344
 AACGGCAGCGAGCCTGAGAGCAGGTCCCCGGACAACAGGACCCAGCTGGCCCCCGCGTGC       c.1920
 N  G  S  E  P  E  S  R  S  P  D  N  R  T  Q  L  A  P  A  C         p.640

          .         .         .         .         .         .       g.25404
 ATGCCAGGGGGACGCTGGTGCCCTGGAGCCAACATCTGCTTGCCGCTGGACGCCTCCTGC       c.1980
 M  P  G  G  R  W  C  P  G  A  N  I  C  L  P  L  D  A  S  C         p.660

          .         .         .         .         .         .       g.25464
 CACCCCCAGGCCTGCGCCAATGGCTGCACGTCAGGGCCAGGGCTACCCGGGGCCCCCTAT       c.2040
 H  P  Q  A  C  A  N  G  C  T  S  G  P  G  L  P  G  A  P  Y         p.680

          .         .         .         .         .        | 11.    g.25976
 GCGCTATGGAGAGAGTTCCTCTTCTCCGTTCCCGCGGGGCCCCCCGCGCAGTACTCG | GTC    c.2100
 A  L  W  R  E  F  L  F  S  V  P  A  G  P  P  A  Q  Y  S   | V      p.700

          .         .         .         .         .         .       g.26036
 ACCCTCCACGGCCAGGATGTCCTCATGCTCCCTGGTGACCTCGTTGGCTTGCAGCACGAC       c.2160
 T  L  H  G  Q  D  V  L  M  L  P  G  D  L  V  G  L  Q  H  D         p.720

          .         .         .         .         .         .       g.26096
 GCTGGCCCTGGCGCCCTCCTGCACTGCTCGCCGGCTCCCGGCCACCCTGGTCCCCGGGCC       c.2220
 A  G  P  G  A  L  L  H  C  S  P  A  P  G  H  P  G  P  R  A         p.740

          .         .         .         .         .         .       g.26156
 CCGTACCTCTCCGCCAACGCCTCGTCATGGCTGCCCCACTTGCCAGCCCAGCTGGAGGGC       c.2280
 P  Y  L  S  A  N  A  S  S  W  L  P  H  L  P  A  Q  L  E  G         p.760

          .         .         .         .         .         .       g.26216
 ACTTGGGCCTGCCCTGCCTGTGCCCTGCGGCTGCTTGCAGCCACGGAACAGCTCACCGTG       c.2340
 T  W  A  C  P  A  C  A  L  R  L  L  A  A  T  E  Q  L  T  V         p.780

          .         .         .         .         .         .       g.26276
 CTGCTGGGCTTGAGGCCCAACCCTGGACTGCGGCTGCCTGGGCGCTATGAGGTCCGGGCA       c.2400
 L  L  G  L  R  P  N  P  G  L  R  L  P  G  R  Y  E  V  R  A         p.800

          .         .         .         .         .         .       g.26336
 GAGGTGGGCAATGGCGTGTCCAGGCACAACCTCTCCTGCAGCTTTGACGTGGTCTCCCCA       c.2460
 E  V  G  N  G  V  S  R  H  N  L  S  C  S  F  D  V  V  S  P         p.820

          .         .         .         .         .         .       g.26396
 GTGGCTGGGCTGCGGGTCATCTACCCTGCCCCCCGCGACGGCCGCCTCTACGTGCCCACC       c.2520
 V  A  G  L  R  V  I  Y  P  A  P  R  D  G  R  L  Y  V  P  T         p.840

          .         .         .         .         .         .       g.26456
 AACGGCTCAGCCTTGGTGCTCCAGGTGGACTCTGGTGCCAACGCCACGGCCACGGCTCGC       c.2580
 N  G  S  A  L  V  L  Q  V  D  S  G  A  N  A  T  A  T  A  R         p.860

          .         .         .         .         .         .       g.26516
 TGGCCTGGGGGCAGTGTCAGCGCCCGCTTTGAGAATGTCTGCCCTGCCCTGGTGGCCACC       c.2640
 W  P  G  G  S  V  S  A  R  F  E  N  V  C  P  A  L  V  A  T         p.880

          .         .         .         .         .         .       g.26576
 TTCGTGCCCGGCTGCCCCTGGGAGACCAACGATACCCTGTTCTCAGTGGTAGCACTGCCG       c.2700
 F  V  P  G  C  P  W  E  T  N  D  T  L  F  S  V  V  A  L  P         p.900

          .         .         .         .         .         .       g.26636
 TGGCTCAGTGAGGGGGAGCACGTGGTGGACGTGGTGGTGGAAAACAGCGCCAGCCGGGCC       c.2760
 W  L  S  E  G  E  H  V  V  D  V  V  V  E  N  S  A  S  R  A         p.920

          .         .         .         .         .         .       g.26696
 AACCTCAGCCTGCGGGTGACGGCGGAGGAGCCCATCTGTGGCCTCCGCGCCACGCCCAGC       c.2820
 N  L  S  L  R  V  T  A  E  E  P  I  C  G  L  R  A  T  P  S         p.940

          .         .         .    | 12    .         .         .    g.27633
 CCCGAGGCCCGTGTACTGCAGGGAGTCCTAGTG | AGGTACAGCCCCGTGGTGGAGGCCGGC    c.2880
 P  E  A  R  V  L  Q  G  V  L  V   | R  Y  S  P  V  V  E  A  G      p.960

          .         .         .         .         .         .       g.27693
 TCGGACATGGTCTTCCGGTGGACCATCAACGACAAGCAGTCCCTGACCTTCCAGAACGTG       c.2940
 S  D  M  V  F  R  W  T  I  N  D  K  Q  S  L  T  F  Q  N  V         p.980

          .         .         .         .      | 13  .         .    g.27950
 GTCTTCAATGTCATTTATCAGAGCGCGGCGGTCTTCAAGCTCTCA | CTGACGGCCTCCAAC    c.3000
 V  F  N  V  I  Y  Q  S  A  A  V  F  K  L  S   | L  T  A  S  N      p.1000

          .         .         .         .         .         .       g.28010
 CACGTGAGCAACGTCACCGTGAACTACAACGTAACCGTGGAGCGGATGAACAGGATGCAG       c.3060
 H  V  S  N  V  T  V  N  Y  N  V  T  V  E  R  M  N  R  M  Q         p.1020

          .         .         .         .         .         .       g.28070
 GGTCTGCAGGTCTCCACAGTGCCGGCCGTGCTGTCCCCCAATGCCACGCTAGCACTGACG       c.3120
 G  L  Q  V  S  T  V  P  A  V  L  S  P  N  A  T  L  A  L  T         p.1040

          .         .         .         .  | 14      .         .    g.28444
 GCGGGCGTGCTGGTGGACTCGGCCGTGGAGGTGGCCTTCCT | GTGGACCTTTGGGGATGGG    c.3180
 A  G  V  L  V  D  S  A  V  E  V  A  F  L  |  W  T  F  G  D  G      p.1060

          .         .         .         .         .         .       g.28504
 GAGCAGGCCCTCCACCAGTTCCAGCCTCCGTACAACGAGTCCTTCCCGGTTCCAGACCCC       c.3240
 E  Q  A  L  H  Q  F  Q  P  P  Y  N  E  S  F  P  V  P  D  P         p.1080

          .         .         .         .         .      | 15  .    g.29032
 TCGGTGGCCCAGGTGCTGGTGGAGCACAATGTCATGCACACCTACGCTGCCCCAG | GTGAG    c.3300
 S  V  A  Q  V  L  V  E  H  N  V  M  H  T  Y  A  A  P  G |   E      p.1100

          .         .         .         .         .         .       g.29092
 TACCTCCTGACCGTGCTGGCATCTAATGCCTTCGAGAACCTGACGCAGCAGGTGCCTGTG       c.3360
 Y  L  L  T  V  L  A  S  N  A  F  E  N  L  T  Q  Q  V  P  V         p.1120

          .         .         .         .         .         .       g.29152
 AGCGTGCGCGCCTCCCTGCCCTCCGTGGCTGTGGGTGTGAGTGACGGCGTCCTGGTGGCC       c.3420
 S  V  R  A  S  L  P  S  V  A  V  G  V  S  D  G  V  L  V  A         p.1140

          .         .         .         .         .         .       g.29212
 GGCCGGCCCGTCACCTTCTACCCGCACCCGCTGCCCTCGCCTGGGGGTGTTCTTTACACG       c.3480
 G  R  P  V  T  F  Y  P  H  P  L  P  S  P  G  G  V  L  Y  T         p.1160

          .         .         .         .         .         .       g.29272
 TGGGACTTCGGGGACGGCTCCCCTGTCCTGACCCAGAGCCAGCCGGCTGCCAACCACACC       c.3540
 W  D  F  G  D  G  S  P  V  L  T  Q  S  Q  P  A  A  N  H  T         p.1180

          .         .         .         .         .         .       g.29332
 TATGCCTCGAGGGGCACCTACCACGTGCGCCTGGAGGTCAACAACACGGTGAGCGGTGCG       c.3600
 Y  A  S  R  G  T  Y  H  V  R  L  E  V  N  N  T  V  S  G  A         p.1200

          .         .         .         .         .         .       g.29392
 GCGGCCCAGGCGGATGTGCGCGTCTTTGAGGAGCTCCGCGGACTCAGCGTGGACATGAGC       c.3660
 A  A  Q  A  D  V  R  V  F  E  E  L  R  G  L  S  V  D  M  S         p.1220

          .         .         .         .         .         .       g.29452
 CTGGCCGTGGAGCAGGGCGCCCCCGTGGTGGTCAGCGCCGCGGTGCAGACGGGCGACAAC       c.3720
 L  A  V  E  Q  G  A  P  V  V  V  S  A  A  V  Q  T  G  D  N         p.1240

          .         .         .         .         .         .       g.29512
 ATCACGTGGACCTTCGACATGGGGGACGGCACCGTGCTGTCGGGCCCGGAGGCAACAGTG       c.3780
 I  T  W  T  F  D  M  G  D  G  T  V  L  S  G  P  E  A  T  V         p.1260

          .         .         .         .         .         .       g.29572
 GAGCATGTGTACCTGCGGGCACAGAACTGCACAGTGACCGTGGGTGCGGCCAGCCCCGCC       c.3840
 E  H  V  Y  L  R  A  Q  N  C  T  V  T  V  G  A  A  S  P  A         p.1280

          .         .         .         .         .         .       g.29632
 GGCCACCTGGCCCGGAGCCTGCACGTGCTGGTCTTCGTCCTGGAGGTGCTGCGCGTTGAA       c.3900
 G  H  L  A  R  S  L  H  V  L  V  F  V  L  E  V  L  R  V  E         p.1300

          .         .         .         .         .         .       g.29692
 CCCGCCGCCTGCATCCCCACGCAGCCTGACGCGCGGCTCACGGCCTACGTCACCGGGAAC       c.3960
 P  A  A  C  I  P  T  Q  P  D  A  R  L  T  A  Y  V  T  G  N         p.1320

          .         .         .         .         .         .       g.29752
 CCGGCCCACTACCTCTTCGACTGGACCTTCGGGGATGGCTCCTCCAACACGACCGTGCGG       c.4020
 P  A  H  Y  L  F  D  W  T  F  G  D  G  S  S  N  T  T  V  R         p.1340

          .         .         .         .         .         .       g.29812
 GGGTGCCCGACGGTGACACACAACTTCACGCGGAGCGGCACGTTCCCCCTGGCGCTGGTG       c.4080
 G  C  P  T  V  T  H  N  F  T  R  S  G  T  F  P  L  A  L  V         p.1360

          .         .         .         .         .         .       g.29872
 CTGTCCAGCCGCGTGAACAGGGCGCATTACTTCACCAGCATCTGCGTGGAGCCAGAGGTG       c.4140
 L  S  S  R  V  N  R  A  H  Y  F  T  S  I  C  V  E  P  E  V         p.1380

          .         .         .         .         .         .       g.29932
 GGCAACGTCACCCTGCAGCCAGAGAGGCAGTTTGTGCAGCTCGGGGACGAGGCCTGGCTG       c.4200
 G  N  V  T  L  Q  P  E  R  Q  F  V  Q  L  G  D  E  A  W  L         p.1400

          .         .         .         .         .         .       g.29992
 GTGGCATGTGCCTGGCCCCCGTTCCCCTACCGCTACACCTGGGACTTTGGCACCGAGGAA       c.4260
 V  A  C  A  W  P  P  F  P  Y  R  Y  T  W  D  F  G  T  E  E         p.1420

          .         .         .         .         .         .       g.30052
 GCCGCCCCCACCCGTGCCAGGGGCCCTGAGGTGACGTTCATCTACCGAGACCCAGGCTCC       c.4320
 A  A  P  T  R  A  R  G  P  E  V  T  F  I  Y  R  D  P  G  S         p.1440

          .         .         .         .         .         .       g.30112
 TATCTTGTGACAGTCACCGCGTCCAACAACATCTCTGCTGCCAATGACTCAGCCCTGGTG       c.4380
 Y  L  V  T  V  T  A  S  N  N  I  S  A  A  N  D  S  A  L  V         p.1460

          .         .         .         .         .         .       g.30172
 GAGGTGCAGGAGCCCGTGCTGGTCACCAGCATCAAGGTCAATGGCTCCCTTGGGCTGGAG       c.4440
 E  V  Q  E  P  V  L  V  T  S  I  K  V  N  G  S  L  G  L  E         p.1480

          .         .         .         .         .         .       g.30232
 CTGCAGCAGCCGTACCTGTTCTCTGCTGTGGGCCGTGGGCGCCCCGCCAGCTACCTGTGG       c.4500
 L  Q  Q  P  Y  L  F  S  A  V  G  R  G  R  P  A  S  Y  L  W         p.1500

          .         .         .         .         .         .       g.30292
 GATCTGGGGGACGGTGGGTGGCTCGAGGGTCCGGAGGTCACCCACGCTTACAACAGCACA       c.4560
 D  L  G  D  G  G  W  L  E  G  P  E  V  T  H  A  Y  N  S  T         p.1520

          .         .         .         .         .         .       g.30352
 GGTGACTTCACCGTTAGGGTGGCCGGCTGGAATGAGGTGAGCCGCAGCGAGGCCTGGCTC       c.4620
 G  D  F  T  V  R  V  A  G  W  N  E  V  S  R  S  E  A  W  L         p.1540

          .         .         .         .         .         .       g.30412
 AATGTGACGGTGAAGCGGCGCGTGCGGGGGCTCGTCGTCAATGCAAGCCGCACGGTGGTG       c.4680
 N  V  T  V  K  R  R  V  R  G  L  V  V  N  A  S  R  T  V  V         p.1560

          .         .         .         .         .         .       g.30472
 CCCCTGAATGGGAGCGTGAGCTTCAGCACGTCGCTGGAGGCCGGCAGTGATGTGCGCTAT       c.4740
 P  L  N  G  S  V  S  F  S  T  S  L  E  A  G  S  D  V  R  Y         p.1580

          .         .         .         .         .         .       g.30532
 TCCTGGGTGCTCTGTGACCGCTGCACGCCCATCCCTGGGGGTCCTACCATCTCTTACACC       c.4800
 S  W  V  L  C  D  R  C  T  P  I  P  G  G  P  T  I  S  Y  T         p.1600

          .         .         .         .         .         .       g.30592
 TTCCGCTCCGTGGGCACCTTCAATATCATCGTCACGGCTGAGAACGAGGTGGGCTCCGCC       c.4860
 F  R  S  V  G  T  F  N  I  I  V  T  A  E  N  E  V  G  S  A         p.1620

          .         .         .         .         .         .       g.30652
 CAGGACAGCATCTTCGTCTATGTCCTGCAGCTCATAGAGGGGCTGCAGGTGGTGGGCGGT       c.4920
 Q  D  S  I  F  V  Y  V  L  Q  L  I  E  G  L  Q  V  V  G  G         p.1640

          .         .         .         .         .         .       g.30712
 GGCCGCTACTTCCCCACCAACCACACGGTACAGCTGCAGGCCGTGGTTAGGGATGGCACC       c.4980
 G  R  Y  F  P  T  N  H  T  V  Q  L  Q  A  V  V  R  D  G  T         p.1660

          .         .         .         .         .         .       g.30772
 AACGTCTCCTACAGCTGGACTGCCTGGAGGGACAGGGGCCCGGCCCTGGCCGGCAGCGGC       c.5040
 N  V  S  Y  S  W  T  A  W  R  D  R  G  P  A  L  A  G  S  G         p.1680

          .         .         .         .         .         .       g.30832
 AAAGGCTTCTCGCTCACCGTGCTCGAGGCCGGCACCTACCATGTGCAGCTGCGGGCCACC       c.5100
 K  G  F  S  L  T  V  L  E  A  G  T  Y  H  V  Q  L  R  A  T         p.1700

          .         .         .         .         .         .       g.30892
 AACATGCTGGGCAGCGCCTGGGCCGACTGCACCATGGACTTCGTGGAGCCTGTGGGGTGG       c.5160
 N  M  L  G  S  A  W  A  D  C  T  M  D  F  V  E  P  V  G  W         p.1720

          .         .         .         .         .         .       g.30952
 CTGATGGTGGCCGCCTCCCCGAACCCAGCTGCCGTCAACACAAGCGTCACCCTCAGTGCC       c.5220
 L  M  V  A  A  S  P  N  P  A  A  V  N  T  S  V  T  L  S  A         p.1740

          .         .         .         .         .         .       g.31012
 GAGCTGGCTGGTGGCAGTGGTGTCGTATACACTTGGTCCTTGGAGGAGGGGCTGAGCTGG       c.5280
 E  L  A  G  G  S  G  V  V  Y  T  W  S  L  E  E  G  L  S  W         p.1760

          .         .         .         .         .         .       g.31072
 GAGACCTCCGAGCCATTTACCACCCATAGCTTCCCCACACCCGGCCTGCACTTGGTCACC       c.5340
 E  T  S  E  P  F  T  T  H  S  F  P  T  P  G  L  H  L  V  T         p.1780

          .         .         .         .         .         .       g.31132
 ATGACGGCAGGGAACCCGCTGGGCTCAGCCAACGCCACCGTGGAAGTGGATGTGCAGGTG       c.5400
 M  T  A  G  N  P  L  G  S  A  N  A  T  V  E  V  D  V  Q  V         p.1800

          .         .         .         .         .         .       g.31192
 CCTGTGAGTGGCCTCAGCATCAGGGCCAGCGAGCCCGGAGGCAGCTTCGTGGCGGCCGGG       c.5460
 P  V  S  G  L  S  I  R  A  S  E  P  G  G  S  F  V  A  A  G         p.1820

          .         .         .         .         .         .       g.31252
 TCCTCTGTGCCCTTTTGGGGGCAGCTGGCCACGGGCACCAATGTGAGCTGGTGCTGGGCT       c.5520
 S  S  V  P  F  W  G  Q  L  A  T  G  T  N  V  S  W  C  W  A         p.1840

          .         .         .         .         .         .       g.31312
 GTGCCCGGCGGCAGCAGCAAGCGTGGCCCTCATGTCACCATGGTCTTCCCGGATGCTGGC       c.5580
 V  P  G  G  S  S  K  R  G  P  H  V  T  M  V  F  P  D  A  G         p.1860

          .         .         .         .         .         .       g.31372
 ACCTTCTCCATCCGGCTCAATGCCTCCAACGCAGTCAGCTGGGTCTCAGCCACGTACAAC       c.5640
 T  F  S  I  R  L  N  A  S  N  A  V  S  W  V  S  A  T  Y  N         p.1880

          .         .         .         .         .         .       g.31432
 CTCACGGCGGAGGAGCCCATCGTGGGCCTGGTGCTGTGGGCCAGCAGCAAGGTGGTGGCG       c.5700
 L  T  A  E  E  P  I  V  G  L  V  L  W  A  S  S  K  V  V  A         p.1900

          .         .         .         .         .         .       g.31492
 CCCGGGCAGCTGGTCCATTTTCAGATCCTGCTGGCTGCCGGCTCAGCTGTCACCTTCCGC       c.5760
 P  G  Q  L  V  H  F  Q  I  L  L  A  A  G  S  A  V  T  F  R         p.1920

          .         .         .         .         .         .       g.31552
 CTGCAGGTCGGCGGGGCCAACCCCGAGGTGCTCCCCGGGCCCCGTTTCTCCCACAGCTTC       c.5820
 L  Q  V  G  G  A  N  P  E  V  L  P  G  P  R  F  S  H  S  F         p.1940

          .         .         .         .         .         .       g.31612
 CCCCGCGTCGGAGACCACGTGGTGAGCGTGCGGGGCAAAAACCACGTGAGCTGGGCCCAG       c.5880
 P  R  V  G  D  H  V  V  S  V  R  G  K  N  H  V  S  W  A  Q         p.1960

          .         .         .         .         .         .       g.31672
 GCGCAGGTGCGCATCGTGGTGCTGGAGGCCGTGAGTGGGCTGCAGGTGCCCAACTGCTGC       c.5940
 A  Q  V  R  I  V  V  L  E  A  V  S  G  L  Q  V  P  N  C  C         p.1980

          .         .         .         .         .         .       g.31732
 GAGCCTGGCATCGCCACGGGCACTGAGAGGAACTTCACAGCCCGCGTGCAGCGCGGCTCT       c.6000
 E  P  G  I  A  T  G  T  E  R  N  F  T  A  R  V  Q  R  G  S         p.2000

          .         .         .         .         .         .       g.31792
 CGGGTCGCCTACGCCTGGTACTTCTCGCTGCAGAAGGTCCAGGGCGACTCGCTGGTCATC       c.6060
 R  V  A  Y  A  W  Y  F  S  L  Q  K  V  Q  G  D  S  L  V  I         p.2020

          .         .         .         .         .         .       g.31852
 CTGTCGGGCCGCGACGTCACCTACACGCCCGTGGCCGCGGGGCTGTTGGAGATCCAGGTG       c.6120
 L  S  G  R  D  V  T  Y  T  P  V  A  A  G  L  L  E  I  Q  V         p.2040

          .         .         .         .         .         .       g.31912
 CGCGCCTTCAACGCCCTGGGCAGTGAGAACCGCACGCTGGTGCTGGAGGTTCAGGACGCC       c.6180
 R  A  F  N  A  L  G  S  E  N  R  T  L  V  L  E  V  Q  D  A         p.2060

          .         .         .         .         .         .       g.31972
 GTCCAGTATGTGGCCCTGCAGAGCGGCCCCTGCTTCACCAACCGCTCGGCGCAGTTTGAG       c.6240
 V  Q  Y  V  A  L  Q  S  G  P  C  F  T  N  R  S  A  Q  F  E         p.2080

          .         .         .         .         .         .       g.32032
 GCCGCCACCAGCCCCAGCCCCCGGCGTGTGGCCTACCACTGGGACTTTGGGGATGGGTCG       c.6300
 A  A  T  S  P  S  P  R  R  V  A  Y  H  W  D  F  G  D  G  S         p.2100

          .         .         .         .         .         .       g.32092
 CCAGGGCAGGACACAGATGAGCCCAGGGCCGAGCACTCCTACCTGAGGCCTGGGGACTAC       c.6360
 P  G  Q  D  T  D  E  P  R  A  E  H  S  Y  L  R  P  G  D  Y         p.2120

          .         .         .         .         .         .       g.32152
 CGCGTGCAGGTGAACGCCTCCAACCTGGTGAGCTTCTTCGTGGCGCAGGCCACGGTGACC       c.6420
 R  V  Q  V  N  A  S  N  L  V  S  F  F  V  A  Q  A  T  V  T         p.2140

          .         .         .         .         .         .       g.32212
 GTCCAGGTGCTGGCCTGCCGGGAGCCGGAGGTGGACGTGGTCCTGCCCCTGCAGGTGCTG       c.6480
 V  Q  V  L  A  C  R  E  P  E  V  D  V  V  L  P  L  Q  V  L         p.2160

          .         .         .         .         .         .       g.32272
 ATGCGGCGATCACAGCGCAACTACTTGGAGGCCCACGTTGACCTGCGCGACTGCGTCACC       c.6540
 M  R  R  S  Q  R  N  Y  L  E  A  H  V  D  L  R  D  C  V  T         p.2180

          .         .         .         .         .         .       g.32332
 TACCAGACTGAGTACCGCTGGGAGGTGTATCGCACCGCCAGCTGCCAGCGGCCGGGGCGC       c.6600
 Y  Q  T  E  Y  R  W  E  V  Y  R  T  A  S  C  Q  R  P  G  R         p.2200

          .         .         .         .         .         .       g.32392
 CCAGCGCGTGTGGCCCTGCCCGGCGTGGACGTGAGCCGGCCTCGGCTGGTGCTGCCGCGG       c.6660
 P  A  R  V  A  L  P  G  V  D  V  S  R  P  R  L  V  L  P  R         p.2220

          .         .         .         .         .         .       g.32452
 CTGGCGCTGCCTGTGGGGCACTACTGCTTTGTGTTTGTCGTGTCATTTGGGGACACGCCA       c.6720
 L  A  L  P  V  G  H  Y  C  F  V  F  V  V  S  F  G  D  T  P         p.2240

          .         .         .         .         .         .       g.32512
 CTGACACAGAGCATCCAGGCCAATGTGACGGTGGCCCCCGAGCGCCTGGTGCCCATCATT       c.6780
 L  T  Q  S  I  Q  A  N  V  T  V  A  P  E  R  L  V  P  I  I         p.2260

          .         .         .         .         .         .       g.32572
 GAGGGTGGCTCATACCGCGTGTGGTCAGACACACGGGACCTGGTGCTGGATGGGAGCGAG       c.6840
 E  G  G  S  Y  R  V  W  S  D  T  R  D  L  V  L  D  G  S  E         p.2280

          .         .         .         .         .         .       g.32632
 TCCTACGACCCCAACCTGGAGGACGGCGACCAGACGCCGCTCAGTTTCCACTGGGCCTGT       c.6900
 S  Y  D  P  N  L  E  D  G  D  Q  T  P  L  S  F  H  W  A  C         p.2300

          .      | 16  .         .         .         .         .    g.32911
 GTGGCTTCGACACAG | AGGGAGGCTGGCGGGTGTGCGCTGAACTTTGGGCCCCGCGGGAGC    c.6960
 V  A  S  T  Q   | R  E  A  G  G  C  A  L  N  F  G  P  R  G  S      p.2320

          .         .         .         .         .         .       g.32971
 AGCACGGTCACCATTCCACGGGAGCGGCTGGCGGCTGGCGTGGAGTACACCTTCAGCCTG       c.7020
 S  T  V  T  I  P  R  E  R  L  A  A  G  V  E  Y  T  F  S  L         p.2340

          .         .         .         .      | 17  .         .    g.33965
 ACCGTGTGGAAGGCCGGCCGCAAGGAGGAGGCCACCAACCAGACG | GTGCTGATCCGGAGT    c.7080
 T  V  W  K  A  G  R  K  E  E  A  T  N  Q  T   | V  L  I  R  S      p.2360

          .         .         .         .         .         .       g.34025
 GGCCGGGTGCCCATTGTGTCCTTGGAGTGTGTGTCCTGCAAGGCACAGGCCGTGTACGAA       c.7140
 G  R  V  P  I  V  S  L  E  C  V  S  C  K  A  Q  A  V  Y  E         p.2380

          .         .         .         .         .         .       g.34085
 GTGAGCCGCAGCTCCTACGTGTACTTGGAGGGCCGCTGCCTCAATTGCAGCAGCGGCTCC       c.7200
 V  S  R  S  S  Y  V  Y  L  E  G  R  C  L  N  C  S  S  G  S         p.2400

           | 18        .         .         .         .         .    g.34272
 AAGCGAGGG | CGGTGGGCTGCACGTACGTTCAGCAACAAGACGCTGGTGCTGGATGAGACC    c.7260
 K  R  G   | R  W  A  A  R  T  F  S  N  K  T  L  V  L  D  E  T      p.2420

          .         .         .         .         .         .       g.34332
 ACCACATCCACGGGCAGTGCAGGCATGCGACTGGTGCTGCGGCGGGGCGTGCTGCGGGAC       c.7320
 T  T  S  T  G  S  A  G  M  R  L  V  L  R  R  G  V  L  R  D         p.2440

          .         .         .         .         .         .       g.34392
 GGCGAGGGATACACCTTCACGCTCACGGTGCTGGGCCGCTCTGGCGAGGAGGAGGGCTGC       c.7380
 G  E  G  Y  T  F  T  L  T  V  L  G  R  S  G  E  E  E  G  C         p.2460

          .         .         .         .         .         .       g.34452
 GCCTCCATCCGCCTGTCCCCCAACCGCCCGCCGCTGGGGGGCTCTTGCCGCCTCTTCCCA       c.7440
 A  S  I  R  L  S  P  N  R  P  P  L  G  G  S  C  R  L  F  P         p.2480

          .         .         .         .          | 19        .    g.34605
 CTGGGCGCTGTGCACGCCCTCACCACCAAGGTGCACTTCGAATGCACGG | GCTGGCATGAC    c.7500
 L  G  A  V  H  A  L  T  T  K  V  H  F  E  C  T  G |   W  H  D      p.2500

          .         .         .         .         .         .       g.34665
 GCGGAGGATGCTGGCGCCCCGCTGGTGTACGCCCTGCTGCTGCGGCGCTGTCGCCAGGGC       c.7560
 A  E  D  A  G  A  P  L  V  Y  A  L  L  L  R  R  C  R  Q  G         p.2520

          .         .         .         .         .         .       g.34725
 CACTGCGAGGAGTTCTGTGTCTACAAGGGCAGCCTCTCCAGCTACGGAGCCGTGCTGCCC       c.7620
 H  C  E  E  F  C  V  Y  K  G  S  L  S  S  Y  G  A  V  L  P         p.2540

          .         .         .         .         .         .       g.34785
 CCGGGTTTCAGGCCACACTTCGAGGTGGGCCTGGCCGTGGTGGTGCAGGACCAGCTGGGA       c.7680
 P  G  F  R  P  H  F  E  V  G  L  A  V  V  V  Q  D  Q  L  G         p.2560

          .         .    | 20    .         .         .         .    g.34911
 GCCGCTGTGGTCGCCCTCAACAG | GTCTTTGGCCATCACCCTCCCAGAGCCCAACGGCAGC    c.7740
 A  A  V  V  A  L  N  R  |  S  L  A  I  T  L  P  E  P  N  G  S      p.2580

          .         .         .         .         .         .       g.34971
 GCAACGGGGCTCACAGTCTGGCTGCACGGGCTCACCGCTAGTGTGCTCCCAGGGCTGCTG       c.7800
 A  T  G  L  T  V  W  L  H  G  L  T  A  S  V  L  P  G  L  L         p.2600

          .         .         .         .         .         .       g.35031
 CGGCAGGCCGATCCCCAGCACGTCATCGAGTACTCGTTGGCCCTGGTCACCGTGCTGAAC       c.7860
 R  Q  A  D  P  Q  H  V  I  E  Y  S  L  A  L  V  T  V  L  N         p.2620

     | 21    .         .         .         .         .         .    g.35481
 GAG | TACGAGCGGGCCCTGGACGTGGCGGCAGAGCCCAAGCACGAGCGGCAGCACCGAGCC    c.7920
 E   | Y  E  R  A  L  D  V  A  A  E  P  K  H  E  R  Q  H  R  A      p.2640

          .         .         .         .         .         .       g.35541
 CAGATACGCAAGAACATCACGGAGACTCTGGTGTCCCTGAGGGTCCACACTGTGGATGAC       c.7980
 Q  I  R  K  N  I  T  E  T  L  V  S  L  R  V  H  T  V  D  D         p.2660

          .         .         .       | 22 .         .         .    g.36280
 ATCCAGCAGATCGCTGCTGCGCTGGCCCAGTGCATG | GGGCCCAGCAGGGAGCTCGTATGC    c.8040
 I  Q  Q  I  A  A  A  L  A  Q  C  M   | G  P  S  R  E  L  V  C      p.2680

          .         .         .         .         .         .       g.36340
 CGCTCGTGCCTGAAGCAGACGCTGCACAAGCTGGAGGCCATGATGCTCATCCTGCAGGCA       c.8100
 R  S  C  L  K  Q  T  L  H  K  L  E  A  M  M  L  I  L  Q  A         p.2700

          .         .         .         .         .         .       g.36400
 GAGACCACCGCGGGCACCGTGACGCCCACCGCCATCGGAGACAGCATCCTCAACATCACA       c.8160
 E  T  T  A  G  T  V  T  P  T  A  I  G  D  S  I  L  N  I  T         p.2720

   | 23      .         .         .         .         .         .    g.37062
 G | GAGACCTCATCCACCTGGCCAGCTCGGACGTGCGGGCACCACAGCCCTCAGAGCTGGGA    c.8220
 G |   D  L  I  H  L  A  S  S  D  V  R  A  P  Q  P  S  E  L  G      p.2740

          .         .         .         .         .         .       g.37122
 GCCGAGTCACCATCTCGGATGGTGGCGTCCCAGGCCTACAACCTGACCTCTGCCCTCATG       c.8280
 A  E  S  P  S  R  M  V  A  S  Q  A  Y  N  L  T  S  A  L  M         p.2760

          .         .         .         .         .         .       g.37182
 CGCATCCTCATGCGCTCCCGCGTGCTCAACGAGGAGCCCCTGACGCTGGCGGGCGAGGAG       c.8340
 R  I  L  M  R  S  R  V  L  N  E  E  P  L  T  L  A  G  E  E         p.2780

          .         .         .         .         .         .       g.37242
 ATCGTGGCCCAGGGCAAGCGCTCGGACCCGCGGAGCCTGCTGTGCTATGGCGGCGCCCCA       c.8400
 I  V  A  Q  G  K  R  S  D  P  R  S  L  L  C  Y  G  G  A  P         p.2800

          .         .         .         .         .         .       g.37302
 GGGCCTGGCTGCCACTTCTCCATCCCCGAGGCTTTCAGCGGGGCCCTGGCCAACCTCAGT       c.8460
 G  P  G  C  H  F  S  I  P  E  A  F  S  G  A  L  A  N  L  S         p.2820

          .         .         .         .         .         .       g.37362
 GACGTGGTGCAGCTCATCTTTCTGGTGGACTCCAATCCCTTTCCCTTTGGCTATATCAGC       c.8520
 D  V  V  Q  L  I  F  L  V  D  S  N  P  F  P  F  G  Y  I  S         p.2840

          .         .         .         .         .         .       g.37422
 AACTACACCGTCTCCACCAAGGTGGCCTCGATGGCATTCCAGACACAGGCCGGCGCCCAG       c.8580
 N  Y  T  V  S  T  K  V  A  S  M  A  F  Q  T  Q  A  G  A  Q         p.2860

          .         .         .         .         .         .       g.37482
 ATCCCCATCGAGCGGCTGGCCTCAGAGCGCGCCATCACCGTGAAGGTGCCCAACAACTCG       c.8640
 I  P  I  E  R  L  A  S  E  R  A  I  T  V  K  V  P  N  N  S         p.2880

          .         .         .         .         .         .       g.37542
 GACTGGGCTGCCCGGGGCCACCGCAGCTCCGCCAACTCCGCCAACTCCGTTGTGGTCCAG       c.8700
 D  W  A  A  R  G  H  R  S  S  A  N  S  A  N  S  V  V  V  Q         p.2900

          .         .         .         .         .         .       g.37602
 CCCCAGGCCTCCGTCGGTGCTGTGGTCACCCTGGACAGCAGCAACCCTGCGGCCGGGCTG       c.8760
 P  Q  A  S  V  G  A  V  V  T  L  D  S  S  N  P  A  A  G  L         p.2920

          .         .         .  | 24      .         .         .    g.37957
 CATCTGCAGCTCAACTATACGCTGCTGGACG | GCCACTACCTGTCTGAGGAACCTGAGCCC    c.8820
 H  L  Q  L  N  Y  T  L  L  D  G |   H  Y  L  S  E  E  P  E  P      p.2940

          .         .         .         .         .         .       g.38017
 TACCTGGCAGTCTACCTACACTCGGAGCCCCGGCCCAATGAGCACAACTGCTCGGCTAGC       c.8880
 Y  L  A  V  Y  L  H  S  E  P  R  P  N  E  H  N  C  S  A  S         p.2960

          .         .         .         .         .         .       g.38077
 AGGAGGATCCGCCCAGAGTCACTCCAGGGTGCTGACCACCGGCCCTACACCTTCTTCATT       c.8940
 R  R  I  R  P  E  S  L  Q  G  A  D  H  R  P  Y  T  F  F  I         p.2980

          | 25         .         .         .         .         .    g.38317
 TCCCCGGG | GAGCAGAGACCCAGCGGGGAGTTACCATCTGAACCTCTCCAGCCACTTCCGC    c.9000
 S  P  G  |  S  R  D  P  A  G  S  Y  H  L  N  L  S  S  H  F  R      p.3000

          .         .         .         .         .         .       g.38377
 TGGTCGGCGCTGCAGGTGTCCGTGGGCCTGTACACGTCCCTGTGCCAGTACTTCAGCGAG       c.9060
 W  S  A  L  Q  V  S  V  G  L  Y  T  S  L  C  Q  Y  F  S  E         p.3020

          .         .         .         .         .         .       g.38437
 GAGGACATGGTGTGGCGGACAGAGGGGCTGCTGCCCCTGGAGGAGACCTCGCCCCGCCAG       c.9120
 E  D  M  V  W  R  T  E  G  L  L  P  L  E  E  T  S  P  R  Q         p.3040

          .         .         .         .         .         .       g.38497
 GCCGTCTGCCTCACCCGCCACCTCACCGCCTTCGGCGCCAGCCTCTTCGTGCCCCCAAGC       c.9180
 A  V  C  L  T  R  H  L  T  A  F  G  A  S  L  F  V  P  P  S         p.3060

          .         .  | 26      .         .         .         .    g.38681
 CATGTCCGCTTTGTGTTTCCT | GAGCCGACAGCGGATGTAAACTACATCGTCATGCTGACA    c.9240
 H  V  R  F  V  F  P   | E  P  T  A  D  V  N  Y  I  V  M  L  T      p.3080

          .         .         .         .         .         .       g.38741
 TGTGCTGTGTGCCTGGTGACCTACATGGTCATGGCCGCCATCCTGCACAAGCTGGACCAG       c.9300
 C  A  V  C  L  V  T  Y  M  V  M  A  A  I  L  H  K  L  D  Q         p.3100

          .         .         .         .         .         .       g.38801
 TTGGATGCCAGCCGGGGCCGCGCCATCCCTTTCTGTGGGCAGCGGGGCCGCTTCAAGTAC       c.9360
 L  D  A  S  R  G  R  A  I  P  F  C  G  Q  R  G  R  F  K  Y         p.3120

          .         .         .        | 27.         .         .    g.40355
 GAGATCCTCGTCAAGACAGGCTGGGGCCGGGGCTCAG | GTACCACGGCCCACGTGGGCATC    c.9420
 E  I  L  V  K  T  G  W  G  R  G  S  G |   T  T  A  H  V  G  I      p.3140

          .         .         .         .         .         .       g.40415
 ATGCTGTATGGGGTGGACAGCCGGAGCGGCCACCGGCACCTGGACGGCGACAGAGCCTTC       c.9480
 M  L  Y  G  V  D  S  R  S  G  H  R  H  L  D  G  D  R  A  F         p.3160

          .         .         .         .         .         .       g.40475
 CACCGCAACAGCCTGGACATCTTCCGGATCGCCACCCCGCACAGCCTGGGTAGCGTGTGG       c.9540
 H  R  N  S  L  D  I  F  R  I  A  T  P  H  S  L  G  S  V  W         p.3180

          .         .         | 28         .         .         .    g.40621
 AAGATCCGAGTGTGGCACGACAACAAAG | GGCTCAGCCCTGCCTGGTTCCTGCAGCACGTC    c.9600
 K  I  R  V  W  H  D  N  K  G |   L  S  P  A  W  F  L  Q  H  V      p.3200

          .         .         .         .         .         .       g.40681
 ATCGTCAGGGACCTGCAGACGGCACGCAGCGCCTTCTTCCTGGTCAATGACTGGCTTTCG       c.9660
 I  V  R  D  L  Q  T  A  R  S  A  F  F  L  V  N  D  W  L  S         p.3220

          .         .         .         .         .   | 29     .    g.40835
 GTGGAGACGGAGGCCAACGGGGGCCTGGTGGAGAAGGAGGTGCTGGCCGCGA | GCGACGCA    c.9720
 V  E  T  E  A  N  G  G  L  V  E  K  E  V  L  A  A  S |   D  A      p.3240

          .         .         .         .         .         .       g.40895
 GCCCTTTTGCGCTTCCGGCGCCTGCTGGTGGCTGAGCTGCAGCGTGGCTTCTTTGACAAG       c.9780
 A  L  L  R  F  R  R  L  L  V  A  E  L  Q  R  G  F  F  D  K         p.3260

          .         .         .         .         .         .       g.40955
 CACATCTGGCTCTCCATATGGGACCGGCCGCCTCGTAGCCGTTTCACTCGCATCCAGAGG       c.9840
 H  I  W  L  S  I  W  D  R  P  P  R  S  R  F  T  R  I  Q  R         p.3280

          .         .         .         .         .         .       g.41015
 GCCACCTGCTGCGTTCTCCTCATCTGCCTCTTCCTGGGCGCCAACGCCGTGTGGTACGGG       c.9900
 A  T  C  C  V  L  L  I  C  L  F  L  G  A  N  A  V  W  Y  G         p.3300

          .         .    | 30    .         .         .         .    g.41165
 GCTGTTGGCGACTCTGCCTACAG | CACGGGGCATGTGTCCAGGCTGAGCCCGCTGAGCGTC    c.9960
 A  V  G  D  S  A  Y  S  |  T  G  H  V  S  R  L  S  P  L  S  V      p.3320

          .         .         .         .         .         .       g.41225
 GACACAGTCGCTGTTGGCCTGGTGTCCAGCGTGGTTGTCTATCCCGTCTACCTGGCCATC       c.10020
 D  T  V  A  V  G  L  V  S  S  V  V  V  Y  P  V  Y  L  A  I         p.3340

          .         .         . | 31       .         .         .    g.42944
 CTTTTTCTCTTCCGGATGTCCCGGAGCAAG | GTGGCTGGGAGCCCGAGCCCCACACCTGCC    c.10080
 L  F  L  F  R  M  S  R  S  K   | V  A  G  S  P  S  P  T  P  A      p.3360

          .         .         .         .         .         .       g.43004
 GGGCAGCAGGTGCTGGACATCGACAGCTGCCTGGACTCGTCCGTGCTGGACAGCTCCTTC       c.10140
 G  Q  Q  V  L  D  I  D  S  C  L  D  S  S  V  L  D  S  S  F         p.3380

          .         .        | 32.         .         .         .    g.43151
 CTCACGTTCTCAGGCCTCCACGCTGAG | CAGGCCTTTGTTGGACAGATGAAGAGTGACTTG    c.10200
 L  T  F  S  G  L  H  A  E   | Q  A  F  V  G  Q  M  K  S  D  L      p.3400

          .         . | 33       .         .         .         .    g.43435
 TTTCTGGATGATTCTAAGAG | TCTGGTGTGCTGGCCCTCCGGCGAGGGAACGCTCAGTTGG    c.10260
 F  L  D  D  S  K  S  |  L  V  C  W  P  S  G  E  G  T  L  S  W      p.3420

          .         .         .         .         .         .       g.43495
 CCGGACCTGCTCAGTGACCCGTCCATTGTGGGTAGCAATCTGCGGCAGCTGGCACGGGGC       c.10320
 P  D  L  L  S  D  P  S  I  V  G  S  N  L  R  Q  L  A  R  G         p.3440

          .         .         .         .         .         .       g.43555
 CAGGCGGGCCATGGGCTGGGCCCAGAGGAGGACGGCTTCTCCCTGGCCAGCCCCTACTCG       c.10380
 Q  A  G  H  G  L  G  P  E  E  D  G  F  S  L  A  S  P  Y  S         p.3460

          .         .      | 34  .         .         .         .    g.43692
 CCTGCCAAATCCTTCTCAGCATCAG | ATGAAGACCTGATCCAGCAGGTCCTTGCCGAGGGG    c.10440
 P  A  K  S  F  S  A  S  D |   E  D  L  I  Q  Q  V  L  A  E  G      p.3480

          .         .         .         .         .          | 35    g.46689
 GTCAGCAGCCCAGCCCCTACCCAAGACACCCACATGGAAACGGACCTGCTCAGCAGCCT | G    c.10500
 V  S  S  P  A  P  T  Q  D  T  H  M  E  T  D  L  L  S  S  L  |      p.3500

          .         .         .         .         .         .       g.46749
 TCCAGCACTCCTGGGGAGAAGACAGAGACGCTGGCGCTGCAGAGGCTGGGGGAGCTGGGG       c.10560
 S  S  T  P  G  E  K  T  E  T  L  A  L  Q  R  L  G  E  L  G         p.3520

          .         .         .         .         .         | 36    g.46887
 CCACCCAGCCCAGGCCTGAACTGGGAACAGCCCCAGGCAGCGAGGCTGTCCAGGACAG | GA    c.10620
 P  P  S  P  G  L  N  W  E  Q  P  Q  A  A  R  L  S  R  T  G |       p.3540

          .         .         .         .         .         .       g.46947
 CTGGTGGAGGGTCTGCGGAAGCGCCTGCTGCCGGCCTGGTGTGCCTCCCTGGCCCACGGG       c.10680
 L  V  E  G  L  R  K  R  L  L  P  A  W  C  A  S  L  A  H  G         p.3560

          .         .         .         .         .         .       g.47007
 CTCAGCCTGCTCCTGGTGGCTGTGGCTGTGGCTGTCTCAGGGTGGGTGGGTGCGAGCTTC       c.10740
 L  S  L  L  L  V  A  V  A  V  A  V  S  G  W  V  G  A  S  F         p.3580

          .         .         .         .         .         .       g.47067
 CCCCCGGGCGTGAGTGTTGCGTGGCTCCTGTCCAGCAGCGCCAGCTTCCTGGCCTCATTC       c.10800
 P  P  G  V  S  V  A  W  L  L  S  S  S  A  S  F  L  A  S  F         p.3600

          .         .  | 37      .         .         .         .    g.47199
 CTCGGCTGGGAGCCACTGAAG | GTCTTGCTGGAAGCCCTGTACTTCTCACTGGTGGCCAAG    c.10860
 L  G  W  E  P  L  K   | V  L  L  E  A  L  Y  F  S  L  V  A  K      p.3620

          .         .         .         .         .         .       g.47259
 CGGCTGCACCCGGATGAAGATGACACCCTGGTAGAGAGCCCGGCTGTGACGCCTGTGAGC       c.10920
 R  L  H  P  D  E  D  D  T  L  V  E  S  P  A  V  T  P  V  S         p.3640

          .         .         .         .         .         .       g.47319
 GCACGTGTGCCCCGCGTACGGCCACCCCACGGCTTTGCACTCTTCCTGGCCAAGGAAGAA       c.10980
 A  R  V  P  R  V  R  P  P  H  G  F  A  L  F  L  A  K  E  E         p.3660

          .         .         .       | 38 .         .         .    g.47829
 GCCCGCAAGGTCAAGAGGCTACATGGCATGCTGCGG | AGCCTCCTGGTGTACATGCTTTTT    c.11040
 A  R  K  V  K  R  L  H  G  M  L  R   | S  L  L  V  Y  M  L  F      p.3680

          .         .         .         .         .         .       g.47889
 CTGCTGGTGACCCTGCTGGCCAGCTATGGGGATGCCTCATGCCATGGGCACGCCTACCGT       c.11100
 L  L  V  T  L  L  A  S  Y  G  D  A  S  C  H  G  H  A  Y  R         p.3700

          .         .         .         .         .       | 39 .    g.48310
 CTGCAAAGCGCCATCAAGCAGGAGCTGCACAGCCGGGCCTTCCTGGCCATCACGCG | GTCT    c.11160
 L  Q  S  A  I  K  Q  E  L  H  S  R  A  F  L  A  I  T  R  |  S      p.3720

          .         .         .         .         .         .       g.48370
 GAGGAGCTCTGGCCATGGATGGCCCACGTGCTGCTGCCCTACGTCCACGGGAACCAGTCC       c.11220
 E  E  L  W  P  W  M  A  H  V  L  L  P  Y  V  H  G  N  Q  S         p.3740

          .         .         .         .          | 40        .    g.48721
 AGCCCAGAGCTGGGGCCCCCACGGCTGCGGCAGGTGCGGCTGCAGGAAG | CACTCTACCCA    c.11280
 S  P  E  L  G  P  P  R  L  R  Q  V  R  L  Q  E  A |   L  Y  P      p.3760

          .         .         .         .         .         .       g.48781
 GACCCTCCCGGCCCCAGGGTCCACACGTGCTCGGCCGCAGGAGGCTTCAGCACCAGCGAT       c.11340
 D  P  P  G  P  R  V  H  T  C  S  A  A  G  G  F  S  T  S  D         p.3780

          .         .         .         .         .         .       g.48841
 TACGACGTTGGCTGGGAGAGTCCTCACAATGGCTCGGGGACGTGGGCCTATTCAGCGCCG       c.11400
 Y  D  V  G  W  E  S  P  H  N  G  S  G  T  W  A  Y  S  A  P         p.3800

          .  | 41      .         .         .         .         .    g.49041
 GATCTGCTGGG | GGCATGGTCCTGGGGCTCCTGTGCCGTGTATGACAGCGGGGGCTACGTG    c.11460
 D  L  L  G  |  A  W  S  W  G  S  C  A  V  Y  D  S  G  G  Y  V      p.3820

          .         .         .         .         .         .       g.49101
 CAGGAGCTGGGCCTGAGCCTGGAGGAGAGCCGCGACCGGCTGCGCTTCCTGCAGCTGCAC       c.11520
 Q  E  L  G  L  S  L  E  E  S  R  D  R  L  R  F  L  Q  L  H         p.3840

          .        | 42.         .         .         .         .    g.49344
 AACTGGCTGGACAACAG | GAGCCGCGCTGTGTTCCTGGAGCTCACGCGCTACAGCCCGGCC    c.11580
 N  W  L  D  N  R  |  S  R  A  V  F  L  E  L  T  R  Y  S  P  A      p.3860

          .         .         .         .         .         .       g.49404
 GTGGGGCTGCACGCCGCCGTCACGCTGCGCCTCGAGTTCCCGGCGGCCGGCCGCGCCCTG       c.11640
 V  G  L  H  A  A  V  T  L  R  L  E  F  P  A  A  G  R  A  L         p.3880

          .         .         .         .         .         .       g.49464
 GCCGCCCTCAGCGTCCGCCCCTTTGCGCTGCGCCGCCTCAGCGCGGGCCTCTCGCTGCCT       c.11700
 A  A  L  S  V  R  P  F  A  L  R  R  L  S  A  G  L  S  L  P         p.3900

          .   | 43     .         .         .         .         .    g.49772
 CTGCTCACCTCG | GTGTGCCTGCTGCTGTTCGCCGTGCACTTCGCCGTGGCCGAGGCCCGT    c.11760
 L  L  T  S   | V  C  L  L  L  F  A  V  H  F  A  V  A  E  A  R      p.3920

          .         .         .         .         .         .       g.49832
 ACTTGGCACAGGGAAGGGCGCTGGCGCGTGCTGCGGCTCGGAGCCTGGGCGCGGTGGCTG       c.11820
 T  W  H  R  E  G  R  W  R  V  L  R  L  G  A  W  A  R  W  L         p.3940

          .         .         .         .         .         .       g.49892
 CTGGTGGCGCTGACGGCGGCCACGGCACTGGTACGCCTCGCCCAGCTGGGTGCCGCTGAC       c.11880
 L  V  A  L  T  A  A  T  A  L  V  R  L  A  Q  L  G  A  A  D         p.3960

          .         .         .         .         .         .       g.49952
 CGCCAGTGGACCCGTTTCGTGCGCGGCCGCCCGCGCCGCTTCACTAGCTTCGACCAGGTG       c.11940
 R  Q  W  T  R  F  V  R  G  R  P  R  R  F  T  S  F  D  Q  V         p.3980

          .         .         .         .         .         .       g.50012
 GCGCAGCTGAGCTCCGCAGCCCGTGGCCTGGCGGCCTCGCTGCTCTTCCTGCTTTTGGTC       c.12000
 A  Q  L  S  S  A  A  R  G  L  A  A  S  L  L  F  L  L  L  V         p.4000

     | 44    .         .         .         .         .         .    g.50147
 AAG | GCTGCCCAGCAGCTACGCTTCGTGCGCCAGTGGTCCGTCTTTGGCAAGACATTATGC    c.12060
 K   | A  A  Q  Q  L  R  F  V  R  Q  W  S  V  F  G  K  T  L  C      p.4020

          .         .         .         .         .         .       g.50207
 CGAGCTCTGCCAGAGCTCCTGGGGGTCACCTTGGGCCTGGTGGTGCTCGGGGTAGCCTAC       c.12120
 R  A  L  P  E  L  L  G  V  T  L  G  L  V  V  L  G  V  A  Y         p.4040

          .         | 45         .         .         .         .    g.50350
 GCCCAGCTGGCCATCCTG | CTCGTGTCTTCCTGTGTGGACTCCCTCTGGAGCGTGGCCCAG    c.12180
 A  Q  L  A  I  L   | L  V  S  S  C  V  D  S  L  W  S  V  A  Q      p.4060

          .         .         .         .         .         .       g.50410
 GCCCTGTTGGTGCTGTGCCCTGGGACTGGGCTCTCTACCCTGTGTCCTGCCGAGTCCTGG       c.12240
 A  L  L  V  L  C  P  G  T  G  L  S  T  L  C  P  A  E  S  W         p.4080

          .         .         .         .         .         .       g.50470
 CACCTGTCACCCCTGCTGTGTGTGGGGCTCTGGGCACTGCGGCTGTGGGGCGCCCTACGG       c.12300
 H  L  S  P  L  L  C  V  G  L  W  A  L  R  L  W  G  A  L  R         p.4100

          .         .         .         .         .         .       g.50530
 CTGGGGGCTGTTATTCTCCGCTGGCGCTACCACGCCTTGCGTGGAGAGCTGTACCGGCCG       c.12360
 L  G  A  V  I  L  R  W  R  Y  H  A  L  R  G  E  L  Y  R  P         p.4120

          .         .         .         .         .         .       g.50590
 GCCTGGGAGCCCCAGGACTACGAGATGGTGGAGTTGTTCCTGCGCAGGCTGCGCCTCTGG       c.12420
 A  W  E  P  Q  D  Y  E  M  V  E  L  F  L  R  R  L  R  L  W         p.4140

          .         .     | 46   .         .         .         .    g.50740
 ATGGGCCTCAGCAAGGTCAAGGAG | TTCCGCCACAAAGTCCGCTTTGAAGGGATGGAGCCG    c.12480
 M  G  L  S  K  V  K  E   | F  R  H  K  V  R  F  E  G  M  E  P      p.4160

          .         .         .         .         .         .       g.50800
 CTGCCCTCTCGCTCCTCCAGGGGCTCCAAGGTATCCCCGGATGTGCCCCCACCCAGCGCT       c.12540
 L  P  S  R  S  S  R  G  S  K  V  S  P  D  V  P  P  P  S  A         p.4180

          .         .         .         .         .         .       g.50860
 GGCTCCGATGCCTCGCACCCCTCCACCTCCTCCAGCCAGCTGGATGGGCTGAGCGTGAGC       c.12600
 G  S  D  A  S  H  P  S  T  S  S  S  Q  L  D  G  L  S  V  S         p.4200

          .         .         .         .         .         .       g.50920
 CTGGGCCGGCTGGGGACAAGGTGTGAGCCTGAGCCCTCCCGCCTCCAAGCCGTGTTCGAG       c.12660
 L  G  R  L  G  T  R  C  E  P  E  P  S  R  L  Q  A  V  F  E         p.4220

          .         .         .         .         .         .       g.50980
 GCCCTGCTCACCCAGTTTGACCGACTCAACCAGGCCACAGAGGACGTCTACCAGCTGGAG       c.12720
 A  L  L  T  Q  F  D  R  L  N  Q  A  T  E  D  V  Y  Q  L  E         p.4240

          .         .         .         .         .         .       g.51040
 CAGCAGCTGCACAGCCTGCAAGGCCGCAGGAGCAGCCGGGCGCCCGCCGGATCTTCCCGT       c.12780
 Q  Q  L  H  S  L  Q  G  R  R  S  S  R  A  P  A  G  S  S  R         p.4260

          .         .         .         .         .         .       g.51100
 GGCCCATCCCCGGGCCTGCGGCCAGCACTGCCCAGCCGCCTTGCCCGGGCCAGTCGGGGT       c.12840
 G  P  S  P  G  L  R  P  A  L  P  S  R  L  A  R  A  S  R  G         p.4280

          .         .         .         .         .         .       g.51160
 GTGGACCTGGCCACTGGCCCCAGCAGGACACCCCTTCGGGCCAAGAACAAGGTCCACCCC       c.12900
 V  D  L  A  T  G  P  S  R  T  P  L  R  A  K  N  K  V  H  P         p.4300

          .                                                         g.51172
 AGCAGCACTTAG                                                       c.12912
 S  S  T  X                                                         p.4303

          .         .         .         .         .         .       g.51232
 tcctccttcctggcgggggtgggccgtggagtcggagtggacaccgctcagtattacttt       c.*60

          .         .         .         .         .         .       g.51292
 ctgccgctgtcaaggccgagggccaggcagaatggctgcacgtaggttccccagagagca       c.*120

          .         .         .         .         .         .       g.51352
 ggcaggggcatctgtctgtctgtgggcttcagcactttaaagaggctgtgtggccaacca       c.*180

          .         .         .         .         .         .       g.51412
 ggacccagggtcccctccccagctcccttgggaaggacacagcagtattggacggtttct       c.*240

          .         .         .         .         .         .       g.51472
 agcctctgagatgctaatttatttccccgagtcctcaggtacagcgggctgtgcccggcc       c.*300

          .         .         .         .         .         .       g.51532
 ccaccccctgggcagatgtcccccactgctaaggctgctggcttcagggagggttagcct       c.*360

          .         .         .         .         .         .       g.51592
 gcaccgccgccaccctgcccctaagttattacctctccagttcctaccgtactccctgca       c.*420

          .         .         .         .         .         .       g.51652
 ccgtctcactgtgtgtctcgtgtcagtaatttatatggtgttaaaatgtgtatatttttg       c.*480

          .         .         .         .         .         .       g.51712
 tatgtcactattttcactagggctgaggggcctgcgcccagagctggcctcccccaacac       c.*540

          .         .         .         .         .         .       g.51772
 ctgctgcgcttggtaggtgtggtggcgttatggcagcccggctgctgcttggatgcgagc       c.*600

          .         .         .         .         .         .       g.51832
 ttggccttgggccggtgctgggggcacagctgtctgccaggcactctcatcaccccagag       c.*660

          .         .         .         .         .         .       g.51892
 gccttgtcatcctcccttgccccaggccaggtagcaagagagcagcgcccaggcctgctg       c.*720

          .         .         .         .         .         .       g.51952
 gcatcaggtctgggcaagtagcaggactaggcatgtcagaggaccccagggtggttagag       c.*780

          .         .         .         .         .         .       g.52012
 gaaaagactcctcctgggggctggctcccagggtggaggaaggtgactgtgtgtgtgtgt       c.*840

          .         .         .         .         .         .       g.52072
 gtgtgcgcgcgcgcacgcgcgagtgtgctgtatggcccaggcagcctcaaggccctcgga       c.*900

          .         .         .         .         .         .       g.52132
 gctggctgtgcctgcttctgtgtaccacttctgtgggcatggccgcttctagagcctcga       c.*960

          .         .         .         .         .                 g.52189
 cacccccccaacccccgcaccaagcagacaaagtcaataaaagagctgtctgactgc          c.*1017

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Polycystic kidney disease 1 (autosomal dominant) protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 10c
©2004-2014 Leiden University Medical Center