cubilin (intrinsic factor-cobalamin receptor) (CUBN) - coding DNA reference sequence

(used for variant description)

(last modified August 30, 2012)


This file was created to facilitate the description of sequence variants on transcript NM_001081.3 in the CUBN gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NC_000010.10, covering CUBN transcript NM_001081.3.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
           .         .         .         .         .                g.5052
         atgctcagttggttggagtggcctcactcttacctgccaacctgggaggttg       c.-1

          .         .         .         .         .         .       g.5112
 ATGATGAACATGTCTTTACCTTTTCTTTGGAGTTTGCTTACCTTATTAATATTTGCTGAA       c.60
 M  M  N  M  S  L  P  F  L  W  S  L  L  T  L  L  I  F  A  E         p.20

          .         .         .         .         .         .       g.5172
 GTAAATGGCGAAGCTGGAGAACTTGAGCTGCAGAGACAAAAAAGAAGCATCAATCTCCAA       c.120
 V  N  G  E  A  G  E  L  E  L  Q  R  Q  K  R  S  I  N  L  Q         p.40

    | 02     .         .         .         .         .         .    g.5625
 CA | GCCTCGAATGGCTACAGAGAGAGGAAATTTGGTGTTTCTTACGGGGTCTGCTCAAAAC    c.180
 Q  |  P  R  M  A  T  E  R  G  N  L  V  F  L  T  G  S  A  Q  N      p.60

          .         .         .         .         .         .       g.5685
 ATTGAGTTTAGAACCGGATCCCTGGGAAAAATTAAATTAAATGATGAAGATCTCAGTGAG       c.240
 I  E  F  R  T  G  S  L  G  K  I  K  L  N  D  E  D  L  S  E         p.80

          .   | 03     .         .         .         .         .    g.6941
 TGTTTACATCAG | ATCCAGAAAAACAAAGAAGATATTATAGAGTTAAAAGGGAGTGCAATT    c.300
 C  L  H  Q   | I  Q  K  N  K  E  D  I  I  E  L  K  G  S  A  I      p.100

          .         .         .         .         | 04         .    g.8030
 GGTCTGCCTCAAAATATATCTAGTCAAATCTATCAGCTTAATTCCAAG | CTGGTGGATCTT    c.360
 G  L  P  Q  N  I  S  S  Q  I  Y  Q  L  N  S  K   | L  V  D  L      p.120

          .         .        | 05.         .         .         .    g.11161
 GAGAGAAAATTCCAAGGCTTGCAGCAG | ACTGTTGACAAAAAGGTTTGCAGCAGCAATCCT    c.420
 E  R  K  F  Q  G  L  Q  Q   | T  V  D  K  K  V  C  S  S  N  P      p.140

          .         .         .         .         .         .       g.11221
 TGCCAGAATGGTGGAACCTGCCTCAATCTGCATGATTCCTTTTTTTGTATCTGTCCCCCA       c.480
 C  Q  N  G  G  T  C  L  N  L  H  D  S  F  F  C  I  C  P  P         p.160

           | 06        .         .         .         .         .    g.11970
 CAGTGGAAG | GGTCCTCTCTGCTCAGCTGATGTTAACGAATGTGAGATTTACTCAGGAACA    c.540
 Q  W  K   | G  P  L  C  S  A  D  V  N  E  C  E  I  Y  S  G  T      p.180

          .         .         .         .         .    | 07    .    g.19227
 CCCTTGAGCTGCCAGAATGGAGGCACATGTGTTAATACAATGGGAAGTTACAG | TTGTCAC    c.600
 P  L  S  C  Q  N  G  G  T  C  V  N  T  M  G  S  Y  S  |  C  H      p.200

          .         .         .         .         .         .       g.19287
 TGCCCACCTGAGACGTACGGACCCCAGTGTGCATCCAAATATGACGACTGTGAAGGGGGT       c.660
 C  P  P  E  T  Y  G  P  Q  C  A  S  K  Y  D  D  C  E  G  G         p.220

          .         .         .         .         .         .       g.19347
 TCTGTGGCACGCTGTGTCCATGGCATCTGTGAGGATTTAATGCGAGAGCAAGCTGGAGAG       c.720
 S  V  A  R  C  V  H  G  I  C  E  D  L  M  R  E  Q  A  G  E         p.240

  | 08       .         .         .         .         .         .    g.20688
  | CCCAAGTACAGCTGCGTCTGTGATGCTGGGTGGATGTTTTCACCCAACAGCCCTGCCTGC    c.780
  | P  K  Y  S  C  V  C  D  A  G  W  M  F  S  P  N  S  P  A  C      p.260

          .         .         .         .         .         .       g.20748
 ACGCTGGACAGAGACGAGTGCAGCTTCCAGCCCGGGCCTTGCTCCACACTTGTGCAGTGT       c.840
 T  L  D  R  D  E  C  S  F  Q  P  G  P  C  S  T  L  V  Q  C         p.280

          .         .         .         .    | 09    .         .    g.23784
 TTCAACACTCAAGGCTCTTTCTACTGTGGGGCCTGTCCAACAG | GCTGGCAAGGCAATGGA    c.900
 F  N  T  Q  G  S  F  Y  C  G  A  C  P  T  G |   W  Q  G  N  G      p.300

          .         .         .         .         .         .       g.23844
 TATATTTGCGAAGATATCAATGAATGTGAGATAAATAACGGCGGCTGTTCTGTGGCTCCA       c.960
 Y  I  C  E  D  I  N  E  C  E  I  N  N  G  G  C  S  V  A  P         p.320

          .         .         .         .         .      | 10  .    g.25087
 CCCGTTGAGTGTGTGAATACACCTGGGTCTTCCCACTGCCAGGCCTGTCCACCAG | GGTAC    c.1020
 P  V  E  C  V  N  T  P  G  S  S  H  C  Q  A  C  P  P  G |   Y      p.340

          .         .         .         .         .         .       g.25147
 CAGGGTGACGGAAGAGTGTGCACACTCACAGACATCTGCTCAGTCAGTAATGGAGGCTGC       c.1080
 Q  G  D  G  R  V  C  T  L  T  D  I  C  S  V  S  N  G  G  C         p.360

          .         .         .  | 11      .         .         .    g.29271
 CACCCAGATGCCTCATGCTCCTCAACTCTAG | GTTCCTTACCTCTCTGCACGTGTCTCCCG    c.1140
 H  P  D  A  S  C  S  S  T  L  G |   S  L  P  L  C  T  C  L  P      p.380

          .         .         .         .         .         .       g.29331
 GGTTATACTGGAAATGGTTATGGGCCAAATGGATGTGTGCAGCTCAGTAATATTTGCCTA       c.1200
 G  Y  T  G  N  G  Y  G  P  N  G  C  V  Q  L  S  N  I  C  L         p.400

          .         .         . | 12       .         .         .    g.30242
 AGTCACCCCTGTCTAAATGGACAATGCATC | GACACTGTCTCTGGTTATTTTTGTAAGTGT    c.1260
 S  H  P  C  L  N  G  Q  C  I   | D  T  V  S  G  Y  F  C  K  C      p.420

          .         .         .         .         .         .       g.30302
 GACTCAGGTTGGACAGGTGTCAACTGTACAGAAAACATCAATGAGTGTTTGAGCAACCCC       c.1320
 D  S  G  W  T  G  V  N  C  T  E  N  I  N  E  C  L  S  N  P         p.440

          .         .         .         .         .         .       g.30362
 TGTTTGAATGGAGGAACTTGTGTTGATGGCGTTGATTCTTTCAGTTGTGAATGCACACGT       c.1380
 C  L  N  G  G  T  C  V  D  G  V  D  S  F  S  C  E  C  T  R         p.460

          .         .         .        | 13.         .         .    g.31603
 CTCTGGACTGGAGCTCTCTGTCAGGTTCCTCAGCAAG | TTTGTGGAGAGTCCCTCTCAGGA    c.1440
 L  W  T  G  A  L  C  Q  V  P  Q  Q  V |   C  G  E  S  L  S  G      p.480

          .         .         .         .         .         .       g.31663
 ATAAATGGAAGCTTCAGCTACAGGAGCCCGGATGTTGGTTATGTTCATGATGTTAACTGC       c.1500
 I  N  G  S  F  S  Y  R  S  P  D  V  G  Y  V  H  D  V  N  C         p.500

          .         .         . | 14       .         .         .    g.34608
 TTCTGGGTTATCAAAACTGAAATGGGAAAG | GTCCTGCGTATCACTTTCACTTTTTTCCGG    c.1560
 F  W  V  I  K  T  E  M  G  K   | V  L  R  I  T  F  T  F  F  R      p.520

          .         .         .         .         .         .       g.34668
 TTAGAATCCATGGACAACTGTCCACACGAGTTTCTTCAGGTTTATGATGGAGATTCCTCT       c.1620
 L  E  S  M  D  N  C  P  H  E  F  L  Q  V  Y  D  G  D  S  S         p.540

          .         .         .         .         .         .       g.34728
 TCTGCTTTTCAACTTGGAAGATTTTGTGGCTCCAGCCTCCCTCATGAACTCCTCAGCAGT       c.1680
 S  A  F  Q  L  G  R  F  C  G  S  S  L  P  H  E  L  L  S  S         p.560

          .         .         .         .         .         .       g.34788
 GACAATGCTCTCTATTTTCATCTCTATTCTGAACATTTAAGAAATGGGAGAGGCTTTACA       c.1740
 D  N  A  L  Y  F  H  L  Y  S  E  H  L  R  N  G  R  G  F  T         p.580

          .         .      | 15  .         .         .         .    g.46507
 GTAAGATGGGAAACACAGCAACCAG | AGTGTGGAGGTATCCTGACTGGTCCTTACGGTTCT    c.1800
 V  R  W  E  T  Q  Q  P  E |   C  G  G  I  L  T  G  P  Y  G  S      p.600

          .         .         .         .         .         .       g.46567
 ATTAAGTCTCCGGGGTATCCTGGAAACTATCCCCCAGGAAGAGATTGTGTCTGGATTGTT       c.1860
 I  K  S  P  G  Y  P  G  N  Y  P  P  G  R  D  C  V  W  I  V         p.620

          .         .         .         .         .         .       g.46627
 GTAACTAGTCCTGACCTCCTGGTAACATTTACTTTTGGGACCTTGAGCCTCGAGCACCAT       c.1920
 V  T  S  P  D  L  L  V  T  F  T  F  G  T  L  S  L  E  H  H         p.640

          .         .        | 16.         .         .         .    g.49091
 GATGACTGCAACAAAGATTACCTTGAG | ATTCGAGATGGTCCTTTGTATCAGGACCCCCTT    c.1980
 D  D  C  N  K  D  Y  L  E   | I  R  D  G  P  L  Y  Q  D  P  L      p.660

          .         .         .         .         .         .       g.49151
 CTTGGGAAGTTCTGCACCACTTTCTCTGTCCCACCGCTCCAGACTACTGGCCCCTTTGCC       c.2040
 L  G  K  F  C  T  T  F  S  V  P  P  L  Q  T  T  G  P  F  A         p.680

          .         .         .         .         .         .       g.49211
 AGAATTCACTTCCATTCAGACTCCCAGATTAGTGACCAAGGCTTCCATATCACCTACTTA       c.2100
 R  I  H  F  H  S  D  S  Q  I  S  D  Q  G  F  H  I  T  Y  L         p.700

          . | 17       .         .         .         .         .    g.50406
 ACATCACCTT | CGGATCTGCGTTGTGGTGGGAACTACACGGACCCAGAGGGTGAACTCTTC    c.2160
 T  S  P  S |   D  L  R  C  G  G  N  Y  T  D  P  E  G  E  L  F      p.720

          .         .         .         .         .         .       g.50466
 TTGCCTGAGTTGTCTGGGCCTTTCACTCACACCAGGCAATGCGTCTATATGATGAAGCAG       c.2220
 L  P  E  L  S  G  P  F  T  H  T  R  Q  C  V  Y  M  M  K  Q         p.740

          .         .         .         .         .         .       g.50526
 CCCCAGGGAGAACAAATACAAATCAACTTCACCCACGTGGAGCTGCAATGCCAGAGTGAC       c.2280
 P  Q  G  E  Q  I  Q  I  N  F  T  H  V  E  L  Q  C  Q  S  D         p.760

          .         .  | 18      .         .         .         .    g.62885
 AGTTCTCAGAATTACATTGAG | GTTCGAGATGGTGAAACCTTACTTGGAAAAGTCTGTGGC    c.2340
 S  S  Q  N  Y  I  E   | V  R  D  G  E  T  L  L  G  K  V  C  G      p.780

          .         .         .         .         .         .       g.62945
 AACGGAACCATCTCTCACATTAAATCCATTACTAATAGTGTCTGGATCAGGTTTAAAATA       c.2400
 N  G  T  I  S  H  I  K  S  I  T  N  S  V  W  I  R  F  K  I         p.800

          .         .         .         .       | 19 .         .    g.63227
 GATGCTTCTGTTGAAAAAGCTAGTTTCAGAGCTGTTTATCAAGTCG | CTTGCGGGGATGAA    c.2460
 D  A  S  V  E  K  A  S  F  R  A  V  Y  Q  V  A |   C  G  D  E      p.820

          .         .         .         .         .         .       g.63287
 TTAACTGGAGAAGGGGTCATTCGCTCGCCTTTTTTTCCTAACGTGTATCCTGGAGAAAGA       c.2520
 L  T  G  E  G  V  I  R  S  P  F  F  P  N  V  Y  P  G  E  R         p.840

          .         .         .         .         .         .       g.63347
 ACCTGTAGGTGGACCATCCACCAGCCCCAAAGCCAAGTCATTCTCCTCAACTTCACTGTC       c.2580
 T  C  R  W  T  I  H  Q  P  Q  S  Q  V  I  L  L  N  F  T  V         p.860

          .         .         .         .      | 20  .         .    g.66062
 TTTGAAATTGGAAGTTCTGCCCACTGTGAAACAGATTATGTTGAG | ATTGGTAGCAGTTCC    c.2640
 F  E  I  G  S  S  A  H  C  E  T  D  Y  V  E   | I  G  S  S  S      p.880

          .         .         .         .         .         .       g.66122
 ATTTTGGGTTCTCCTGAAAATAAAAAGTATTGCGGTACAGACATACCTTCATTTATAACA       c.2700
 I  L  G  S  P  E  N  K  K  Y  C  G  T  D  I  P  S  F  I  T         p.900

          .         .         .         .         .         .       g.66182
 TCTGTGTACAATTTTCTTTATGTCACATTCGTGAAAAGTTCTTCTACTGAAAACCATGGT       c.2760
 S  V  Y  N  F  L  Y  V  T  F  V  K  S  S  S  T  E  N  H  G         p.920

          .         .         .  | 21      .         .         .    g.66566
 TTCATGGCTAAGTTCAGTGCTGAGGATTTGG | CATGTGGAGAAATTCTTACAGAATCAACA    c.2820
 F  M  A  K  F  S  A  E  D  L  A |   C  G  E  I  L  T  E  S  T      p.940

          .         .         .         .         .         .       g.66626
 GGGACCATTCAAAGTCCTGGCCATCCAAATGTCTACCCCCACGGTATCAACTGTACTTGG       c.2880
 G  T  I  Q  S  P  G  H  P  N  V  Y  P  H  G  I  N  C  T  W         p.960

          .         .         .         .         .         .       g.66686
 CATATATTAGTCCAACCTAATCACCTGATTCATTTAATGTTCGAAACATTTCATCTGGAG       c.2940
 H  I  L  V  Q  P  N  H  L  I  H  L  M  F  E  T  F  H  L  E         p.980

          .         .         .         .         .         .       g.66746
 TTTCATTACAATTGCACAAACGACTACTTGGAAGTTTATGACACCGACTCTGAGACATCC       c.3000
 F  H  Y  N  C  T  N  D  Y  L  E  V  Y  D  T  D  S  E  T  S         p.1000

          | 22         .         .         .         .         .    g.69231
 CTTGGAAG | ATACTGTGGAAAGTCGATCCCGCCATCTCTCACAAGCAGTGGTAACTCATTG    c.3060
 L  G  R  |  Y  C  G  K  S  I  P  P  S  L  T  S  S  G  N  S  L      p.1020

          .         .         .         .         .         .       g.69291
 ATGCTGGTGTTTGTGACTGACTCCGACCTCGCTTATGAAGGCTTCTTAATAAACTATGAA       c.3120
 M  L  V  F  V  T  D  S  D  L  A  Y  E  G  F  L  I  N  Y  E         p.1040

          .          | 23        .         .         .         .    g.87255
 GCAATCAGTGCAGCAACAG | CATGTTTGCAAGACTACACAGATGATTTGGGGACATTCACT    c.3180
 A  I  S  A  A  T  A |   C  L  Q  D  Y  T  D  D  L  G  T  F  T      p.1060

          .         .         .         .         .         .       g.87315
 TCTCCAAACTTCCCCAATAATTATCCCAACAACTGGGAATGCATTTATCGGATCACAGTG       c.3240
 S  P  N  F  P  N  N  Y  P  N  N  W  E  C  I  Y  R  I  T  V         p.1080

          .         .         .         .         .         .       g.87375
 AGAACTGGCCAACTGATTGCAGTGCACTTCACAAACTTCTCCTTGGAGGAAGCCATTGGA       c.3300
 R  T  G  Q  L  I  A  V  H  F  T  N  F  S  L  E  E  A  I  G         p.1100

          .         .          | 24        .         .         .    g.88754
 AACTATTATACAGATTTTCTGGAAATCAG | AGATGGAGGCTATGAAAAATCACCATTGCTG    c.3360
 N  Y  Y  T  D  F  L  E  I  R  |  D  G  G  Y  E  K  S  P  L  L      p.1120

          .         .         .         .         .         .       g.88814
 GGAATATTCTATGGCTCAAATCTACCCCCAACAATCATCTCTCATAGTAACAAACTATGG       c.3420
 G  I  F  Y  G  S  N  L  P  P  T  I  I  S  H  S  N  K  L  W         p.1140

          .         .         .         .         .         .       g.88874
 TTAAAATTTAAGAGTGACCAAATAGACACAAGGTCTGGATTCTCAGCTTACTGGGATGGG       c.3480
 L  K  F  K  S  D  Q  I  D  T  R  S  G  F  S  A  Y  W  D  G         p.1160

          . | 25       .         .         .         .         .    g.89679
 TCATCAACAG | GTTGCGGGGGTAATCTCACCACTTCAAGCGGCACGTTCATATCTCCCAAC    c.3540
 S  S  T  G |   C  G  G  N  L  T  T  S  S  G  T  F  I  S  P  N      p.1180

          .         .         .         .         .         .       g.89739
 TACCCGATGCCCTATTACCACAGCTCTGAATGCTACTGGTGGTTGAAATCTAGCCACGGC       c.3600
 Y  P  M  P  Y  Y  H  S  S  E  C  Y  W  W  L  K  S  S  H  G         p.1200

          .         .         .         .         .         .       g.89799
 AGCGCATTTGAACTGGAATTCAAAGACTTTCACTTGGAGCATCATCCAAACTGCACTTTA       c.3660
 S  A  F  E  L  E  F  K  D  F  H  L  E  H  H  P  N  C  T  L         p.1220

          .   | 26     .         .         .         .         .    g.90882
 GATTACCTGGCT | GTATATGATGGCCCAAGTAGCAACTCTCATCTGCTAACTCAGCTTTGT    c.3720
 D  Y  L  A   | V  Y  D  G  P  S  S  N  S  H  L  L  T  Q  L  C      p.1240

          .         .         .         .         .         .       g.90942
 GGGGATGAGAAACCCCCTCTTATTCGTTCTAGTGGAGACAGCATGTTTATAAAACTGAGG       c.3780
 G  D  E  K  P  P  L  I  R  S  S  G  D  S  M  F  I  K  L  R         p.1260

          .         .         .         .          | 27        .    g.93608
 ACAGATGAAGGTCAGCAAGGACGTGGCTTCAAGGCTGAATACCGGCAGA | CATGTGAGAAT    c.3840
 T  D  E  G  Q  Q  G  R  G  F  K  A  E  Y  R  Q  T |   C  E  N      p.1280

          .         .         .         .         .         .       g.93668
 GTGGTAATAGTCAATCAAACCTATGGCATCTTAGAGAGTATAGGGTATCCGAATCCTTAT       c.3900
 V  V  I  V  N  Q  T  Y  G  I  L  E  S  I  G  Y  P  N  P  Y         p.1300

          .         .         .         .         .         .       g.93728
 TCTGAAAATCAGCATTGCAACTGGACCATCCGGGCAACAACAGGCAACACTGTGAACTAC       c.3960
 S  E  N  Q  H  C  N  W  T  I  R  A  T  T  G  N  T  V  N  Y         p.1320

          .         .         .         .         .        | 28.    g.114837
 ACATTTTTAGCATTTGACTTGGAACATCACATAAACTGCTCCACAGATTATTTAGAG | CTC    c.4020
 T  F  L  A  F  D  L  E  H  H  I  N  C  S  T  D  Y  L  E   | L      p.1340

          .         .         .         .         .         .       g.114897
 TATGATGGACCACGGCAGATGGGACGCTACTGTGGAGTAGACCTGCCCCCTCCAGGGAGT       c.4080
 Y  D  G  P  R  Q  M  G  R  Y  C  G  V  D  L  P  P  P  G  S         p.1360

          .         .         .         .         .         .       g.114957
 ACTACAAGCTCCAAGCTTCAAGTGCTGCTCCTTACAGATGGGGTTGGCCGCCGTGAGAAA       c.4140
 T  T  S  S  K  L  Q  V  L  L  L  T  D  G  V  G  R  R  E  K         p.1380

          .         .         | 29         .         .         .    g.144334
 GGATTTCAGATGCAGTGGTTTGTTTACG | GTTGTGGTGGAGAGCTGTCTGGGGCCACAGGC    c.4200
 G  F  Q  M  Q  W  F  V  Y  G |   C  G  G  E  L  S  G  A  T  G      p.1400

          .         .         .         .         .         .       g.144394
 TCCTTCAGCAGCCCCGGGTTCCCCAACAGGTATCCACCAAACAAGGAGTGTATCTGGTAC       c.4260
 S  F  S  S  P  G  F  P  N  R  Y  P  P  N  K  E  C  I  W  Y         p.1420

          .         .         .         .         .         .       g.144454
 ATTAGGACGGACCCCGGGAGTAGCATTCAGCTCACCATCCATGACTTCGATGTGGAGTAT       c.4320
 I  R  T  D  P  G  S  S  I  Q  L  T  I  H  D  F  D  V  E  Y         p.1440

          .         .         . | 30       .         .         .    g.150568
 CATTCAAGGTGCAACTTTGATGTCTTGGAG | ATCTATGGAGGCCCCGATTTCCACTCTCCC    c.4380
 H  S  R  C  N  F  D  V  L  E   | I  Y  G  G  P  D  F  H  S  P      p.1460

          .         .         .         .         .         .       g.150628
 AGAATAGCCCAACTGTGTACCCAGAGATCACCTGAGAACCCCATGCAGGTCTCCAGCACT       c.4440
 R  I  A  Q  L  C  T  Q  R  S  P  E  N  P  M  Q  V  S  S  T         p.1480

          .         .         .         .         .         .       g.150688
 GGAAATGAGCTAGCAATTCGATTCAAGACCGACTTGTCCATAAATGGGAGAGGCTTCAAT       c.4500
 G  N  E  L  A  I  R  F  K  T  D  L  S  I  N  G  R  G  F  N         p.1500

          .         .      | 31  .         .         .         .    g.152199
 GCGTCATGGCAAGCAGTCACTGGAG | GTTGTGGTGGGATTTTCCAGGCTCCCAGTGGAGAG    c.4560
 A  S  W  Q  A  V  T  G  G |   C  G  G  I  F  Q  A  P  S  G  E      p.1520

          .         .         .         .         .         .       g.152259
 ATTCATTCTCCAAATTACCCCAGTCCTTATAGGAGCAACACAGACTGTTCTTGGGTCATT       c.4620
 I  H  S  P  N  Y  P  S  P  Y  R  S  N  T  D  C  S  W  V  I         p.1540

          .         .         .         .         .         .       g.152319
 CGGGTTGACAGAAATCATCGTGTTCTCTTGAACTTCACTGACTTTGATCTTGAACCACAA       c.4680
 R  V  D  R  N  H  R  V  L  L  N  F  T  D  F  D  L  E  P  Q         p.1560

          .      | 32  .         .         .         .         .    g.180314
 GACTCTTGTATTATG | GCATACGATGGCTTAAGCTCCACAATGTCCCGCCTTGCCAGGACG    c.4740
 D  S  C  I  M   | A  Y  D  G  L  S  S  T  M  S  R  L  A  R  T      p.1580

          .         .         .         .         .         .       g.180374
 TGTGGAAGGGAGCAGCTGGCTAACCCCATCGTCTCCTCAGGAAACAGCCTCTTCTTGAGA       c.4800
 C  G  R  E  Q  L  A  N  P  I  V  S  S  G  N  S  L  F  L  R         p.1600

          .         .         .         .         .      | 33  .    g.182433
 TTTCAGTCTGGCCCTTCCAGACAGAACAGAGGCTTCCGAGCTCAATTCAGGCAAG | CCTGC    c.4860
 F  Q  S  G  P  S  R  Q  N  R  G  F  R  A  Q  F  R  Q  A |   C      p.1620

          .         .         .         .         .         .       g.182493
 GGAGGCCACATCCTCACCAGCTCATTTGATACTGTTTCCTCTCCACGGTTCCCTGCCAAT       c.4920
 G  G  H  I  L  T  S  S  F  D  T  V  S  S  P  R  F  P  A  N         p.1640

          .         .         .         .          | 34        .    g.184717
 TATCCAAACAATCAGAACTGCAGCTGGATCATTCAAGCGCAACCTCCAT | TAAATCATATC    c.4980
 Y  P  N  N  Q  N  C  S  W  I  I  Q  A  Q  P  P  L |   N  H  I      p.1660

          .         .         .         .         .         .       g.184777
 ACCCTCTCTTTTACCCACTTTGAACTTGAAAGAAGCACAACGTGTGCACGTGACTTTGTA       c.5040
 T  L  S  F  T  H  F  E  L  E  R  S  T  T  C  A  R  D  F  V         p.1680

          .         .         .         . | 35       .         .    g.186231
 GAAATTTTGGATGGCGGCCACGAAGACGCGCCCCTCCGAG | GCCGTTACTGTGGCACCGAC    c.5100
 E  I  L  D  G  G  H  E  D  A  P  L  R  G |   R  Y  C  G  T  D      p.1700

          .         .         .         .         .         .       g.186291
 ATGCCCCATCCTATCACATCCTTCAGCAGCGCCCTGACGCTGAGATTCGTCTCTGATTCT       c.5160
 M  P  H  P  I  T  S  F  S  S  A  L  T  L  R  F  V  S  D  S         p.1720

          .         .         .         .          | 36        .    g.187461
 AGCATCAGTGCTGGGGGTTTCCACACCACGGTCACCGCATCAGTGTCGG | CTTGTGGTGGA    c.5220
 S  I  S  A  G  G  F  H  T  T  V  T  A  S  V  S  A |   C  G  G      p.1740

          .         .         .         .         .         .       g.187521
 ACGTTCTACATGGCTGAAGGCATCTTCAACAGCCCTGGCTACCCAGACATTTATCCCCCT       c.5280
 T  F  Y  M  A  E  G  I  F  N  S  P  G  Y  P  D  I  Y  P  P         p.1760

          .         .         .         .         .         .       g.187581
 AATGTGGAATGTGTCTGGAACATCGTCAGTTCCCCTGGCAACCGGCTCCAGCTGTCTTTT       c.5340
 N  V  E  C  V  W  N  I  V  S  S  P  G  N  R  L  Q  L  S  F         p.1780

    | 37     .         .         .         .         .         .    g.194638
 AT | ATCTTTCCAGTTGGAAGACTCTCAGGACTGCAGCAGAGATTTTGTGGAGATCCGTGAA    c.5400
 I  |  S  F  Q  L  E  D  S  Q  D  C  S  R  D  F  V  E  I  R  E      p.1800

          .         .         .         .         .         .       g.194698
 GGAAATGCCACGGGTCACTTGGTGGGACGATACTGTGGAAACTCCTTCCCTCTCAATTAT       c.5460
 G  N  A  T  G  H  L  V  G  R  Y  C  G  N  S  F  P  L  N  Y         p.1820

          .         .         .         .         .         .       g.194758
 TCTTCCATCGTTGGACATACCCTGTGGGTCAGATTTATCTCAGATGGTTCTGGCAGCGGC       c.5520
 S  S  I  V  G  H  T  L  W  V  R  F  I  S  D  G  S  G  S  G         p.1840

          .         .         | 38         .         .         .    g.195702
 ACGGGCTTCCAGGCCACATTTATGAAGA | TATTTGGCAATGATAATATTGTGGGAACTCAT    c.5580
 T  G  F  Q  A  T  F  M  K  I |   F  G  N  D  N  I  V  G  T  H      p.1860

          .         .         .         .         .         .       g.195762
 GGGAAAGTCGCCTCTCCTTTCTGGCCTGAAAACTACCCACATAACTCCAATTACCAATGG       c.5640
 G  K  V  A  S  P  F  W  P  E  N  Y  P  H  N  S  N  Y  Q  W         p.1880

          .         .         .         .         .         .       g.195822
 ACAGTAAATGTGAATGCATCTCACGTTGTCCATGGTAGAATCTTGGAGATGGACATAGAA       c.5700
 T  V  N  V  N  A  S  H  V  V  H  G  R  I  L  E  M  D  I  E         p.1900

          .         .         .    | 39    .         .         .    g.197060
 GAAATACAAAACTGCTATTATGACAAATTAAGG | ATCTATGATGGGCCTAGCATTCACGCC    c.5760
 E  I  Q  N  C  Y  Y  D  K  L  R   | I  Y  D  G  P  S  I  H  A      p.1920

          .         .         .         .         .         .       g.197120
 CGCCTAATTGGAGCTTACTGTGGTACCCAGACTGAATCTTTCAGCTCCACTGGAAATTCT       c.5820
 R  L  I  G  A  Y  C  G  T  Q  T  E  S  F  S  S  T  G  N  S         p.1940

          .         .         .         .         .         .       g.197180
 TTGACATTTCATTTTTACTCCGACTCTTCAATCTCAGGGAAGGGATTCCTTCTGGAGTGG       c.5880
 L  T  F  H  F  Y  S  D  S  S  I  S  G  K  G  F  L  L  E  W         p.1960

          .         .         .         .       | 40 .         .    g.201547
 TTTGCAGTGGATGCACCTGATGGTGTTTTACCTACCATTGCTCCAG | GTGCTTGTGGTGGC    c.5940
 F  A  V  D  A  P  D  G  V  L  P  T  I  A  P  G |   A  C  G  G      p.1980

          .         .         .         .         .         .       g.201607
 TTCCTGAGGACGGGAGATGCACCCGTGTTTCTCTTCTCCCCGGGCTGGCCTGACAGTTAC       c.6000
 F  L  R  T  G  D  A  P  V  F  L  F  S  P  G  W  P  D  S  Y         p.2000

          .         .         .         .         .         .       g.201667
 AGTAATAGAGTGGACTGTACGTGGCTCATCCAGGCTCCCGACTCTACCGTGGAACTCAAC       c.6060
 S  N  R  V  D  C  T  W  L  I  Q  A  P  D  S  T  V  E  L  N         p.2020

          .         .         .         .         .         .       g.201727
 ATTCTTTCCCTGGACATTGAATCTCACCGAACGTGTGCCTATGATAGCCTTGTGATACGA       c.6120
 I  L  S  L  D  I  E  S  H  R  T  C  A  Y  D  S  L  V  I  R         p.2040

      | 41   .         .         .         .         .         .    g.206570
 GATG | GAGATAATAACTTGGCCCAGCAGCTAGCAGTTCTCTGTGGCAGAGAGATCCCTGGG    c.6180
 D  G |   D  N  N  L  A  Q  Q  L  A  V  L  C  G  R  E  I  P  G      p.2060

          .         .         .         .         .         .       g.206630
 CCCATCCGGTCTACTGGAGAGTACATGTTCATCCGCTTCACCTCGGACTCCAGTGTAACC       c.6240
 P  I  R  S  T  G  E  Y  M  F  I  R  F  T  S  D  S  S  V  T         p.2080

          .         .         .  | 42      .         .         .    g.209072
 AGGGCAGGCTTCAATGCATCCTTTCACAAGA | GCTGCGGTGGATATTTGCATGCAGACAGA    c.6300
 R  A  G  F  N  A  S  F  H  K  S |   C  G  G  Y  L  H  A  D  R      p.2100

          .         .         .         .         .         .       g.209132
 GGGATCATCACGTCCCCCAAGTATCCAGAGACTTACCCATCCAACCTCAACTGTTCTTGG       c.6360
 G  I  I  T  S  P  K  Y  P  E  T  Y  P  S  N  L  N  C  S  W         p.2120

          .         .         .         .         .         .       g.209192
 CACGTCCTGGTCCAAAGTGGCCTGACCATTGCTGTCCATTTTGAACAGCCTTTCCAGATT       c.6420
 H  V  L  V  Q  S  G  L  T  I  A  V  H  F  E  Q  P  F  Q  I         p.2140

          .         .         .         .   | 43     .         .    g.209411
 CCAAATGGAGATTCTTCTTGCAACCAGGGGGATTACTTGGTG | CTAAGAAATGGTCCTGAT    c.6480
 P  N  G  D  S  S  C  N  Q  G  D  Y  L  V   | L  R  N  G  P  D      p.2160

          .         .         .         .         .         .       g.209471
 ATCTGTTCTCCACCCTTGGGACCCCCTGGAGGAAATGGTCATTTTTGTGGCAGTCATGCT       c.6540
 I  C  S  P  P  L  G  P  P  G  G  N  G  H  F  C  G  S  H  A         p.2180

          .         .         .         .         .         .       g.209531
 TCATCAACTCTGTTCACCTCGGATAATCAAATGTTTGTTCAGTTTATTTCTGATCACAGT       c.6600
 S  S  T  L  F  T  S  D  N  Q  M  F  V  Q  F  I  S  D  H  S         p.2200

          .         .         .         .       | 44 .         .    g.214694
 AATGAAGGGCAAGGATTTAAAATCAAATATGAGGCAAAGAGTTTAG | CCTGTGGGGGCAAC    c.6660
 N  E  G  Q  G  F  K  I  K  Y  E  A  K  S  L  A |   C  G  G  N      p.2220

          .         .         .         .         .         .       g.214754
 GTCTACATCCATGATGCTGATTCTGCTGGGTATGTGACCTCCCCCAACCACCCTCATAAT       c.6720
 V  Y  I  H  D  A  D  S  A  G  Y  V  T  S  P  N  H  P  H  N         p.2240

          .         .         .         .         .         .       g.214814
 TATCCCCCGCACGCTGATTGCATTTGGATCTTAGCGGCTCCACCGGAAACACGCATACAG       c.6780
 Y  P  P  H  A  D  C  I  W  I  L  A  A  P  P  E  T  R  I  Q         p.2260

          .         .         .         .  | 45      .         .    g.216036
 CTGCAATTTGAAGATCGATTCGATATTGAAGTAACACCCAA | CTGTACTTCCAACTACCTT    c.6840
 L  Q  F  E  D  R  F  D  I  E  V  T  P  N  |  C  T  S  N  Y  L      p.2280

          .         .         .         .         .         .       g.216096
 GAGTTGCGGGATGGAGTGGATTCGGATGCACCAATACTTTCCAAATTTTGTGGGACATCT       c.6900
 E  L  R  D  G  V  D  S  D  A  P  I  L  S  K  F  C  G  T  S         p.2300

          .         .         .         .         .         .       g.216156
 TTGCCCAGCAGTCAGTGGTCCTCAGGAGAGGTTATGTATTTGAGATTTCGATCTGACAAC       c.6960
 L  P  S  S  Q  W  S  S  G  E  V  M  Y  L  R  F  R  S  D  N         p.2320

          .         .         .         . | 46       .         .    g.218807
 AGCCCCACACATGTGGGATTCAAGGCCAAGTATTCTATAG | CTCAGTGTGGGGGAAGAGTA    c.7020
 S  P  T  H  V  G  F  K  A  K  Y  S  I  A |   Q  C  G  G  R  V      p.2340

          .         .         .         .         .         .       g.218867
 CCAGGGCAAAGTGGTGTTGTTGAAAGCATTGGACATCCAACACTTCCATACAGAGACAAC       c.7080
 P  G  Q  S  G  V  V  E  S  I  G  H  P  T  L  P  Y  R  D  N         p.2360

          .         .         .         .         .         .       g.218927
 TTATTCTGTGAGTGGCATCTCCAGGGGCTCTCTGGACACTATCTCACCATCTCTTTTGAA       c.7140
 L  F  C  E  W  H  L  Q  G  L  S  G  H  Y  L  T  I  S  F  E         p.2380

          .         .         .         .         .         .       g.218987
 GACTTTAACCTTCAGAATTCTTCTGGCTGTGAAAAAGACTTCGTGGAGATCTGGGACAAT       c.7200
 D  F  N  L  Q  N  S  S  G  C  E  K  D  F  V  E  I  W  D  N         p.2400

          . | 47       .         .         .         .         .    g.219695
 CATACCTCTG | GAAACATCTTGGGCAGATACTGTGGAAACACCATTCCTGACAGCATAGAC    c.7260
 H  T  S  G |   N  I  L  G  R  Y  C  G  N  T  I  P  D  S  I  D      p.2420

          .         .         .         .         .         .       g.219755
 ACTTCTAGCAATACTGCTGTGGTCAGGTTTGTCACAGACGGCTCTGTGACTGCCTCAGGA       c.7320
 T  S  S  N  T  A  V  V  R  F  V  T  D  G  S  V  T  A  S  G         p.2440

          .         .         .  | 48      .         .         .    g.220854
 TTCAGACTGCGATTTGAATCCAGTATGGAAG | AGTGTGGTGGGGATCTTCAGGGCTCTATT    c.7380
 F  R  L  R  F  E  S  S  M  E  E |   C  G  G  D  L  Q  G  S  I      p.2460

          .         .         .         .         .         .       g.220914
 GGAACATTTACTTCTCCCAACTACCCGAACCCAAATCCTCATGGCCGGATCTGCGAGTGG       c.7440
 G  T  F  T  S  P  N  Y  P  N  P  N  P  H  G  R  I  C  E  W         p.2480

          .         .         .         .         .         .       g.220974
 AGAATCACTGCCCCGGAGGGAAGGCGGATCACCCTAATGTTTAACAACCTGAGGCTGGCC       c.7500
 R  I  T  A  P  E  G  R  R  I  T  L  M  F  N  N  L  R  L  A         p.2500

          .         .         .    | 49    .         .         .    g.227165
 ACGCATCCGTCCTGCAACAATGAGCATGTGATA | GTATTCAATGGCATTAGAAGTAACTCA    c.7560
 T  H  P  S  C  N  N  E  H  V  I   | V  F  N  G  I  R  S  N  S      p.2520

          .         .         .         .         .         .       g.227225
 CCCCAGCTAGAGAAACTGTGTAGTAGTGTGAATGTAAGCAATGAGATTAAATCTTCAGGA       c.7620
 P  Q  L  E  K  L  C  S  S  V  N  V  S  N  E  I  K  S  S  G         p.2540

          .         .         .         .         .         .       g.227285
 AACACAATGAAAGTCATTTTTTTCACGGATGGATCCAGGCCATATGGCGGCTTCACTGCT       c.7680
 N  T  M  K  V  I  F  F  T  D  G  S  R  P  Y  G  G  F  T  A         p.2560

          .         .      | 50  .         .         .         .    g.228443
 TCCTATACCTCCAGTGAAGATGCAG | TGTGTGGTGGGTCTCTTCCAAATACTCCTGAAGGA    c.7740
 S  Y  T  S  S  E  D  A  V |   C  G  G  S  L  P  N  T  P  E  G      p.2580

          .         .         .         .         .         .       g.228503
 AACTTTACTTCTCCTGGCTATGACGGAGTCAGGAATTACTCAAGAAACCTGAACTGCGAA       c.7800
 N  F  T  S  P  G  Y  D  G  V  R  N  Y  S  R  N  L  N  C  E         p.2600

          .         .         .         .         .         .       g.228563
 TGGACTCTCAGCAATCCAAATCAGGGAAATTCATCCATTTCCATTCACTTTGAAGATTTT       c.7860
 W  T  L  S  N  P  N  Q  G  N  S  S  I  S  I  H  F  E  D  F         p.2620

          .         .         .         .         .   | 51     .    g.230710
 TACCTAGAAAGTCACCAAGACTGTCAATTTGATGTCCTCGAGTTTCGAGTGG | GTGATGCT    c.7920
 Y  L  E  S  H  Q  D  C  Q  F  D  V  L  E  F  R  V  G |   D  A      p.2640

          .         .         .         .         .         .       g.230770
 GATGGGCCCCTGATGTGGAGACTTTGTGGTCCTTCAAAGCCTACATTGCCATTGGTTATA       c.7980
 D  G  P  L  M  W  R  L  C  G  P  S  K  P  T  L  P  L  V  I         p.2660

          .         .         .         .         .         .       g.230830
 CCTTATTCTCAGGTATGGATTCACTTTGTCACCAACGAACGTGTAGAACACATTGGATTC       c.8040
 P  Y  S  Q  V  W  I  H  F  V  T  N  E  R  V  E  H  I  G  F         p.2680

          .         .   | 52     .         .         .         .    g.233396
 CATGCAAAGTATTCCTTTACAG | ATTGTGGCGGAATACAGATAGGTGACAGTGGAGTGATC    c.8100
 H  A  K  Y  S  F  T  D |   C  G  G  I  Q  I  G  D  S  G  V  I      p.2700

          .         .         .         .         .         .       g.233456
 ACAAGCCCCAACTATCCAAATGCTTATGACAGCCTGACCCACTGCTCTTCGCTGTTGGAG       c.8160
 T  S  P  N  Y  P  N  A  Y  D  S  L  T  H  C  S  S  L  L  E         p.2720

          .         .     | 53   .         .         .         .    g.234003
 GCCCCACAAGGGCACACCATCACT | CTCACATTTAGTGACTTTGATATTGAACCCCATACA    c.8220
 A  P  Q  G  H  T  I  T   | L  T  F  S  D  F  D  I  E  P  H  T      p.2740

          .         .         .         .         .         .       g.234063
 ACTTGTGCTTGGGACTCTGTCACTGTCAGGAATGGTGGGTCCCCTGAATCACCCATCATA       c.8280
 T  C  A  W  D  S  V  T  V  R  N  G  G  S  P  E  S  P  I  I         p.2760

          .         .         .         .         .         .       g.234123
 GGACAATACTGTGGAAATTCAAACCCCAGGACAATACAGTCAGGTTCCAATCAGCTGGTC       c.8340
 G  Q  Y  C  G  N  S  N  P  R  T  I  Q  S  G  S  N  Q  L  V         p.2780

          .         .         .         .         .         .       g.234183
 GTGACTTTTAACTCAGACCATTCATTGCAAGGTGGTGGATTTTATGCTACGTGGAACACA       c.8400
 V  T  F  N  S  D  H  S  L  Q  G  G  G  F  Y  A  T  W  N  T         p.2800

          . | 54       .         .         .         .         .    g.235684
 CAAACTTTAG | GTTGTGGTGGAATATTTCATTCTGATAATGGTACAATCAGATCCCCTCAC    c.8460
 Q  T  L  G |   C  G  G  I  F  H  S  D  N  G  T  I  R  S  P  H      p.2820

          .         .         .         .         .         .       g.235744
 TGGCCTCAGAATTTTCCCGAAAACAGCAGATGTTCCTGGACGGCCATTACTCACAAAAGT       c.8520
 W  P  Q  N  F  P  E  N  S  R  C  S  W  T  A  I  T  H  K  S         p.2840

          .         .         .         .         .         .       g.235804
 AAACACTTGGAGATCAGCTTTGACAACAACTTCCTAATCCCCAGCGGTGATGGACAATGT       c.8580
 K  H  L  E  I  S  F  D  N  N  F  L  I  P  S  G  D  G  Q  C         p.2860

          .         | 55         .         .         .         .    g.244332
 CAGAATAGCTTCGTGAAG | GTGTGGGCAGGAACTGAGGAGGTGGACAAAGCCCTGCTAGCC    c.8640
 Q  N  S  F  V  K   | V  W  A  G  T  E  E  V  D  K  A  L  L  A      p.2880

          .         .         .         .         .         .       g.244392
 ACTGGCTGTGGGAACGTGGCTCCGGGTCCCGTTATCACACCAAGTAACACATTCACTGCC       c.8700
 T  G  C  G  N  V  A  P  G  P  V  I  T  P  S  N  T  F  T  A         p.2900

          .         .         .         .         .      | 56  .    g.246256
 GTCTTCCAGTCTCAGGAGGCACCAGCTCAGGGCTTCTCCGCGTCCTTTGTTAGCC | GATGT    c.8760
 V  F  Q  S  Q  E  A  P  A  Q  G  F  S  A  S  F  V  S  R |   C      p.2920

          .         .         .         .         .         .       g.246316
 GGAAGTAATTTCACTGGCCCTTCAGGTTACATCATTTCTCCAAATTACCCAAAACAATAT       c.8820
 G  S  N  F  T  G  P  S  G  Y  I  I  S  P  N  Y  P  K  Q  Y         p.2940

          .         .         .         .         .         .       g.246376
 GACAACAACATGAATTGCACCTATGTCATAGAGGCTAATCCTCTGTCAGTGGTCCTCTTG       c.8880
 D  N  N  M  N  C  T  Y  V  I  E  A  N  P  L  S  V  V  L  L         p.2960

          .         .      | 57  .         .         .         .    g.257755
 ACTTTTGTGTCCTTCCACTTAGAAG | CTCGTTCCGCTGTGACGGGAAGCTGTGTCAACGAT    c.8940
 T  F  V  S  F  H  L  E  A |   R  S  A  V  T  G  S  C  V  N  D      p.2980

          .         .         .         .         .         .       g.257815
 GGCGTGCACATTATCAGAGGTTACAGCGTCATGTCCACCCCATTTGCTACTGTGTGTGGG       c.9000
 G  V  H  I  I  R  G  Y  S  V  M  S  T  P  F  A  T  V  C  G         p.3000

          .         .         .         .         .         .       g.257875
 GATGAGATGCCAGCTCCCCTCACCATCGCTGGGCCGGTTCTGCTTAACTTCTACTCCAAC       c.9060
 D  E  M  P  A  P  L  T  I  A  G  P  V  L  L  N  F  Y  S  N         p.3020

          .         .         .         .       | 58 .         .    g.260328
 GAGCAAATCACAGACTTCGGATTCAAGTTTTCCTATAGGATAATCT | CCTGTGGTGGTGTG    c.9120
 E  Q  I  T  D  F  G  F  K  F  S  Y  R  I  I  S |   C  G  G  V      p.3040

          .         .         .         .         .         .       g.260388
 TTCAATTTCTCTTCTGGAATCATCACAAGTCCTGCCTATTCATACGCAGACTACCCAAAT       c.9180
 F  N  F  S  S  G  I  I  T  S  P  A  Y  S  Y  A  D  Y  P  N         p.3060

          .         .         .         .         .       | 59 .    g.264968
 GATATGCACTGTCTGTATACCATCACCGTTAGTGACGACAAGGTGATCGAGCTCAA | GTTC    c.9240
 D  M  H  C  L  Y  T  I  T  V  S  D  D  K  V  I  E  L  K  |  F      p.3080

          .         .         .         .         .         .       g.265028
 AGTGATTTTGATGTGGTTCCCTCCACCTCCTGCTCCCATGACTACCTGGCAATTTACGAT       c.9300
 S  D  F  D  V  V  P  S  T  S  C  S  H  D  Y  L  A  I  Y  D         p.3100

          .         .         .         .         .         .       g.265088
 GGTGCCAATACCAGCGATCCCCTTCTTGGCAAATTCTGCGGTTCCAAGCGCCCACCAAAT       c.9360
 G  A  N  T  S  D  P  L  L  G  K  F  C  G  S  K  R  P  P  N         p.3120

          .         .         .         .         .         .       g.265148
 GTGAAGAGCAGCAATAATAGTATGCTCCTGGTGTTCAAGACAGATTCATTTCAGACAGCA       c.9420
 V  K  S  S  N  N  S  M  L  L  V  F  K  T  D  S  F  Q  T  A         p.3140

          .         .         .     | 60   .         .         .    g.283400
 AAAGGCTGGAAGATGTCTTTCCGGCAGACATTGG | GGCCTCAGCAAGGATGTGGTGGTTAT    c.9480
 K  G  W  K  M  S  F  R  Q  T  L  G |   P  Q  Q  G  C  G  G  Y      p.3160

          .         .         .         .         .         .       g.283460
 CTGACAGGCTCGAATAATACCTTTGCCTCTCCTGATTCTGATTCGAATGGAATGTATGAC       c.9540
 L  T  G  S  N  N  T  F  A  S  P  D  S  D  S  N  G  M  Y  D         p.3180

          .         .         .         .         .         .       g.283520
 AAGAATTTAAACTGTGTATGGATCATAATTGCACCTGTAAACAAAGTAATTCACCTCACC       c.9600
 K  N  L  N  C  V  W  I  I  I  A  P  V  N  K  V  I  H  L  T         p.3200

          .         .         .         .         .         .       g.283580
 TTCAATACATTTGCTCTGGAGGCAGCAAGTACTAGGCAAAGATGCCTTTATGATTATGTA       c.9660
 F  N  T  F  A  L  E  A  A  S  T  R  Q  R  C  L  Y  D  Y  V         p.3220

     | 61    .         .         .         .         .         .    g.293827
 AAG | TTATATGATGGGGATAGTGAAAATGCGAACTTGGCTGGAACGTTTTGTGGTTCCACA    c.9720
 K   | L  Y  D  G  D  S  E  N  A  N  L  A  G  T  F  C  G  S  T      p.3240

          .         .         .         .         .         .       g.293887
 GTACCTGCTCCTTTTATCTCTTCTGGTAACTTCCTTACGGTTCAATTCATCAGTGACTTA       c.9780
 V  P  A  P  F  I  S  S  G  N  F  L  T  V  Q  F  I  S  D  L         p.3260

          .         .         .         .       | 62 .         .    g.294296
 ACATTAGAGAGGGAAGGATTTAATGCTACATACACCATCATGGACA | TGCCTTGTGGTGGA    c.9840
 T  L  E  R  E  G  F  N  A  T  Y  T  I  M  D  M |   P  C  G  G      p.3280

          .         .         .         .         .         .       g.294356
 ACATACAATGCAACTTGGACCCCACAAAATATTTCATCACCCAATTCATCAGACCCAGAT       c.9900
 T  Y  N  A  T  W  T  P  Q  N  I  S  S  P  N  S  S  D  P  D         p.3300

          .         .         .         .         .         .       g.294416
 GTCCCATTTTCCATCTGTACTTGGGTCATTGATTCCCCTCCGCATCAGCAGGTCAAGATA       c.9960
 V  P  F  S  I  C  T  W  V  I  D  S  P  P  H  Q  Q  V  K  I         p.3320

          .         .         .         .         .         .       g.294476
 ACTGTGTGGGCATTACAGCTGACCTCGCAAGACTGCACGCAGAATTACTTACAGCTTCAG       c.10020
 T  V  W  A  L  Q  L  T  S  Q  D  C  T  Q  N  Y  L  Q  L  Q         p.3340

          .   | 63     .         .         .         .         .    g.298483
 GACTCACCGCAG | GGTCACGGAAATTCAAGATTTCAGTTCTGTGGCAGAAATGCTTCGGCT    c.10080
 D  S  P  Q   | G  H  G  N  S  R  F  Q  F  C  G  R  N  A  S  A      p.3360

          .         .         .         .         .         .       g.298543
 GTGCCAGTGTTTTATTCTTCTATGAGTACTGCAATGGTCATTTTCAAATCTGGAGTTGTA       c.10140
 V  P  V  F  Y  S  S  M  S  T  A  M  V  I  F  K  S  G  V  V         p.3380

          .         .         .         . | 64       .         .    g.299642
 AACAGAAACTCTAGAATGAGTTTCACCTATCAGATTGCAG | ATTGCAACAGAGACTATCAC    c.10200
 N  R  N  S  R  M  S  F  T  Y  Q  I  A  D |   C  N  R  D  Y  H      p.3400

          .         .         .         .         .         .       g.299702
 AAGGCATTTGGCAACCTGAGAAGCCCTGGATGGCCAGATAACTACGACAATGACAAGGAT       c.10260
 K  A  F  G  N  L  R  S  P  G  W  P  D  N  Y  D  N  D  K  D         p.3420

          .         .         .         .         .         .       g.299762
 TGCACCGTTACTCTCACAGCCCCCCAGAACCACACCATTTCCCTCTTTTTTCATTCACTT       c.10320
 C  T  V  T  L  T  A  P  Q  N  H  T  I  S  L  F  F  H  S  L         p.3440

          .         .         .         .   | 65     .         .    g.303418
 GGCATCGAGAACTCAGTTGAATGCAGAAACGATTTCTTGGAG | GTGAGAAATGGAAGTAAC    c.10380
 G  I  E  N  S  V  E  C  R  N  D  F  L  E   | V  R  N  G  S  N      p.3460

          .         .         .         .         .         .       g.303478
 AGCAATTCACCATTACTGGGCAAGTACTGTGGAACTCTGCTGCCAAACCCTGTCTTCTCT       c.10440
 S  N  S  P  L  L  G  K  Y  C  G  T  L  L  P  N  P  V  F  S         p.3480

          .         .         .         .         .         .       g.303538
 CAAAATAATGAACTATACCTACGATTTAAGAGTGATAGTGTAACTTCTGATCGTGGATAT       c.10500
 Q  N  N  E  L  Y  L  R  F  K  S  D  S  V  T  S  D  R  G  Y         p.3500

          .         .         | 66         .         .         .    g.305809
 GAAATCATCTGGACTTCATCACCCTCTG | GATGTGGTGGAACTCTTTATGGAGACAGAGGC    c.10560
 E  I  I  W  T  S  S  P  S  G |   C  G  G  T  L  Y  G  D  R  G      p.3520

          .         .         .         .         .         .       g.305869
 TCATTCACCAGCCCCGGCTATCCAGGCACATACCCAAACAACACGTACTGCGAGTGGGTC       c.10620
 S  F  T  S  P  G  Y  P  G  T  Y  P  N  N  T  Y  C  E  W  V         p.3540

          .         .         .         .         .         .       g.305929
 CTTGTTGCTCCTGCTGGAAGGCTTGTCACCATCAACTTCTACTTCATCAGCATTGACGAT       c.10680
 L  V  A  P  A  G  R  L  V  T  I  N  F  Y  F  I  S  I  D  D         p.3560

          .         .         .         .         .         .       g.305989
 CCAGGAGACTGTGTCCAGAACTATCTCACACTCTATGATGGGCCCAACGCCAGCTCTCCA       c.10740
 P  G  D  C  V  Q  N  Y  L  T  L  Y  D  G  P  N  A  S  S  P         p.3580

          .         .     | 67   .         .         .         .    g.309771
 TCCTCTGGACCATACTGCGGAGGC | GACACCAGCATAGCTCCCTTCGTGGCTTCCTCAAAT    c.10800
 S  S  G  P  Y  C  G  G   | D  T  S  I  A  P  F  V  A  S  S  N      p.3600

          .         .         .         .         .         .       g.309831
 CAGGTCTTCATAAAATTTCATGCTGATTATGCACGGCGTCCATCCGCATTCCGATTAACT       c.10860
 Q  V  F  I  K  F  H  A  D  Y  A  R  R  P  S  A  F  R  L  T         p.3620

          .                                                         g.309843
 TGGGACAGCTAA                                                       c.10872
 W  D  S  X                                                         p.3623

          .         .         .         .         .         .       g.309903
 gtgggtaacaactcgtgttcactcagcactttccctctgcagcacgctggacagcactct       c.*60

          .         .         .         .         .         .       g.309963
 gccatcctgatacatgacccctgctgatgccacagagaataagctgaacttgtatggttt       c.*120

          .         .         .         .         .         .       g.310023
 ttcaccaaaccatggatagaatcaatatttgtaggccaggcgtggtggctcacccccctg       c.*180

          .         .         .         .         .         .       g.310083
 tattctcagcactttgggaggccgaggcaggttgatcacctgaggtcaggagtttgagac       c.*240

          .         .         .         .         .         .       g.310143
 tagcctggccagcatggtgaaacctcatctctctaacaatataataattagccaggcgtg       c.*300

          .         .         .         .         .         .       g.310203
 gtggtgggtgcctgtaattccagccactcgagaggctgaggcaggagaattgcttgaacc       c.*360

          .         .         .         .         .         .       g.310263
 caggaggcagaggttgcagtgagctaagatcacaccactacactccagcctgggcgagac       c.*420

          .         .         .         .         .         .       g.310323
 ggcaagactccatctcaaaaaaaaaagaaacaaaaaaaaccagaatcaatatttgtacat       c.*480

          .         .         .         .         .         .       g.310383
 tttctcgaacatagaatatagcttctttagtcttgagtgtgcatttcattctaatatttt       c.*540

          .         .         .         .         .         .       g.310443
 gagctgaaatttaaaaaaactttgaaagagttggaaatgattatggcatatgtgacatac       c.*600

          .         .         .         .         .         .       g.310503
 atttttaaaagttaataataatagccaggggcagtggctcatacccataatcccagcacg       c.*660

          .         .         .         .         .         .       g.310563
 ctgggaggccatgatgggaggattgcttgaacctaggagtttgagaccagcctgggcaac       c.*720

          .         .         .         .         .         .       g.310623
 aaagtgagacctgatttttacaaaaaatcaaaaaattagccaggcatggtggcatgcacc       c.*780

          .         .         .         .         .         .       g.310683
 cgtggttccagctacacaggaggttgaagcaggaggatcacttgagcccagtaggttaag       c.*840

          .         .         .         .         .         .       g.310743
 gctgcagtgaaaccctgtgaattaaccactgtactccagcctgggtgacagactgagacc       c.*900

          .         .         .         .         .         .       g.310803
 ctatctcaaaaatgacaacaagaacaacaaaagttaatgataatatagaagcataaattt       c.*960

          .         .         .         .                           g.310852
 cctgtgaatgttcaattacacataataaacattattgaattgtacacaa                  c.*1009

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Cubilin (intrinsic factor-cobalamin receptor) protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build beta-07
©2004-2012 Leiden University Medical Center