apolipoprotein B (including Ag(x) antigen) (APOB) - coding DNA reference sequence

(used for variant description)

(last modified March 5, 2015)


This file was created to facilitate the description of sequence variants on transcript NM_000384.2 in the APOB gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NG_011793.1, covering APOB transcript NM_000384.2.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                                                                    g.5008
                                                     attcccac       c.-121

 .         .         .         .         .         .                g.5068
 cgggacctgcggggctgagtgcccttctcggttgctgccgctgaggagcccgcccagcca       c.-61

 .         .         .         .         .         .                g.5128
 gccagggccgcgaggccgaggccaggccgcagcccaggagccgccccaccgcagctggcg       c.-1

          .         .         .         .         .         .       g.5188
 ATGGACCCGCCGAGGCCCGCGCTGCTGGCGCTGCTGGCGCTGCCTGCGCTGCTGCTGCTG       c.60
 M  D  P  P  R  P  A  L  L  A  L  L  A  L  P  A  L  L  L  L         p.20

          .         .   | 02     .         .         .         .    g.5560
 CTGCTGGCGGGCGCCAGGGCCG | AAGAGGAAATGCTGGAAAATGTCAGCCTGGTCTGTCCA    c.120
 L  L  A  G  A  R  A  E |   E  E  M  L  E  N  V  S  L  V  C  P      p.40

   | 03      .         .         .         .         .         .    g.6656
 A | AAGATGCGACCCGATTCAAGCACCTCCGGAAGTACACATACAACTATGAGGCTGAGAGT    c.180
 K |   D  A  T  R  F  K  H  L  R  K  Y  T  Y  N  Y  E  A  E  S      p.60

          .         .         .         .         .        | 04.    g.7993
 TCCAGTGGAGTCCCTGGGACTGCTGATTCAAGAAGTGCCACCAGGATCAACTGCAAG | GTT    c.240
 S  S  G  V  P  G  T  A  D  S  R  S  A  T  R  I  N  C  K   | V      p.80

          .         .         .         .         .         .       g.8053
 GAGCTGGAGGTTCCCCAGCTCTGCAGCTTCATCCTGAAGACCAGCCAGTGCACCCTGAAA       c.300
 E  L  E  V  P  Q  L  C  S  F  I  L  K  T  S  Q  C  T  L  K         p.100

          .         .         .         .         .         .       g.8113
 GAGGTGTATGGCTTCAACCCTGAGGGCAAAGCCTTGCTGAAGAAAACCAAGAACTCTGAG       c.360
 E  V  Y  G  F  N  P  E  G  K  A  L  L  K  K  T  K  N  S  E         p.120

          .         .    | 05    .         .         .         .    g.10999
 GAGTTTGCTGCAGCCATGTCCAG | GTATGAGCTCAAGCTGGCCATTCCAGAAGGGAAGCAG    c.420
 E  F  A  A  A  M  S  R  |  Y  E  L  K  L  A  I  P  E  G  K  Q      p.140

          .         .         .         .         .         .       g.11059
 GTTTTCCTTTACCCGGAGAAAGATGAACCTACTTACATCCTGAACATCAAGAGGGGCATC       c.480
 V  F  L  Y  P  E  K  D  E  P  T  Y  I  L  N  I  K  R  G  I         p.160

          .         .         .         .         .        | 06.    g.11821
 ATTTCTGCCCTCCTGGTTCCCCCAGAGACAGAAGAAGCCAAGCAAGTGTTGTTTCTG | GAT    c.540
 I  S  A  L  L  V  P  P  E  T  E  E  A  K  Q  V  L  F  L   | D      p.180

          .         .         .         .         .         .       g.11881
 ACCGTGTATGGAAACTGCTCCACTCACTTTACCGTCAAGACGAGGAAGGGCAATGTGGCA       c.600
 T  V  Y  G  N  C  S  T  H  F  T  V  K  T  R  K  G  N  V  A         p.200

          .         .         .         .         .         .       g.11941
 ACAGAAATATCCACTGAAAGAGACCTGGGGCAGTGTGATCGCTTCAAGCCCATCCGCACA       c.660
 T  E  I  S  T  E  R  D  L  G  Q  C  D  R  F  K  P  I  R  T         p.220

          .         .         .    | 07    .         .         .    g.13392
 GGCATCAGCCCACTTGCTCTCATCAAAGGCATG | ACCCGCCCCTTGTCAACTCTGATCAGC    c.720
 G  I  S  P  L  A  L  I  K  G  M   | T  R  P  L  S  T  L  I  S      p.240

          .         .         .         .         .         .       g.13452
 AGCAGCCAGTCCTGTCAGTACACACTGGACGCTAAGAGGAAGCATGTGGCAGAAGCCATC       c.780
 S  S  Q  S  C  Q  Y  T  L  D  A  K  R  K  H  V  A  E  A  I         p.260

          .         .         .         | 08         .         .    g.14194
 TGCAAGGAGCAACACCTCTTCCTGCCTTTCTCCTACAA | GAATAAGTATGGGATGGTAGCA    c.840
 C  K  E  Q  H  L  F  L  P  F  S  Y  K  |  N  K  Y  G  M  V  A      p.280

          .         .         .         .         .         .       g.14254
 CAAGTGACACAGACTTTGAAACTTGAAGACACACCAAAGATCAACAGCCGCTTCTTTGGT       c.900
 Q  V  T  Q  T  L  K  L  E  D  T  P  K  I  N  S  R  F  F  G         p.300

      | 09   .         .         .         .         .         .    g.15611
 GAAG | GTACTAAGAAGATGGGCCTCGCATTTGAGAGCACCAAATCCACATCACCTCCAAAG    c.960
 E  G |   T  K  K  M  G  L  A  F  E  S  T  K  S  T  S  P  P  K      p.320

          .         .         .         .         .         .       g.15671
 CAGGCCGAAGCTGTTTTGAAGACTCTCCAGGAACTGAAAAAACTAACCATCTCTGAGCAA       c.1020
 Q  A  E  A  V  L  K  T  L  Q  E  L  K  K  L  T  I  S  E  Q         p.340

          .         .         .         .         .         .       g.15731
 AATATCCAGAGAGCTAATCTCTTCAATAAGCTGGTTACTGAGCTGAGAGGCCTCAGTGAT       c.1080
 N  I  Q  R  A  N  L  F  N  K  L  V  T  E  L  R  G  L  S  D         p.360

          .         .         .         .     | 10   .         .    g.16508
 GAAGCAGTCACATCTCTCTTGCCACAGCTGATTGAGGTGTCCAG | CCCCATCACTTTACAA    c.1140
 E  A  V  T  S  L  L  P  Q  L  I  E  V  S  S  |  P  I  T  L  Q      p.380

          .         .         .         .         .         .       g.16568
 GCCTTGGTTCAGTGTGGACAGCCTCAGTGCTCCACTCACATCCTCCAGTGGCTGAAACGT       c.1200
 A  L  V  Q  C  G  Q  P  Q  C  S  T  H  I  L  Q  W  L  K  R         p.400

          .         .         .         .         .         .       g.16628
 GTGCATGCCAACCCCCTTCTGATAGATGTGGTCACCTACCTGGTGGCCCTGATCCCCGAG       c.1260
 V  H  A  N  P  L  L  I  D  V  V  T  Y  L  V  A  L  I  P  E         p.420

          .         .         .         .         .         .       g.16688
 CCCTCAGCACAGCAGCTGCGAGAGATCTTCAACATGGCGAGGGATCAGCGCAGCCGAGCC       c.1320
 P  S  A  Q  Q  L  R  E  I  F  N  M  A  R  D  Q  R  S  R  A         p.440

          .         .         .   | 11     .         .         .    g.19086
 ACCTTGTATGCGCTGAGCCACGCGGTCAACAA | CTATCATAAGACAAACCCTACAGGGACC    c.1380
 T  L  Y  A  L  S  H  A  V  N  N  |  Y  H  K  T  N  P  T  G  T      p.460

          .         .         .         .         .         .       g.19146
 CAGGAGCTGCTGGACATTGCTAATTACCTGATGGAACAGATTCAAGATGACTGCACTGGG       c.1440
 Q  E  L  L  D  I  A  N  Y  L  M  E  Q  I  Q  D  D  C  T  G         p.480

          .         .         . | 12       .         .         .    g.19318
 GATGAAGATTACACCTATTTGATTCTGCGG | GTCATTGGAAATATGGGCCAAACCATGGAG    c.1500
 D  E  D  Y  T  Y  L  I  L  R   | V  I  G  N  M  G  Q  T  M  E      p.500

          .         .         .         .         .         .       g.19378
 CAGTTAACTCCAGAACTCAAGTCTTCAATCCTGAAATGTGTCCAAAGTACAAAGCCATCA       c.1560
 Q  L  T  P  E  L  K  S  S  I  L  K  C  V  Q  S  T  K  P  S         p.520

          .         .         .         .         .        | 13.    g.20538
 CTGATGATCCAGAAAGCTGCCATCCAGGCTCTGCGGAAAATGGAGCCTAAAGACAAG | GAC    c.1620
 L  M  I  Q  K  A  A  I  Q  A  L  R  K  M  E  P  K  D  K   | D      p.540

          .         .         .         .         .         .       g.20598
 CAGGAGGTTCTTCTTCAGACTTTCCTTGATGATGCTTCTCCGGGAGATAAGCGACTGGCT       c.1680
 Q  E  V  L  L  Q  T  F  L  D  D  A  S  P  G  D  K  R  L  A         p.560

          .         .         .         .         .         .       g.20658
 GCCTATCTTATGTTGATGAGGAGTCCTTCACAGGCAGATATTAACAAAATTGTCCAAATT       c.1740
 A  Y  L  M  L  M  R  S  P  S  Q  A  D  I  N  K  I  V  Q  I         p.580

          .         .         .         .         .         .       g.20718
 CTACCATGGGAACAGAATGAGCAAGTGAAGAACTTTGTGGCTTCCCATATTGCCAATATC       c.1800
 L  P  W  E  Q  N  E  Q  V  K  N  F  V  A  S  H  I  A  N  I         p.600

          .         .          | 14        .         .         .    g.21039
 TTGAACTCAGAAGAATTGGATATCCAAGA | TCTGAAAAAGTTAGTGAAAGAAGCTCTGAAA    c.1860
 L  N  S  E  E  L  D  I  Q  D  |  L  K  K  L  V  K  E  A  L  K      p.620

          .         .         .         .         .         .       g.21099
 GAATCTCAACTTCCAACTGTCATGGACTTCAGAAAATTCTCTCGGAACTATCAACTCTAC       c.1920
 E  S  Q  L  P  T  V  M  D  F  R  K  F  S  R  N  Y  Q  L  Y         p.640

          .         .         .         .         .         .       g.21159
 AAATCTGTTTCTCTTCCATCACTTGACCCAGCCTCAGCCAAAATAGAAGGGAATCTTATA       c.1980
 K  S  V  S  L  P  S  L  D  P  A  S  A  K  I  E  G  N  L  I         p.660

          .         .         .         .         .         .       g.21219
 TTTGATCCAAATAACTACCTTCCTAAAGAAAGCATGCTGAAAACTACCCTCACTGCCTTT       c.2040
 F  D  P  N  N  Y  L  P  K  E  S  M  L  K  T  T  L  T  A  F         p.680

          .         .        | 15.         .         .         .    g.22142
 GGATTTGCTTCAGCTGACCTCATCGAG | ATTGGCTTGGAAGGAAAAGGCTTTGAGCCAACA    c.2100
 G  F  A  S  A  D  L  I  E   | I  G  L  E  G  K  G  F  E  P  T      p.700

          .         .         .         .         .         .       g.22202
 TTGGAAGCTCTTTTTGGGAAGCAAGGATTTTTCCCAGACAGTGTCAACAAAGCTTTGTAC       c.2160
 L  E  A  L  F  G  K  Q  G  F  F  P  D  S  V  N  K  A  L  Y         p.720

          .         .         .         .         .         .       g.22262
 TGGGTTAATGGTCAAGTTCCTGATGGTGTCTCTAAGGTCTTAGTGGACCACTTTGGCTAT       c.2220
 W  V  N  G  Q  V  P  D  G  V  S  K  V  L  V  D  H  F  G  Y         p.740

          .         .     | 16   .         .         .         .    g.23985
 ACCAAAGATGATAAACATGAGCAG | GATATGGTAAATGGAATAATGCTCAGTGTTGAGAAG    c.2280
 T  K  D  D  K  H  E  Q   | D  M  V  N  G  I  M  L  S  V  E  K      p.760

          .         .         .         .         .         .       g.24045
 CTGATTAAAGATTTGAAATCCAAAGAAGTCCCGGAAGCCAGAGCCTACCTCCGCATCTTG       c.2340
 L  I  K  D  L  K  S  K  E  V  P  E  A  R  A  Y  L  R  I  L         p.780

          .         .         .         .         .         .       g.24105
 GGAGAGGAGCTTGGTTTTGCCAGTCTCCATGACCTCCAGCTCCTGGGAAAGCTGCTTCTG       c.2400
 G  E  E  L  G  F  A  S  L  H  D  L  Q  L  L  G  K  L  L  L         p.800

          .         .         .       | 17 .         .         .    g.25405
 ATGGGTGCCCGCACTCTGCAGGGGATCCCCCAGATG | ATTGGAGAGGTCATCAGGAAGGGC    c.2460
 M  G  A  R  T  L  Q  G  I  P  Q  M   | I  G  E  V  I  R  K  G      p.820

          .         .         .         .         .         .       g.25465
 TCAAAGAATGACTTTTTTCTTCACTACATCTTCATGGAGAATGCCTTTGAACTCCCCACT       c.2520
 S  K  N  D  F  F  L  H  Y  I  F  M  E  N  A  F  E  L  P  T         p.840

          .         .         .         .         .         .       g.25525
 GGAGCTGGATTACAGTTGCAAATATCTTCATCTGGAGTCATTGCTCCCGGAGCCAAGGCT       c.2580
 G  A  G  L  Q  L  Q  I  S  S  S  G  V  I  A  P  G  A  K  A         p.860

          .         .     | 18   .         .         .         .    g.26067
 GGAGTAAAACTGGAAGTAGCCAAC | ATGCAGGCTGAACTGGTGGCAAAACCCTCCGTGTCT    c.2640
 G  V  K  L  E  V  A  N   | M  Q  A  E  L  V  A  K  P  S  V  S      p.880

          .         .         .         .         .         .       g.26127
 GTGGAGTTTGTGACAAATATGGGCATCATCATTCCGGACTTCGCTAGGAGTGGGGTCCAG       c.2700
 V  E  F  V  T  N  M  G  I  I  I  P  D  F  A  R  S  G  V  Q         p.900

          .         .         .         .         .         .       g.26187
 ATGAACACCAACTTCTTCCACGAGTCGGGTCTGGAGGCTCATGTTGCCCTAAAAGCTGGG       c.2760
 M  N  T  N  F  F  H  E  S  G  L  E  A  H  V  A  L  K  A  G         p.920

          .         .         .         .         .       | 19 .    g.29172
 AAGCTGAAGTTTATCATTCCTTCCCCAAAGAGACCAGTCAAGCTGCTCAGTGGAGG | CAAC    c.2820
 K  L  K  F  I  I  P  S  P  K  R  P  V  K  L  L  S  G  G  |  N      p.940

          .         .         .         .         .         .       g.29232
 ACATTACATTTGGTCTCTACCACCAAAACGGAGGTGATCCCACCTCTCATTGAGAACAGG       c.2880
 T  L  H  L  V  S  T  T  K  T  E  V  I  P  P  L  I  E  N  R         p.960

          .         .         .         .         .         .       g.29292
 CAGTCCTGGTCAGTTTGCAAGCAAGTCTTTCCTGGCCTGAATTACTGCACCTCAGGCGCT       c.2940
 Q  S  W  S  V  C  K  Q  V  F  P  G  L  N  Y  C  T  S  G  A         p.980

          .         .         .         .         .          | 20    g.29961
 TACTCCAACGCCAGCTCCACAGACTCCGCCTCCTACTATCCGCTGACCGGGGACACCAG | A    c.3000
 Y  S  N  A  S  S  T  D  S  A  S  Y  Y  P  L  T  G  D  T  R  |      p.1000

          .         .         .         .         .         .       g.30021
 TTAGAGCTGGAACTGAGGCCTACAGGAGAGATTGAGCAGTATTCTGTCAGCGCAACCTAT       c.3060
 L  E  L  E  L  R  P  T  G  E  I  E  Q  Y  S  V  S  A  T  Y         p.1020

          .         .         .         .         .         .       g.30081
 GAGCTCCAGAGAGAGGACAGAGCCTTGGTGGATACCCTGAAGTTTGTAACTCAAGCAGAA       c.3120
 E  L  Q  R  E  D  R  A  L  V  D  T  L  K  F  V  T  Q  A  E         p.1040

   | 21      .         .         .         .         .         .    g.32483
 G | GTGCGAAGCAGACTGAGGCTACCATGACATTCAAATATAATCGGCAGAGTATGACCTTG    c.3180
 G |   A  K  Q  T  E  A  T  M  T  F  K  Y  N  R  Q  S  M  T  L      p.1060

          .         .         .         .         .         .       g.32543
 TCCAGTGAAGTCCAAATTCCGGATTTTGATGTTGACCTCGGAACAATCCTCAGAGTTAAT       c.3240
 S  S  E  V  Q  I  P  D  F  D  V  D  L  G  T  I  L  R  V  N         p.1080

          .         .         .         .         .         .       g.32603
 GATGAATCTACTGAGGGCAAAACGTCTTACAGACTCACCCTGGACATTCAGAACAAGAAA       c.3300
 D  E  S  T  E  G  K  T  S  Y  R  L  T  L  D  I  Q  N  K  K         p.1100

          .         .         .   | 22     .         .         .    g.33556
 ATTACTGAGGTCGCCCTCATGGGCCACCTAAG | TTGTGACACAAAGGAAGAAAGAAAAATC    c.3360
 I  T  E  V  A  L  M  G  H  L  S  |  C  D  T  K  E  E  R  K  I      p.1120

          .         .         .         .         .         .       g.33616
 AAGGGTGTTATTTCCATACCCCGTTTGCAAGCAGAAGCCAGAAGTGAGATCCTCGCCCAC       c.3420
 K  G  V  I  S  I  P  R  L  Q  A  E  A  R  S  E  I  L  A  H         p.1140

          .         .         .         .         .         .       g.33676
 TGGTCGCCTGCCAAACTGCTTCTCCAAATGGACTCATCTGCTACAGCTTATGGCTCCACA       c.3480
 W  S  P  A  K  L  L  L  Q  M  D  S  S  A  T  A  Y  G  S  T         p.1160

          .         .         | 23         .         .         .    g.33845
 GTTTCCAAGAGGGTGGCATGGCATTATG | ATGAAGAGAAGATTGAATTTGAATGGAACACA    c.3540
 V  S  K  R  V  A  W  H  Y  D |   E  E  K  I  E  F  E  W  N  T      p.1180

          .         .         .         .         .         .       g.33905
 GGCACCAATGTAGATACCAAAAAAATGACTTCCAATTTCCCTGTGGATCTCTCCGATTAT       c.3600
 G  T  N  V  D  T  K  K  M  T  S  N  F  P  V  D  L  S  D  Y         p.1200

          .         .         .         .         .         .       g.33965
 CCTAAGAGCTTGCATATGTATGCTAATAGACTCCTGGATCACAGAGTCCCTCAAACAGAC       c.3660
 P  K  S  L  H  M  Y  A  N  R  L  L  D  H  R  V  P  Q  T  D         p.1220

          .         .         .       | 24 .         .         .    g.34504
 ATGACTTTCCGGCACGTGGGTTCCAAATTAATAGTT | GCAATGAGCTCATGGCTTCAGAAG    c.3720
 M  T  F  R  H  V  G  S  K  L  I  V   | A  M  S  S  W  L  Q  K      p.1240

          .         .         .         .         .         .       g.34564
 GCATCTGGGAGTCTTCCTTATACCCAGACTTTGCAAGACCACCTCAATAGCCTGAAGGAG       c.3780
 A  S  G  S  L  P  Y  T  Q  T  L  Q  D  H  L  N  S  L  K  E         p.1260

          .         .         .         .         .         .       g.34624
 TTCAACCTCCAGAACATGGGATTGCCAGACTTCCACATCCCAGAAAACCTCTTCTTAAAA       c.3840
 F  N  L  Q  N  M  G  L  P  D  F  H  I  P  E  N  L  F  L  K         p.1280

    | 25     .         .         .         .         .         .    g.35598
 AG | CGATGGCCGGGTCAAATATACCTTGAACAAGAACAGTTTGAAAATTGAGATTCCTTTG    c.3900
 S  |  D  G  R  V  K  Y  T  L  N  K  N  S  L  K  I  E  I  P  L      p.1300

          .         .         .         .         .         .       g.35658
 CCTTTTGGTGGCAAATCCTCCAGAGATCTAAAGATGTTAGAGACTGTTAGGACACCAGCC       c.3960
 P  F  G  G  K  S  S  R  D  L  K  M  L  E  T  V  R  T  P  A         p.1320

          .         .         .         .         .         .       g.35718
 CTCCACTTCAAGTCTGTGGGATTCCATCTGCCATCTCGAGAGTTCCAAGTCCCTACTTTT       c.4020
 L  H  F  K  S  V  G  F  H  L  P  S  R  E  F  Q  V  P  T  F         p.1340

          .         .         .         .         .         .       g.35778
 ACCATTCCCAAGTTGTATCAACTGCAAGTGCCTCTCCTGGGTGTTCTAGACCTCTCCACG       c.4080
 T  I  P  K  L  Y  Q  L  Q  V  P  L  L  G  V  L  D  L  S  T         p.1360

          .         .         .         .         .         .       g.35838
 AATGTCTACAGCAACTTGTACAACTGGTCCGCCTCCTACAGTGGTGGCAACACCAGCACA       c.4140
 N  V  Y  S  N  L  Y  N  W  S  A  S  Y  S  G  G  N  T  S  T         p.1380

          .         .         .         .         .         .       g.35898
 GACCATTTCAGCCTTCGGGCTCGTTACCACATGAAGGCTGACTCTGTGGTTGACCTGCTT       c.4200
 D  H  F  S  L  R  A  R  Y  H  M  K  A  D  S  V  V  D  L  L         p.1400

          .       | 26 .         .         .         .         .    g.36466
 TCCTACAATGTGCAAG | GATCTGGAGAAACAACATATGACCACAAGAATACGTTCACACTA    c.4260
 S  Y  N  V  Q  G |   S  G  E  T  T  Y  D  H  K  N  T  F  T  L      p.1420

          .         .         .         .         .         .       g.36526
 TCATGTGATGGGTCTCTACGCCACAAATTTCTAGATTCGAATATCAAATTCAGTCATGTA       c.4320
 S  C  D  G  S  L  R  H  K  F  L  D  S  N  I  K  F  S  H  V         p.1440

          .         .         .         .         .         .       g.36586
 GAAAAACTTGGAAACAACCCAGTCTCAAAAGGTTTACTAATATTCGATGCATCTAGTTCC       c.4380
 E  K  L  G  N  N  P  V  S  K  G  L  L  I  F  D  A  S  S  S         p.1460

          .         .         .         .         .         .       g.36646
 TGGGGACCACAGATGTCTGCTTCAGTTCATTTGGACTCCAAAAAGAAACAGCATTTGTTT       c.4440
 W  G  P  Q  M  S  A  S  V  H  L  D  S  K  K  K  Q  H  L  F         p.1480

          .         .         .         .         .         .       g.36706
 GTCAAAGAAGTCAAGATTGATGGGCAGTTCAGAGTCTCTTCGTTCTATGCTAAAGGCACA       c.4500
 V  K  E  V  K  I  D  G  Q  F  R  V  S  S  F  Y  A  K  G  T         p.1500

          .         .         .         .         .         .       g.36766
 TATGGCCTGTCTTGTCAGAGGGATCCTAACACTGGCCGGCTCAATGGAGAGTCCAACCTG       c.4560
 Y  G  L  S  C  Q  R  D  P  N  T  G  R  L  N  G  E  S  N  L         p.1520

          .         .         .         .         .         .       g.36826
 AGGTTTAACTCCTCCTACCTCCAAGGCACCAACCAGATAACAGGAAGATATGAAGATGGA       c.4620
 R  F  N  S  S  Y  L  Q  G  T  N  Q  I  T  G  R  Y  E  D  G         p.1540

          .         .         .         .         .         .       g.36886
 ACCCTCTCCCTCACCTCCACCTCTGATCTGCAAAGTGGCATCATTAAAAATACTGCTTCC       c.4680
 T  L  S  L  T  S  T  S  D  L  Q  S  G  I  I  K  N  T  A  S         p.1560

          .         .         .         .         .         .       g.36946
 CTAAAGTATGAGAACTACGAGCTGACTTTAAAATCTGACACCAATGGGAAGTATAAGAAC       c.4740
 L  K  Y  E  N  Y  E  L  T  L  K  S  D  T  N  G  K  Y  K  N         p.1580

          .         .         .         .         .         .       g.37006
 TTTGCCACTTCTAACAAGATGGATATGACCTTCTCTAAGCAAAATGCACTGCTGCGTTCT       c.4800
 F  A  T  S  N  K  M  D  M  T  F  S  K  Q  N  A  L  L  R  S         p.1600

          .         .         .         .         .         .       g.37066
 GAATATCAGGCTGATTACGAGTCATTGAGGTTCTTCAGCCTGCTTTCTGGATCACTAAAT       c.4860
 E  Y  Q  A  D  Y  E  S  L  R  F  F  S  L  L  S  G  S  L  N         p.1620

          .         .         .         .         .         .       g.37126
 TCCCATGGTCTTGAGTTAAATGCTGACATCTTAGGCACTGACAAAATTAATAGTGGTGCT       c.4920
 S  H  G  L  E  L  N  A  D  I  L  G  T  D  K  I  N  S  G  A         p.1640

          .         .         .         .         .         .       g.37186
 CACAAGGCGACACTAAGGATTGGCCAAGATGGAATATCTACCAGTGCAACGACCAACTTG       c.4980
 H  K  A  T  L  R  I  G  Q  D  G  I  S  T  S  A  T  T  N  L         p.1660

          .         .         .         .         .         .       g.37246
 AAGTGTAGTCTCCTGGTGCTGGAGAATGAGCTGAATGCAGAGCTTGGCCTCTCTGGGGCA       c.5040
 K  C  S  L  L  V  L  E  N  E  L  N  A  E  L  G  L  S  G  A         p.1680

          .         .         .         .         .         .       g.37306
 TCTATGAAATTAACAACAAATGGCCGCTTCAGGGAACACAATGCAAAATTCAGTCTGGAT       c.5100
 S  M  K  L  T  T  N  G  R  F  R  E  H  N  A  K  F  S  L  D         p.1700

          .         .         .         .         .         .       g.37366
 GGGAAAGCCGCCCTCACAGAGCTATCACTGGGAAGTGCTTATCAGGCCATGATTCTGGGT       c.5160
 G  K  A  A  L  T  E  L  S  L  G  S  A  Y  Q  A  M  I  L  G         p.1720

          .         .         .         .         .         .       g.37426
 GTCGACAGCAAAAACATTTTCAACTTCAAGGTCAGTCAAGAAGGACTTAAGCTCTCAAAT       c.5220
 V  D  S  K  N  I  F  N  F  K  V  S  Q  E  G  L  K  L  S  N         p.1740

          .         .         .         .         .         .       g.37486
 GACATGATGGGCTCATATGCTGAAATGAAATTTGACCACACAAACAGTCTGAACATTGCA       c.5280
 D  M  M  G  S  Y  A  E  M  K  F  D  H  T  N  S  L  N  I  A         p.1760

          .         .         .         .         .         .       g.37546
 GGCTTATCACTGGACTTCTCTTCAAAACTTGACAACATTTACAGCTCTGACAAGTTTTAT       c.5340
 G  L  S  L  D  F  S  S  K  L  D  N  I  Y  S  S  D  K  F  Y         p.1780

          .         .         .         .         .         .       g.37606
 AAGCAAACTGTTAATTTACAGCTACAGCCCTATTCTCTGGTAACTACTTTAAACAGTGAC       c.5400
 K  Q  T  V  N  L  Q  L  Q  P  Y  S  L  V  T  T  L  N  S  D         p.1800

          .         .         .         .         .         .       g.37666
 CTGAAATACAATGCTCTGGATCTCACCAACAATGGGAAACTACGGCTAGAACCCCTGAAG       c.5460
 L  K  Y  N  A  L  D  L  T  N  N  G  K  L  R  L  E  P  L  K         p.1820

          .         .         .         .         .         .       g.37726
 CTGCATGTGGCTGGTAACCTAAAAGGAGCCTACCAAAATAATGAAATAAAACACATCTAT       c.5520
 L  H  V  A  G  N  L  K  G  A  Y  Q  N  N  E  I  K  H  I  Y         p.1840

          .         .         .         .         .         .       g.37786
 GCCATCTCTTCTGCTGCCTTATCAGCAAGCTATAAAGCAGACACTGTTGCTAAGGTTCAG       c.5580
 A  I  S  S  A  A  L  S  A  S  Y  K  A  D  T  V  A  K  V  Q         p.1860

          .         .         .         .         .         .       g.37846
 GGTGTGGAGTTTAGCCATCGGCTCAACACAGACATCGCTGGGCTGGCTTCAGCCATTGAC       c.5640
 G  V  E  F  S  H  R  L  N  T  D  I  A  G  L  A  S  A  I  D         p.1880

          .         .         .         .         .         .       g.37906
 ATGAGCACAAACTATAATTCAGACTCACTGCATTTCAGCAATGTCTTCCGTTCTGTAATG       c.5700
 M  S  T  N  Y  N  S  D  S  L  H  F  S  N  V  F  R  S  V  M         p.1900

          .         .         .         .         .         .       g.37966
 GCCCCGTTTACCATGACCATCGATGCACATACAAATGGCAATGGGAAACTCGCTCTCTGG       c.5760
 A  P  F  T  M  T  I  D  A  H  T  N  G  N  G  K  L  A  L  W         p.1920

          .         .         .         .         .         .       g.38026
 GGAGAACATACTGGGCAGCTGTATAGCAAATTCCTGTTGAAAGCAGAACCTCTGGCATTT       c.5820
 G  E  H  T  G  Q  L  Y  S  K  F  L  L  K  A  E  P  L  A  F         p.1940

          .         .         .         .         .         .       g.38086
 ACTTTCTCTCATGATTACAAAGGCTCCACAAGTCATCATCTCGTGTCTAGGAAAAGCATC       c.5880
 T  F  S  H  D  Y  K  G  S  T  S  H  H  L  V  S  R  K  S  I         p.1960

          .         .         .         .         .         .       g.38146
 AGTGCAGCTCTTGAACACAAAGTCAGTGCCCTGCTTACTCCAGCTGAGCAGACAGGCACC       c.5940
 S  A  A  L  E  H  K  V  S  A  L  L  T  P  A  E  Q  T  G  T         p.1980

          .         .         .         .         .         .       g.38206
 TGGAAACTCAAGACCCAATTTAACAACAATGAATACAGCCAGGACTTGGATGCTTACAAC       c.6000
 W  K  L  K  T  Q  F  N  N  N  E  Y  S  Q  D  L  D  A  Y  N         p.2000

          .         .         .         .         .         .       g.38266
 ACTAAAGATAAAATTGGCGTGGAGCTTACTGGACGAACTCTGGCTGACCTAACTCTACTA       c.6060
 T  K  D  K  I  G  V  E  L  T  G  R  T  L  A  D  L  T  L  L         p.2020

          .         .         .         .         .         .       g.38326
 GACTCCCCAATTAAAGTGCCACTTTTACTCAGTGAGCCCATCAATATCATTGATGCTTTA       c.6120
 D  S  P  I  K  V  P  L  L  L  S  E  P  I  N  I  I  D  A  L         p.2040

          .         .         .         .         .         .       g.38386
 GAGATGAGAGATGCCGTTGAGAAGCCCCAAGAATTTACAATTGTTGCTTTTGTAAAGTAT       c.6180
 E  M  R  D  A  V  E  K  P  Q  E  F  T  I  V  A  F  V  K  Y         p.2060

          .         .         .         .         .         .       g.38446
 GATAAAAACCAAGATGTTCACTCCATTAACCTCCCATTTTTTGAGACCTTGCAAGAATAT       c.6240
 D  K  N  Q  D  V  H  S  I  N  L  P  F  F  E  T  L  Q  E  Y         p.2080

          .         .         .         .         .         .       g.38506
 TTTGAGAGGAATCGACAAACCATTATAGTTGTACTGGAAAACGTACAGAGAAACCTGAAG       c.6300
 F  E  R  N  R  Q  T  I  I  V  V  L  E  N  V  Q  R  N  L  K         p.2100

          .         .         .         .         .         .       g.38566
 CACATCAATATTGATCAATTTGTAAGAAAATACAGAGCAGCCCTGGGAAAACTCCCACAG       c.6360
 H  I  N  I  D  Q  F  V  R  K  Y  R  A  A  L  G  K  L  P  Q         p.2120

          .         .         .         .         .         .       g.38626
 CAAGCTAATGATTATCTGAATTCATTCAATTGGGAGAGACAAGTTTCACATGCCAAGGAG       c.6420
 Q  A  N  D  Y  L  N  S  F  N  W  E  R  Q  V  S  H  A  K  E         p.2140

          .         .         .         .         .         .       g.38686
 AAACTGACTGCTCTCACAAAAAAGTATAGAATTACAGAAAATGATATACAAATTGCATTA       c.6480
 K  L  T  A  L  T  K  K  Y  R  I  T  E  N  D  I  Q  I  A  L         p.2160

          .         .         .         .         .         .       g.38746
 GATGATGCCAAAATCAACTTTAATGAAAAACTATCTCAACTGCAGACATATATGATACAA       c.6540
 D  D  A  K  I  N  F  N  E  K  L  S  Q  L  Q  T  Y  M  I  Q         p.2180

          .         .         .         .         .         .       g.38806
 TTTGATCAGTATATTAAAGATAGTTATGATTTACATGATTTGAAAATAGCTATTGCTAAT       c.6600
 F  D  Q  Y  I  K  D  S  Y  D  L  H  D  L  K  I  A  I  A  N         p.2200

          .         .         .         .         .         .       g.38866
 ATTATTGATGAAATCATTGAAAAATTAAAAAGTCTTGATGAGCACTATCATATCCGTGTA       c.6660
 I  I  D  E  I  I  E  K  L  K  S  L  D  E  H  Y  H  I  R  V         p.2220

          .         .         .         .         .         .       g.38926
 AATTTAGTAAAAACAATCCATGATCTACATTTGTTTATTGAAAATATTGATTTTAACAAA       c.6720
 N  L  V  K  T  I  H  D  L  H  L  F  I  E  N  I  D  F  N  K         p.2240

          .         .         .         .         .         .       g.38986
 AGTGGAAGTAGTACTGCATCCTGGATTCAAAATGTGGATACTAAGTACCAAATCAGAATC       c.6780
 S  G  S  S  T  A  S  W  I  Q  N  V  D  T  K  Y  Q  I  R  I         p.2260

          .         .         .         .         .         .       g.39046
 CAGATACAAGAAAAACTGCAGCAGCTTAAGAGACACATACAGAATATAGACATCCAGCAC       c.6840
 Q  I  Q  E  K  L  Q  Q  L  K  R  H  I  Q  N  I  D  I  Q  H         p.2280

          .         .         .         .         .         .       g.39106
 CTAGCTGGAAAGTTAAAACAACACATTGAGGCTATTGATGTTAGAGTGCTTTTAGATCAA       c.6900
 L  A  G  K  L  K  Q  H  I  E  A  I  D  V  R  V  L  L  D  Q         p.2300

          .         .         .         .         .         .       g.39166
 TTGGGAACTACAATTTCATTTGAAAGAATAAATGACGTTCTTGAGCATGTCAAACACTTT       c.6960
 L  G  T  T  I  S  F  E  R  I  N  D  V  L  E  H  V  K  H  F         p.2320

          .         .         .         .         .         .       g.39226
 GTTATAAATCTTATTGGGGATTTTGAAGTAGCTGAGAAAATCAATGCCTTCAGAGCCAAA       c.7020
 V  I  N  L  I  G  D  F  E  V  A  E  K  I  N  A  F  R  A  K         p.2340

          .         .         .         .         .         .       g.39286
 GTCCATGAGTTAATCGAGAGGTATGAAGTAGACCAACAAATCCAGGTTTTAATGGATAAA       c.7080
 V  H  E  L  I  E  R  Y  E  V  D  Q  Q  I  Q  V  L  M  D  K         p.2360

          .         .         .         .         .         .       g.39346
 TTAGTAGAGTTGGCCCACCAATACAAGTTGAAGGAGACTATTCAGAAGCTAAGCAATGTC       c.7140
 L  V  E  L  A  H  Q  Y  K  L  K  E  T  I  Q  K  L  S  N  V         p.2380

          .         .         .         .         .         .       g.39406
 CTACAACAAGTTAAGATAAAAGATTACTTTGAGAAATTGGTTGGATTTATTGATGATGCT       c.7200
 L  Q  Q  V  K  I  K  D  Y  F  E  K  L  V  G  F  I  D  D  A         p.2400

          .         .         .         .         .         .       g.39466
 GTCAAGAAGCTTAATGAATTATCTTTTAAAACATTCATTGAAGATGTTAACAAATTCCTT       c.7260
 V  K  K  L  N  E  L  S  F  K  T  F  I  E  D  V  N  K  F  L         p.2420

          .         .         .         .         .         .       g.39526
 GACATGTTGATAAAGAAATTAAAGTCATTTGATTACCACCAGTTTGTAGATGAAACCAAT       c.7320
 D  M  L  I  K  K  L  K  S  F  D  Y  H  Q  F  V  D  E  T  N         p.2440

          .         .         .         .         .         .       g.39586
 GACAAAATCCGTGAGGTGACTCAGAGACTCAATGGTGAAATTCAGGCTCTGGAACTACCA       c.7380
 D  K  I  R  E  V  T  Q  R  L  N  G  E  I  Q  A  L  E  L  P         p.2460

          .         .         .         .         .         .       g.39646
 CAAAAAGCTGAAGCATTAAAACTGTTTTTAGAGGAAACCAAGGCCACAGTTGCAGTGTAT       c.7440
 Q  K  A  E  A  L  K  L  F  L  E  E  T  K  A  T  V  A  V  Y         p.2480

          .         .         .         .         .         .       g.39706
 CTGGAAAGCCTACAGGACACCAAAATAACCTTAATCATCAATTGGTTACAGGAGGCTTTA       c.7500
 L  E  S  L  Q  D  T  K  I  T  L  I  I  N  W  L  Q  E  A  L         p.2500

          .         .         .         .         .         .       g.39766
 AGTTCAGCATCTTTGGCTCACATGAAGGCCAAATTCCGAGAGACCCTAGAAGATACACGA       c.7560
 S  S  A  S  L  A  H  M  K  A  K  F  R  E  T  L  E  D  T  R         p.2520

          .         .         .         .         .         .       g.39826
 GACCGAATGTATCAAATGGACATTCAGCAGGAACTTCAACGATACCTGTCTCTGGTAGGC       c.7620
 D  R  M  Y  Q  M  D  I  Q  Q  E  L  Q  R  Y  L  S  L  V  G         p.2540

          .         .         .         .         .         .       g.39886
 CAGGTTTATAGCACACTTGTCACCTACATTTCTGATTGGTGGACTCTTGCTGCTAAGAAC       c.7680
 Q  V  Y  S  T  L  V  T  Y  I  S  D  W  W  T  L  A  A  K  N         p.2560

          .         .         .         .         .         .       g.39946
 CTTACTGACTTTGCAGAGCAATATTCTATCCAAGATTGGGCTAAACGTATGAAAGCATTG       c.7740
 L  T  D  F  A  E  Q  Y  S  I  Q  D  W  A  K  R  M  K  A  L         p.2580

          .         .         .         .         .         .       g.40006
 GTAGAGCAAGGGTTCACTGTTCCTGAAATCAAGACCATCCTTGGGACCATGCCTGCCTTT       c.7800
 V  E  Q  G  F  T  V  P  E  I  K  T  I  L  G  T  M  P  A  F         p.2600

          .         .         .         .         .         .       g.40066
 GAAGTCAGTCTTCAGGCTCTTCAGAAAGCTACCTTCCAGACACCTGATTTTATAGTCCCC       c.7860
 E  V  S  L  Q  A  L  Q  K  A  T  F  Q  T  P  D  F  I  V  P         p.2620

          .         .         .         .         .         .       g.40126
 CTAACAGATTTGAGGATTCCATCAGTTCAGATAAACTTCAAAGACTTAAAAAATATAAAA       c.7920
 L  T  D  L  R  I  P  S  V  Q  I  N  F  K  D  L  K  N  I  K         p.2640

          .         .         .         .         .         .       g.40186
 ATCCCATCCAGGTTTTCCACACCAGAATTTACCATCCTTAACACCTTCCACATTCCTTCC       c.7980
 I  P  S  R  F  S  T  P  E  F  T  I  L  N  T  F  H  I  P  S         p.2660

          .         .         .         .         .         .       g.40246
 TTTACAATTGACTTTGTAGAAATGAAAGTAAAGATCATCAGAACCATTGACCAGATGCTG       c.8040
 F  T  I  D  F  V  E  M  K  V  K  I  I  R  T  I  D  Q  M  L         p.2680

          .         .         .         .         .         .       g.40306
 AACAGTGAGCTGCAGTGGCCCGTTCCAGATATATATCTCAGGGATCTGAAGGTGGAGGAC       c.8100
 N  S  E  L  Q  W  P  V  P  D  I  Y  L  R  D  L  K  V  E  D         p.2700

          .         .         .         .         .         .       g.40366
 ATTCCTCTAGCGAGAATCACCCTGCCAGACTTCCGTTTACCAGAAATCGCAATTCCAGAA       c.8160
 I  P  L  A  R  I  T  L  P  D  F  R  L  P  E  I  A  I  P  E         p.2720

          .         .         .         .         .         .       g.40426
 TTCATAATCCCAACTCTCAACCTTAATGATTTTCAAGTTCCTGACCTTCACATACCAGAA       c.8220
 F  I  I  P  T  L  N  L  N  D  F  Q  V  P  D  L  H  I  P  E         p.2740

          .         .         .         .         .         .       g.40486
 TTCCAGCTTCCCCACATCTCACACACAATTGAAGTACCTACTTTTGGCAAGCTATACAGT       c.8280
 F  Q  L  P  H  I  S  H  T  I  E  V  P  T  F  G  K  L  Y  S         p.2760

          .         .         .         .         .         .       g.40546
 ATTCTGAAAATCCAATCTCCTCTTTTCACATTAGATGCAAATGCTGACATAGGGAATGGA       c.8340
 I  L  K  I  Q  S  P  L  F  T  L  D  A  N  A  D  I  G  N  G         p.2780

          .         .         .         .         .         .       g.40606
 ACCACCTCAGCAAACGAAGCAGGTATCGCAGCTTCCATCACTGCCAAAGGAGAGTCCAAA       c.8400
 T  T  S  A  N  E  A  G  I  A  A  S  I  T  A  K  G  E  S  K         p.2800

          .         .         .         .         .         .       g.40666
 TTAGAAGTTCTCAATTTTGATTTTCAAGCAAATGCACAACTCTCAAACCCTAAGATTAAT       c.8460
 L  E  V  L  N  F  D  F  Q  A  N  A  Q  L  S  N  P  K  I  N         p.2820

          .         .         .         .         .         .       g.40726
 CCGCTGGCTCTGAAGGAGTCAGTGAAGTTCTCCAGCAAGTACCTGAGAACGGAGCATGGG       c.8520
 P  L  A  L  K  E  S  V  K  F  S  S  K  Y  L  R  T  E  H  G         p.2840

          .         .         .         .         .         .       g.40786
 AGTGAAATGCTGTTTTTTGGAAATGCTATTGAGGGAAAATCAAACACAGTGGCAAGTTTA       c.8580
 S  E  M  L  F  F  G  N  A  I  E  G  K  S  N  T  V  A  S  L         p.2860

          .         .         .         .         .         .       g.40846
 CACACAGAAAAAAATACACTGGAGCTTAGTAATGGAGTGATTGTCAAGATAAACAATCAG       c.8640
 H  T  E  K  N  T  L  E  L  S  N  G  V  I  V  K  I  N  N  Q         p.2880

          .         .         .         .         .         .       g.40906
 CTTACCCTGGATAGCAACACTAAATACTTCCACAAATTGAACATCCCCAAACTGGACTTC       c.8700
 L  T  L  D  S  N  T  K  Y  F  H  K  L  N  I  P  K  L  D  F         p.2900

          .         .         .         .         .         .       g.40966
 TCTAGTCAGGCTGACCTGCGCAACGAGATCAAGACACTGTTGAAAGCTGGCCACATAGCA       c.8760
 S  S  Q  A  D  L  R  N  E  I  K  T  L  L  K  A  G  H  I  A         p.2920

          .         .         .         .         .         .       g.41026
 TGGACTTCTTCTGGAAAAGGGTCATGGAAATGGGCCTGCCCCAGATTCTCAGATGAGGGA       c.8820
 W  T  S  S  G  K  G  S  W  K  W  A  C  P  R  F  S  D  E  G         p.2940

          .         .         .         .         .         .       g.41086
 ACACATGAATCACAAATTAGTTTCACCATAGAAGGACCCCTCACTTCCTTTGGACTGTCC       c.8880
 T  H  E  S  Q  I  S  F  T  I  E  G  P  L  T  S  F  G  L  S         p.2960

          .         .         .         .         .         .       g.41146
 AATAAGATCAATAGCAAACACCTAAGAGTAAACCAAAACTTGGTTTATGAATCTGGCTCC       c.8940
 N  K  I  N  S  K  H  L  R  V  N  Q  N  L  V  Y  E  S  G  S         p.2980

          .         .         .         .         .         .       g.41206
 CTCAACTTTTCTAAACTTGAAATTCAATCACAAGTCGATTCCCAGCATGTGGGCCACAGT       c.9000
 L  N  F  S  K  L  E  I  Q  S  Q  V  D  S  Q  H  V  G  H  S         p.3000

          .         .         .         .         .         .       g.41266
 GTTCTAACTGCTAAAGGCATGGCACTGTTTGGAGAAGGGAAGGCAGAGTTTACTGGGAGG       c.9060
 V  L  T  A  K  G  M  A  L  F  G  E  G  K  A  E  F  T  G  R         p.3020

          .         .         .         .         .         .       g.41326
 CATGATGCTCATTTAAATGGAAAGGTTATTGGAACTTTGAAAAATTCTCTTTTCTTTTCA       c.9120
 H  D  A  H  L  N  G  K  V  I  G  T  L  K  N  S  L  F  F  S         p.3040

          .         .         .         .         .         .       g.41386
 GCCCAGCCATTTGAGATCACGGCATCCACAAACAATGAAGGGAATTTGAAAGTTCGTTTT       c.9180
 A  Q  P  F  E  I  T  A  S  T  N  N  E  G  N  L  K  V  R  F         p.3060

          .         .         .         .         .         .       g.41446
 CCATTAAGGTTAACAGGGAAGATAGACTTCCTGAATAACTATGCACTGTTTCTGAGTCCC       c.9240
 P  L  R  L  T  G  K  I  D  F  L  N  N  Y  A  L  F  L  S  P         p.3080

          .         .         .         .         .         .       g.41506
 AGTGCCCAGCAAGCAAGTTGGCAAGTAAGTGCTAGGTTCAATCAGTATAAGTACAACCAA       c.9300
 S  A  Q  Q  A  S  W  Q  V  S  A  R  F  N  Q  Y  K  Y  N  Q         p.3100

          .         .         .         .         .         .       g.41566
 AATTTCTCTGCTGGAAACAACGAGAACATTATGGAGGCCCATGTAGGAATAAATGGAGAA       c.9360
 N  F  S  A  G  N  N  E  N  I  M  E  A  H  V  G  I  N  G  E         p.3120

          .         .         .         .         .         .       g.41626
 GCAAATCTGGATTTCTTAAACATTCCTTTAACAATTCCTGAAATGCGTCTACCTTACACA       c.9420
 A  N  L  D  F  L  N  I  P  L  T  I  P  E  M  R  L  P  Y  T         p.3140

          .         .         .         .         .         .       g.41686
 ATAATCACAACTCCTCCACTGAAAGATTTCTCTCTATGGGAAAAAACAGGCTTGAAGGAA       c.9480
 I  I  T  T  P  P  L  K  D  F  S  L  W  E  K  T  G  L  K  E         p.3160

          .         .         .         .         .         .       g.41746
 TTCTTGAAAACGACAAAGCAATCATTTGATTTAAGTGTAAAAGCTCAGTATAAGAAAAAC       c.9540
 F  L  K  T  T  K  Q  S  F  D  L  S  V  K  A  Q  Y  K  K  N         p.3180

          .         .         .         .         .         .       g.41806
 AAACACAGGCATTCCATCACAAATCCTTTGGCTGTGCTTTGTGAGTTTATCAGTCAGAGC       c.9600
 K  H  R  H  S  I  T  N  P  L  A  V  L  C  E  F  I  S  Q  S         p.3200

          .         .         .         .         .         .       g.41866
 ATCAAATCCTTTGACAGGCATTTTGAAAAAAACAGAAACAATGCATTAGATTTTGTCACC       c.9660
 I  K  S  F  D  R  H  F  E  K  N  R  N  N  A  L  D  F  V  T         p.3220

          .         .         .         .         .         .       g.41926
 AAATCCTATAATGAAACAAAAATTAAGTTTGATAAGTACAAAGCTGAAAAATCTCACGAC       c.9720
 K  S  Y  N  E  T  K  I  K  F  D  K  Y  K  A  E  K  S  H  D         p.3240

          .         .         .         .         .         .       g.41986
 GAGCTCCCCAGGACCTTTCAAATTCCTGGATACACTGTTCCAGTTGTCAATGTTGAAGTG       c.9780
 E  L  P  R  T  F  Q  I  P  G  Y  T  V  P  V  V  N  V  E  V         p.3260

          .         .         .         .         .         .       g.42046
 TCTCCATTCACCATAGAGATGTCGGCATTCGGCTATGTGTTCCCAAAAGCAGTCAGCATG       c.9840
 S  P  F  T  I  E  M  S  A  F  G  Y  V  F  P  K  A  V  S  M         p.3280

          .         .         .         .         .         .       g.42106
 CCTAGTTTCTCCATCCTAGGTTCTGACGTCCGTGTGCCTTCATACACATTAATCCTGCCA       c.9900
 P  S  F  S  I  L  G  S  D  V  R  V  P  S  Y  T  L  I  L  P         p.3300

          .         .         .         .         .         .       g.42166
 TCATTAGAGCTGCCAGTCCTTCATGTCCCTAGAAATCTCAAGCTTTCTCTTCCAGATTTC       c.9960
 S  L  E  L  P  V  L  H  V  P  R  N  L  K  L  S  L  P  D  F         p.3320

          .         .         .         .         .         .       g.42226
 AAGGAATTGTGTACCATAAGCCATATTTTTATTCCTGCCATGGGCAATATTACCTATGAT       c.10020
 K  E  L  C  T  I  S  H  I  F  I  P  A  M  G  N  I  T  Y  D         p.3340

          .         .         .         .         .         .       g.42286
 TTCTCCTTTAAATCAAGTGTCATCACACTGAATACCAATGCTGAACTTTTTAACCAGTCA       c.10080
 F  S  F  K  S  S  V  I  T  L  N  T  N  A  E  L  F  N  Q  S         p.3360

          .         .         .         .         .         .       g.42346
 GATATTGTTGCTCATCTCCTTTCTTCATCTTCATCTGTCATTGATGCACTGCAGTACAAA       c.10140
 D  I  V  A  H  L  L  S  S  S  S  S  V  I  D  A  L  Q  Y  K         p.3380

          .         .         .         .         .         .       g.42406
 TTAGAGGGCACCACAAGATTGACAAGAAAAAGGGGATTGAAGTTAGCCACAGCTCTGTCT       c.10200
 L  E  G  T  T  R  L  T  R  K  R  G  L  K  L  A  T  A  L  S         p.3400

          .         .         .         .         .         .       g.42466
 CTGAGCAACAAATTTGTGGAGGGTAGTCATAACAGTACTGTGAGCTTAACCACGAAAAAT       c.10260
 L  S  N  K  F  V  E  G  S  H  N  S  T  V  S  L  T  T  K  N         p.3420

          .         .         .         .         .         .       g.42526
 ATGGAAGTGTCAGTGGCAACAACCACAAAAGCCCAAATTCCAATTTTGAGAATGAATTTC       c.10320
 M  E  V  S  V  A  T  T  T  K  A  Q  I  P  I  L  R  M  N  F         p.3440

          .         .         .         .         .         .       g.42586
 AAGCAAGAACTTAATGGAAATACCAAGTCAAAACCTACTGTCTCTTCCTCCATGGAATTT       c.10380
 K  Q  E  L  N  G  N  T  K  S  K  P  T  V  S  S  S  M  E  F         p.3460

          .         .         .         .         .         .       g.42646
 AAGTATGATTTCAATTCTTCAATGCTGTACTCTACCGCTAAAGGAGCAGTTGACCACAAG       c.10440
 K  Y  D  F  N  S  S  M  L  Y  S  T  A  K  G  A  V  D  H  K         p.3480

          .         .         .         .         .         .       g.42706
 CTTAGCTTGGAAAGCCTCACCTCTTACTTTTCCATTGAGTCATCTACCAAAGGAGATGTC       c.10500
 L  S  L  E  S  L  T  S  Y  F  S  I  E  S  S  T  K  G  D  V         p.3500

          .         .         .         .         .         .       g.42766
 AAGGGTTCGGTTCTTTCTCGGGAATATTCAGGAACTATTGCTAGTGAGGCCAACACTTAC       c.10560
 K  G  S  V  L  S  R  E  Y  S  G  T  I  A  S  E  A  N  T  Y         p.3520

          .         .         .         .         .         .       g.42826
 TTGAATTCCAAGAGCACACGGTCTTCAGTGAAGCTGCAGGGCACTTCCAAAATTGATGAT       c.10620
 L  N  S  K  S  T  R  S  S  V  K  L  Q  G  T  S  K  I  D  D         p.3540

          .         .         .         .         .         .       g.42886
 ATCTGGAACCTTGAAGTAAAAGAAAATTTTGCTGGAGAAGCCACACTCCAACGCATATAT       c.10680
 I  W  N  L  E  V  K  E  N  F  A  G  E  A  T  L  Q  R  I  Y         p.3560

          .         .         .         .         .         .       g.42946
 TCCCTCTGGGAGCACAGTACGAAAAACCACTTACAGCTAGAGGGCCTCTTTTTCACCAAC       c.10740
 S  L  W  E  H  S  T  K  N  H  L  Q  L  E  G  L  F  F  T  N         p.3580

          .         .         .         .         .         .       g.43006
 GGAGAACATACAAGCAAAGCCACCCTGGAACTCTCTCCATGGCAAATGTCAGCTCTTGTT       c.10800
 G  E  H  T  S  K  A  T  L  E  L  S  P  W  Q  M  S  A  L  V         p.3600

          .         .         .         .         .         .       g.43066
 CAGGTCCATGCAAGTCAGCCCAGTTCCTTCCATGATTTCCCTGACCTTGGCCAGGAAGTG       c.10860
 Q  V  H  A  S  Q  P  S  S  F  H  D  F  P  D  L  G  Q  E  V         p.3620

          .         .         .         .         .         .       g.43126
 GCCCTGAATGCTAACACTAAGAACCAGAAGATCAGATGGAAAAATGAAGTCCGGATTCAT       c.10920
 A  L  N  A  N  T  K  N  Q  K  I  R  W  K  N  E  V  R  I  H         p.3640

          .         .         .         .         .         .       g.43186
 TCTGGGTCTTTCCAGAGCCAGGTCGAGCTTTCCAATGACCAAGAAAAGGCACACCTTGAC       c.10980
 S  G  S  F  Q  S  Q  V  E  L  S  N  D  Q  E  K  A  H  L  D         p.3660

          .         .         .         .         .         .       g.43246
 ATTGCAGGATCCTTAGAAGGACACCTAAGGTTCCTCAAAAATATCATCCTACCAGTCTAT       c.11040
 I  A  G  S  L  E  G  H  L  R  F  L  K  N  I  I  L  P  V  Y         p.3680

          .         .         .         .         .         .       g.43306
 GACAAGAGCTTATGGGATTTCCTAAAGCTGGATGTAACCACCAGCATTGGTAGGAGACAG       c.11100
 D  K  S  L  W  D  F  L  K  L  D  V  T  T  S  I  G  R  R  Q         p.3700

          .         .         .         .         .         .       g.43366
 CATCTTCGTGTTTCAACTGCCTTTGTGTACACCAAAAACCCCAATGGCTATTCATTCTCC       c.11160
 H  L  R  V  S  T  A  F  V  Y  T  K  N  P  N  G  Y  S  F  S         p.3720

          .         .         .         .         .         .       g.43426
 ATCCCTGTAAAAGTTTTGGCTGATAAATTCATTATTCCTGGGCTGAAACTAAATGATCTA       c.11220
 I  P  V  K  V  L  A  D  K  F  I  I  P  G  L  K  L  N  D  L         p.3740

          .         .         .         .         .         .       g.43486
 AATTCAGTTCTTGTCATGCCTACGTTCCATGTCCCATTTACAGATCTTCAGGTTCCATCG       c.11280
 N  S  V  L  V  M  P  T  F  H  V  P  F  T  D  L  Q  V  P  S         p.3760

          .         .         .         .         .         .       g.43546
 TGCAAACTTGACTTCAGAGAAATACAAATCTATAAGAAGCTGAGAACTTCATCATTTGCC       c.11340
 C  K  L  D  F  R  E  I  Q  I  Y  K  K  L  R  T  S  S  F  A         p.3780

          .         .         .         .         .         .       g.43606
 CTCAACCTACCAACACTCCCCGAGGTAAAATTCCCTGAAGTTGATGTGTTAACAAAATAT       c.11400
 L  N  L  P  T  L  P  E  V  K  F  P  E  V  D  V  L  T  K  Y         p.3800

          .         .         .         .         .         .       g.43666
 TCTCAACCAGAAGACTCCTTGATTCCCTTTTTTGAGATAACCGTGCCTGAATCTCAGTTA       c.11460
 S  Q  P  E  D  S  L  I  P  F  F  E  I  T  V  P  E  S  Q  L         p.3820

          .         .         .         .         .         .       g.43726
 ACTGTGTCCCAGTTCACGCTTCCAAAAAGTGTTTCAGATGGCATTGCTGCTTTGGATCTA       c.11520
 T  V  S  Q  F  T  L  P  K  S  V  S  D  G  I  A  A  L  D  L         p.3840

          .         .         .         .         .         .       g.43786
 AATGCAGTAGCCAACAAGATCGCAGACTTTGAGTTGCCCACCATCATCGTGCCTGAGCAG       c.11580
 N  A  V  A  N  K  I  A  D  F  E  L  P  T  I  I  V  P  E  Q         p.3860

          .         .         .         .         .         .       g.43846
 ACCATTGAGATTCCCTCCATTAAGTTCTCTGTACCTGCTGGAATTGTCATTCCTTCCTTT       c.11640
 T  I  E  I  P  S  I  K  F  S  V  P  A  G  I  V  I  P  S  F         p.3880

          .         .         .         .         .         .       g.43906
 CAAGCACTGACTGCACGCTTTGAGGTAGACTCTCCCGTGTATAATGCCACTTGGAGTGCC       c.11700
 Q  A  L  T  A  R  F  E  V  D  S  P  V  Y  N  A  T  W  S  A         p.3900

          .         .         .         .         .         .       g.43966
 AGTTTGAAAAACAAAGCAGATTATGTTGAAACAGTCCTGGATTCCACATGCAGCTCAACC       c.11760
 S  L  K  N  K  A  D  Y  V  E  T  V  L  D  S  T  C  S  S  T         p.3920

          .         .         | 27         .         .         .    g.44430
 GTACAGTTCCTAGAATATGAACTAAATG | TTTTGGGAACACACAAAATCGAAGATGGTACG    c.11820
 V  Q  F  L  E  Y  E  L  N  V |   L  G  T  H  K  I  E  D  G  T      p.3940

          .         .         .         .         .         .       g.44490
 TTAGCCTCTAAGACTAAAGGAACATTTGCACACCGTGACTTCAGTGCAGAATATGAAGAA       c.11880
 L  A  S  K  T  K  G  T  F  A  H  R  D  F  S  A  E  Y  E  E         p.3960

          .         .    | 28    .         .         .         .    g.44658
 GATGGCAAATATGAAGGACTTCA | GGAATGGGAAGGAAAAGCGCACCTCAATATCAAAAGC    c.11940
 D  G  K  Y  E  G  L  Q  |  E  W  E  G  K  A  H  L  N  I  K  S      p.3980

          .         .         .         .         .         .       g.44718
 CCAGCGTTCACCGATCTCCATCTGCGCTACCAGAAAGACAAGAAAGGCATCTCCACCTCA       c.12000
 P  A  F  T  D  L  H  L  R  Y  Q  K  D  K  K  G  I  S  T  S         p.4000

          .         .         .         .         .         .       g.44778
 GCAGCCTCCCCAGCCGTAGGCACCGTGGGCATGGATATGGATGAAGATGACGACTTTTCT       c.12060
 A  A  S  P  A  V  G  T  V  G  M  D  M  D  E  D  D  D  F  S         p.4020

          .         .        | 29.         .         .         .    g.45772
 AAATGGAACTTCTACTACAGCCCTCAG | TCCTCTCCAGATAAAAAACTCACCATATTCAAA    c.12120
 K  W  N  F  Y  Y  S  P  Q   | S  S  P  D  K  K  L  T  I  F  K      p.4040

          .         .         .         .         .         .       g.45832
 ACTGAGTTGAGGGTCCGGGAATCTGATGAGGAAACTCAGATCAAAGTTAATTGGGAAGAA       c.12180
 T  E  L  R  V  R  E  S  D  E  E  T  Q  I  K  V  N  W  E  E         p.4060

          .         .         .         .         .         .       g.45892
 GAGGCAGCTTCTGGCTTGCTAACCTCTCTGAAAGACAACGTGCCCAAGGCCACAGGGGTC       c.12240
 E  A  A  S  G  L  L  T  S  L  K  D  N  V  P  K  A  T  G  V         p.4080

          .         .         .         .         .         .       g.45952
 CTTTATGATTATGTCAACAAGTACCACTGGGAACACACAGGGCTCACCCTGAGAGAAGTG       c.12300
 L  Y  D  Y  V  N  K  Y  H  W  E  H  T  G  L  T  L  R  E  V         p.4100

          .         .         .         .         .         .       g.46012
 TCTTCAAAGCTGAGAAGAAATCTGCAGAACAATGCTGAGTGGGTTTATCAAGGGGCCATT       c.12360
 S  S  K  L  R  R  N  L  Q  N  N  A  E  W  V  Y  Q  G  A  I         p.4120

          .         .         .         .         .         .       g.46072
 AGGCAAATTGATGATATCGACGTGAGGTTCCAGAAAGCAGCCAGTGGCACCACTGGGACC       c.12420
 R  Q  I  D  D  I  D  V  R  F  Q  K  A  A  S  G  T  T  G  T         p.4140

          .         .         .         .         .         .       g.46132
 TACCAAGAGTGGAAGGACAAGGCCCAGAATCTGTACCAGGAACTGTTGACTCAGGAAGGC       c.12480
 Y  Q  E  W  K  D  K  A  Q  N  L  Y  Q  E  L  L  T  Q  E  G         p.4160

          .         .         .         .         .         .       g.46192
 CAAGCCAGTTTCCAGGGACTCAAGGATAACGTGTTTGATGGCTTGGTACGAGTTACTCAA       c.12540
 Q  A  S  F  Q  G  L  K  D  N  V  F  D  G  L  V  R  V  T  Q         p.4180

          .         .         .         .         .         .       g.46252
 GAATTCCATATGAAAGTCAAGCATCTGATTGACTCACTCATTGATTTTCTGAACTTCCCC       c.12600
 E  F  H  M  K  V  K  H  L  I  D  S  L  I  D  F  L  N  F  P         p.4200

          .         .         .         .         .         .       g.46312
 AGATTCCAGTTTCCGGGGAAACCTGGGATATACACTAGGGAGGAACTTTGCACTATGTTC       c.12660
 R  F  Q  F  P  G  K  P  G  I  Y  T  R  E  E  L  C  T  M  F         p.4220

          .         .         .         .         .         .       g.46372
 ATAAGGGAGGTAGGGACGGTACTGTCCCAGGTATATTCGAAAGTCCATAATGGTTCAGAA       c.12720
 I  R  E  V  G  T  V  L  S  Q  V  Y  S  K  V  H  N  G  S  E         p.4240

          .         .         .         .         .         .       g.46432
 ATACTGTTTTCCTATTTCCAAGACCTAGTGATTACACTTCCTTTCGAGTTAAGGAAACAT       c.12780
 I  L  F  S  Y  F  Q  D  L  V  I  T  L  P  F  E  L  R  K  H         p.4260

          .         .         .         .         .         .       g.46492
 AAACTAATAGATGTAATCTCGATGTATAGGGAACTGTTGAAAGATTTATCAAAAGAAGCC       c.12840
 K  L  I  D  V  I  S  M  Y  R  E  L  L  K  D  L  S  K  E  A         p.4280

          .         .         .         .         .         .       g.46552
 CAAGAGGTATTTAAAGCCATTCAGTCTCTCAAGACCACAGAGGTGCTACGTAATCTTCAG       c.12900
 Q  E  V  F  K  A  I  Q  S  L  K  T  T  E  V  L  R  N  L  Q         p.4300

          .         .         .         .         .         .       g.46612
 GACCTTTTACAATTCATTTTCCAACTAATAGAAGATAACATTAAACAGCTGAAAGAGATG       c.12960
 D  L  L  Q  F  I  F  Q  L  I  E  D  N  I  K  Q  L  K  E  M         p.4320

          .         .         .         .         .         .       g.46672
 AAATTTACTTATCTTATTAATTATATCCAAGATGAGATCAACACAATCTTCAGTGATTAT       c.13020
 K  F  T  Y  L  I  N  Y  I  Q  D  E  I  N  T  I  F  S  D  Y         p.4340

          .         .         .         .         .         .       g.46732
 ATCCCATATGTTTTTAAATTGTTGAAAGAAAACCTATGCCTTAATCTTCATAAGTTCAAT       c.13080
 I  P  Y  V  F  K  L  L  K  E  N  L  C  L  N  L  H  K  F  N         p.4360

          .         .         .         .         .         .       g.46792
 GAATTTATTCAAAACGAGCTTCAGGAAGCTTCTCAAGAGTTACAGCAGATCCATCAATAC       c.13140
 E  F  I  Q  N  E  L  Q  E  A  S  Q  E  L  Q  Q  I  H  Q  Y         p.4380

          .         .         .         .         .         .       g.46852
 ATTATGGCCCTTCGTGAAGAATATTTTGATCCAAGTATAGTTGGCTGGACAGTGAAATAT       c.13200
 I  M  A  L  R  E  E  Y  F  D  P  S  I  V  G  W  T  V  K  Y         p.4400

          .         .         .         .         .         .       g.46912
 TATGAACTTGAAGAAAAGATAGTCAGTCTGATCAAGAACCTGTTAGTTGCTCTTAAGGAC       c.13260
 Y  E  L  E  E  K  I  V  S  L  I  K  N  L  L  V  A  L  K  D         p.4420

          .         .         .         .         .         .       g.46972
 TTCCATTCTGAATATATTGTCAGTGCCTCTAACTTTACTTCCCAACTCTCAAGTCAAGTT       c.13320
 F  H  S  E  Y  I  V  S  A  S  N  F  T  S  Q  L  S  S  Q  V         p.4440

          .         .         .         .         .         .       g.47032
 GAGCAATTTCTGCACAGAAATATTCAGGAATATCTTAGCATCCTTACCGATCCAGATGGA       c.13380
 E  Q  F  L  H  R  N  I  Q  E  Y  L  S  I  L  T  D  P  D  G         p.4460

          .         .         .         .         .         .       g.47092
 AAAGGGAAAGAGAAGATTGCAGAGCTTTCTGCCACTGCTCAGGAAATAATTAAAAGCCAG       c.13440
 K  G  K  E  K  I  A  E  L  S  A  T  A  Q  E  I  I  K  S  Q         p.4480

          .         .         .         .         .         .       g.47152
 GCCATTGCGACGAAGAAAATAATTTCTGATTACCACCAGCAGTTTAGATATAAACTGCAA       c.13500
 A  I  A  T  K  K  I  I  S  D  Y  H  Q  Q  F  R  Y  K  L  Q         p.4500

          .         .         .         .         .         .       g.47212
 GATTTTTCAGACCAACTCTCTGATTACTATGAAAAATTTATTGCTGAATCCAAAAGATTG       c.13560
 D  F  S  D  Q  L  S  D  Y  Y  E  K  F  I  A  E  S  K  R  L         p.4520

          .         .         .         .         .         .       g.47272
 ATTGACCTGTCCATTCAAAACTACCACACATTTCTGATATACATCACGGAGTTACTGAAA       c.13620
 I  D  L  S  I  Q  N  Y  H  T  F  L  I  Y  I  T  E  L  L  K         p.4540

          .         .         .         .         .         .       g.47332
 AAGCTGCAATCAACCACAGTCATGAACCCCTACATGAAGCTTGCTCCAGGAGAACTTACT       c.13680
 K  L  Q  S  T  T  V  M  N  P  Y  M  K  L  A  P  G  E  L  T         p.4560

          .                                                         g.47344
 ATCATCCTCTAA                                                       c.13692
 I  I  L  X                                                         p.4563

          .         .         .         .         .         .       g.47404
 ttttttaaaagaaatcttcatttattcttcttttccaattgaactttcacatagcacaga       c.*60

          .         .         .         .         .         .       g.47464
 aaaaattcaaactgcctatattgataaaaccatacagtgagccagccttgcagtaggcag       c.*120

          .         .         .         .         .         .       g.47524
 tagactataagcagaagcacatatgaactggacctgcaccaaagctggcaccagggctcg       c.*180

          .         .         .         .         .         .       g.47584
 gaaggtctctgaactcagaaggatggcattttttgcaagttaaagaaaatcaggatctga       c.*240

          .         .         .         .         .         .       g.47644
 gttattttgctaaacttgggggaggaggaacaaataaatggagtctttattgtgtatcat       c.*300

                                                                    g.47645
 a                                                                  c.*301

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Apolipoprotein B (including Ag(x) antigen) protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 12
©2004-2015 Leiden University Medical Center