zinc finger homeobox 4 (ZFHX4) - coding DNA reference sequence

(used for variant description)

(last modified May 10, 2021)


This file was created to facilitate the description of sequence variants on transcript NM_024721.4 in the ZFHX4 gene based on a coding DNA reference sequence following the HGVS recommendations.

The sequence was taken from NC_000008.10, covering ZFHX4 transcript NM_024721.4.


Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                                         .         .                g.5027
                                  atttttttaaacagggctaaaacgata       c.-361

 .         .         .         .         .         .                g.5087
 ataattagcagaataaagacatatcggattttcatttcctttcctccttttcccaacccc       c.-301

 .         .         .         .         .         .                g.5147
 ttcacaaccaaacagcgagaccgcggtcggcacatgctttaactcctcccggacccccga       c.-241

 .         .         .         .         .         .                g.5207
 ggaccgctccatgccccccactttctgctccagcgtttttattttcacccaataaagttc       c.-181

 .         .         .         .         .         .                g.5267
 gaggattattttttattttttttgtttttttaatgaaccctctcgttttacttggatgtg       c.-121

 .         .         .         .         .         .                g.5327
 atcagctgtaagtaaaataaaagcaaaacaaaaaagaggcgaagatcgagtaggaactgc       c.-61

 .         .    | 02    .         .         .         .             g.27809
 aggggaaatggaaa | gtccctgacaggctggatgaaatgagatccccatgtagcaattgcc    c.-1

          .         .         .         .         .         .       g.27869
 ATGGAAACCTGTGACTCCCCTCCTATCTCAAGGCAGGAAAATGGGCAGAGCACATCAAAG       c.60
 M  E  T  C  D  S  P  P  I  S  R  Q  E  N  G  Q  S  T  S  K         p.20

          .         .         .         .         .         .       g.27929
 CTATGTGGAACGACACAACTTGATAATGAGGTGCCAGAGAAAGTTGCAGGGATGGAGCCT       c.120
 L  C  G  T  T  Q  L  D  N  E  V  P  E  K  V  A  G  M  E  P         p.40

          .         .         .         .         .         .       g.27989
 GACAGGGAAAACAGCTCCACAGATGACAACCTGAAAACGGATGAGCGCAAAAGTGAAGCC       c.180
 D  R  E  N  S  S  T  D  D  N  L  K  T  D  E  R  K  S  E  A         p.60

          .         .         .         .         .         .       g.28049
 TTGCTGGGTTTCAGCGTTGAGAATGCAGCTGCCACTCAGGTTACCTCAGCAAAGGAGATA       c.240
 L  L  G  F  S  V  E  N  A  A  A  T  Q  V  T  S  A  K  E  I         p.80

          .         .         .         .         .         .       g.28109
 CCCTGCAACGAATGTGCCACTTCTTTTCCCAGTTTACAGAAATACATGGAACACCACTGC       c.300
 P  C  N  E  C  A  T  S  F  P  S  L  Q  K  Y  M  E  H  H  C         p.100

          .         .         .         .         .         .       g.28169
 CCTAATGCCCGCCTTCCTGTCCTGAAGGATGACAACGAGAGCGAGATCAGCGAGTTAGAG       c.360
 P  N  A  R  L  P  V  L  K  D  D  N  E  S  E  I  S  E  L  E         p.120

          .         .         .         .         .         .       g.28229
 GACAGTGACGTGGAAAATCTAACAGGGGAGATCGTTTACCAGCCTGATGGGTCAGCATAT       c.420
 D  S  D  V  E  N  L  T  G  E  I  V  Y  Q  P  D  G  S  A  Y         p.140

          .         .         .         .         .         .       g.28289
 ATAATTGAGGACTCCAAAGAAAGTGGGCAGAATGCACAGACTGGGGCAAATAGCAAACTC       c.480
 I  I  E  D  S  K  E  S  G  Q  N  A  Q  T  G  A  N  S  K  L         p.160

          .         .         .         .         .         .       g.28349
 TTTTCTACAGCGATGTTCCTGGACTCCCTGGCATCTGCTGGAGAGAAGAGTGATCAGTCT       c.540
 F  S  T  A  M  F  L  D  S  L  A  S  A  G  E  K  S  D  Q  S         p.180

          .         .         .         .         .         .       g.28409
 GCTTCTGCACCTATGTCGTTCTACCCACAGATCATCAACACTTTTCATATCGCTTCATCC       c.600
 A  S  A  P  M  S  F  Y  P  Q  I  I  N  T  F  H  I  A  S  S         p.200

          .         .         .         .         .         .       g.28469
 CTCGGGAAACCATTTACAGCCGATCAGGCTTTCCCAAATACCTCAGCATTAGCAGGAGTT       c.660
 L  G  K  P  F  T  A  D  Q  A  F  P  N  T  S  A  L  A  G  V         p.220

          .         .         .         .         .         .       g.28529
 GGTCCTGTGTTGCACAGTTTCCGTGTCTATGATCTCCGACACAAGAGAGAGAAAGACTAT       c.720
 G  P  V  L  H  S  F  R  V  Y  D  L  R  H  K  R  E  K  D  Y         p.240

          .         .         .         .         .         .       g.28589
 CTAACCAGTGATGGCTCAGCCAAAAACTCCTGTGTGTCCAAAGATGTCCCTAACAATGTG       c.780
 L  T  S  D  G  S  A  K  N  S  C  V  S  K  D  V  P  N  N  V         p.260

          .         .         .         .         .         .       g.28649
 GACTTGTCCAAATTCGATGGTTGTGTTAGCGATGGGAAAAGGAAACCTGTTTTAATGTGT       c.840
 D  L  S  K  F  D  G  C  V  S  D  G  K  R  K  P  V  L  M  C         p.280

          .         .         .         .         .         .       g.28709
 TTCTTGTGCAAGTTGTCTTTTGGTTATATCAGGTCATTTGTAACCCATGCTGTGCATGAT       c.900
 F  L  C  K  L  S  F  G  Y  I  R  S  F  V  T  H  A  V  H  D         p.300

          .         .         .         .         .         .       g.28769
 CATCGGATGACCCTCAATGACGAGGAGCAGAAGCTCCTCAGTAATAAATGCGTCTCCGCC       c.960
 H  R  M  T  L  N  D  E  E  Q  K  L  L  S  N  K  C  V  S  A         p.320

          .         .         .         .         .         .       g.28829
 ATAATACAGGGGATTGGCAAAGACAAAGAACCTCTTATAAGCTTTCTGGAACCAAAAAAA       c.1020
 I  I  Q  G  I  G  K  D  K  E  P  L  I  S  F  L  E  P  K  K         p.340

          .         .         .         .         .         .       g.28889
 TCCACTTCTGTTTATCCCCATTTTTCTACTACAAACCTCATAGGACCCGATCCAACCTTC       c.1080
 S  T  S  V  Y  P  H  F  S  T  T  N  L  I  G  P  D  P  T  F         p.360

          .         .         .         .         .         .       g.28949
 CGCGGTTTATGGAGCGCTTTTCATGTTGAAAATGGTGACTCTTTGCCGGCTGGCTTTGCC       c.1140
 R  G  L  W  S  A  F  H  V  E  N  G  D  S  L  P  A  G  F  A         p.380

          .         .         .         .         .         .       g.29009
 TTCTTAAAAGGAAGCGCGAGCACCTCGAGCTCAGCAGAGCAGCCGCTGGGGATTACCCAA       c.1200
 F  L  K  G  S  A  S  T  S  S  S  A  E  Q  P  L  G  I  T  Q         p.400

          .         .         .         .         .         .       g.29069
 ATGCCAAAGGCTGAAGTGAATCTGGGGGGGCTGTCTAGTTTAGTAGTGAACACCCCAATT       c.1260
 M  P  K  A  E  V  N  L  G  G  L  S  S  L  V  V  N  T  P  I         p.420

          .         .         .         .         .         .       g.29129
 ACCTCTGTCTCCCTCAGCCACTCATCGTCTGAGTCTAGCAAGATGTCAGAGAGCAAAGAC       c.1320
 T  S  V  S  L  S  H  S  S  S  E  S  S  K  M  S  E  S  K  D         p.440

          .         .         .         .         .         .       g.29189
 CAAGAGAACAACTGTGAAAGGCCAAAAGAAAGCAACGTTTTACACCCAAACGGGGAGTGC       c.1380
 Q  E  N  N  C  E  R  P  K  E  S  N  V  L  H  P  N  G  E  C         p.460

          .         .         .         .         .         .       g.29249
 CCTGTCAAAAGTGAACCCACTGAACCGGGAGATGAGGATGAAGAAGATGCGTACTCCAAT       c.1440
 P  V  K  S  E  P  T  E  P  G  D  E  D  E  E  D  A  Y  S  N         p.480

          .         .         .         .         .         .       g.29309
 GAACTTGATGACGAGGAAGTATTAGGTGAACTCACCGATAGTATTGGTAACAAAGATTTC       c.1500
 E  L  D  D  E  E  V  L  G  E  L  T  D  S  I  G  N  K  D  F         p.500

          .         .         .         .         .         .       g.29369
 CCTCTCTTAAACCAAAGCATTTCTCCTTTATCATCCAGTGTGCTAAAATTTATTGAAAAG       c.1560
 P  L  L  N  Q  S  I  S  P  L  S  S  S  V  L  K  F  I  E  K         p.520

          .         .         .         .         .         .       g.29429
 GGTACCTCGTCCTCCTCGGCGACTGTTTCTGATGACACAGAAAAGAAAAAACAGACTGCT       c.1620
 G  T  S  S  S  S  A  T  V  S  D  D  T  E  K  K  K  Q  T  A         p.540

          .         .         .         .         .         .       g.29489
 GCTGTTAGGGCCAGTGGCAGTGTTGCTAGTAACTATGGCATCAGTGGCAAGGACTTTGCA       c.1680
 A  V  R  A  S  G  S  V  A  S  N  Y  G  I  S  G  K  D  F  A         p.560

          .         .         .         .         .         .       g.29549
 GACGCAAGTGCCAGTAAAGACAGTGCCACAGCTGCTCATCCAAGTGAAATAGCCCGGGGA       c.1740
 D  A  S  A  S  K  D  S  A  T  A  A  H  P  S  E  I  A  R  G         p.580

          .         .         .         .         .         .       g.29609
 GACGAAGACAGTTCAGCCACTCCTCACCAGCATGGCTTTACCCCGAGTACTCCTGGCACA       c.1800
 D  E  D  S  S  A  T  P  H  Q  H  G  F  T  P  S  T  P  G  T         p.600

          .         .         .         .         .         .       g.29669
 CCAGGGCCTGGAGGAGACGGCTCACCGGGCAGTGGCATCGAGTGTCCAAAGTGCGACACT       c.1860
 P  G  P  G  G  D  G  S  P  G  S  G  I  E  C  P  K  C  D  T         p.620

          .         .         .         .         .         .       g.29729
 GTGTTGGGGTCTTCGAGGTCTCTTGGTGGTCATATGACTATGATGCACTCGAGGAACTCA       c.1920
 V  L  G  S  S  R  S  L  G  G  H  M  T  M  M  H  S  R  N  S         p.640

          .         .         .         .         .         .       g.29789
 TGCAAAACCCTCAAATGTCCTAAATGTAACTGGCACTACAAATATCAGCAGACCCTGGAG       c.1980
 C  K  T  L  K  C  P  K  C  N  W  H  Y  K  Y  Q  Q  T  L  E         p.660

          .         .         .         .         .         .       g.29849
 GCCCATATGAAGGAGAAACACCCTGAGCCGGGTGGCTCTTGTGTTTATTGTAAGACTGGA       c.2040
 A  H  M  K  E  K  H  P  E  P  G  G  S  C  V  Y  C  K  T  G         p.680

          .         .         .         .         .         .       g.29909
 CAGCCTCACCCCAGGCTTGCCCGGGGTGAGAGTTACACGTGTGGCTATAAACCCTTCCGT       c.2100
 Q  P  H  P  R  L  A  R  G  E  S  Y  T  C  G  Y  K  P  F  R         p.700

          .         .         .         .         .         .       g.29969
 TGTGAGGTTTGTAACTACTCTACCACTACCAAAGGCAACCTCAGTATTCATATGCAGTCG       c.2160
 C  E  V  C  N  Y  S  T  T  T  K  G  N  L  S  I  H  M  Q  S         p.720

          .         .         .         .         .         .       g.30029
 GACAAGCACCTGAACAATGTTCAGAATCTCCAAAATGGCAATGGTGAGCAGGTGTTTGGC       c.2220
 D  K  H  L  N  N  V  Q  N  L  Q  N  G  N  G  E  Q  V  F  G         p.740

          .         .         .         .         .         .       g.30089
 CACTCTGCCCCAGCCCCCAACACCAGCCTCAGTGGCTGCGGAACACCCTCTCCGTCCAAA       c.2280
 H  S  A  P  A  P  N  T  S  L  S  G  C  G  T  P  S  P  S  K         p.760

          .         .         .         .         .         .       g.30149
 CCCAAACAGAAACCCACCTGGCGGTGTGAAGTTTGTGATTATGAAACCAATGTCGCCAGG       c.2340
 P  K  Q  K  P  T  W  R  C  E  V  C  D  Y  E  T  N  V  A  R         p.780

          .         .         .         .         .         .       g.30209
 AACCTCCGAATTCATATGACCAGCGAAAAGCACATGCATAATATGATGCTTTTGCAGCAG       c.2400
 N  L  R  I  H  M  T  S  E  K  H  M  H  N  M  M  L  L  Q  Q         p.800

          .         .         .         .         .         .       g.30269
 AACATGAAGCAGATCCAGCATAATCTGCACTTGGGCCTCGCCCCGGCGGAAGCAGAGCTT       c.2460
 N  M  K  Q  I  Q  H  N  L  H  L  G  L  A  P  A  E  A  E  L         p.820

          .         .         .         .         .         .       g.30329
 TATCAGTACTACCTAGCCCAGAACATAGGCCTGACCGGAATGAAGCTGGAAAACCCTGCC       c.2520
 Y  Q  Y  Y  L  A  Q  N  I  G  L  T  G  M  K  L  E  N  P  A         p.840

          .         .         .         .         .         .       g.30389
 GACCCTCAGCTGATGATCAATCCATTCCAGCTGGATCCAGCGACAGCAGCGGCTTTGGCA       c.2580
 D  P  Q  L  M  I  N  P  F  Q  L  D  P  A  T  A  A  A  L  A         p.860

          . | 03       .         .         .         .         .    g.31316
 CCAGGGCTCG | TAAATAATGAGCTGCCGCCTGAAATCCGGCTTGCCAGTGGTCAGCTAATG    c.2640
 P  G  L  V |   N  N  E  L  P  P  E  I  R  L  A  S  G  Q  L  M      p.880

          .         .         .         .         .         .       g.31376
 GGTGATGACCTGTCCCTCCTTACTGCAGGAGAGCTGTCACCTTATATCAGTGACCCAGCG       c.2700
 G  D  D  L  S  L  L  T  A  G  E  L  S  P  Y  I  S  D  P  A         p.900

          .         .         .         .         .         .       g.31436
 CTGAAGCTATTCCAGTGTGCTGTTTGCAACAAATTCACCTCTGACAGCCTGGAGGCCCTA       c.2760
 L  K  L  F  Q  C  A  V  C  N  K  F  T  S  D  S  L  E  A  L         p.920

          .         .         .         .         .         .       g.31496
 AGTGTGCATGTGAGCAGTGAGCGCTCTCTCCCTGAAGAGGAATGGAGGGCAGTAATTGGA       c.2820
 S  V  H  V  S  S  E  R  S  L  P  E  E  E  W  R  A  V  I  G         p.940

          .         .         .         .         .         .       g.31556
 GATATCTACCAGTGCAAGCTCTGCAACTACAACACTCAGCTCAAAGCCAACTTCCAGCTA       c.2880
 D  I  Y  Q  C  K  L  C  N  Y  N  T  Q  L  K  A  N  F  Q  L         p.960

          .         .         .         .         .         .       g.31616
 CACTGCAAGACTGATAAACATATGCAGAAATATCAACTGGTGGCTCACATTAAAGAAGGG       c.2940
 H  C  K  T  D  K  H  M  Q  K  Y  Q  L  V  A  H  I  K  E  G         p.980

          .         .         .         .         .         .       g.31676
 GGCAAAAGCAATGAGTGGAGGTTGAAGTGTATTGCCATTGGCAACCCTGTTCACCTAAAA       c.3000
 G  K  S  N  E  W  R  L  K  C  I  A  I  G  N  P  V  H  L  K         p.1000

          .         .         .         .         .         .       g.31736
 TGTAACGCCTGTGACTATTACACCAACAGTGTGGATAAATTACGCTTGCATACCACCAAT       c.3060
 C  N  A  C  D  Y  Y  T  N  S  V  D  K  L  R  L  H  T  T  N         p.1020

          .         .         .    | 04    .         .         .    g.101956
 CACAGGCACGAGGCGGCCCTGAAGCTCTACAAG | CACTTGCAGAAGCAAGAGGGTGCAGTG    c.3120
 H  R  H  E  A  A  L  K  L  Y  K   | H  L  Q  K  Q  E  G  A  V      p.1040

          .         .         .         .         .         .       g.102016
 AATCCCGAATCCTGCTATTACTACTGTGCCGTGTGTGATTACACCACCAAGGTCAAGTTG       c.3180
 N  P  E  S  C  Y  Y  Y  C  A  V  C  D  Y  T  T  K  V  K  L         p.1060

          .         .         .         .         .         .       g.102076
 AATCTGGTACAACATGTCCGTTCGGTGAAGCATCAGCAGACTGAGGGCCTACGGAAGCTC       c.3240
 N  L  V  Q  H  V  R  S  V  K  H  Q  Q  T  E  G  L  R  K  L         p.1080

          .         .         .         .         .         .       g.102136
 CAGCTCCACCAGCAAGGCCTGGCACCAGAGGAGGACAACCTCAGTGAGATCTTTTTTGTT       c.3300
 Q  L  H  Q  Q  G  L  A  P  E  E  D  N  L  S  E  I  F  F  V         p.1100

          .         .      | 05  .         .         .         .    g.157094
 AAAGATTGCCCACCAAATGAGCTTG | AAACTGCCTCATTGGGAGCCAGGACTTGTGATGAT    c.3360
 K  D  C  P  P  N  E  L  E |   T  A  S  L  G  A  R  T  C  D  D      p.1120

          .         .         .     | 06   .         .         .    g.166402
 GATCTTACAGAGCAGCAGTTGAGATCGACCTCAG | AAGAACAAAGTGAGGAGGCAGAAGGA    c.3420
 D  L  T  E  Q  Q  L  R  S  T  S  E |   E  Q  S  E  E  A  E  G      p.1140

          .         .         .         .         .         .       g.166462
 GCTATTAAGCCTACAGCAGTGGCCGAGGACGATGAAAAAGACACAAGTGAGAGAGACAAT       c.3480
 A  I  K  P  T  A  V  A  E  D  D  E  K  D  T  S  E  R  D  N         p.1160

          .         .         .  | 07      .         .         .    g.172745
 AGTGAAGGCAAAAACTCTAATAAAGACTCTG | GGATAATCACACCAGAGAAGGAACTAAAA    c.3540
 S  E  G  K  N  S  N  K  D  S  G |   I  I  T  P  E  K  E  L  K      p.1180

          .         .         .         .         .         .       g.172805
 GTTAGTGTGGCAGGGGGTACCCAGCCACTCCTGCTGGCAAAAGAAGAGGATGTTGCAACA       c.3600
 V  S  V  A  G  G  T  Q  P  L  L  L  A  K  E  E  D  V  A  T         p.1200

          .         .         .         .      | 08  .         .    g.173248
 AAAAGGTCAAAACCTACAGAGGACAATAAATTCTGTCATGAACAG | TTCTATCAATGTCCT    c.3660
 K  R  S  K  P  T  E  D  N  K  F  C  H  E  Q   | F  Y  Q  C  P      p.1220

          .         .         .         .         .         .       g.173308
 TATTGTAACTACAATAGTAGGGACCAAAGTCGTATCCAGATGCACGTCCTATCACAGCAC       c.3720
 Y  C  N  Y  N  S  R  D  Q  S  R  I  Q  M  H  V  L  S  Q  H         p.1240

          .         .         .         .         .         .       g.173368
 TCGGTGCAGCCGGTCATCTGCTGTCCTCTCTGTCAGGACGTCCTCAGCAACAAAATGCAT       c.3780
 S  V  Q  P  V  I  C  C  P  L  C  Q  D  V  L  S  N  K  M  H         p.1260

          .         .         .         .         .         .       g.173428
 CTCCAACTGCATCTGACGCATTTGCACAGTGTGTCTCCAGACTGTGTGGAGAAGCTGCTT       c.3840
 L  Q  L  H  L  T  H  L  H  S  V  S  P  D  C  V  E  K  L  L         p.1280

        | 09 .         .         .         .         .         .    g.174020
 ATGACA | GTGCCTGTCCCTGATGTGATGATGCCAAACAGTATGCTACTGCCAGCAGCTGCC    c.3900
 M  T   | V  P  V  P  D  V  M  M  P  N  S  M  L  L  P  A  A  A      p.1300

          .         .         .         .         .         .       g.174080
 TCTGAGAAATCAGAGCGGGACACACCTGCAGCCGTGACAGCTGAGGGGTCTGGGAAATAT       c.3960
 S  E  K  S  E  R  D  T  P  A  A  V  T  A  E  G  S  G  K  Y         p.1320

      | 10   .         .         .         .         .         .    g.174663
 TCAG | GTGAAAGTCCAATGGATGACAAAAGCATGGCAGGTCTCGAGGATTCAAAGGCTAAT    c.4020
 S  G |   E  S  P  M  D  D  K  S  M  A  G  L  E  D  S  K  A  N      p.1340

          .         .         .         .         .         .       g.174723
 GTGGAAGTAAAGAATGAGGAGCAGAAACCGACTAAAGAACCCTTGGAAGTCTCAGAATGG       c.4080
 V  E  V  K  N  E  E  Q  K  P  T  K  E  P  L  E  V  S  E  W         p.1360

          .         .         .         .         .         .       g.174783
 AATAAAAATAGCAGTAAGGATGTGAAAATCCCCGACACACTGCAAGATCAATTAAATGAA       c.4140
 N  K  N  S  S  K  D  V  K  I  P  D  T  L  Q  D  Q  L  N  E         p.1380

          .         .         .         .         .         .       g.174843
 CAGCAAAAAAGGCAACCGCTCTCTGTTTCTGACCGTCATGTCTACAAGTATCGCTGTAAC       c.4200
 Q  Q  K  R  Q  P  L  S  V  S  D  R  H  V  Y  K  Y  R  C  N         p.1400

          .         .         .         .         .         .       g.174903
 CATTGTAGCTTGGCTTTCAAAACTATGCAGAAGCTTCAGATACATTCCCAGTATCATGCA       c.4260
 H  C  S  L  A  F  K  T  M  Q  K  L  Q  I  H  S  Q  Y  H  A         p.1420

          .         .         .         .         .         .       g.174963
 ATTCGGGCTGCGACAATGTGTAACCTCTGCCAGCGCAGTTTCCGTACATTCCAGGCTTTA       c.4320
 I  R  A  A  T  M  C  N  L  C  Q  R  S  F  R  T  F  Q  A  L         p.1440

          .         .         .         .         .         .       g.175023
 AAAAAACACTTGGAAGCAGGCCACCCTGAACTGAGTGAAGCTGAACTTCAACAGCTATAT       c.4380
 K  K  H  L  E  A  G  H  P  E  L  S  E  A  E  L  Q  Q  L  Y         p.1460

          .         .         .         .         .         .       g.175083
 GCCTCCTTGCCCGTGAATGGAGAACTGTGGGCAGAGAGCGAAACTATGTCCCAGGATGAC       c.4440
 A  S  L  P  V  N  G  E  L  W  A  E  S  E  T  M  S  Q  D  D         p.1480

          .         .         .         .         .         .       g.175143
 CATGGCCTAGAGCAGGAAATGGAGAGAGAGTATGAGGTGGACCACGAAGGGAAAGCAAGT       c.4500
 H  G  L  E  Q  E  M  E  R  E  Y  E  V  D  H  E  G  K  A  S         p.1500

          .         .         .         .         .         .       g.175203
 CCTGTAGGAAGTGATAGTAGCTCTATTCCAGATGACATGGGCTCTGAACCAAAGCGGACC       c.4560
 P  V  G  S  D  S  S  S  I  P  D  D  M  G  S  E  P  K  R  T         p.1520

          .         .         .         .         .         .       g.175263
 TTACCTTTTAGAAAAGGGCCCAATTTTACGATGGAAAAATTCCTTGATCCATCTCGTCCA       c.4620
 L  P  F  R  K  G  P  N  F  T  M  E  K  F  L  D  P  S  R  P         p.1540

          .         .         .         .         .         .       g.175323
 TATAAATGTACAGTGTGTAAAGAGTCATTCACCCAAAAGAACATTCTCTTGGTCCACTAT       c.4680
 Y  K  C  T  V  C  K  E  S  F  T  Q  K  N  I  L  L  V  H  Y         p.1560

          .         .         .         .         .         .       g.175383
 AATTCAGTTTCTCACTTGCATAAGCTGAAAAAAGTTTTGCAGGAAGCCTCCAGTCCTGTC       c.4740
 N  S  V  S  H  L  H  K  L  K  K  V  L  Q  E  A  S  S  P  V         p.1580

          .         .         .         .         .         .       g.175443
 CCACAAGAAACCAACAGCAACACAGATAACAAACCCTACAAGTGCAGCATCTGCAATGTT       c.4800
 P  Q  E  T  N  S  N  T  D  N  K  P  Y  K  C  S  I  C  N  V         p.1600

          .         .         .         .         .         .       g.175503
 GCATACAGCCAAAGCTCAACATTGGAAATCCACATGAGGTCTGTGCTCCACCAGACAAAG       c.4860
 A  Y  S  Q  S  S  T  L  E  I  H  M  R  S  V  L  H  Q  T  K         p.1620

          .         .         .         .         .         .       g.175563
 GCTAGGGCTGCAAAGCTGGAGCCCAGTGGTCATGTGGCTGGTGGGCACAGCATTGCAGCA       c.4920
 A  R  A  A  K  L  E  P  S  G  H  V  A  G  G  H  S  I  A  A         p.1640

          .         .         .         .         .         .       g.175623
 AATGTCAACAGCCCTGGCCAGGGGATGTTAGATTCCATGAGTTTAGCAGCTGTAAACAGC       c.4980
 N  V  N  S  P  G  Q  G  M  L  D  S  M  S  L  A  A  V  N  S         p.1660

          .         .         .         .         .         .       g.175683
 AAAGATACCCATTTAGATGCCAAAGAATTAAATAAAAAGCAAACTCCTGATTTAATCTCT       c.5040
 K  D  T  H  L  D  A  K  E  L  N  K  K  Q  T  P  D  L  I  S         p.1680

          .         .         .         .         .         .       g.175743
 GCTCAACCTGCACATCACCCACCACAGTCACCAGCACAAATTCAGATGCAACTACAGCAC       c.5100
 A  Q  P  A  H  H  P  P  Q  S  P  A  Q  I  Q  M  Q  L  Q  H         p.1700

          .         .         .         .         .         .       g.175803
 GAATTACAACAGCAAGCCGCATTCTTTCAGCCTCAGTTTCTAAACCCAGCCTTTTTGCCT       c.5160
 E  L  Q  Q  Q  A  A  F  F  Q  P  Q  F  L  N  P  A  F  L  P         p.1720

          .         .         .         .         .         .       g.175863
 CATTTTCCTATGACCCCAGAAGCACTGCTGCAGTTTCAGCAGCCTCAGTTTCTCTTTCCA       c.5220
 H  F  P  M  T  P  E  A  L  L  Q  F  Q  Q  P  Q  F  L  F  P         p.1740

          .         .         .         .         .         .       g.175923
 TTTTATATACCTGGGACGGAGTTCAGCTTGGGGCCAGATTTGGGCTTGCCAGGCTCTGCC       c.5280
 F  Y  I  P  G  T  E  F  S  L  G  P  D  L  G  L  P  G  S  A         p.1760

          .         .         .         .         .         .       g.175983
 ACATTTGGGATGCCTGGCATGACAGGAATGGCTGGCTCCTTGCTTGAAGACCTAAAGCAG       c.5340
 T  F  G  M  P  G  M  T  G  M  A  G  S  L  L  E  D  L  K  Q         p.1780

          .         .         .         .         .         .       g.176043
 CAGATTCAAACCCAACATCACGTTGGTCAAACTCAACTCCAGATACTACAGCAACAAGCA       c.5400
 Q  I  Q  T  Q  H  H  V  G  Q  T  Q  L  Q  I  L  Q  Q  Q  A         p.1800

          .         .         .         .         .         .       g.176103
 CAACAATACCAAGCCACACAGCCCCAGCTGCAGCCTCAAAAACAACAGCAGCAGCCACCA       c.5460
 Q  Q  Y  Q  A  T  Q  P  Q  L  Q  P  Q  K  Q  Q  Q  Q  P  P         p.1820

          .         .         .         .         .         .       g.176163
 CCTCCACAGCAGCAGCAGCAACAGCAGGCAAGCAAATTATTGAAACAAGAGCAAAGTAAC       c.5520
 P  P  Q  Q  Q  Q  Q  Q  Q  A  S  K  L  L  K  Q  E  Q  S  N         p.1840

          .         .         .         .         .         .       g.176223
 ATAGTGAGTGCAGACTGCCAAATCATGAAGGATGTGCCATCTTATAAGGAGGCAGAAGAT       c.5580
 I  V  S  A  D  C  Q  I  M  K  D  V  P  S  Y  K  E  A  E  D         p.1860

          .         .         .         .         .         .       g.176283
 ATTTCTGAAAAGCCAGAAAAACCAAAGCAGGAATTTATAAGTGAAGGTGAAGGACTCAAA       c.5640
 I  S  E  K  P  E  K  P  K  Q  E  F  I  S  E  G  E  G  L  K         p.1880

          .         .         .         .         .         .       g.176343
 GAAGGCAAAGACACAAAGAAGCAAAAATCCTTGGAACCATCCATCCCACCACCCCGAATA       c.5700
 E  G  K  D  T  K  K  Q  K  S  L  E  P  S  I  P  P  P  R  I         p.1900

          .         .         .         .         .         .       g.176403
 GCTTCAGGGGCCAGAGGAAATGCTGCCAAAGCGTTATTGGAAAACTTTGGTTTTGAACTG       c.5760
 A  S  G  A  R  G  N  A  A  K  A  L  L  E  N  F  G  F  E  L         p.1920

          .         .         .         .         .         .       g.176463
 GTCATTCAGTATAACGAAAACAGGCAGAAGGTACAGAAGAAGGGCAAAAGTGGTGAAGGC       c.5820
 V  I  Q  Y  N  E  N  R  Q  K  V  Q  K  K  G  K  S  G  E  G         p.1940

          .         .         .         .         .         .       g.176523
 GAAAACACTGACAAACTAGAATGTGGAACATGTGGTAAATTGTTTTCCAATGTTCTTATT       c.5880
 E  N  T  D  K  L  E  C  G  T  C  G  K  L  F  S  N  V  L  I         p.1960

          .         .         .         .         .         .       g.176583
 TTAAAGAGTCACCAAGAACATGTACATGGGCAATTTTTTCCATATGCAGCGCTAGAAAAA       c.5940
 L  K  S  H  Q  E  H  V  H  G  Q  F  F  P  Y  A  A  L  E  K         p.1980

          .         .         .         .         .         .       g.176643
 TTTGCTCGTCAATACAGGGAGGCCTATGACAAGCTTTATCCAATTTCTCCATCTTCTCCA       c.6000
 F  A  R  Q  Y  R  E  A  Y  D  K  L  Y  P  I  S  P  S  S  P         p.2000

          .         .         .         .         .         .       g.176703
 GAAACGCCGCCCCCGCCACCTCCTCCTCCTCCCTTGCCTCCGGCTCCTCCACAGCCTTCT       c.6060
 E  T  P  P  P  P  P  P  P  P  P  L  P  P  A  P  P  Q  P  S         p.2020

          .         .         .         .         .         .       g.176763
 TCTATGGGTCCTGTAAAGATCCCCAACACGGTTTCTACTCCTCTGCAAGCTCCACCACCC       c.6120
 S  M  G  P  V  K  I  P  N  T  V  S  T  P  L  Q  A  P  P  P         p.2040

          .         .         .         .         .         .       g.176823
 ACTCCTCCCCCACCACCACCACCTCCTCCTCCTCCTCCTCCTCCCCCCCCACCTCCTCCA       c.6180
 T  P  P  P  P  P  P  P  P  P  P  P  P  P  P  P  P  P  P  P         p.2060

          .         .         .         .         .         .       g.176883
 CCTTCTGCTCCTCCACAGGTCCAACTGCCGGTTTCTCTGGACCTGCCGCTCTTTCCTTCC       c.6240
 P  S  A  P  P  Q  V  Q  L  P  V  S  L  D  L  P  L  F  P  S         p.2080

          .         .         .         .         .         .       g.176943
 ATTATGATGCAACCTGTGCAACACCCTGCGCTTCCTCCCCAGCTTGCCCTGCAGCTGCCA       c.6300
 I  M  M  Q  P  V  Q  H  P  A  L  P  P  Q  L  A  L  Q  L  P         p.2100

          .         .         .         .         .         .       g.177003
 CAGATGGACGCACTCTCTGCAGACCTCACCCAACTTTGCCAGCAGCAGCTCGGATTAGAT       c.6360
 Q  M  D  A  L  S  A  D  L  T  Q  L  C  Q  Q  Q  L  G  L  D         p.2120

          .         .         .         .         .         .       g.177063
 CCCAACTTCTTAAGACATTCTCAGTTCAAACGCCCACGGACAAGAATTACAGATGATCAG       c.6420
 P  N  F  L  R  H  S  Q  F  K  R  P  R  T  R  I  T  D  D  Q         p.2140

          .         .         .         .         .         .       g.177123
 CTAAAAATCCTGAGGGCTTATTTTGACATTAATAATTCTCCAAGTGAAGAACAGATCCAG       c.6480
 L  K  I  L  R  A  Y  F  D  I  N  N  S  P  S  E  E  Q  I  Q         p.2160

          .         .         .         .         .         .       g.177183
 GAAATGGCAGAGAAATCTGGCCTCTCCCAAAAAGTTATCAAACACTGGTTTAGAAATACG       c.6540
 E  M  A  E  K  S  G  L  S  Q  K  V  I  K  H  W  F  R  N  T         p.2180

          .         .         .         .         .         .       g.177243
 CTTTTTAAGGAACGACAGAGAAATAAAGATTCACCATACAACTTCAGTAACCCTCCTATA       c.6600
 L  F  K  E  R  Q  R  N  K  D  S  P  Y  N  F  S  N  P  P  I         p.2200

          .         .         .         .         .         .       g.177303
 ACGGTTTTAGAAGATATCAGAATTGATCCACAGCCCACCTCTTTAGAACATTACAAATCT       c.6660
 T  V  L  E  D  I  R  I  D  P  Q  P  T  S  L  E  H  Y  K  S         p.2220

          .         .         .         .         .         .       g.177363
 GATGCATCATTCAGTAAAAGGTCTTCTAGAACGAGATTTACTGACTACCAGCTTAGGGTT       c.6720
 D  A  S  F  S  K  R  S  S  R  T  R  F  T  D  Y  Q  L  R  V         p.2240

          .         .         .         .         .         .       g.177423
 CTGCAAGACTTTTTTGACACAAACGCTTACCCAAAAGATGATGAAATAGAACAACTCTCC       c.6780
 L  Q  D  F  F  D  T  N  A  Y  P  K  D  D  E  I  E  Q  L  S         p.2260

          .         .         .         .         .         .       g.177483
 ACTGTTCTCAATCTGCCTACCCGGGTTATTGTTGTATGGTTCCAGAATGCTCGTCAGAAA       c.6840
 T  V  L  N  L  P  T  R  V  I  V  V  W  F  Q  N  A  R  Q  K         p.2280

          .         .         .         .         .         .       g.177543
 GCACGAAAGAGTTATGAGAATCAAGCAGAAACAAAAGATAATGAAAAAAGAGAACTCACT       c.6900
 A  R  K  S  Y  E  N  Q  A  E  T  K  D  N  E  K  R  E  L  T         p.2300

          .         .         .         .         .         .       g.177603
 AATGAACGGTACATTCGAACAAGCAACATGCAGTACCAGTGTAAAAAGTGCAATGTGGTT       c.6960
 N  E  R  Y  I  R  T  S  N  M  Q  Y  Q  C  K  K  C  N  V  V         p.2320

          .         .         .         .         .         .       g.177663
 TTCCCCAGGATCTTTGACTTGATTACGCATCAGAAAAAGCAGTGTTACAAGGATGAAGAT       c.7020
 F  P  R  I  F  D  L  I  T  H  Q  K  K  Q  C  Y  K  D  E  D         p.2340

          .         .         .         .         .         .       g.177723
 GATGATGCCCAAGATGAAAGCCAAACAGAAGACTCCATGGATGCCACTGATCAAGTGGTA       c.7080
 D  D  A  Q  D  E  S  Q  T  E  D  S  M  D  A  T  D  Q  V  V         p.2360

          .         .         .         .         .         .       g.177783
 TACAAGCATTGCACAGTGTCTGGCCAAACGGATGCAGCTAAAAACGCTGCTGCCCCTGCA       c.7140
 Y  K  H  C  T  V  S  G  Q  T  D  A  A  K  N  A  A  A  P  A         p.2380

          .         .         .         .         .         .       g.177843
 GCAAGTTCTGGCTCTGGGACCAGCACCCCCCTGATTCCATCACCCAAACCAGAACCTGAG       c.7200
 A  S  S  G  S  G  T  S  T  P  L  I  P  S  P  K  P  E  P  E         p.2400

          .         .         .         .         .         .       g.177903
 AAGACTTCTCCAAAACCTGAATATCCCGCAGAAAAGCCAAAGCAGAGTGACCCCTCTCCC       c.7260
 K  T  S  P  K  P  E  Y  P  A  E  K  P  K  Q  S  D  P  S  P         p.2420

          .         .         .         .         .         .       g.177963
 CCTTCTCAAGGCACCAAACCAGCCCTGCCATTAGCATCGACTTCCTCGGACCCACCACAG       c.7320
 P  S  Q  G  T  K  P  A  L  P  L  A  S  T  S  S  D  P  P  Q         p.2440

          .         .         .         .         .         .       g.178023
 GCATCCACAGCCCAGCCACAGCCACAGCCACAGCCACCAAAACAACCCCAACTTATCGGA       c.7380
 A  S  T  A  Q  P  Q  P  Q  P  Q  P  P  K  Q  P  Q  L  I  G         p.2460

          .         .         .         .         .         .       g.178083
 AGACCTCCCTCGGCCTCTCAAACACCGGTCCCTTCCAGTCCACTGCAAATTTCCATGACG       c.7440
 R  P  P  S  A  S  Q  T  P  V  P  S  S  P  L  Q  I  S  M  T         p.2480

          .         .         .         .         .         .       g.178143
 TCTCTCCAGAACAGTCTACCTCCACAGTTACTACAATACCAATGTGATCAGTGTACAGTT       c.7500
 S  L  Q  N  S  L  P  P  Q  L  L  Q  Y  Q  C  D  Q  C  T  V         p.2500

          .         .         .         .         .         .       g.178203
 GCCTTCCCAACTCTGGAACTCTGGCAGGAACACCAGCACATGCACTTCCTTGCTGCTCAA       c.7560
 A  F  P  T  L  E  L  W  Q  E  H  Q  H  M  H  F  L  A  A  Q         p.2520

          .         .         .         .         .         .       g.178263
 AACCAATTCCTTCACTCTCCGTTCTTGGAAAGGCCCATGGACATGCCCTACATGATATTT       c.7620
 N  Q  F  L  H  S  P  F  L  E  R  P  M  D  M  P  Y  M  I  F         p.2540

          .         .         .         .         .         .       g.178323
 GACCCCAACAATCCGCTGATGACTGGACAACTGCTGGGCAGTTCCCTCACTCAAATGCCC       c.7680
 D  P  N  N  P  L  M  T  G  Q  L  L  G  S  S  L  T  Q  M  P         p.2560

          .         .         .         .         .         .       g.178383
 CCTCAGGCCAGTTCCTCCCACACCACAGCCCCCACAACGGTTGCTGCTTCCCTAAAAAGG       c.7740
 P  Q  A  S  S  S  H  T  T  A  P  T  T  V  A  A  S  L  K  R         p.2580

          .         .         .         .         .         .       g.178443
 AAACTAGACGATAAAGAAGATAATAATTGCAGTGAAAAAGAAGGAGGGAATAGCGGTGAA       c.7800
 K  L  D  D  K  E  D  N  N  C  S  E  K  E  G  G  N  S  G  E         p.2600

          .         .         .         .         .         .       g.178503
 GACCAACACCGAGATAAACGCTTGAGAACCACGATCACCCCGGAACAGCTGGAAATACTC       c.7860
 D  Q  H  R  D  K  R  L  R  T  T  I  T  P  E  Q  L  E  I  L         p.2620

          .         .         .         .         .         .       g.178563
 TATGAAAAATACTTGCTGGATTCCAATCCTACCAGAAAAATGCTTGATCATATTGCCCGC       c.7920
 Y  E  K  Y  L  L  D  S  N  P  T  R  K  M  L  D  H  I  A  R         p.2640

          .         .         .         .         .         .       g.178623
 GAAGTCGGGCTGAAAAAAAGGGTCGTGCAAGTCTGGTTCCAGAATACACGAGCGCGGGAG       c.7980
 E  V  G  L  K  K  R  V  V  Q  V  W  F  Q  N  T  R  A  R  E         p.2660

          .         .         .         .         .         .       g.178683
 AGGAAAGGCCAGTTCCGGGCGGTGGGTCCAGCACAGTCTCATAAACGGTGTCCGTTTTGC       c.8040
 R  K  G  Q  F  R  A  V  G  P  A  Q  S  H  K  R  C  P  F  C         p.2680

          .         .         .         .         .         .       g.178743
 CGAGCCCTGTTTAAAGCAAAGTCGGCCTTAGAAAGCCACATTCGCTCTCGGCACTGGAAT       c.8100
 R  A  L  F  K  A  K  S  A  L  E  S  H  I  R  S  R  H  W  N         p.2700

          .         .         .         .         .         .       g.178803
 GAAGGAAAGCAGGCAGGTTACAGCTTGCCACCAAGCCCTTTAATATCCACCGAAGATGGG       c.8160
 E  G  K  Q  A  G  Y  S  L  P  P  S  P  L  I  S  T  E  D  G         p.2720

          .         .         .         .         .         .       g.178863
 GGAGAAAGCCCACAGAAATACATCTATTTTGATTACCCATCTTTGCCATTAACTAAAATT       c.8220
 G  E  S  P  Q  K  Y  I  Y  F  D  Y  P  S  L  P  L  T  K  I         p.2740

          .         .         .         .         .         .       g.178923
 GATCTATCAAGTGAGAATGAATTGGCTTCTACAGTGTCAACACCTGTTAGTAAAACAGCA       c.8280
 D  L  S  S  E  N  E  L  A  S  T  V  S  T  P  V  S  K  T  A         p.2760

          .         .         .         .         .         .       g.178983
 GAGCTGTCACCGAAGAATCTTTTAAGCCCTTCTTCTTTTAAAGCAGAGTGTTCTGAGGAT       c.8340
 E  L  S  P  K  N  L  L  S  P  S  S  F  K  A  E  C  S  E  D         p.2780

          .         .         .         .         .         .       g.179043
 GTAGAGAATTTAAATGCCCCTCCTGCTGAGGCTGGGTATGATCAAAATAAAACCGATTTT       c.8400
 V  E  N  L  N  A  P  P  A  E  A  G  Y  D  Q  N  K  T  D  F         p.2800

          .         .         .         .         .         .       g.179103
 GATGAGACTTCATCGATTAATACGGCAATCAGTGACGCCACCACCGGAGACGAGGGAAAC       c.8460
 D  E  T  S  S  I  N  T  A  I  S  D  A  T  T  G  D  E  G  N         p.2820

          .         .         .         .         .         .       g.179163
 ACTGAAATGGAAAGCACCACAGGAAGTTCCGGAGATGTGAAACCGGCTTTGTCTCCCAAA       c.8520
 T  E  M  E  S  T  T  G  S  S  G  D  V  K  P  A  L  S  P  K         p.2840

          .         .         .         .         .         .       g.179223
 GAGCCAAAAACTCTGGATACTCTGCCAAAACCTGCAACCACACCTACCACGGAGGTCTGC       c.8580
 E  P  K  T  L  D  T  L  P  K  P  A  T  T  P  T  T  E  V  C         p.2860

          .         .         .         .         .         .       g.179283
 GATGACAAATTTCTCTTTTCTCTCACAAGCCCATCCATCCATTTCAATGACAAAGATGGC       c.8640
 D  D  K  F  L  F  S  L  T  S  P  S  I  H  F  N  D  K  D  G         p.2880

          .         .         .         .         .         .       g.179343
 GACCACGACCAAAGCTTTTACATCACAGATGACCCGGATGACAACGCCGACCGCAGCGAA       c.8700
 D  H  D  Q  S  F  Y  I  T  D  D  P  D  D  N  A  D  R  S  E         p.2900

          .         .         .         .         .         .       g.179403
 ACGTCCAGCATAGCGGACCCGAGCTCCCCAAATCCATTCGGATCCAGCAATCCCTTTAAA       c.8760
 T  S  S  I  A  D  P  S  S  P  N  P  F  G  S  S  N  P  F  K         p.2920

          .         .         .         .         .         .       g.179463
 TCCAAAAGTAATGATCGGCCGGGTCACAAGCGTTTTCGAACGCAAATGAGCAATCTTCAA       c.8820
 S  K  S  N  D  R  P  G  H  K  R  F  R  T  Q  M  S  N  L  Q         p.2940

          .         .         .         .         .         .       g.179523
 CTCAAGGTTCTCAAGGCTTGCTTTAGTGACTACCGAACTCCAACCATGCAAGAATGTGAA       c.8880
 L  K  V  L  K  A  C  F  S  D  Y  R  T  P  T  M  Q  E  C  E         p.2960

          .         .         .         .         .         .       g.179583
 ATGTTAGGGAATGAGATTGGTCTGCCCAAACGCGTAGTCCAGGTGTGGTTCCAAAATGCA       c.8940
 M  L  G  N  E  I  G  L  P  K  R  V  V  Q  V  W  F  Q  N  A         p.2980

          .         .         .         .         .         .       g.179643
 AGGGCAAAGGAAAAGAAATTTAAAATTAACATAGGGAAGCCTTTCATGATCAATCAAGGC       c.9000
 R  A  K  E  K  K  F  K  I  N  I  G  K  P  F  M  I  N  Q  G         p.3000

          .         .         .         .         .         .       g.179703
 GGAACGGAAGGCACCAAACCAGAGTGTACCCTCTGCGGGGTGAAGTACTCTGCCCGCTTG       c.9060
 G  T  E  G  T  K  P  E  C  T  L  C  G  V  K  Y  S  A  R  L         p.3020

          .         .         .         .         .         .       g.179763
 TCCATCAGAGATCACATTTTCTCCAAACAGCACATTTCAAAAGTGAGGGAGACCGTTGGC       c.9120
 S  I  R  D  H  I  F  S  K  Q  H  I  S  K  V  R  E  T  V  G         p.3040

          .         .         .         .         .         .       g.179823
 AGTCAGCTCGATCGGGAGAAAGATTACTTGGCTCCGACCACGGTTCGGCAGCTGATGGCA       c.9180
 S  Q  L  D  R  E  K  D  Y  L  A  P  T  T  V  R  Q  L  M  A         p.3060

          .         .         .         .         .         .       g.179883
 CAGCAAGAACTTGATCGTATAAAGAAAGCTTCAGACGTGCTGGGCTTGACGGTACAGCAG       c.9240
 Q  Q  E  L  D  R  I  K  K  A  S  D  V  L  G  L  T  V  Q  Q         p.3080

          .         .         .         .         .         .       g.179943
 CCAGGCATGATGGACAGCAGTTCTCTCCACGGCATCAGCCTGCCAACAGCCTACCCCGGA       c.9300
 P  G  M  M  D  S  S  S  L  H  G  I  S  L  P  T  A  Y  P  G         p.3100

          .         .         .         .         .         .       g.180003
 CTCCCCGGCCTTCCTCCAGTCCTTCTCCCCGGAATGAACGGTCCATCCTCCTTGCCGGGA       c.9360
 L  P  G  L  P  P  V  L  L  P  G  M  N  G  P  S  S  L  P  G         p.3120

          .          | 11        .         .         .         .    g.186856
 TTTCCACAAAATTCAAACA | CTTTAACACCTCCCGGTGCAGGCATGCTTGGGTTTCCTACT    c.9420
 F  P  Q  N  S  N  T |   L  T  P  P  G  A  G  M  L  G  F  P  T      p.3140

          .         .         .         .         .         .       g.186916
 TCAGCTACTTCGTCTCCTGCCCTGTCTCTCAGCAGTGCCCCCACCAAACCTTTGCTGCAG       c.9480
 S  A  T  S  S  P  A  L  S  L  S  S  A  P  T  K  P  L  L  Q         p.3160

          .         .         .         .         .         .       g.186976
 ACTCCACCACCTCCACCACCTCCTCCTCCTCCTCCTCCTTCATCCTCTCTGTCAGGACAG       c.9540
 T  P  P  P  P  P  P  P  P  P  P  P  P  S  S  S  L  S  G  Q         p.3180

          .         .         .         .         .         .       g.187036
 CAGACCGAGCAACAGAACAAAGAATCTGAGAAAAAGCAAACTAAGCCAAACAAGGTGAAA       c.9600
 Q  T  E  Q  Q  N  K  E  S  E  K  K  Q  T  K  P  N  K  V  K         p.3200

          .         .         .         .         .         .       g.187096
 AAAATCAAAGAGGAGGAATTAGAGGCCACCAAACCCGAAAAACACCCCAAAAAAGAGGAA       c.9660
 K  I  K  E  E  E  L  E  A  T  K  P  E  K  H  P  K  K  E  E         p.3220

          .         .         .         .         .         .       g.187156
 AAAATCTCATCTGCTCTTTCAGTGTTGGGCAAAGTTGTAGGTGAAACACATGTCGATCCT       c.9720
 K  I  S  S  A  L  S  V  L  G  K  V  V  G  E  T  H  V  D  P         p.3240

          .         .         .         .         .         .       g.187216
 ATTCAGTTGCAGGCATTACAGAATGCAATTGCTGGTGACCCAGCTTCCTTTATAGGCGGA       c.9780
 I  Q  L  Q  A  L  Q  N  A  I  A  G  D  P  A  S  F  I  G  G         p.3260

          .         .         .         .         .         .       g.187276
 CAGTTCTTGCCATACTTTATCCCTGGGTTTGCTTCTTATTTTACACCTCAGCTCCCTGGA       c.9840
 Q  F  L  P  Y  F  I  P  G  F  A  S  Y  F  T  P  Q  L  P  G         p.3280

          .         .         .         .         .         .       g.187336
 ACAGTGCAGGGGGGATACTTCCCACCTGTCTGTGGCATGGAGAGCCTCTTTCCTTATGGC       c.9900
 T  V  Q  G  G  Y  F  P  P  V  C  G  M  E  S  L  F  P  Y  G         p.3300

          .         .         .         .         .         .       g.187396
 CCTACAATGCCCCAGACACTGGCAGGTCTGTCCCCAGGTGCACTGTTGCAGCAGTACCAA       c.9960
 P  T  M  P  Q  T  L  A  G  L  S  P  G  A  L  L  Q  Q  Y  Q         p.3320

          .         .         .         .         .         .       g.187456
 CAGTATCAGCAGAACCTGCAGGAGTCCCTGCAAAAGCAGCAAAAGCAACAGCAAGAACAG       c.10020
 Q  Y  Q  Q  N  L  Q  E  S  L  Q  K  Q  Q  K  Q  Q  Q  E  Q         p.3340

          .         .         .         .         .         .       g.187516
 CAGCAGAAACCAGTTCAGGCAAAGACATCCAAAGTAGAAAGTGACCAGCCGCAAAACTCC       c.10080
 Q  Q  K  P  V  Q  A  K  T  S  K  V  E  S  D  Q  P  Q  N  S         p.3360

          .         .         .         .         .         .       g.187576
 AACGATGCTTCAGAAACAAAGGAAGACAAAAGTACTGCTACAGAAAGCACAAAAGAAGAA       c.10140
 N  D  A  S  E  T  K  E  D  K  S  T  A  T  E  S  T  K  E  E         p.3380

          .         .         .         .         .         .       g.187636
 CCCCAGTTAGAATCCAAAAGTGCAGACTTTTCAGACACTTACGTTGTTCCATTCGTCAAG       c.10200
 P  Q  L  E  S  K  S  A  D  F  S  D  T  Y  V  V  P  F  V  K         p.3400

          .         .         .         .         .         .       g.187696
 TATGAGTTTATATGCAGAAAGTGCCAGATGATGTTTACTGATGAAGACGCCGCAGTAAAT       c.10260
 Y  E  F  I  C  R  K  C  Q  M  M  F  T  D  E  D  A  A  V  N         p.3420

          .         .         .         .         .         .       g.187756
 CATCAAAAGTCCTTCTGTTATTTCGGTCAGCCTTTGATTGACCCACAAGAGACAGTGCTT       c.10320
 H  Q  K  S  F  C  Y  F  G  Q  P  L  I  D  P  Q  E  T  V  L         p.3440

          .         .         .         .         .         .       g.187816
 CGTGTCCCAGTCAGCAAATATCAGTGTCTTGCCTGTGATGTGGCTATCAGTGGGAATGAA       c.10380
 R  V  P  V  S  K  Y  Q  C  L  A  C  D  V  A  I  S  G  N  E         p.3460

          .         .         .         .         .         .       g.187876
 GCACTTAGCCAACACCTCCAGTCAAGCTTGCACAAAGAGAAAACAATCAAACAAGCAATG       c.10440
 A  L  S  Q  H  L  Q  S  S  L  H  K  E  K  T  I  K  Q  A  M         p.3480

          .         .         .         .         .         .       g.187936
 AGAAATGCCAAAGAGCATGTTAGATTATTACCTCACTCAGTCTGCTCCCCTAATCCTAAC       c.10500
 R  N  A  K  E  H  V  R  L  L  P  H  S  V  C  S  P  N  P  N         p.3500

          .         .         .         .         .         .       g.187996
 ACCACATCTACCTCGCAGTCTGCAGCTTCTTCTAATAACACCTATCCTCATCTTTCTTGC       c.10560
 T  T  S  T  S  Q  S  A  A  S  S  N  N  T  Y  P  H  L  S  C         p.3520

          .         .         .         .         .         .       g.188056
 TTCTCCATGAAGTCCTGGCCTAATATCCTTTTCCAAGCGTCTGCCAGGAGAGCTGCTTCT       c.10620
 F  S  M  K  S  W  P  N  I  L  F  Q  A  S  A  R  R  A  A  S         p.3540

          .         .         .         .         .         .       g.188116
 CCCCCTTCTTCTCCTCCTTCCCTTTCCTTGCCTTCAACGGTTACCTCAAGTTTGTGCAGC       c.10680
 P  P  S  S  P  P  S  L  S  L  P  S  T  V  T  S  S  L  C  S         p.3560

          .         .         .         .         .         .       g.188176
 ACCTCAGGGGTTCAAACCTCACTACCCACAGAAAGTTGTTCAGATGAGTCTGACAGTGAG       c.10740
 T  S  G  V  Q  T  S  L  P  T  E  S  C  S  D  E  S  D  S  E         p.3580

          .         .         .         .         .         .       g.188236
 CTGAGCCAGAAGCTAGAAGACTTAGATAATTCTTTGGAAGTGAAGGCTAAGCCTGCTTCT       c.10800
 L  S  Q  K  L  E  D  L  D  N  S  L  E  V  K  A  K  P  A  S         p.3600

          .         .         .         .         .                 g.188287
 GGCCTAGATGGTAATTTCAATAGCATCCGAATGGATATGTTCAGTGTGTAG                c.10851
 G  L  D  G  N  F  N  S  I  R  M  D  M  F  S  V  X                  p.3616

          .         .         .         .         .         .       g.188347
 gagtgaagacaggatcccgtgcttaaaaaaataaaaaataaaaaaataaaaaaaaaataa       c.*60

          .         .         .         .         .         .       g.188407
 gactttaactgcagttccaaagcttctctaacccaaaaattacagtaccaaatgattgac       c.*120

          .         .         .         .         .         .       g.188467
 tcaggattgtttttcccatattgatatgctggcaatataggatggtatgtaatggacaga       c.*180

          .         .         .         .         .         .       g.188527
 actgatgcagatggttgaatgcgcttgtactatatgctaaaatatggaaaaggaaaaaaa       c.*240

          .         .         .         .         .         .       g.188587
 aatctcacaagttcttttggaacttgtttcaagccaaaaactctcaagaaagcaaattgc       c.*300

          .         .         .         .         .         .       g.188647
 acctcagctggattgatttccaaatgctagcatgtactgtatgggaggatgatccagatg       c.*360

          .         .         .         .         .         .       g.188707
 tttcaaagagaatttctcttagtttagttaggtgtaattcagtagctttaaattctcagg       c.*420

          .         .         .         .         .         .       g.188767
 tcagaacataacatttctcatttgttaaaagcagcaagaagcctggtaaaactgtgactt       c.*480

          .         .         .         .         .         .       g.188827
 ttccccaaacgtcaatctttattagaaagcattttctaggtgtgtttagtgtacaaagag       c.*540

          .         .         .         .         .         .       g.188887
 actttataacccttactggacaacacacagatccttgagctcacgctgcaggatagtaca       c.*600

          .         .         .         .         .         .       g.188947
 gttttaccgcagagggaatctggaacagtggaatcatgtgtctgccctgtgtattgcagt       c.*660

          .         .         .         .         .         .       g.189007
 ttgtattgccacaagctatatttataccagtgtcacccttttcttgtagaatatactaat       c.*720

          .         .         .         .         .         .       g.189067
 aatctgtgccaactctaccttctcacttttacctctgacgtcattctttttttctgaaag       c.*780

          .         .         .         .         .         .       g.189127
 aggtaataattctagttttgatagactctgaggattatgtgaacaggacatttttcattt       c.*840

          .         .         .         .         .         .       g.189187
 gtgaatttaatgctatactgtcaaggtacttgcttgtgtctgaactctagtgcacttatg       c.*900

          .         .         .         .         .         .       g.189247
 attttgtagaccatgtgaaatttaataagataccttttttttcctttctttgtgtgtagt       c.*960

          .         .         .         .         .         .       g.189307
 gcagcaacagtttggtctgcatttgttagaagtttaactcctaacaacccaaagacctat       c.*1020

          .         .         .         .         .         .       g.189367
 ttaacaattggtgcataaatgaaagtagtactgtatacttgaaactgtttaagtacaagt       c.*1080

          .         .         .         .         .         .       g.189427
 tgaacaaaaattatgaaaaggtatatttgcttctcgggaaagcaaagaagctgctttaaa       c.*1140

          .         .         .         .         .         .       g.189487
 aaataaaaaggggactaaaaatttgttttgtataaagaggttagccctgcgcacgtagga       c.*1200

          .         .         .         .         .         .       g.189547
 ctgaattcagtgatatccctatacactgccatttagtggataggttattgtacttccatt       c.*1260

          .         .         .         .         .         .       g.189607
 catactctgggcacttgtgttgtattgttctgttacatactttttttaacctgttttgtt       c.*1320

          .         .         .         .         .         .       g.189667
 ttatcatatatgcattaaaagtattatctttatcaacatttgctgctactgtgttaacat       c.*1380

          .         .         .         .         .         .       g.189727
 ttttgttttgcttgccatgaatttcaacttccaccacccagtgaattgatttataaattg       c.*1440

          .         .         .         .         .         .       g.189787
 ctatgctttgctgtttttctgttgctgtggaacttaaagaatgtgaaagctgtcaaaggg       c.*1500

          .         .         .         .         .         .       g.189847
 tattttacgaatcacttttgtgtttgatatagtaaaacaatgtgattcattccaaagtaa       c.*1560

          .         .         .         .         .         .       g.189907
 cagaaggttatttgtaagaaagttaaaggcttgtgaacaaagaaagctaagctgttgtac       c.*1620

          .         .         .         .         .         .       g.189967
 atatttgtagttggctgtgcatggtacaaatttattaatatgaagaaatgcaaaatgtat       c.*1680

          .         .         .         .         .         .       g.190027
 tgcttttgatatttctcttccgagatgaacaagtagcatgtaatgcaactgtttgacagt       c.*1740

          .         .         .         .         .         .       g.190087
 ttaactcaagtcatgcttcaaactgttttaatgatcaaatcaagacacatttcattttac       c.*1800

          .         .         .         .         .         .       g.190147
 attttattattgtacagtttttgtttcggatgatgatcacagcaatctttattctataca       c.*1860

          .         .         .         .         .         .       g.190207
 ttttatgtgaacttttttaatgtctttaatttggattttttttttttttagtattttaac       c.*1920

          .         .         .         .         .         .       g.190267
 atttattttaatcctgaagacacttttttgattgtgtttcgtaagagacaacatggcctc       c.*1980

          .         .         .         .         .         .       g.190327
 ctaaggtgcaatcctgccgctatagtgagctaatgtcctgaatccaaaggcttcagaaaa       c.*2040

          .         .         .         .         .         .       g.190387
 ttgcttttgcctttttcatgaatgttaagcagcagcattgtgagatcgatctgtcctggc       c.*2100

          .         .         .         .         .         .       g.190447
 agttaacacgatgtgcaacagtgtgttagcatggaacagaacgcttttcacaaaacaaag       c.*2160

          .         .         .         .         .         .       g.190507
 gactgttttacaaatgattattccgacagtgtgtcgacataaacttttacaactgcacag       c.*2220

          .         .         .         .         .         .       g.190567
 cagccaaaaaaagaaaaaaaaaagaaaaaaaactttaactggatggacgttgttagggtg       c.*2280

          .         .         .         .         .         .       g.190627
 agaaataaaaggacagcctccaaaggttgagaatgagaattgttttttcctggatatcaa       c.*2340

          .         .         .         .         .         .       g.190687
 agggattatcacagcgcaatcattgtctacacaacatgtactctcaacgcctgggttaca       c.*2400

          .         .         .         .         .         .       g.190747
 taggaaatgcaccctgaggttttaataaaagcccctatggctataactttaaataaacta       c.*2460

          .         .         .         .         .         .       g.190807
 aaccaaaaatgttattgatgttttatatatagagagtagtctcattagtttttgttactg       c.*2520

          .         .         .         .         .         .       g.190867
 taatgtttgaagtctcaaatgcaccgtattacggtaaataacatggttttgaaaactttt       c.*2580

          .         .         .         .         .         .       g.190927
 ttttattttgtcacagacctgttgtcatagttgaaatgatgtttattgtagatggtattt       c.*2640

          .         .         .         .         .         .       g.190987
 gaacttattcttctggaaatagttcatcaagtatgtttgttgctcattgtgatacattaa       c.*2700

          .         .                                               g.191007
 aaactgtatctacatattta                                               c.*2720

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Zinc finger homeobox 4 protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift variants, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.3.0 Build 26
©2004-2021 Leiden University Medical Center