(used for variant description)
(last modified May 9, 2016)
This file was created to facilitate the description of sequence variants on transcript NM_001145901.1 in the SARS2 gene based on a coding DNA reference sequence following the HGVS recommendations.
The sequence was taken from NG_031865.1, covering SARS2 transcript NM_001145901.1.
(upstream sequence)
. . . . g.5040
tgaagaggcgggacacagacgacgattgggcgacgaagga c.-121
. . . . . . g.5100
ctctatcgcggtcaactttcctcaggatctgattggctcgtcacggtcaggttcgctgcc c.-61
. . . . . . g.5160
ttgcaattggtctaaacgcggagtgggcgggacgaagtgccgccttgttcccggtccaag c.-1
. . . . . . g.5220
ATGGCTGCGTCCATGGCGCGGCGCTTGTGGCCTTTGCTGACTCGTCGGGGGTTCCGGCCC c.60
M A A S M A R R L W P L L T R R G F R P p.20
. . . . . . g.5280
CGGGGAGGCTGCATCTCCAACGATAGTCCAAGGAGAAGTTTCACTACAGAGAAACGAAAC c.120
R G G C I S N D S P R R S F T T E K R N p.40
. . . . . . g.5340
CGGAACCTCCTGTACGAGTATGCGCGCGAGGGCTACAGCGCACTCCCTCAGCTGGACATA c.180
R N L L Y E Y A R E G Y S A L P Q L D I p.60
. . . . . . g.5400
GAGCGGTTCTGCGCATGCCCAGAAGAGGCCGCACACGCCCTGGAGCTCCGCAAGGGGGAG c.240
E R F C A C P E E A A H A L E L R K G E p.80
. . | 02. . . . g.9629
CTGCGCTCGGCGGACCTGCCCGCGATC | ATCTCGACATGGCAGGAGCTGAGGCAGCTGCAG c.300
L R S A D L P A I | I S T W Q E L R Q L Q p.100
. . . . . . g.9689
GAGCAGATCCGGAGCCTGGAGGAAGAGAAGGCAGCTGTGACTGAGGCAGTGCGGGCCCTG c.360
E Q I R S L E E E K A A V T E A V R A L p.120
| 03 . . . | 04 . . . g.13845
CTG | GCAAACCAGGACAGTGGTGAAGTGCAGCAG | GTGCGGCTGGATCCAGGTGCTGGCTCC c.420
L | A N Q D S G E V Q Q | V R L D P G A G S p.140
. . . . . | 05 . g.14310
ATATTTGGTCCTACGTTCCTCCCATTCCCAGGCCAGCTTTCTCTCCTTGT | GGAGGCCCAG c.480
I F G P T F L P F P G Q L S L L V | E A Q p.160
. . . . . . g.14370
CTTGAGGAGCAGTTCTACCTGCAGGCGCTGAAGCTGCCCAACCAGACCCACCCAGACGTG c.540
L E E Q F Y L Q A L K L P N Q T H P D V p.180
| 06 . . . . . | 07 . g.15752
| CCCGTCGGGGATGAGAGCCAGGCTCGAGTGCTCCACATGGTCGGAGACAAGCCAG | TTTTC c.600
| P V G D E S Q A R V L H M V G D K P V | F p.200
. . . . . | 08 g.16030
TCCTTCCAACCTCGGGGCCACCTGGAAATTGGCGAGAAACTCGACATCATCCGTCAGAA | G c.660
S F Q P R G H L E I G E K L D I I R Q K | p.220
. . . . . . g.16090
CGCCTGTCCCACGTGTCTGGCCACCGGTCCTATTACCTGCGCGGGGCTGGAGCCCTCCTG c.720
R L S H V S G H R S Y Y L R G A G A L L p.240
. . . . | 09 . . g.17098
CAGCACGGCCTGGTCAACTTCACATTCAACAAGCTTCTCCGCCGG | GGCTTCACCCCCATG c.780
Q H G L V N F T F N K L L R R | G F T P M p.260
. . . | 10 . . . g.17393
ACGGTGCCAGACCTTCTCCGCGGAGCAGTGTTT | GAAGGCTGTGGGATGACACCAAATGCC c.840
T V P D L L R G A V F | E G C G M T P N A p.280
. . . . . . g.17453
AACCCATCCCAAATTTACAACATCGACCCTGCCCGCTTCAAAGATCTCAACCTGGCTGGA c.900
N P S Q I Y N I D P A R F K D L N L A G p.300
. . | 11 . . . . g.17795
ACAGCGGAGGTGGGGCTTGCAG | GCTACTTCATGGACCACACCGTGGCCTTCAGGGACCTG c.960
T A E V G L A G | Y F M D H T V A F R D L p.320
| 12 . . . . . g.17940
CCAGTCAG | GATGGTTTGCTCCAGCACCTGCTACCGGGCAGAGACAAACACGGGACAGGAA c.1020
P V R | M V C S S T C Y R A E T N T G Q E p.340
. . . | 13 . . . g.18087
CCCCGGGGGCTGTATCGAGTACACCACTTCACCAAG | GTGGAGATGTTTGGGGTGACAGGC c.1080
P R G L Y R V H H F T K | V E M F G V T G p.360
. . . . . . g.18147
CCTGGGCTGGAGCAGAGCTCACAGCTGCTGGAGGAGTTCCTGTCCCTTCAGATGGAGATC c.1140
P G L E Q S S Q L L E E F L S L Q M E I p.380
. . | 14 . . . . g.19616
TTGACAGAGCTGGGCTTGCACTTCCG | GGTCCTGGATATGCCCACCCAAGAACTGGGCCTC c.1200
L T E L G L H F R | V L D M P T Q E L G L p.400
. . . . . . g.19676
CCCGCCTACCGCAAGTTTGACATTGAGGCCTGGATGCCAGGCCGAGGCCGCTTTGGAGAG c.1260
P A Y R K F D I E A W M P G R G R F G E p.420
| 15 . . . . . . g.19827
| GTCACCAGTGCTTCCAACTGCACAGACTTCCAGAGCCGCCGCCTCCACATCATGTTCCAG c.1320
| V T S A S N C T D F Q S R R L H I M F Q p.440
. . . | 16 . . . g.20017
ACCGAGGCTGGGGAGCTGCAGTTTGCCCACACG | GTGAACGCCACCGCCTGTGCTGTCCCC c.1380
T E A G E L Q F A H T | V N A T A C A V P p.460
. . . | 17 . . g.20168
CGCCTTCTCATCGCGCTCCTGGAGAGTAACCAGCAGAAG | GACGGCTCAGTGCTCGTGCCC c.1440
R L L I A L L E S N Q Q K | D G S V L V P p.480
. . . . . . g.20228
CCTGCCCTCCAGTCCTACCTCGGCACTGATCGGATCACAGCCCCTACCCACGTGCCTCTC c.1500
P A L Q S Y L G T D R I T A P T H V P L p.500
. . . . . . g.20288
CAGTACATCGGCCCCAACCAGCCCCGGAAGCCTGGGCTGCCTGGCCAGCCTGCTGTAAGC c.1560
Q Y I G P N Q P R K P G L P G Q P A V S p.520
g.20291
TAA c.1563
X p.520
. . . . . . g.20351
gaacccacccacagcagccctcgggggtgtcactgcttcctggagttcaggagaccccgg c.*60
. . . . . . g.20411
acacctgggacctgtgttgctgagcccgtcctgacatctgtgttcttcctgtcagctcca c.*120
. . . . . . g.20471
cgcccgggcccctggaccacggggtccacctctcctctgtccttgctgcctcagagtcag c.*180
. . . . . . g.20531
tcactgaccctgttatcattgagggtcccagtgggaagcaggacgtctgggctttacggt c.*240
. . . . . . g.20591
tctagggacaggagaagcagaggaagaggcttccatccctccttccttctttcctcctac c.*300
. . . . g.20633
agtgctgagcaaaaagtccccaataaatggtcaggacaaagg c.*342
(downstream sequence)
Legend:
Powered by LOVD v.3.0 Build 15
©2004-2016 Leiden University Medical Center