Skip to content

Commit e35a0b6

Browse files
Ebola Zaire first attempt
chore: rebuild [skip ci] Add Ebola Sudan chore: rebuild [skip ci] Fewer examples, adapted readme chore: rebuild [skip ci] test-wnv
1 parent 0400bd4 commit e35a0b6

29 files changed

Lines changed: 5679 additions & 221 deletions

data/nextstrain/collection.json

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -51,6 +51,10 @@
5151
"nextstrain/flu/h3n2/pa",
5252
"nextstrain/flu/h1n1pdm/pb2",
5353
"nextstrain/flu/h1n1pdm/pb1",
54-
"nextstrain/flu/h3n2/pb2"
54+
"nextstrain/flu/h3n2/pb2",
55+
"nextstrain/ebola/zaire",
56+
"nextstrain/ebola/sudan",
57+
"nextstrain/wnv/lineage-2",
58+
"nextstrain/wnv/lineage-1"
5559
]
5660
}
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
## Unreleased
2+
3+
Initial release of this dataset.
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
# Nextclade dataset for "Sudan Ebolavirus"
2+
3+
| Key | Value |
4+
| ---------------------- | ------------------------------------------------------------------------------- |
5+
| authors | [Cornelius Roemer](https://neherlab.org), [Richard Neher](https://neherlab.org) |
6+
| data source | Genbank |
7+
| nextclade dataset path | nextstrain/ebola/sudan |
8+
| annotation | [NC_006432.1](https://www.ncbi.nlm.nih.gov/nuccore/NC_006432) |
9+
| related datasets | Zaire Ebola virus: `nextstrain/ebola/zaire` |

data/nextstrain/wnv/lineage-1/examples.fasta

Lines changed: 317 additions & 0 deletions
Large diffs are not rendered by default.
Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,23 @@
1+
##gff-version 3
2+
#!gff-spec-version 1.21
3+
#!processor NCBI annotwriter
4+
##sequence-region NC_009942.1 1 11029
5+
##species https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?id=11082
6+
NC_009942.1 RefSeq region 1 11029 . + . ID=NC_009942.1:1..11029;Dbxref=taxon:11082;country=USA;gb-acronym=WNV;gbkey=Src;genome=genomic;isolate=385-99;mol_type=genomic RNA;note=lineage 1%3B Vero cell passage 2 after isolation;strain=NY99
7+
NC_009942.1 RefSeq gene 97 10395 . + . gene=POLY;ID=gene-WNVNY99_gp1;gbkey=Prot;product=anchored capsid protein ancC;protein_id=YP_005097850.1
8+
NC_009942.1 RefSeq CDS 6916 7311 . + . ID=id-YP_009246337;Parent=gene-WNVNY99_gp1;gene=NS4BWARF4;Note=The mature peptide consists of the N-terminal portion of NS4B and the translation of the WARF4 open reading frame%2C which is only found in lineage 1 of West Nile virus%3B The existence of the protein has been shown indirectly%3B WARF4 ORF has been added and annotated with the kind help of Dr. David Karlin (Department of Zoology%2C University of Oxford%2C UK);gbkey=Prot;product=N-NS4B/WARF4;protein_id=YP_009246339.1
9+
NC_009942.1 RefSeq CDS 7311 7754 . + . ID=id-YP_009246337;Parent=gene-WNVNY99_gp1;gene=NS4BWARF4;Note=The mature peptide consists of the N-terminal portion of NS4B and the translation of the WARF4 open reading frame%2C which is only found in lineage 1 of West Nile virus%3B The existence of the protein has been shown indirectly%3B WARF4 ORF has been added and annotated with the kind help of Dr. David Karlin (Department of Zoology%2C University of Oxford%2C UK);gbkey=Prot;product=N-NS4B/WARF4;protein_id=YP_009246339.1
10+
NC_009942.1 RefSeq CDS 2470 3552 . + . ID=id-YP_006485883;Parent=gene-WNVNY99_gp1;gene=NS1prime;Note=NS1'%3B transframe fusion protein expressed via programmed ribosomal frameshifting;gbkey=Prot;product=nonstructural protein NS1 prime;protein_id=YP_009246338.1
11+
NC_009942.1 RefSeq CDS 3552 3680 . + . ID=id-YP_006485883;Parent=gene-WNVNY99_gp1;gene=NS1prime;Note=NS1'%3B transframe fusion protein expressed via programmed ribosomal frameshifting;gbkey=Prot;product=nonstructural protein NS1 prime;protein_id=YP_009246338.1
12+
NC_009942.1 RefSeq CDS 97 465 . + . gene=ancC;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=anchored capsid protein ancC;protein_id=YP_005097850.1
13+
NC_009942.1 RefSeq CDS 466 741 . + . gene=pr;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=protein pr;protein_id=YP_009164953.1
14+
NC_009942.1 RefSeq CDS 742 966 . + . gene=M;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=membrane glycoprotein M;protein_id=YP_001527879.1
15+
NC_009942.1 RefSeq CDS 967 2469 . + . gene=E;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=envelope protein E;protein_id=YP_001527880.1
16+
NC_009942.1 RefSeq CDS 2470 3525 . + . gene=NS1;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=nonstructural protein NS1;protein_id=YP_001527881.1
17+
NC_009942.1 RefSeq CDS 3526 4218 . + . gene=NS2A;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=nonstructural protein NS2A;protein_id=YP_001527882.1
18+
NC_009942.1 RefSeq CDS 4219 4611 . + . gene=NS2B;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=nonstructural protein NS2B;protein_id=YP_001527883.1
19+
NC_009942.1 RefSeq CDS 4612 6468 . + . gene=NS3;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=nonstructural protein NS3;protein_id=YP_001527884.1
20+
NC_009942.1 RefSeq CDS 6469 6846 . + . gene=NS4A;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=nonstructural protein NS4A;protein_id=YP_001527885.1
21+
NC_009942.1 RefSeq CDS 6847 6915 . + . gene=2K;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=protein 2K;protein_id=YP_001527885.1
22+
NC_009942.1 RefSeq CDS 6916 7680 . + . gene=NS4B;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=nonstructural protein NS4B;protein_id=YP_001527886.1
23+
NC_009942.1 RefSeq CDS 7681 10395 . + . gene=NS5;Parent=gene-WNVNY99_gp1;gbkey=Prot;product=RNA-dependent RNA polymerase NS5;protein_id=YP_001527887.1
Lines changed: 71 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,71 @@
1+
{
2+
"alignmentParams": {
3+
"excessBandwidth": 100,
4+
"terminalBandwidth": 300,
5+
"allowedMismatches": 10,
6+
"windowSize": 40,
7+
"minSeedCover": 0.03,
8+
"gapAlignmentSide": "left"
9+
},
10+
"attributes": {
11+
"name": "West Nile Virus Lineage 1",
12+
"reference accession": "NC_009942.1",
13+
"reference name": "NY99"
14+
},
15+
"compatibility": {
16+
"cli": "3.0.0-alpha.0",
17+
"web": "3.0.0-alpha.0"
18+
},
19+
"deprecated": false,
20+
"enabled": true,
21+
"experimental": true,
22+
"files": {
23+
"changelog": "CHANGELOG.md",
24+
"examples": "examples.fasta",
25+
"genomeAnnotation": "genome_annotation.gff3",
26+
"pathogenJson": "pathogen.json",
27+
"readme": "README.md",
28+
"reference": "reference.fasta"
29+
},
30+
"official": true,
31+
"qc": {
32+
"frameShifts": {
33+
"enabled": true,
34+
"ignoredFrameShifts": [
35+
],
36+
"scoreWeight": 20
37+
},
38+
"missingData": {
39+
"enabled": true,
40+
"missingDataThreshold": 3000,
41+
"scoreBias": 500
42+
},
43+
"mixedSites": {
44+
"enabled": true,
45+
"mixedSitesThreshold": 40
46+
},
47+
"privateMutations": {
48+
"cutoff": 300,
49+
"enabled": true,
50+
"typical": 50,
51+
"weightLabeledSubstitutions": 6,
52+
"weightReversionSubstitutions": 6,
53+
"weightUnlabeledSubstitutions": 1
54+
},
55+
"snpClusters": {
56+
"clusterCutOff": 10,
57+
"enabled": false,
58+
"scoreWeight": 10,
59+
"windowSize": 100
60+
},
61+
"stopCodons": {
62+
"enabled": true,
63+
"ignoredStopCodons": [
64+
],
65+
"scoreWeight": 20
66+
}
67+
},
68+
"schemaVersion": "3.0.0",
69+
"shortcuts": [
70+
]
71+
}
Lines changed: 159 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,159 @@
1+
>NC_009942.1 West Nile virus lineage 1, complete genome
2+
AGTAGTTCGCCTGTGTGAGCTGACAAACTTAGTAGTGTTTGTGAGGATTAACAACAATTAACACAGTGCG
3+
AGCTGTTTCTTAGCACGAAGATCTCGATGTCTAAGAAACCAGGAGGGCCCGGCAAGAGCCGGGCTGTCAA
4+
TATGCTAAAACGCGGAATGCCCCGCGTGTTGTCCTTGATTGGACTGAAGAGGGCTATGTTGAGCCTGATC
5+
GACGGCAAGGGGCCAATACGATTTGTGTTGGCTCTCTTGGCGTTCTTCAGGTTCACAGCAATTGCTCCGA
6+
CCCGAGCAGTGCTGGATCGATGGAGAGGTGTGAACAAACAAACAGCGATGAAACACCTTCTGAGTTTTAA
7+
GAAGGAACTAGGGACCTTGACCAGTGCTATCAATCGGCGGAGCTCAAAACAAAAGAAAAGAGGAGGAAAG
8+
ACCGGAATTGCAGTCATGATTGGCCTGATCGCCAGCGTAGGAGCAGTTACCCTCTCTAACTTCCAAGGGA
9+
AGGTGATGATGACGGTAAATGCTACTGACGTCACAGATGTCATCACGATTCCAACAGCTGCTGGAAAGAA
10+
CCTATGCATTGTCAGAGCAATGGATGTGGGATACATGTGCGATGATACTATCACTTATGAATGCCCAGTA
11+
CTGTCGGCTGGTAATGATCCAGAAGACATCGACTGTTGGTGCACAAAGTCAGCAGTCTACGTCAGGTATG
12+
GAAGATGCACCAAGACACGCCACTCAAGACGCAGTCGGAGGTCACTGACAGTGCAGACACACGGAGAAAG
13+
CACTCTAGCGAACAAGAAGGGGGCTTGGATGGACAGCACCAAGGCCACAAGGTATTTGGTAAAAACAGAA
14+
TCATGGATCTTGAGGAACCCTGGATATGCCCTGGTGGCAGCCGTCATTGGTTGGATGCTTGGGAGCAACA
15+
CCATGCAGAGAGTTGTGTTTGTCGTGCTATTGCTTTTGGTGGCCCCAGCTTACAGCTTCAACTGCCTTGG
16+
AATGAGCAACAGAGACTTCTTGGAAGGAGTGTCTGGAGCAACATGGGTGGATTTGGTTCTCGAAGGCGAC
17+
AGCTGCGTGACTATCATGTCTAAGGACAAGCCTACCATCGATGTGAAGATGATGAATATGGAGGCGGCCA
18+
ACCTGGCAGAGGTCCGCAGTTATTGCTATTTGGCTACCGTCAGCGATCTCTCCACCAAAGCTGCGTGCCC
19+
GACCATGGGAGAAGCTCACAATGACAAACGTGCTGACCCAGCTTTTGTGTGCAGACAAGGAGTGGTGGAC
20+
AGGGGCTGGGGCAACGGCTGCGGACTATTTGGCAAAGGAAGCATTGACACATGCGCCAAATTTGCCTGCT
21+
CTACCAAGGCAATAGGAAGAACCATCTTGAAAGAGAATATCAAGTACGAAGTGGCCATTTTTGTCCATGG
22+
ACCAACTACTGTGGAGTCGCACGGAAACTACTCCACACAGGTTGGAGCCACTCAGGCAGGGAGACTCAGC
23+
ATCACTCCTGCGGCGCCTTCATACACACTAAAGCTTGGAGAATATGGAGAGGTGACAGTGGACTGTGAAC
24+
CACGGTCAGGGATTGACACCAATGCATACTACGTGATGACTGTTGGAACAAAGACGTTCTTGGTCCATCG
25+
TGAGTGGTTCATGGACCTCAACCTCCCTTGGAGCAGTGCTGGAAGTACTGTGTGGAGGAACAGAGAGACG
26+
TTAATGGAGTTTGAGGAACCACACGCCACGAAGCAGTCTGTGATAGCATTGGGCTCACAAGAGGGAGCTC
27+
TGCATCAAGCTTTGGCTGGAGCCATTCCTGTGGAATTTTCAAGCAACACTGTCAAGTTGACGTCGGGTCA
28+
TTTGAAGTGTAGAGTGAAGATGGAAAAATTGCAGTTGAAGGGAACAACCTATGGCGTCTGTTCAAAGGCT
29+
TTCAAGTTTCTTGGGACTCCCGCAGACACAGGTCACGGCACTGTGGTGTTGGAATTGCAGTACACTGGCA
30+
CGGATGGACCTTGCAAAGTTCCTATCTCGTCAGTGGCTTCATTGAACGACCTAACGCCAGTGGGCAGATT
31+
GGTCACTGTCAACCCTTTTGTTTCAGTGGCCACGGCCAACGCTAAGGTCCTGATTGAATTGGAACCACCC
32+
TTTGGAGACTCATACATAGTGGTGGGCAGAGGAGAACAACAGATCAATCACCATTGGCACAAGTCTGGAA
33+
GCAGCATTGGCAAAGCCTTTACAACCACCCTCAAAGGAGCGCAGAGACTAGCCGCTCTAGGAGACACAGC
34+
TTGGGACTTTGGATCAGTTGGAGGGGTGTTCACCTCAGTTGGGAAGGCTGTCCATCAAGTGTTCGGAGGA
35+
GCATTCCGCTCACTGTTCGGAGGCATGTCCTGGATAACGCAAGGATTGCTGGGGGCTCTCCTGTTGTGGA
36+
TGGGCATCAATGCTCGTGATAGGTCCATAGCTCTCACGTTTCTCGCAGTTGGAGGAGTTCTGCTCTTCCT
37+
CTCCGTGAACGTGCACGCTGACACTGGGTGTGCCATAGACATCAGCCGGCAAGAGCTGAGATGTGGAAGT
38+
GGAGTGTTCATACACAATGATGTGGAGGCTTGGATGGACCGGTACAAGTATTACCCTGAAACGCCACAAG
39+
GCCTAGCCAAGATCATTCAGAAAGCTCATAAGGAAGGAGTGTGCGGTCTACGATCAGTTTCCAGACTGGA
40+
GCATCAAATGTGGGAAGCAGTGAAGGACGAGCTGAACACTCTTTTGAAGGAGAATGGTGTGGACCTTAGT
41+
GTCGTGGTTGAGAAACAGGAGGGAATGTACAAGTCAGCACCTAAACGCCTCACCGCCACCACGGAAAAAT
42+
TGGAAATTGGCTGGAAGGCCTGGGGAAAGAGTATTTTATTTGCACCAGAACTCGCCAACAACACCTTTGT
43+
GGTTGATGGTCCGGAGACCAAGGAATGTCCGACTCAGAATCGCGCTTGGAATAGCTTAGAAGTGGAGGAT
44+
TTTGGATTTGGTCTCACCAGCACTCGGATGTTCCTGAAGGTCAGAGAGAGCAACACAACTGAATGTGACT
45+
CGAAGATCATTGGAACGGCTGTCAAGAACAACTTGGCGATCCACAGTGACCTGTCCTATTGGATTGAAAG
46+
CAGGCTCAATGATACGTGGAAGCTTGAAAGGGCAGTTCTGGGTGAAGTCAAATCATGTACGTGGCCTGAG
47+
ACGCATACCTTGTGGGGCGATGGAATCCTTGAGAGTGACTTGATAATACCAGTCACACTGGCGGGACCAC
48+
GAAGCAATCACAATCGGAGACCTGGGTACAAGACACAAAACCAGGGCCCATGGGACGAAGGCCGGGTAGA
49+
GATTGACTTCGATTACTGCCCAGGAACTACGGTCACCCTGAGTGAGAGCTGCGGACACCGTGGACCTGCC
50+
ACTCGCACCACCACAGAGAGCGGAAAGTTGATAACAGATTGGTGCTGCAGGAGCTGCACCTTACCACCAC
51+
TGCGCTACCAAACTGACAGCGGCTGTTGGTATGGTATGGAGATCAGACCACAGAGACATGATGAAAAGAC
52+
CCTCGTGCAGTCACAAGTGAATGCTTATAATGCTGATATGATTGACCCTTTTCAGTTGGGCCTTCTGGTC
53+
GTGTTCTTGGCCACCCAGGAGGTCCTTCGCAAGAGGTGGACAGCCAAGATCAGCATGCCAGCTATACTGA
54+
TTGCTCTGCTAGTCCTGGTGTTTGGGGGCATTACTTACACTGATGTGTTACGCTATGTCATCTTGGTGGG
55+
GGCAGCTTTCGCAGAATCTAATTCGGGAGGAGACGTGGTACACTTGGCGCTCATGGCGACCTTCAAGATA
56+
CAACCAGTGTTTATGGTGGCATCGTTTCTCAAAGCGAGATGGACCAACCAGGAGAACATTTTGTTGATGT
57+
TGGCGGCTGTTTTCTTTCAAATGGCTTATCACGATGCCCGCCAAATTCTGCTCTGGGAGATCCCTGATGT
58+
GTTGAATTCACTGGCGGTAGCTTGGATGATACTGAGAGCCATAACATTCACAACGACATCAAACGTGGTT
59+
GTTCCGCTGCTAGCCCTGCTAACACCCGGGCTGAGATGCTTGAATCTGGATGTGTACAGGATACTGCTGT
60+
TGATGGTCGGAATAGGCAGCTTGATCAGGGAGAAGAGGAGTGCAGCCGCAAAAAAGAAAGGAGCAAGTCT
61+
GCTATGCTTGGCTCTAGCCTCAACAGGACTTTTCAACCCCATGATCCTTGCTGCTGGACTGATTGCATGT
62+
GATCCCAACCGTAAACGCGGATGGCCCGCAACTGAAGTGATGACAGCTGTCGGCCTAATGTTTGCCATCG
63+
TCGGAGGGCTGGCAGAGCTTGACATTGACTCCATGGCCATTCCAATGACTATCGCGGGGCTCATGTTTGC
64+
TGCTTTCGTGATTTCTGGGAAATCAACAGATATGTGGATTGAGAGAACGGCGGACATTTCCTGGGAAAGT
65+
GATGCAGAAATTACAGGCTCGAGCGAAAGAGTTGATGTGCGGCTTGATGATGATGGAAACTTCCAGCTCA
66+
TGAATGATCCAGGAGCACCTTGGAAGATATGGATGCTCAGAATGGTCTGTCTCGCGATTAGTGCGTACAC
67+
CCCCTGGGCAATCTTGCCCTCAGTAGTTGGATTTTGGATAACTCTCCAATACACAAAGAGAGGAGGCGTG
68+
TTGTGGGACACTCCCTCACCAAAGGAGTACAAAAAGGGGGACACGACCACCGGCGTCTACAGGATCATGA
69+
CTCGTGGGCTGCTCGGCAGTTATCAAGCAGGAGCGGGCGTGATGGTTGAAGGTGTTTTCCACACCCTTTG
70+
GCATACAACAAAAGGAGCCGCTTTGATGAGCGGAGAGGGCCGCCTGGACCCATACTGGGGCAGTGTCAAG
71+
GAGGATCGACTTTGTTACGGAGGACCCTGGAAATTGCAGCACAAGTGGAACGGGCAGGATGAGGTGCAGA
72+
TGATTGTGGTGGAACCTGGCAAGAACGTTAAGAACGTCCAGACGAAACCAGGGGTGTTCAAAACACCTGA
73+
AGGAGAAATCGGGGCCGTGACTTTGGACTTCCCCACTGGAACATCAGGCTCACCAATAGTGGACAAAAAC
74+
GGTGATGTGATTGGGCTTTATGGCAATGGAGTCATAATGCCCAACGGCTCATACATAAGCGCGATAGTGC
75+
AGGGTGAAAGGATGGATGAGCCAATCCCAGCCGGATTCGAACCTGAGATGCTGAGGAAAAAACAGATCAC
76+
TGTACTGGATCTCCATCCCGGCGCCGGTAAAACAAGGAGGATTCTGCCACAGATCATCAAAGAGGCCATA
77+
AACAGAAGACTGAGAACAGCCGTGCTAGCGCCAACCAGGGTTGTGGCTGCTGAGATGGCTGAAGCACTGA
78+
GAGGACTGCCCATCCGGTACCAGACATCCGCAGTGCCCAGAGAACATAATGGAAATGAGATTGTTGATGT
79+
CATGTGTCATGCTACCCTCACCCACAGGCTGATGTCTCCTCACAGGGTGCCGAACTACAACCTGTTCGTG
80+
ATGGATGAGGCTCATTTCACCGACCCAGCTAGCATTGCAGCAAGAGGTTACATTTCCACAAAGGTCGAGC
81+
TAGGGGAGGCGGCGGCAATATTCATGACAGCCACCCCACCAGGCACTTCAGATCCATTCCCAGAGTCCAA
82+
TTCACCAATTTCCGACTTACAGACTGAGATCCCGGATCGAGCTTGGAACTCTGGATACGAATGGATCACA
83+
GAATACACCGGGAAGACGGTTTGGTTTGTGCCTAGTGTCAAGATGGGGAATGAGATTGCCCTTTGCCTAC
84+
AACGTGCTGGAAAGAAAGTAGTCCAATTGAACAGAAAGTCGTACGAGACGGAGTACCCAAAATGTAAGAA
85+
CGATGATTGGGACTTTGTTATCACAACAGACATATCTGAAATGGGGGCTAACTTCAAGGCGAGCAGGGTG
86+
ATTGACAGCCGGAAGAGTGTGAAACCAACCATCATAACAGAAGGAGAAGGGAGAGTGATCCTGGGAGAAC
87+
CATCTGCAGTGACAGCAGCTAGTGCCGCCCAGAGACGTGGACGTATCGGTAGAAATCCGTCGCAAGTTGG
88+
TGATGAGTACTGTTATGGGGGGCACACGAATGAAGACGACTCGAACTTCGCCCATTGGACTGAGGCACGA
89+
ATCATGCTGGACAACATCAACATGCCAAACGGACTGATCGCTCAATTCTACCAACCAGAGCGTGAGAAGG
90+
TATATACCATGGATGGGGAATACCGGCTCAGAGGAGAAGAGAGAAAAAACTTTCTGGAACTGTTGAGGAC
91+
TGCAGATCTGCCAGTTTGGCTGGCTTACAAGGTTGCAGCGGCTGGAGTGTCATACCACGACCGGAGGTGG
92+
TGCTTTGATGGTCCTAGGACAAACACAATTTTAGAAGACAACAACGAAGTGGAAGTCATCACGAAGCTTG
93+
GTGAAAGGAAGATTCTGAGGCCGCGCTGGATTGATGCCAGGGTGTACTCGGATCACCAGGCACTAAAGGC
94+
GTTCAAGGACTTCGCCTCGGGAAAACGTTCTCAGATAGGGCTCATTGAGGTTCTGGGAAAGATGCCTGAG
95+
CACTTCATGGGGAAGACATGGGAAGCACTTGACACCATGTACGTTGTGGCCACTGCAGAGAAAGGAGGAA
96+
GAGCTCACAGAATGGCCCTGGAGGAACTGCCAGATGCTCTTCAGACAATTGCCTTGATTGCCTTATTGAG
97+
TGTGATGACCATGGGAGTATTCTTCCTCCTCATGCAGCGGAAGGGCATTGGAAAGATAGGTTTGGGAGGC
98+
GCTGTCTTGGGAGTCGCGACCTTTTTCTGTTGGATGGCTGAAGTTCCAGGAACGAAGATCGCCGGAATGT
99+
TGCTGCTCTCCCTTCTCTTGATGATTGTGCTAATTCCTGAGCCAGAGAAGCAACGTTCGCAGACAGACAA
100+
CCAGCTAGCCGTGTTCCTGATTTGTGTCATGACCCTTGTGAGCGCAGTGGCAGCCAACGAGATGGGTTGG
101+
CTAGATAAGACCAAGAGTGACATAAGCAGTTTGTTTGGGCAAAGAATTGAGGTCAAGGAGAATTTCAGCA
102+
TGGGAGAGTTTCTTCTGGACTTGAGGCCGGCAACAGCCTGGTCACTGTACGCTGTGACAACAGCGGTCCT
103+
CACTCCACTGCTAAAGCATTTGATCACGTCAGATTACATCAACACCTCATTGACCTCAATAAACGTTCAG
104+
GCAAGTGCACTATTCACACTCGCGCGAGGCTTCCCCTTCGTCGATGTTGGAGTGTCGGCTCTCCTGCTAG
105+
CAGCCGGATGCTGGGGACAAGTCACCCTCACCGTTACGGTAACAGCGGCAACACTCCTTTTTTGCCACTA
106+
TGCCTACATGGTTCCCGGTTGGCAAGCTGAGGCAATGCGCTCAGCCCAGCGGCGGACAGCGGCCGGAATC
107+
ATGAAGAACGCTGTAGTGGATGGCATCGTGGCCACGGACGTCCCAGAATTAGAGCGCACCACACCCATCA
108+
TGCAGAAGAAAGTTGGACAGATCATGCTGATCTTGGTGTCTCTAGCTGCAGTAGTAGTGAACCCGTCTGT
109+
GAAGACAGTACGAGAAGCCGGAATTTTGATCACGGCCGCAGCGGTGACGCTTTGGGAGAATGGAGCAAGC
110+
TCTGTTTGGAACGCAACAACTGCCATCGGACTCTGCCACATCATGCGTGGGGGTTGGTTGTCATGTCTAT
111+
CCATAACATGGACACTCATAAAGAACATGGAAAAACCAGGACTAAAAAGAGGTGGGGCAAAAGGACGCAC
112+
CTTGGGAGAGGTTTGGAAAGAAAGACTCAACCAGATGACAAAAGAAGAGTTCACTAGGTACCGCAAAGAG
113+
GCCATCATCGAAGTCGATCGCTCAGCGGCAAAACACGCCAGGAAAGAAGGCAATGTCACTGGAGGGCATC
114+
CAGTCTCTAGGGGCACAGCAAAACTGAGATGGCTGGTCGAACGGAGGTTTCTCGAACCGGTCGGAAAAGT
115+
GATTGACCTTGGATGTGGAAGAGGCGGTTGGTGTTACTATATGGCAACCCAAAAAAGAGTCCAAGAAGTC
116+
AGAGGGTACACAAAGGGCGGTCCCGGACATGAAGAGCCCCAACTAGTGCAAAGTTATGGATGGAACATTG
117+
TCACCATGAAGAGTGGAGTGGATGTGTTCTACAGACCTTCTGAGTGTTGTGACACCCTCCTTTGTGACAT
118+
CGGAGAGTCCTCGTCAAGTGCTGAGGTTGAAGAGCATAGGACGATTCGGGTCCTTGAAATGGTTGAGGAC
119+
TGGCTGCACCGAGGGCCAAGGGAATTTTGCGTGAAGGTGCTCTGTCCCTACATGCCGAAAGTCATAGAGA
120+
AGATGGAGCTGCTCCAACGCCGGTATGGGGGGGGACTGGTCAGAAACCCACTCTCACGGAATTCCACGCA
121+
CGAGATGTATTGGGTGAGTCGAGCTTCAGGCAATGTGGTACATTCAGTGAATATGACCAGCCAGGTGCTC
122+
CTAGGAAGAATGGAAAAAAGGACCTGGAAGGGACCCCAATACGAGGAAGATGTAAACTTGGGAAGTGGAA
123+
CCAGGGCGGTGGGAAAACCCCTGCTCAACTCAGACACCAGTAAAATCAAGAACAGGATTGAACGACTCAG
124+
GCGTGAGTACAGTTCGACGTGGCACCACGATGAGAACCACCCATATAGAACCTGGAACTATCACGGCAGT
125+
TATGATGTGAAGCCCACAGGCTCCGCCAGTTCGCTGGTCAATGGAGTGGTCAGGCTCCTCTCAAAACCAT
126+
GGGACACCATCACGAATGTTACCACCATGGCCATGACTGACACTACTCCCTTCGGGCAGCAGCGAGTGTT
127+
CAAAGAGAAGGTGGACACGAAAGCTCCTGAACCGCCAGAAGGAGTGAAGTACGTGCTCAACGAGACCACC
128+
AACTGGTTGTGGGCGTTTTTGGCCAGAGAAAAACGTCCCAGAATGTGCTCTCGAGAGGAATTCATAAGAA
129+
AGGTCAACAGCAATGCAGCTTTGGGTGCCATGTTTGAAGAGCAGAATCAATGGAGGAGCGCCAGAGAAGC
130+
AGTTGAAGATCCAAAATTTTGGGAGATGGTGGATGAGGAGCGCGAGGCACATCTGCGGGGGGAATGTCAC
131+
ACTTGCATTTACAACATGATGGGAAAGAGAGAGAAAAAACCCGGAGAGTTCGGAAAGGCCAAGGGAAGCA
132+
GAGCCATTTGGTTCATGTGGCTCGGAGCTCGCTTTCTGGAGTTCGAGGCTCTGGGTTTTCTCAATGAAGA
133+
CCACTGGCTTGGAAGAAAGAACTCAGGAGGAGGTGTCGAGGGCTTGGGCCTCCAAAAACTGGGTTACATC
134+
CTGCGTGAAGTTGGCACCCGGCCTGGGGGCAAGATCTATGCTGATGACACAGCTGGCTGGGACACCCGCA
135+
TCACGAGAGCTGACTTGGAAAATGAAGCTAAGGTGCTTGAGCTGCTTGATGGGGAACATCGGCGTCTTGC
136+
CAGGGCCATCATTGAGCTCACCTATCGTCACAAAGTTGTGAAAGTGATGCGCCCGGCTGCTGATGGAAGA
137+
ACCGTCATGGATGTTATCTCCAGAGAAGATCAGAGGGGGAGTGGACAAGTTGTCACCTACGCCCTAAACA
138+
CTTTCACCAACCTGGCCGTCCAGCTGGTGAGGATGATGGAAGGGGAAGGAGTGATTGGCCCAGATGATGT
139+
GGAGAAACTCACAAAAGGGAAAGGACCCAAAGTCAGGACCTGGCTGTTTGAGAATGGGGAAGAAAGACTC
140+
AGCCGCATGGCTGTCAGTGGAGATGACTGTGTGGTAAAGCCCCTGGACGATCGCTTTGCCACCTCGCTCC
141+
ACTTCCTCAATGCTATGTCAAAGGTTCGCAAAGACATCCAAGAGTGGAAACCGTCAACTGGATGGTATGA
142+
TTGGCAGCAGGTTCCATTTTGCTCAAACCATTTCACTGAATTGATCATGAAAGATGGAAGAACACTGGTG
143+
GTTCCATGCCGAGGACAGGATGAATTGGTAGGCAGAGCTCGCATATCTCCAGGGGCCGGATGGAACGTCC
144+
GCGACACTGCTTGTCTGGCTAAGTCTTATGCCCAGATGTGGCTGCTTCTGTACTTCCACAGAAGAGACCT
145+
GCGGCTCATGGCCAACGCCATTTGCTCCGCTGTCCCTGTGAATTGGGTCCCTACCGGAAGAACCACGTGG
146+
TCCATCCATGCAGGAGGAGAGTGGATGACAACAGAGGACATGTTGGAGGTCTGGAACCGTGTTTGGATAG
147+
AGGAGAATGAATGGATGGAAGACAAAACCCCAGTGGAGAAATGGAGTGACGTCCCATATTCAGGAAAACG
148+
AGAGGACATCTGGTGTGGCAGCCTGATTGGCACAAGAGCCCGAGCCACGTGGGCAGAAAACATCCAGGTG
149+
GCTATCAACCAAGTCAGAGCAATCATCGGAGATGAGAAGTATGTGGACTACATGAGTTCACTAAAGAGAT
150+
ATGAAGACACAACTTTGGTTGAGGACACAGTACTGTAGATATTTAATCAATTGTAAATAGACAATATAAG
151+
TATGCATAAAAGTGTAGTTTTATAGTAGTATTTAGTGGTGTTAGTGTAAATAGTTAAGAAAATTTTGAGG
152+
AGAAAGTCAGGCCGGGAAGTTCCCGCCACCGGAAGTTGAGTAGACGGTGCTGCCTGCGACTCAACCCCAG
153+
GAGGACTGGGTGAACAAAGCCGCGAAGTGATCCATGTAAGCCCTCAGAACCGTCTCGGAAGGAGGACCCC
154+
ACATGTTGTAACTTCAAAGCCCAATGTCAGACCACGCTACGGCGTGCTACTCTGCGGAGAGTGCAGTCTG
155+
CGATAGTGCCCCAGGAGGACTGGGTTAACAAAGGCAAACCAACGCCCCACGCGGCCCTAGCCCCGGTAAT
156+
GGTGTTAACCAGGGCGAAAGGACTAGAGGTTAGAGGAGACCCCGCGGTTTAAAGTGCACGGCCCAGCCTG
157+
GCTGAAGCTGTAGGTCAGGGGAAGGACTAGAGGTTAGTGGAGACCCCGTGCCACAAAACACCACAACAAA
158+
ACAGCATATTGACACCTGGGATAGACTAGGAGATCTTCTGCTCTGCACAACCAGCCACACGGCACAGTGC
159+
GCCGACAATGGTGGCTGGTGGTGCGAGAACACAGGATCT
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
## Unreleased
2+
3+
Initial release of this dataset.
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
# Nextclade dataset for "Zaire Ebolavirus"
2+
3+
| Key | Value |
4+
| ---------------------- | ------------------------------------------------------------------------------- |
5+
| authors | [Cornelius Roemer](https://neherlab.org), [Richard Neher](https://neherlab.org) |
6+
| data source | Genbank |
7+
| nextclade dataset path | nextstrain/ebola/zaire |
8+
| annotation | [NC_002549.1](https://www.ncbi.nlm.nih.gov/nuccore/NC_002549) |
9+
| related datasets | Sudan Ebolavirus: `nextstrain/ebola/zaire` |

0 commit comments

Comments
 (0)