Skip to content

Commit 399dbed

Browse files
authored
Merge pull request #454 from nextstrain/bdbv-update
bdbv: update tree and qc params
2 parents ac26df5 + 010ad58 commit 399dbed

12 files changed

Lines changed: 3368 additions & 22 deletions

File tree

data/nextstrain/orthoebolavirus/bdbv/CHANGELOG.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,9 @@
1+
## Unreleased
2+
3+
- adjust QC param settings to reudce private mutation threshold (outbreak genomes should be very similar)
4+
- add SNP cluster QC rule to trigger on stretches of high private mutation density
5+
- update tree
6+
17
## 2026-05-18T20:09:34Z
28

39
- Add outbreak annotation

data/nextstrain/orthoebolavirus/bdbv/pathogen.json

Lines changed: 11 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -33,7 +33,7 @@
3333
"frameShifts": {
3434
"enabled": true,
3535
"ignoredFrameShifts": [],
36-
"scoreWeight": 20
36+
"scoreWeight": 50
3737
},
3838
"missingData": {
3939
"enabled": true,
@@ -42,21 +42,21 @@
4242
},
4343
"mixedSites": {
4444
"enabled": true,
45-
"mixedSitesThreshold": 40
45+
"mixedSitesThreshold": 20
4646
},
4747
"privateMutations": {
48-
"cutoff": 300,
48+
"cutoff": 10,
4949
"enabled": true,
50-
"typical": 50,
51-
"weightLabeledSubstitutions": 6,
52-
"weightReversionSubstitutions": 6,
50+
"typical": 3,
51+
"weightLabeledSubstitutions": 3,
52+
"weightReversionSubstitutions": 3,
5353
"weightUnlabeledSubstitutions": 1
5454
},
5555
"snpClusters": {
56-
"clusterCutOff": 10,
57-
"enabled": false,
58-
"scoreWeight": 10,
59-
"windowSize": 100
56+
"clusterCutOff": 3,
57+
"enabled": true,
58+
"scoreWeight": 50,
59+
"windowSize": 50
6060
},
6161
"stopCodons": {
6262
"enabled": true,
@@ -67,13 +67,5 @@
6767
"schemaVersion": "3.0.0",
6868
"shortcuts": [
6969
"nextstrain/ebola/bdbv"
70-
],
71-
"version": {
72-
"updatedAt": "2026-04-14T11:55:23Z",
73-
"tag": "2026-04-14--11-55-23Z",
74-
"compatibility": {
75-
"cli": "3.0.0-alpha.0",
76-
"web": "3.0.0-alpha.0"
77-
}
78-
}
70+
]
7971
}

data/nextstrain/orthoebolavirus/bdbv/tree.json

Lines changed: 1 addition & 1 deletion
Large diffs are not rendered by default.

data_output/index.json

Lines changed: 9 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2343,10 +2343,18 @@
23432343
"missingData",
23442344
"mixedSites",
23452345
"privateMutations",
2346+
"snpClusters",
23462347
"stopCodons"
23472348
]
23482349
},
23492350
"versions": [
2351+
{
2352+
"tag": "unreleased",
2353+
"compatibility": {
2354+
"cli": "3.0.0-alpha.0",
2355+
"web": "3.0.0-alpha.0"
2356+
}
2357+
},
23502358
{
23512359
"updatedAt": "2026-05-18T20:09:34Z",
23522360
"tag": "2026-05-18--20-09-34Z",
@@ -2365,8 +2373,7 @@
23652373
}
23662374
],
23672375
"version": {
2368-
"updatedAt": "2026-05-18T20:09:34Z",
2369-
"tag": "2026-05-18--20-09-34Z",
2376+
"tag": "unreleased",
23702377
"compatibility": {
23712378
"cli": "3.0.0-alpha.0",
23722379
"web": "3.0.0-alpha.0"
Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,16 @@
1+
## Unreleased
2+
3+
- adjust QC param settings to reudce private mutation threshold (outbreak genomes should be very similar)
4+
- add SNP cluster QC rule to trigger on stretches of high private mutation density
5+
- update tree
6+
7+
## 2026-05-18T20:09:34Z
8+
9+
- Add outbreak annotation
10+
- add GP_003:367 to known stop codons
11+
- Include 2026 genomes
12+
13+
14+
## 2026-05-15T16:16:45Z
15+
16+
Initial release of this dataset.
Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
# Nextclade dataset for Bundibugyo virus (Orthoebolavirus bundibugyoense)
2+
3+
| Key | Value |
4+
| ---------------------- | ------------------------------------------------------------------------------- |
5+
| authors | [Richard Neher](https://neherlab.org) |
6+
| data source | Genbank |
7+
| nextclade dataset path | nextstrain/orthoebolavirus/bdbv |
8+
| annotation | [NC_014373.1](https://www.ncbi.nlm.nih.gov/nuccore/NC_014373.1) |
9+
10+
This Nextclade dataset for Bundibugyo virus [(Orthoebolavirus bundibugyoense)](https://ictv.global/report/chapter/filoviridae/filoviridae/orthoebolavirus) aligns to the reference sequence [NC_014373.1](https://www.ncbi.nlm.nih.gov/nuccore/NC_014373) and translates major CDS. It scores the sequence with respect to unexpected frameshifts or stop codons, missing sequence (in form of `NNN`s) and mixed bases.
11+
12+
Data from the 2026 outbreak were generously shared by the groups of Prof. Placide Mbala-Kingebeni (INRB, DRC) and Dr Isaac Ssewanyana (CPHL, Uganda) to facilitate the public health response and containment of the virus. These data are described in a post on [Virological.org](https://virological.org/t/initial-genomes-from-may-2026-bundibugyo-virus-disease-outbreak-in-the-democratic-republic-of-the-congo-and-uganda/1032) and were deposited in Pathoplexus under [Restricted Data-Use terms](https://pathoplexus.org/about/terms-of-use/restricted-data). Please consult the authors and the [data-use terms](https://pathoplexus.org/about/terms-of-use/restricted-data) before using these sequences.
13+
14+
15+
Binary file not shown.

0 commit comments

Comments
 (0)