You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This dataset uses the historical G-10 prototype sequence ([U05876.1](https://www.ncbi.nlm.nih.gov/nuccore/U05876)), which may differ from contemporary global CVA16 strains. It is intended for broad subgenogroup classification, mutation quality control, and phylogenetic analysis of CVA16 diversity.
15
+
16
+
*Note: The G-10 reference differs substantially from currently circulating strains.* This is common for enterovirus datasets, in contrast to some other virus datasets (e.g., seasonal influenza), where the reference is updated more frequently to reflect recent lineages.
17
+
18
+
To address this, the dataset is *rooted* on a Static Inferred Ancestor — a phylogenetically reconstructed ancestral sequence near the tree root. This provides a stable reference point that can be used, optionally, as an alternative for mutation calling.
19
+
20
+
## Features
21
+
22
+
This dataset supports:
23
+
24
+
- Assignment of subgenotypes
25
+
- Phylogenetic placement
26
+
- Sequence quality control (QC)
27
+
28
+
## Subgenogroups of Coxsackievirus A16
29
+
30
+
Subgenogroups B1a, B1b and B1c are the major phylogenetic divisions of CVA16 and are commonly used in virological surveillance and literature. They are defined by phylogenetic clustering and do not necessarily indicate antigenic differences. In recent years, recombinant forms were identified and labeled C-F (also known as B2, B3, and D). These recombinant forms cluster with the prototype strain, clade A.
31
+
32
+
These designations are based on the phylogenetic structure and mutations, and are widely used in molecular epidemiology, similar to subgenotype systems for other enteroviruses. Unlike influenza (H1N1, H3N2) or SARS-CoV-2, there is no universal, standardized global lineage nomenclature for enteroviruses. Naming follows conventions from published studies and surveillance practices.
33
+
34
+
## Reference types
35
+
36
+
This dataset includes several reference points used in analyses:
37
+
-*Reference:* RefSeq or similarly established reference sequence. Here G-10.
38
+
39
+
-*Parent:* The nearest ancestral node of a sample in the tree, used to infer branch-specific mutations.
40
+
41
+
-*Clade founder:* The inferred ancestral node defining a clade (e.g., B1a, B2). Mutations "since clade founder" describe changes that define that clade.
42
+
43
+
-*Static Inferred Ancestor:* Reconstructed ancestral sequence inferred with an outgroup, representing the likely founder of CVA16. Serves as a stable reference.
44
+
45
+
-*Tree root:* Corresponds to the root of the tree, it may change in future updates as more data become available.
46
+
47
+
All references use the coordinate system of the G-10 sequence.
48
+
49
+
## Issues & Contact
50
+
- For questions or suggestions, please [open an issue](https://github.com/enterovirus-phylo/nextclade_a16/issues) or email: eve-group[at]swisstph.ch
51
+
52
+
## What is a Nextclade dataset?
53
+
54
+
A Nextclade dataset includes the reference sequence, genome annotations, tree, clade definitions, and QC rules. Learn more in the [Nextclade documentation](https://docs.nextstrain.org/projects/nextclade/en/stable/user/datasets.html).
0 commit comments