Skip to content

Commit 56ff3bc

Browse files
authored
B Fu and K Woo Pub update
1 parent a5fd3b1 commit 56ff3bc

2 files changed

Lines changed: 61 additions & 33 deletions

File tree

publication-data.json

Lines changed: 61 additions & 33 deletions
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,67 @@
3434
}
3535
],
3636
"publications": [
37+
{
38+
"author": [
39+
{
40+
"name": "Fu BJ",
41+
"lab_member": true
42+
},
43+
{
44+
"name": "Viswanadham VV",
45+
"lab_member": true
46+
},
47+
{
48+
"name": "Maziec D",
49+
"lab_member": true
50+
},
51+
{
52+
"name": "Jin H",
53+
"lab_member": true
54+
},
55+
{
56+
"name": "Park PJ",
57+
"lab_member": true
58+
}
59+
],
60+
"title": "A recurrent sequencing artifact on Illumina sequencers with two-color fluorescent dye chemistry and its impact on somatic variant detection",
61+
"title_link": "https://link.springer.com/article/10.1186/s13059-026-04081-3",
62+
"abstract": "Background: The sequencing-by-synthesis technology by Illumina, Inc. enables efficient and scalable readouts of mutations from genomic data. To enhance sequencing speed and efficiency, Illumina has shifted from the four-color base calling chemistry of the HiSeq series to a two-color fluorescent dye chemistry in the NovaSeq series. Benchmarking sequencing artifacts due to biases in the newer chemistry is important to evaluate the quality of identified mutations. Results: We re-analyze a series of whole-genome sequencing experiments in which the same samples were sequenced on the NovaSeq 6000 (two-color) and HiSeq X10 (four-color) platforms by independent groups. In several samples, we observe a higher frequency of T-to-G and A-to-C putative substitutions (“T > G”) at the read level for NovaSeq 6000 versus HiSeq X10. As the per-base error rate is still low, the artifactual substitutions have a negligible effect in identifying germline or high variant allele frequency (VAF) somatic mutations. However, such errors can confound the detection of low-VAF somatic variants in high-depth sequencing samples, particularly in studies of mosaic mutations in normal tissues, where variants have low read support and are called without a matched normal. The artifactual T > G variant calls disproportionately occur at NT[TG] trinucleotides, and we leverage this observation to bioinformatically reduce the T > G excess in somatic mutation callsets. Conclusions: We identified a recurrent artifact specific to the Illumina two-color chemistry platform on the NovaSeq 6000 with the potential to contaminate low-VAF somatic mutation calls. Thus, an unexpected enrichment of T > G mutations in mosaicism studies warrants caution.",
63+
"abstract_link": "https://compbio.hms.harvard.edu/publications/a-recurrent-sequencing-artifact-on-illumina-sequencers-with-two-color-fluorescent-dye-chemistry-and-its-impacts-on-somatic-variant-detection",
64+
"year": "2026",
65+
"type": "2026",
66+
"journal": "Genome Biology",
67+
"journal_location": "Genome Biol (2026)",
68+
"pdf_link": "/publications/pdf/s13059-026-04081-3_reference.pdf"
69+
},
70+
{
71+
"author": [
72+
{
73+
"name": "Woo KA",
74+
"lab_member": true
75+
},
76+
{
77+
"name": "Jang Y",
78+
"lab_member": false
79+
},
80+
{
81+
"name": "Kim H",
82+
"lab_member": false
83+
},
84+
{
85+
"name": "Park PJ",
86+
"lab_member": true
87+
}
88+
],
89+
"title": "GLP1R expression and parkinson’s disease and related disorders in GLP-1RA-treated type 2 diabetes",
90+
"title_link": "https://academic.oup.com/brain/advance-article-abstract/doi/10.1093/brain/awag130/8651260?redirectedFrom=fulltext&login=false",
91+
"abstract": "Preclinical and epidemiologic data suggest that glucagon-like peptide-1 receptor agonists (GLP-1RAs) may reduce the risk of Parkinson's disease and related disorders (PDRD), yet clinical trial findings have been mixed, raising the possibility of biologically meaningful heterogeneity. We tested whether interindividual differences in systemic GLP1R expression are associated with PDRD risk among adults with type 2 diabetes initiating GLP-1RAs. We conducted an electronic health record (EHR)-based new-user cohort study in the National Institutes of Health All of Us Research Program, focusing on adults with type 2 diabetes aged 50 years or older who initiated a GLP-1RA between 2005 and 2023 and had no prior PDRD or Alzheimer's disease and related dementias (ADRD). Systemic GLP1R expression was genetically proxied by a 15-variant cis-expression quantitative trait locus (cis-eQTL) genetic risk score dichotomized at the cohort median. Follow-up ended at the earliest of incident PDRD, death, or October 1, 2023. Hazard ratios were estimated using multivariable Cox proportional hazards models adjusting for baseline covariates, and incidence rates and rate differences per 1,000 person-years were estimated using Poisson models. Prespecified sensitivity analyses excluded early outcome events and short drug exposure; an exploratory analysis evaluated incident ADRD. The cohort included 7,039 initiators (3,520 high vs 3,519 low genetic score; mean age 61.7 ± 8.4 years; 59.8% female; mean follow-up 3.8 ± 3.1 years). Compared with the low-score group, the high-score group had a lower hazard of incident PDRD (hazard ratio 0.78; 95% confidence interval 0.62 to 0.98) and a lower incidence rate (rate difference -1.36 per 1,000 person-years; 95% confidence interval -2.51 to -0.20). Associations were consistent across sensitivity analyses. No association was observed for incident ADRD (hazard ratio 1.02; 95% confidence interval 0.81 to 1.28). These findings link a mechanism-anchored genetic proxy for higher systemic GLP1R expression to a lower risk of incident PDRD among GLP-1RA-treated adults with type 2 diabetes. Systemic GLP1R-mediated pathways may be relevant to interindividual differences in observed PDRD risk, and genetic instruments for GLP1R expression may inform hypotheses for future PDRD trial design.",
92+
"abstract_link": "https://compbio.hms.harvard.edu/publications/GLP1R-expression-and-parkinsons-disease-and-related-disorders-in-GLP-1RA-treatedtype-2-diabetes",
93+
"year": "2026",
94+
"type": "2026",
95+
"journal": "Brain",
96+
"journal_location": "Brain, 2026;, awag130"
97+
},
3798
{
3899
"author": [
39100
{
@@ -1743,39 +1804,6 @@
17431804
"journal_location": "bioRxiv",
17441805
"pdf_link": "/publications/pdf/zhang_2025_biorxiv_benchmarking_somatic_sv.pdf"
17451806
},
1746-
{
1747-
"author": [
1748-
{
1749-
"name": "Fu BJ",
1750-
"lab_member": true
1751-
},
1752-
{
1753-
"name": "Viswanadham VV",
1754-
"lab_member": true
1755-
},
1756-
{
1757-
"name": "Maziec D",
1758-
"lab_member": true
1759-
},
1760-
{
1761-
"name": "Jin H",
1762-
"lab_member": true
1763-
},
1764-
{
1765-
"name": "Park PJ",
1766-
"lab_member": true
1767-
}
1768-
],
1769-
"title": "A recurrent sequencing artifact on Illumina sequencers with two-color fluorescent dye chemistry and its impact on somatic variant detection",
1770-
"title_link": "https://www.biorxiv.org/content/10.1101/2025.09.27.678978v1",
1771-
"abstract": "Background: The sequencing-by-synthesis technology by Illumina, Inc. enables efficient and scalable readouts of mutations from genomic data. To enhance sequencing speed and efficiency, Illumina has shifted from the four-color base calling chemistry of the HiSeq series to a two-color fluorescent dye chemistry in the NovaSeq series. Benchmarking sequencing artifacts due to biases in the newer chemistry is important to evaluate the quality of identified mutations. Results: We re-analyzed a series of whole-genome sequencing experiments in which the same samples were sequenced on the NovaSeq 6000 (two-color) and HiSeq X10 (four-color) platforms by independent groups. In several samples, we observed a higher frequency of T-to-G and A-to-C substitutions ('T>G') at the read level for NovaSeq 6000 versus HiSeq X10. As the per-base error rate is still low, the artifactual substitutions have a negligible effect in identifying germline or high variant allele frequency (VAF) somatic mutations. However, such errors can confound the detection of low-VAF somatic variants in high-depth sequencing samples, particularly in studies of mosaic mutations in normal tissues, where variants have low read support and are called without a matched normal. The artifactual T>G variant calls disproportionately occur at NT[TG] trinucleotides, and we leveraged this observation to bioinformatically reduce the T>G excess in somatic mutation callsets. Conclusions: We identified a recurrent artifact specific to the Illumina two-color chemistry platform on the NovaSeq 6000 with the potential to contaminate low-VAF somatic mutation calls. Thus, an unexpected enrichment of T>G mutations in mosaicism studies warrants caution. Keywords: Illumina NovaSeq 6000; Next-generation sequencing; mosaic mutations; sequencing artifacts; somatic mutations.",
1772-
"abstract_link": "https://compbio.hms.harvard.edu/publications/a-recurrent-sequencing-artifact-on-illumina-sequencers-with-two-color-fluorescent-dye-chemistry-and-its-impacts-on-somatic-variant-detection",
1773-
"year": "2025",
1774-
"type": "2025",
1775-
"journal": "bioRxiv",
1776-
"journal_location": "bioRxiv",
1777-
"pdf_link": "/publications/pdf/fu_2025_biorxiv_recurrent_sequencing_artifact.pdf"
1778-
},
17791807
{
17801808
"author": [
17811809
{
1.51 MB
Binary file not shown.

0 commit comments

Comments
 (0)