Skip to content

Commit 5979c43

Browse files
realmarcinclaude
andauthored
#30 backfill batch 2: 4 metals + 3 gut/rhizosphere communities (#83)
* #30 backfill batch 2: 4 metals + 3 gut/rhizosphere communities Continues the SPRUCE/wetland dogfood pattern from PRs #79/#80/#81. Each entry uses CHEBI terms with snippets taken verbatim from cached PMID/DOI abstracts; no cross-repo IDs (MIM IDs haven't been minted). AMD/biomining/REE (4 of 16 remaining): | Community | Ingredients | Source | |---|---|---| | Cyprus_Copper_Sulphide_Bioleaching_Consortium | chalcopyrite (Cu(II) surrogate), chalcocite (Cu(I) sulfide), iron(2+) | PMID:41381092 | | Ferroplasma_Leptospirillum_Syntrophy | iron(2+), pyrite | PMID:16104851 | | Iberian_Pit_Lake_Stratified_Community | sulfate, iron(2+) | PMID:23840525 | | Ewaste_Bioleaching_Consortium | glycine (10 g/L cyanide substrate), hydrogen cyanide (gold lixiviant) | PMID:26704063 | Gut/rhizosphere (3 of ~13 remaining): | Community | Ingredients | Source | |---|---|---| | Bacteroides_Eubacterium_Gnotobiotic_Gut_Model | acetate, butyrate, host-derived mucin glycans | PMID:19321416 | | Brachypodium_Young_Root_Rhizosphere_EcoFAB_Community | root exudates, labile root carbon | PMID:37280433 | | ORNL_PMI_Populus_PD10_SynCom | glucose (minimal-medium axis) | PMID:33995895 | #30 related_ingredients adoption: 12/265 -> 19/265. Test plan: just test (136 passed), all 7 modified files validate clean against the schema. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * Address Copilot review on #83 Five findings, all valid: 1. Cyprus chalcopyrite was mapped to CHEBI:30074 / "copper(2+)", which is wrong on both axes. Updated to CHEBI:50885 / "chalcopyrite" — the mapping the repo already uses (Copper_Biomining_Heap_Leach metabolites). 2. Ewaste cyanide entry's `chebi_term.label` said "hydrogen cyanide" but CHEBI:17514's canonical label is "cyanide". Aligned label. 3. Ewaste cyanide entry's snippet ("This gold complexing agent was used…") did not literally mention cyanide. Replaced with the more direct adjacent abstract sentence ("cyanide-producing heterotrophic Pseudomonas fluorescens and Pseudomonas putida were used") and moved the gold-complexing context into the explanation field. 4. Iberian Pit Lake relevance text described an Fe(II)/Fe(III) cycle across the chemocline but only iron(2+) was listed. Added a separate iron(3+) related_ingredient with its own snippet anchoring the bottom-layer iron-reducing guild (Acidiphilium / Ferroplasma / Acidithiobacillus ferrooxidans in reducing mode); split the original Fe(II) relevance text to reference only the oxidising guild. 5. Ewaste "gold-mobilisation" -> "gold-mobilization" for spelling consistency with the rest of the repo (American spelling). 136 tests still pass; all 3 modified YAMLs validate clean. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent c62b137 commit 5979c43

7 files changed

Lines changed: 283 additions & 0 deletions

kb/communities/Bacteroides_Eubacterium_Gnotobiotic_Gut_Model.yaml

Lines changed: 55 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -219,6 +219,61 @@ environmental_factors:
219219
evidence_source: IN_VIVO
220220
snippet: in combination under 3 dietary conditions
221221
explanation: Supports diet as a controlled environmental factor.
222+
related_ingredients:
223+
- preferred_term: acetate
224+
chebi_term:
225+
id: CHEBI:30089
226+
label: acetate
227+
relevance: Acetate is the cross-feeding currency between B. thetaiotaomicron
228+
and E. rectale in this gnotobiotic model — B. theta produces acetate
229+
that E. rectale consumes to generate butyrate. Cecal acetate is
230+
significantly lower in cocolonised mice, confirming the in vivo cross-
231+
feeding; any environment-analog medium must support an acetate gradient
232+
rather than a fixed pool.
233+
evidence:
234+
- reference: PMID:19321416
235+
supports: SUPPORT
236+
evidence_source: IN_VIVO
237+
snippet: cecal acetate levels are significantly lower in cocolonized
238+
mice compared with B. thetaiotaomicron monoassociated animals
239+
explanation: Anchors acetate as the in vivo cross-feeding intermediate
240+
whose pool shifts predictably when E. rectale is added to the
241+
community.
242+
- preferred_term: butyrate
243+
chebi_term:
244+
id: CHEBI:17968
245+
label: butyrate
246+
relevance: Butyrate is the SCFA output of the consortium and the
247+
epithelium-relevant endpoint; cecal butyrate levels stay similar between
248+
E. rectale mono- and biassociated mice, indicating the coculture
249+
maintains butyrate flux on a smaller acetate pool — a key behavior any
250+
cultivation medium must support measuring.
251+
evidence:
252+
- reference: PMID:19321416
253+
supports: SUPPORT
254+
evidence_source: IN_VIVO
255+
snippet: cecal butyrate levels are similar in E. rectale mono- and
256+
biassociated animals
257+
explanation: Anchors butyrate as the maintained-flux output that
258+
distinguishes this coculture from monocolonised controls.
259+
- preferred_term: host-derived mucin glycans
260+
chebi_term:
261+
id: CHEBI:24400
262+
label: glycoprotein
263+
relevance: Mucin glycans are the host-derived substrate that B. theta
264+
up-regulates when E. rectale is present — switching its glycan usage
265+
away from what E. rectale also exploits. Any defined medium for this
266+
coculture should include a mucin glycan source (or its surrogates) so
267+
that the strain-level adaptation is reproducible.
268+
evidence:
269+
- reference: PMID:19321416
270+
supports: SUPPORT
271+
evidence_source: IN_VIVO
272+
snippet: B. thetaiotaomicron adapts to the presence of E. rectale by
273+
up-regulating a variety of loci specific for host-derived mucin
274+
glycans that E. rectale is unable to use
275+
explanation: Anchors host-derived mucin glycans as the niche-partitioning
276+
substrate whose use is selected by the coculture context.
222277
associated_datasets:
223278
- name: Exact-composition publication - Bacteroides thetaiotaomicron VPI-5482 and Eubacterium rectale ATCC 33656
224279
dataset_type: PHENOTYPE

kb/communities/Brachypodium_Young_Root_Rhizosphere_EcoFAB_Community.yaml

Lines changed: 37 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -147,6 +147,43 @@ environmental_factors:
147147
and base) in young Brachypodium distachyon
148148
explanation: Supports the spatial zone sampling design.
149149
growth_media: []
150+
related_ingredients:
151+
- preferred_term: root exudates
152+
chebi_term:
153+
id: CHEBI:46662
154+
label: organic matter
155+
relevance: Root exudates differ across the developing primary root (tip vs
156+
base) and define the labile-carbon source that distinguishes the
157+
rhizosphere niche from bulk soil; an environment-analog medium must
158+
supply a tip-vs-base-distinguishable exudate mixture, not a uniform
159+
background.
160+
evidence:
161+
- reference: PMID:37280433
162+
supports: SUPPORT
163+
evidence_source: IN_VIVO
164+
snippet: Root exudation patterns are known to vary along distinct parts
165+
of the root even in juvenile plants giving rise to spatially distinct
166+
microbial niches
167+
explanation: Anchors variable root exudation as the substrate gradient
168+
generating the spatially distinct rhizosphere niches sampled in the
169+
EcoFAB design.
170+
- preferred_term: labile root carbon
171+
chebi_term:
172+
id: CHEBI:46662
173+
label: organic matter
174+
relevance: The absence of easily available, labile carbon and nutrients
175+
in bulk soil — as opposed to the labile pool concentrated near roots —
176+
is the explicit functional contrast the paper identifies; any
177+
cultivation medium for this rhizosphere community must supply a
178+
labile-carbon-rich phase distinct from a nutrient-limited control.
179+
evidence:
180+
- reference: PMID:37280433
181+
supports: SUPPORT
182+
evidence_source: COMPUTATIONAL
183+
snippet: the absence of easily available, labile carbon and nutrients
184+
in bulk soil relative to roots
185+
explanation: Anchors labile root carbon as the dimension separating
186+
rhizosphere from bulk-soil microbial function.
150187
external_resources:
151188
- name: Primary publication for the Brachypodium young-root rhizosphere community
152189
repository: OTHER

kb/communities/Cyprus_Copper_Sulphide_Bioleaching_Consortium.yaml

Lines changed: 52 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -192,6 +192,58 @@ external_resources:
192192
resource_id: doi:10.1111/1758-2229.70261
193193
url: https://doi.org/10.1111/1758-2229.70261
194194
description: DOI link to the Environmental Microbiology Reports paper.
195+
related_ingredients:
196+
- preferred_term: chalcopyrite
197+
chebi_term:
198+
id: CHEBI:50885
199+
label: chalcopyrite
200+
relevance: Chalcopyrite (CuFeS2) is the primary copper-sulphide ore the
201+
Cyprus consortium was enriched on; an environment-analog cultivation
202+
medium would need chalcopyrite (or a soluble Cu(II)+Fe(II)+sulphide
203+
surrogate) as the central solid substrate driving consortium
204+
composition and surface attachment.
205+
evidence:
206+
- reference: PMID:41381092
207+
supports: SUPPORT
208+
evidence_source: IN_VITRO
209+
snippet: Copper bioleaching is a green technology for the recovery of
210+
copper from chalcopyrite (CuFeS2) and chalcocite (Cu2S) ores
211+
explanation: Anchors chalcopyrite (the Cu(II)/Fe(II) sulphide source
212+
species) as the central substrate of this bioleaching consortium.
213+
- preferred_term: chalcocite
214+
chebi_term:
215+
id: CHEBI:33415
216+
label: copper(I) sulfide
217+
relevance: Chalcocite (Cu2S) was the second mineral the consortium was
218+
sub-cultured to; switching between chalcopyrite and chalcocite is what
219+
the study used to investigate how community composition responds to
220+
mineral structure and the absence of mineral-derived Fe, so a medium
221+
designed to recapitulate that experiment needs both phases.
222+
evidence:
223+
- reference: PMID:41381092
224+
supports: SUPPORT
225+
evidence_source: IN_VITRO
226+
snippet: established a microbial consortium from a copper bioleaching
227+
column in Cyprus on chalcopyrite and then sub-cultured it to chalcocite
228+
explanation: Anchors chalcocite as the second mineral substrate used to
229+
probe consortium dependence on mineral-derived Fe.
230+
- preferred_term: iron(2+)
231+
chebi_term:
232+
id: CHEBI:29033
233+
label: iron(2+)
234+
relevance: Mineral-derived Fe (released from chalcopyrite into solution as
235+
Fe(II)) is the explicit variable the study manipulated by switching to
236+
chalcocite (which lacks Fe); Fe(II) is the substrate for Leptospirillum
237+
ferrodiazotrophum and Acidithiobacillus iron-oxidisers central to the
238+
consortium.
239+
evidence:
240+
- reference: PMID:41381092
241+
supports: SUPPORT
242+
evidence_source: IN_VITRO
243+
snippet: to investigate how the community composition shifts due to
244+
changes in mineral structure and the absence of mineral-derived Fe
245+
explanation: Anchors mineral-derived Fe(II) as the controllable substrate
246+
whose absence (in chalcocite enrichments) restructures the consortium.
195247
associated_datasets: []
196248
metals_present:
197249
- COPPER

kb/communities/Ewaste_Bioleaching_Consortium.yaml

Lines changed: 34 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -563,6 +563,40 @@ environmental_factors:
563563
snippet: the inhibiting effect of PCBs limited the microbial activity by delaying the onset of the exponential iron oxidation
564564
explanation: Documents PCB-driven inhibition of microbial activity through
565565
delayed onset of exponential iron oxidation.
566+
related_ingredients:
567+
- preferred_term: glycine
568+
chebi_term:
569+
id: CHEBI:15428
570+
label: glycine
571+
relevance: Glycine is the substrate fed at 10 g/L to P. putida WCS361 to
572+
drive cyanide production at 21.5 mg/L; a two-step bioleaching medium
573+
for this consortium must include glycine at the cyanide-induction
574+
concentration to enable the gold-mobilization step.
575+
evidence:
576+
- reference: PMID:26704063
577+
supports: SUPPORT
578+
evidence_source: IN_VITRO
579+
snippet: 21.5 (±1.5)mg/L cyanide with 10g/L glycine as the substrate
580+
explanation: Anchors glycine at 10 g/L as the curator-supported substrate
581+
for cyanide biosynthesis in the second bioleaching step.
582+
- preferred_term: cyanide
583+
chebi_term:
584+
id: CHEBI:17514
585+
label: cyanide
586+
relevance: Cyanide is the gold-complexing lixiviant produced by
587+
P. fluorescens / P. putida in the second bioleaching step; the medium
588+
has to be designed so cyanide accumulates to alkaline-stage
589+
concentrations (~21.5 mg/L) without inhibiting the heterotrophic
590+
Pseudomonas producers.
591+
evidence:
592+
- reference: PMID:26704063
593+
supports: SUPPORT
594+
evidence_source: IN_VITRO
595+
snippet: cyanide-producing heterotrophic Pseudomonas fluorescens and
596+
Pseudomonas putida were used
597+
explanation: Anchors cyanide production by the second-step Pseudomonas
598+
members of the consortium; the abstract goes on to identify cyanide
599+
as the gold-complexing agent in the subsequent step.
566600
metals_present:
567601
- COPPER
568602
- GOLD

kb/communities/Ferroplasma_Leptospirillum_Syntrophy.yaml

Lines changed: 33 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -469,6 +469,39 @@ environmental_factors:
469469
of the response surface methodologies (RSM) which yielded optimal performance at 80 min contact
470470
time, pH 9.0, and 0.15 g L-1 ZMR dose, achieving 98.84 % Cr(VI) removal
471471
explanation: Quantifies organic matter stress condition
472+
related_ingredients:
473+
- preferred_term: iron(2+)
474+
chebi_term:
475+
id: CHEBI:29033
476+
label: iron(2+)
477+
relevance: Fe(II) is the principal energy substrate for the ferrous-iron
478+
oxidising Ferroplasma acidiphilum and partner Leptospirillum
479+
iron-oxidisers; any environment-analog cultivation medium for this
480+
syntrophic pair must supply Fe(II) at extremely low pH.
481+
evidence:
482+
- reference: PMID:16104851
483+
supports: SUPPORT
484+
evidence_source: REVIEW
485+
snippet: acidophilic, ferrous-iron oxidizing Ferroplasma acidiphilum
486+
explanation: Anchors Fe(II) oxidation as the defining energy metabolism
487+
of Ferroplasma in the syntrophic consortium.
488+
- preferred_term: pyrite
489+
chebi_term:
490+
id: CHEBI:46627
491+
label: pyrite
492+
relevance: Pyrite (FeS2) and related sulphide ores are the upstream
493+
substrate that releases Fe(II) and sulphide species into the
494+
Ferroplasma-Leptospirillum habitat; a cultivation medium that
495+
recapitulates the syntrophy needs a pyrite-class mineral substrate or
496+
a soluble surrogate.
497+
evidence:
498+
- reference: PMID:16104851
499+
supports: SUPPORT
500+
evidence_source: REVIEW
501+
snippet: able to mobilize metals from sulfide ores, e.g. pyrite,
502+
arsenopyrite and copper-containing sulfides
503+
explanation: Anchors pyrite (and the sulfide-ore class) as the upstream
504+
substrate the Ferroplasma-led consortium mobilises.
472505
metals_present:
473506
- COPPER
474507
- IRON

kb/communities/Iberian_Pit_Lake_Stratified_Community.yaml

Lines changed: 53 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -722,6 +722,59 @@ environmental_factors:
722722
a taxonomically resolved analysis of microbial contributions to carbon, sulfur, iron, and nitrogen
723723
cycling
724724
explanation: Quantifies diversity increase with depth
725+
related_ingredients:
726+
- preferred_term: sulfate
727+
chebi_term:
728+
id: CHEBI:16189
729+
label: sulfate
730+
relevance: Sulfate is the quantitatively dominant anion in the Iberian
731+
Pyrite Belt pit lakes — present at high concentrations alongside
732+
dissolved heavy metals — and is the central electron-acceptor pool
733+
structuring the acidic stratified community; an environment-analog
734+
medium must supply sulfate at concentrations matching the IPB regime.
735+
evidence:
736+
- reference: PMID:23840525
737+
supports: SUPPORT
738+
evidence_source: IN_VIVO
739+
snippet: Both pit lakes are acidic and showed high concentrations of
740+
sulfate and dissolved metals
741+
explanation: Anchors sulfate as the central anion of the pit lake
742+
hydrochemistry.
743+
- preferred_term: iron(2+)
744+
chebi_term:
745+
id: CHEBI:29033
746+
label: iron(2+)
747+
relevance: Fe(II) is the substrate the IPB iron-oxidising bacteria
748+
(Leptospirillum, Acidithiobacillus ferrooxidans) consume in the upper
749+
oxic layers; a cultivation medium for the oxidising guild must supply
750+
Fe(II) at low pH.
751+
evidence:
752+
- reference: PMID:23840525
753+
supports: SUPPORT
754+
evidence_source: IN_VIVO
755+
snippet: iron oxidizing bacteria (Leptospirillum, Acidithiobacillus
756+
ferrooxidans) and facultative iron reducing bacteria and archaea
757+
explanation: Anchors Fe(II) oxidation as one half of the redox axis
758+
structuring the stratified pit lake community.
759+
- preferred_term: iron(3+)
760+
chebi_term:
761+
id: CHEBI:29034
762+
label: iron(3+)
763+
relevance: Fe(III) — produced by upper-layer oxidisers — is the electron
764+
acceptor the facultative iron-reducers (Acidiphilium, Ferroplasma,
765+
Acidithiobacillus ferrooxidans in reducing mode) in the bottom layer
766+
consume, closing the iron cycle across the chemocline; the medium
767+
needs both oxidation states available to recapitulate the stratified
768+
redox structure.
769+
evidence:
770+
- reference: PMID:23840525
771+
supports: SUPPORT
772+
evidence_source: IN_VIVO
773+
snippet: facultative iron reducing bacteria and archaea
774+
(Acidithiobacillus ferrooxidans, Acidiphilium, Actinobacteria,
775+
Acidimicrobiales, Ferroplasma) detected in the bottom layer
776+
explanation: Anchors Fe(III) reduction as the bottom-layer half of the
777+
redox axis closing the stratified iron cycle.
725778
metals_present:
726779
- COPPER
727780
- IRON

kb/communities/ORNL_PMI_Populus_PD10_SynCom.yaml

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -497,6 +497,25 @@ environmental_factors:
497497
snippet: The membership and relative abundances of the strains stabilized after around 5 growth cycles
498498
and resulted in just a few dominant strains that depended on the medium
499499
explanation: Documents the timeframe to stable community structure
500+
related_ingredients:
501+
- preferred_term: glucose
502+
chebi_term:
503+
id: CHEBI:17234
504+
label: glucose
505+
relevance: Glucose is the central carbon source for the minimal-medium
506+
arm of this passaging study; the community-assembly outcome is
507+
explicitly different between complex and minimal-glucose media, so a
508+
medium designed to recapitulate this Populus PD10 community must define
509+
the glucose-vs-complex axis as a primary independent variable.
510+
evidence:
511+
- reference: PMID:33995895
512+
supports: SUPPORT
513+
evidence_source: IN_VITRO
514+
snippet: co-cultured in either complex or minimal glucose media and
515+
serially transferred until a stable community structure formed
516+
explanation: Anchors glucose as the defining minimal-medium carbon
517+
source under which a distinct stable consortium emerges from the
518+
same 10-strain inoculum.
500519
associated_datasets:
501520
- name: Populus PD10 16S rRNA gene amplicon data
502521
dataset_type: AMPLICON_16S

0 commit comments

Comments
 (0)