You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Drive `just validate-references-all` errors from 43 to 39 (and from
session-start 101 to 39) by adding a last-resort DOI page scraper to
the literature fetcher and refreshing the three Springer caches it
unblocks.
Fetcher (src/communitymech/literature.py):
- fetch_publisher_meta_abstract(): GET https://doi.org/<DOI>, follow
redirects, and pull the abstract excerpt out of the page's
twitter:description / og:description / description meta tag. Springer
publishes the first ~200 characters of the abstract in
twitter:description even for paywalled articles where Crossref /
OpenAlex / Semantic Scholar / Europe PMC have no abstract. Includes
on-disk caching as publisher_<safe-doi>.txt and strips the
"Journal Name - " prefix Springer adds to that field. Elsevier
ScienceDirect intentionally serves a bot-detection page and yields
nothing - that's the residual cap.
- fetch_paper() fallback chain now: CrossRef -> PMID -> PMC -> OpenAlex
-> Semantic Scholar -> Europe PMC -> publisher meta scrape.
Cache refresh (recovers 4 ERROR rows):
- DOI_10.1007_s10311-019-00911-y (Ewaste copper bioleaching, Springer)
- DOI_10.1007_s10230-008-0059-z (Iberian meromictic pit lakes, Springer)
- DOI_10.1007_BF02106205 (Acidobacterium taxonomy paper, Current
Microbiology / Springer; cited 2x in AMD_Acidophile_Heterotroph_Network)
Snippet repairs:
- Ewaste_Bioleaching_Consortium: replace title quote with the abstract's
verbatim e-waste bioleaching framing.
- Iberian_Pit_Lake_Stratified_Community: upgrade PARTIAL to SUPPORT and
expand the snippet to the abstract's vertical-gradient quote.
- AMD_Acidophile_Heterotroph_Network: replace two title quotes with the
abstract's verbatim genus proposal.
Remaining 39 "No content available" errors are all Elsevier 2024-2025
papers (j.jece.2025.120403, j.cej.2024.153492, j.ibiod.2025.106190,
10889868.2024.2407240) plus one ResearchGate preprint - their abstracts
are not in any aggregator we can query and the publisher pages serve
bot-detection HTML.
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Acidobacterium is proposed as a new genus for the acidophilic, chemoorganotrophic bacteria containing menaquinone isolated from acidic mineral environments.Acidobacterium
A marked vertical trend of increasing temperature and dissolved metal concentrations is observed in the monimolimnia of some meromictic pit lakes of the Iberian
Computer circuit boards are a major electronic waste containing higher concentrations of copper, gold and silver. These metals may be recovered by bioleaching, an
0 commit comments