You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Remove IdentifierModule._check_doi_content_consistency and the consistency_score / low_consistency warning path. Fuzzy string-similarity was empirically unable to detect subtle LLM-hallucinated references and only surfaced as a logger.warning that downstream tools could not observe. Citation-authenticity belongs at the semantic abstract-vs-claim layer in the consuming tool (e.g. the sci skill), not at the bibliographic-string layer here.
- Add DOI-only abstract fallback cascade: CrossRef -> Semantic Scholar (/paper/DOI:) -> PubMed (ESearch DOI->PMID, EFetch PMID->abstract). Gated on raw input carrying a DOI; DOIs inferred by fuzzy search do not trigger it. Title-based PubMed fallback is removed because it empirically returned the abstract of an unrelated paper for at least one DOI, which is strictly worse than returning None for downstream semantic checks.
- Rename _complete_fields(..., allow_pubmed_fallback=...) to allow_abstract_fallback. Old name kept as deprecated alias for one release cycle (emits DeprecationWarning).
- journal_article_full template declares abstract as an optional field to match what the enricher emits. journal_article_with_abstract retained as compatibility alias.
- Regression test test_enrich_single_entry_no_doi_in_raw_skips_abstract_fallback pins the no-DOI-in-raw => no Semantic-Scholar/PubMed network call guarantee at the _enrich_single_entry layer.
- Sync CHANGELOG.md and docs/changelog.rst for 0.1.1, bump 4 version sites (pyproject.toml, __init__.py, CITATION.cff, enricher.py User-Agent), and add truncated abstract line to README / quick_start.rst / output_formats.rst sample outputs.
|**Multi-Source Lookup**| Queries CrossRef, arXiv, PubMed, Semantic Scholar, Google Books, and others for every entry. |
71
71
|**Many Identifier Types**| Accepts DOI, PMID, arXiv ID, ISBN, GitHub URL, Zenodo DOI, or plain text queries. |
@@ -143,6 +143,7 @@ Your `results.bib` file now contains entries of different types.
143
143
publisher = "Springer Science and Business Media LLC",
144
144
url = "https://doi.org/10.1038/nature14539",
145
145
type = "journal-article",
146
+
abstract = "Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction...",
Copy file name to clipboardExpand all lines: docs/output_formats.rst
+1Lines changed: 1 addition & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -25,6 +25,7 @@ Format Specification
25
25
doi = "10.1038/nature14539",
26
26
title = "Deep Learning",
27
27
author = "LeCun, Yann and Bengio, Yoshua and Hinton, Geoffrey",
28
+
abstract = "Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction...",
Copy file name to clipboardExpand all lines: docs/quick_start.rst
+1Lines changed: 1 addition & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -51,6 +51,7 @@ Your ``results.bib`` file now contains entries in BibTeX format::
51
51
doi = "10.1038/nature14539",
52
52
title = "Deep learning",
53
53
author = "LeCun, Yann and Bengio, Yoshua and Hinton, Geoffrey",
54
+
abstract = "Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction...",
0 commit comments