Skip to content

Commit 5137c46

Browse files
authored
Fixing retrieval bugs (#94)
* upgrade chroma to fix dropped SelfQueryRetrievals * add minimal case-normalization for BM25Retriever * wordaround weird 1st-message bug * improved word tokenization for BM25Retriever
1 parent f846fca commit 5137c46

5 files changed

Lines changed: 138 additions & 62 deletions

File tree

Dockerfile

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -27,6 +27,9 @@ RUN poetry config virtualenvs.in-project true
2727
# Install dependencies without dev dependencies
2828
RUN poetry install --no-root --without dev
2929

30+
# Download NLTK data
31+
RUN poetry run python -m nltk.downloader punkt_tab
32+
3033
# Adjust PATH to include the virtual environment's bin directory
3134
ENV PATH="${VENV_PATH}/bin:${PATH}"
3235

0 commit comments

Comments
 (0)