Fix remote-ingest worker: GPU acceleration docs, Celery prereqs, embedder env-var bug#2069
Fix remote-ingest worker: GPU acceleration docs, Celery prereqs, embedder env-var bug#2069JSv4 wants to merge 1 commit into
Conversation
…var bug - Fix silent embedding failure: `API_KEY` env var renamed to `VECTOR_EMBEDDER_API_KEY` in `scripts/remote_ingest/remote_worker.yml` to match what the embedder image actually reads (mismatch caused HTTP 401 on every embed, landing docs with no embeddings and no obvious error) - New doc `docs/upload_methods/worker_celery_setup.md`: ops reference for the Celery workers + queues a target must run so worker uploads become documents (the #1 silent failure: uploads accepted as 202 but stay PENDING forever when no worker consumes the `worker_uploads` queue) - Add GPU acceleration section to `remote_ingest_worker.md` and `scripts/remote_ingest/README.md`: how to merge the accel override, NVIDIA/ROCm device-passthrough instructions (commented stanzas in both `remote_worker.accel.yml` and `compose/accelerated/accel.override.yml`), and a reminder to benchmark on your specific hardware - RAM sizing guidance: CPU OCR is serial + 3-6 GB per in-flight parse; size `--max-workers` to available RAM, not CPU count - Fix `mint_worker_token` examples to show it runs inside the Django container (`docker compose run --rm django python manage.py …`) - Clarify `VECTOR_EMBEDDER_API_KEY`: one value wired to both the embedder service and the worker; mismatch → HTTP 401; any value works as long as both sides match - Document `--insecure` and `--max-attempts` flags in the flag reference - Add troubleshooting tables to both READMEs and `remote_ingest_worker.md` - Fix `bench_parse.py` build-context comment (correct `-f` + context path) - Add Intel NPU `/dev/accel/accel0` note to `compose/accelerated/README.md` Closes #2067
Code ReviewThis PR does three things: fixes the 1. Empty-string default for
|
Codecov Report✅ All modified and coverable lines are covered by tests. 📢 Thoughts on this report? Let us know! |
Summary
API_KEY→VECTOR_EMBEDDER_API_KEYinscripts/remote_ingest/remote_worker.yml— the embedder image readsVECTOR_EMBEDDER_API_KEYbut the compose file was settingAPI_KEY, causing HTTP 401 on every embedding call. Documents landed with no embeddings and no obvious error.docs/upload_methods/worker_celery_setup.md: ops reference for the Celery workers + queues a target must run so worker uploads become documents. Covers the Bump postgres from 14.5 to 15.0 in /compose/production/postgres #1 silent failure (uploads accepted as 202 but stayPENDINGforever when no worker consumesworker_uploads), Beat schedule, scaling/HA, key settings, verification commands.remote_worker.accel.ymlandcompose/accelerated/accel.override.yml, note to benchmark on your specific hardware.--max-workersto available RAM, not CPU count.docker compose run --rm django python manage.py mint_worker_token …).remote_ingest_worker.md.--insecure/--max-attemptsflags documented,bench_parse.pybuild-context comment fixed, Intel NPU device note added.Test plan
remote_worker.ymlembedder service picks upVECTOR_EMBEDDER_API_KEY(notAPI_KEY)worker_celery_setup.mdfor accuracy againstconfig/settings/base.pyBeat schedule and task routesmkdocs.ymlnav entry anddocs/upload_methods/index.mdlink resolve correctly in the docs build