### Implementation Use an internal `asyncio.Queue` to separate the **Fetch** (fast) from the **Vectorization** (slow/API-bound). ### Why Allows fetching the next batch while the current one is still waiting for an LLM response, maximizing throughput.
Implementation
Use an internal
asyncio.Queueto separate the Fetch (fast) from the Vectorization (slow/API-bound).Why
Allows fetching the next batch while the current one is still waiting for an LLM response, maximizing throughput.