LLM Batch Pipeline

llm-batch-pipeline is a generic LLM batch processing pipeline. It discovers and parses input files via a plugin system, renders OpenAI Batch API (or Ollama) requests, validates structured JSON outputs with a Pydantic schema, evaluates predictions against ground truth, and exports results to XLSX/JSON.

See the Getting Started Guide for a tested end-to-end walkthrough with OpenAI Batch API and a 3-way sharded Ollama setup (docs/getting-started.md). See the User Guide for installation and CLI reference (docs/user-guide.md). See the Admin Guide for installation/deployment, and the Developer Guide for how to extend the pipeline with custom plugins, prompts, schemas, and evaluation.

Workflow

flowchart TD
    A[Input Files] --> B[1. Discover]
    B --> C[2. Filter - pre]
    C --> D[3. Transform]
    D --> E[4. Filter - post]
    E --> F[5. Render JSONL]
    F --> G{6. Human Review}
    G -->|Approved| H[7. Submit to Backend]
    G -->|--auto-approve| H
    G -->|Rejected| Z[Abort]
    H --> I[8. Validate Results]
    I --> J[9. Evaluate]
    J --> K[10. Export]

    subgraph Backends
        H --> H1[OpenAI Batch API]
        H --> H2[Ollama Local]
    end

    H1 --> I
    H2 --> I

    subgraph Outputs
        K --> K1[results.xlsx]
        K --> K2[evaluation.xlsx]
        K --> K3[evaluation.json]
        K --> K4[metrics.json]
    end

Admin / Install

Admin guide: docs/admin-guide.md
Install (example): uv sync

Requirements

OpenAI backend: set OPENAI_API_KEY.
Local LLM via Ollama: run an Ollama server (pull the model), then use --backend ollama --base-url http://HOST:11434 (repeat --base-url for multi-server sharding).
OpenAI API compatible local server (if supported by your server): use --backend openai and configure the OpenAI SDK base URL (commonly via OPENAI_BASE_URL).

Getting Started

End-to-end walkthrough: docs/getting-started.md
The getting-started guide was tested against live OpenAI Batch and Ollama services.

Quick Test (offline)

Run the unit test suite (no external LLM services):

uv sync --group dev
uv run pytest -q

Plugins

List registered plugins:

uv run llm-batch-pipeline list

The built-in examples include spam_detection and gdpr_detection.

Test / Benchmark

SpamAssassin corpus reference run: docs/benchmark-run.md

Architecture

docs/architecture.md

Extend (plugins)

docs/developer-guide.md
Developer guide covers: custom prompt, custom Pydantic schema, and custom evaluation.

Monitor

Prometheus metrics + Grafana: docs/admin-guide.md

License

EUPL (v1.2): see LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
docs		docs
src/llm_batch_pipeline		src/llm_batch_pipeline
tests		tests
.gitignore		.gitignore
.pylintrc		.pylintrc
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Batch Pipeline

Workflow

Admin / Install

Requirements

Getting Started

Quick Test (offline)

Plugins

Test / Benchmark

Architecture

Extend (plugins)

Monitor

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

LLM Batch Pipeline

Workflow

Admin / Install

Requirements

Getting Started

Quick Test (offline)

Plugins

Test / Benchmark

Architecture

Extend (plugins)

Monitor

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages