ChicagoHAI
diff --git a/‎.github/workflows/test.yml‎
Lines changed: 2 additions & 2 deletions b/‎.github/workflows/test.yml‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎README.md‎
Lines changed: 20 additions & 98 deletions b/‎README.md‎
Lines changed: 20 additions & 98 deletions
diff --git a/‎README.pypi.md‎
Lines changed: 13 additions & 94 deletions b/‎README.pypi.md‎
Lines changed: 13 additions & 94 deletions
diff --git a/‎autochecklist/__init__.py‎
Lines changed: 6 additions & 1 deletion b/‎autochecklist/__init__.py‎
Lines changed: 6 additions & 1 deletion
diff --git a/‎autochecklist/cli.py‎
Lines changed: 17 additions & 4 deletions b/‎autochecklist/cli.py‎
Lines changed: 17 additions & 4 deletions
diff --git a/‎docs/frames.gif‎
887 KB b/‎docs/frames.gif‎
887 KB
diff --git a/‎docs/index.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/index.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎pyproject.toml‎
Lines changed: 2 additions & 1 deletion b/‎pyproject.toml‎
Lines changed: 2 additions & 1 deletion
@@ -1,9 +1,9 @@
 name: Tests
 on:
   push:
-    branches: [main]
+    branches: [main, dev]
   pull_request:
-    branches: [main]
+    branches: [main, dev]
 
 jobs:
   test:
 
@@ -4,11 +4,18 @@
 
 ---
 
-[![GitHub Stars](https://img.shields.io/github/stars/ChicagoHAI/AutoChecklist?style=flat-square)](https://github.com/ChicagoHAI/AutoChecklist)
-[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg?style=flat-square)](https://www.python.org/downloads/)
-[![License](https://img.shields.io/badge/License-Apache%202.0-green.svg?style=flat-square)](LICENSE)
+<p align="center">
+  <a href="https://github.com/ChicagoHAI/AutoChecklist"><img src="https://img.shields.io/github/stars/ChicagoHAI/AutoChecklist?style=flat-square" alt="GitHub Stars"></a>
+  <a href="https://www.python.org/downloads/"><img src="https://img.shields.io/badge/python-3.10+-blue.svg?style=flat-square" alt="Python 3.10+"></a>
+  <a href="LICENSE"><img src="https://img.shields.io/badge/License-Apache%202.0-green.svg?style=flat-square" alt="License"></a>
+  <a href="https://autochecklist.github.io/"><img src="https://img.shields.io/badge/site-autochecklist.github.io-purple?style=flat-square" alt="Site"></a>
+</p>
+
+`AutoChecklist` is an open-source library that unifies LLM-based checklist evaluation into composable pipelines, in a `pip`-installable Python package (`autochecklist`) with CLI and UI features.
 
-`AutoChecklist` is an open-source library that unifies LLM-based checklist evaluation into composable pipelines, in a `pip`-installable Python package (`autochecklist`) with CLI and UI features. 
+<p align="center">
+  <img src="docs/frames.gif" alt="AutoChecklist demo" width="700">
+</p>
 
 ### Features
 -  **Five checklist generator abstractions** that organize methods from research by their reasoning strategies for deriving evaluation criteria
@@ -114,124 +121,39 @@ Refiners are pipeline stages that clean up raw checklists before scoring. They'r
 
 
 
-## Using the Package
-
-### Custom Prompts
-
-Write a prompt template and generate a checklist:
-
-```python
-from autochecklist import DirectGenerator, ChecklistScorer
-
-gen = DirectGenerator(
-    custom_prompt="You are an expert evaluator. Generate yes/no checklist questions to score:\n\n{input}",
-    model="openai/gpt-5-mini",
-)
-checklist = gen.generate(input="Write a haiku about autumn.")
-
-scorer = ChecklistScorer(mode="batch", model="openai/gpt-5-mini")
-score = scorer.score(checklist, target="Leaves fall gently down...")
-print(f"Pass rate: {score.pass_rate:.0%}")
-```
-
-Scorers also take custom prompts. Prompts can also be loaded from `.md` files — see [Custom Prompts](docs/user-guide/custom-prompts.md) for the full guide (placeholders, custom scorers, registration).
-
-### Custom Pipelines
-
-Register a custom pipeline (generator + scorer + prompts) as a reusable unit:
-
-```python
-from autochecklist import register_custom_pipeline, pipeline
-
-# Register from config
-register_custom_pipeline(
-    "my_eval",
-    generator_prompt="Generate yes/no questions for:\n\n{input}",
-    scorer="weighted",
-)
-pipe = pipeline("my_eval", generator_model="openai/gpt-5-mini")
-
-# Or register from an existing pipeline instance
-register_custom_pipeline("my_eval_v2", pipe)
-
-# Save/load pipeline configs as JSON
-from autochecklist import save_pipeline_config, load_pipeline_config
-save_pipeline_config("my_eval", "my_eval.json")
-load_pipeline_config("my_eval.json")  # registers and returns the name
-```
-
-### Built-in Pipelines
-
-The library includes pipelines implementing methods from research papers. Use them via `method_name` or the `pipeline()` shorthand:
+## Quick Start
 
 ```python
 from autochecklist import pipeline
 
 pipe = pipeline("tick", generator_model="openai/gpt-5-mini", scorer_model="openai/gpt-5-mini")
-result = pipe(input="Write a haiku about autumn", target="Leaves fall gently...")
+result = pipe(input="Write a haiku about autumn.", target="Leaves fall gently down...")
 print(f"Pass rate: {result.pass_rate:.0%}")
 ```
 
-See [Supported Pipelines](docs/user-guide/supported-pipelines.md) for the full list of pipelines, paper details, and configuration options.
-
-### Batch Evaluation
-
-```python
-data = [
-    {"input": "Write a haiku", "target": "Leaves fall..."},
-    {"input": "Write a limerick", "target": "There once was..."},
-]
-result = pipe.run_batch(data, show_progress=True)
-print(f"Macro pass rate: {result.macro_pass_rate:.0%}")
-```
-
-For pipeline composition, provider configuration, and the full API, see the [Pipeline Guide](docs/user-guide/pipeline.md).
+See the [Quick Start guide](https://autochecklist.github.io/getting-started/quickstart/) for custom prompts, batch evaluation, and more.
 
-### Command-Line Interface
-
-Run evaluations directly from the terminal:
+### CLI
 
 ```bash
-# Full evaluation (generate + score)
 autochecklist run --pipeline tick --data eval_data.jsonl -o results.jsonl \
   --generator-model openai/gpt-4o-mini --scorer-model openai/gpt-4o-mini
-
-# Generate checklists only
-autochecklist generate --pipeline tick --data inputs.jsonl -o checklists.jsonl \
-  --generator-model openai/gpt-4o-mini
-
-# Score with existing checklist
-autochecklist score --data eval_data.jsonl --checklist checklist.json \
-  -o results.jsonl --scorer-model openai/gpt-4o-mini
-
-# List available pipelines
-autochecklist list
 ```
 
-API keys can be set via `--api-key`, environment variables (`OPENROUTER_API_KEY`), or a `.env` file. See the [CLI Guide](docs/user-guide/cli.md) for full details.
-
-### Examples
-
-Detailed examples with runnable code:
-
-- **[custom_components_tutorial.ipynb](examples/custom_components_tutorial.ipynb)** - Create your own generators, scorers, and refiners
-- **[pipeline_demo.ipynb](examples/pipeline_demo.ipynb)** - Pipeline API, registry, batch evaluation, export
-- **[instance_level_demo.ipynb](examples/instance_level_demo.ipynb)** - DirectGenerator, ContrastiveGenerator (per-input checklists)
-- **[corpus_level_demo.ipynb](examples/corpus_level_demo.ipynb)** - InductiveGenerator, DeductiveGenerator, InteractiveGenerator (per-dataset checklists)
+See the [CLI guide](https://autochecklist.github.io/user-guide/cli/) for all commands.
 
 
 ## UI
 
 A web interface for demonstrating `autochecklist` methods. See [ui/README.md](ui/README.md) for details.
 
-**Quick Start:**
 ```bash
-cd ui
-./launch_ui.sh
-# Frontend: http://localhost:7860
-# Backend:  http://localhost:7861
+autochecklist ui          # or: cd ui && ./launch_ui.sh
+autochecklist ui --dev    # development mode (hot-reload)
 ```
 
+> The `ui` subcommand is only available from a source checkout.
+
 ## Testing
 
 > [!WARNING]
 
@@ -1,8 +1,11 @@
 # AutoChecklist
 
-[![GitHub Stars](https://img.shields.io/github/stars/ChicagoHAI/AutoChecklist?style=flat-square)](https://github.com/ChicagoHAI/AutoChecklist)
-[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg?style=flat-square)](https://www.python.org/downloads/)
-[![License](https://img.shields.io/badge/License-Apache%202.0-green.svg?style=flat-square)](LICENSE)
+<p align="center">
+  <a href="https://github.com/ChicagoHAI/AutoChecklist"><img src="https://img.shields.io/github/stars/ChicagoHAI/AutoChecklist?style=flat-square" alt="GitHub Stars"></a>
+  <a href="https://www.python.org/downloads/"><img src="https://img.shields.io/badge/python-3.10+-blue.svg?style=flat-square" alt="Python 3.10+"></a>
+  <a href="LICENSE"><img src="https://img.shields.io/badge/License-Apache%202.0-green.svg?style=flat-square" alt="License"></a>
+  <a href="https://autochecklist.github.io/"><img src="https://img.shields.io/badge/site-autochecklist.github.io-purple?style=flat-square" alt="Site"></a>
+</p>
 
 A library of composable pipelines for generating and scoring checklist criteria.
 
@@ -33,7 +36,7 @@ Each generator is customizable via prompt templates (`.md` files with `{input}`,
 
 ### Built-in Pipelines
 
-The library includes built-in pipelines implementing methods from research papers ([TICK](https://arxiv.org/abs/2410.03608), [RocketEval](https://arxiv.org/abs/2503.05142), [RLCF](https://arxiv.org/abs/2507.18624), [CheckEval](https://arxiv.org/abs/2403.18771), [InteractEval](https://arxiv.org/abs/2409.07355), and more). See [Supported Pipelines](https://github.com/ChicagoHAI/AutoChecklist/blob/main/docs/user-guide/pipelines.md) for the full list and configuration details.
+The library includes built-in pipelines implementing methods from research papers ([TICK](https://arxiv.org/abs/2410.03608), [RocketEval](https://arxiv.org/abs/2503.05142), [RLCF](https://arxiv.org/abs/2507.18624), [OpenRubrics](https://arxiv.org/abs/2510.07743), [CheckEval](https://arxiv.org/abs/2403.18771), [InteractEval](https://arxiv.org/abs/2409.07355), and more). See [Supported Pipelines](https://autochecklist.github.io/user-guide/supported-pipelines/) for the full list and configuration details.
 
 ### Scoring
 
@@ -78,114 +81,30 @@ pip install "autochecklist[all]"
 
 For development installation from source, see the [GitHub repository](https://github.com/ChicagoHAI/AutoChecklist).
 
-## Using the Package
-
-### Custom Prompts
-
-Write a prompt template and generate a checklist:
-
-```python
-from autochecklist import DirectGenerator, ChecklistScorer
-
-gen = DirectGenerator(
-    custom_prompt="You are an expert evaluator. Generate yes/no checklist questions to score:\n\n{input}",
-    model="openai/gpt-5-mini",
-)
-checklist = gen.generate(input="Write a haiku about autumn.")
-
-scorer = ChecklistScorer(mode="batch", model="openai/gpt-5-mini")
-score = scorer.score(checklist, target="Leaves fall gently down...")
-print(f"Pass rate: {score.pass_rate:.0%}")
-```
-
-Scorers also take custom prompts. Prompts can also be loaded from `.md` files — see [Custom Prompts](https://github.com/ChicagoHAI/AutoChecklist/blob/main/docs/user-guide/custom-prompts.md) for the full guide (placeholders, custom scorers, registration).
-
-### Custom Pipelines
-
-Register a custom pipeline (generator + scorer + prompts) as a reusable unit:
-
-```python
-from autochecklist import register_custom_pipeline, pipeline
-
-# Register from config
-register_custom_pipeline(
-    "my_eval",
-    generator_prompt="Generate yes/no questions for:\n\n{input}",
-    scorer="weighted",
-)
-pipe = pipeline("my_eval", generator_model="openai/gpt-5-mini")
-
-# Or register from an existing pipeline instance
-register_custom_pipeline("my_eval_v2", pipe)
-
-# Save/load pipeline configs as JSON
-from autochecklist import save_pipeline_config, load_pipeline_config
-save_pipeline_config("my_eval", "my_eval.json")
-load_pipeline_config("my_eval.json")  # registers and returns the name
-```
-
-### Built-in Pipelines
-
-The library includes pipelines implementing methods from research papers. Use them via `method_name` or the `pipeline()` shorthand:
+## Quick Start
 
 ```python
 from autochecklist import pipeline
 
 pipe = pipeline("tick", generator_model="openai/gpt-5-mini", scorer_model="openai/gpt-5-mini")
-result = pipe(input="Write a haiku about autumn", target="Leaves fall gently...")
+result = pipe(input="Write a haiku about autumn.", target="Leaves fall gently down...")
 print(f"Pass rate: {result.pass_rate:.0%}")
 ```
 
-See [Supported Pipelines](https://github.com/ChicagoHAI/AutoChecklist/blob/main/docs/user-guide/pipelines.md) for the full list of pipelines, paper details, and configuration options.
-
-### Batch Evaluation
-
-```python
-data = [
-    {"input": "Write a haiku", "target": "Leaves fall..."},
-    {"input": "Write a limerick", "target": "There once was..."},
-]
-result = pipe.run_batch(data, show_progress=True)
-print(f"Macro pass rate: {result.macro_pass_rate:.0%}")
-```
-
-For pipeline composition, provider configuration, and the full API, see the [Pipeline Guide](https://github.com/ChicagoHAI/AutoChecklist/blob/main/docs/user-guide/pipeline.md).
+See the [Quick Start guide](https://autochecklist.github.io/getting-started/quickstart/) for custom prompts, batch evaluation, and more.
 
-### Command-Line Interface
-
-Run evaluations directly from the terminal:
+### CLI
 
 ```bash
-# Full evaluation (generate + score)
 autochecklist run --pipeline tick --data eval_data.jsonl -o results.jsonl \
   --generator-model openai/gpt-4o-mini --scorer-model openai/gpt-4o-mini
-
-# Generate checklists only
-autochecklist generate --pipeline tick --data inputs.jsonl -o checklists.jsonl \
-  --generator-model openai/gpt-4o-mini
-
-# Score with existing checklist
-autochecklist score --data eval_data.jsonl --checklist checklist.json \
-  -o results.jsonl --scorer-model openai/gpt-4o-mini
-
-# List available pipelines
-autochecklist list
 ```
 
-API keys can be set via `--api-key`, environment variables (`OPENROUTER_API_KEY`), or a `.env` file. See the [CLI Guide](https://github.com/ChicagoHAI/AutoChecklist/blob/main/docs/user-guide/cli.md) for full details.
-
-### Examples
-
-Detailed examples with runnable code:
-
-- **[custom_components_tutorial.ipynb](https://github.com/ChicagoHAI/AutoChecklist/blob/main/examples/custom_components_tutorial.ipynb)** - Create your own generators, scorers, and refiners
-- **[pipeline_demo.ipynb](https://github.com/ChicagoHAI/AutoChecklist/blob/main/examples/pipeline_demo.ipynb)** - Pipeline API, registry, batch evaluation, export
-- **[instance_level_demo.ipynb](https://github.com/ChicagoHAI/AutoChecklist/blob/main/examples/instance_level_demo.ipynb)** - DirectGenerator, ContrastiveGenerator (per-input checklists)
-- **[corpus_level_demo.ipynb](https://github.com/ChicagoHAI/AutoChecklist/blob/main/examples/corpus_level_demo.ipynb)** - InductiveGenerator, DeductiveGenerator, InteractiveGenerator (per-dataset checklists)
+See the [CLI guide](https://autochecklist.github.io/user-guide/cli/) for all commands.
 
 ## Links
 
-<!-- - [Full Documentation](https://autochecklist.github.io) -->
+- [Documentation](https://autochecklist.github.io)
 - [GitHub Repository](https://github.com/ChicagoHAI/AutoChecklist) — contributing, UI, dev setup
 - [Bug Tracker](https://github.com/ChicagoHAI/AutoChecklist/issues)
 
 
@@ -49,7 +49,8 @@
     list_refiners_with_info,
 )
 
-__version__ = "0.1.0"
+from importlib.metadata import version as _pkg_version
+__version__ = _pkg_version("autochecklist")
 
 __all__ = [
     # Models
@@ -63,6 +64,10 @@
     "DeductiveInput",
     "FeedbackInput",
     "InteractiveInput",
+    "ChecklistResponse",
+    "WeightedChecklistResponse",
+    "CategorizedChecklistResponse",
+    "GeneratedCategorizedQuestion",
     # Config
     "configure",
     "get_config",
 
@@ -185,12 +185,27 @@ def cmd_list(args: argparse.Namespace) -> None:
             print(f"{r['name']:<20} {r.get('description', '')}")
 
 
+def _find_repo_root() -> Path | None:
+    """Find the repo root by walking up from the package dir."""
+    _dir = Path(__file__).resolve().parent
+    for _ in range(5):
+        _dir = _dir.parent
+        if (_dir / "ui" / "launch_ui.sh").exists() and (_dir / "pyproject.toml").exists():
+            return _dir
+    return None
+
+
 def cmd_ui(args: argparse.Namespace) -> None:
     """Launch the AutoChecklist UI."""
     import os
     import subprocess
 
-    repo_root = Path(__file__).resolve().parent.parent
+    repo_root = _find_repo_root()
+    if repo_root is None:
+        print("Error: could not find AutoChecklist source tree. "
+              "The 'ui' command is only available from a source checkout.", file=sys.stderr)
+        sys.exit(1)
+
     cmd = [str(repo_root / "ui" / "launch_ui.sh")]
     if args.dev:
         cmd.append("--dev")
@@ -281,9 +296,7 @@ def main(argv: list[str] | None = None) -> None:
     list_parser.set_defaults(func=cmd_list)
 
     # --- ui (only available in source checkout) ---
-    pkg_dir = Path(__file__).resolve().parent
-    ui_script = pkg_dir.parent / "ui" / "launch_ui.sh"
-    if ui_script.exists():
+    if _find_repo_root() is not None:
         ui_parser = subparsers.add_parser("ui", help="Launch the AutoChecklist UI")
         ui_parser.add_argument("--dev", action="store_true", help="Run in development mode (hot-reload)")
         ui_parser.set_defaults(func=cmd_ui)
 
@@ -3,6 +3,7 @@
 [![GitHub Stars](https://img.shields.io/github/stars/ChicagoHAI/AutoChecklist?style=flat-square)](https://github.com/ChicagoHAI/AutoChecklist)
 [![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg?style=flat-square)](https://www.python.org/downloads/)
 [![License](https://img.shields.io/badge/License-Apache%202.0-green.svg?style=flat-square)](LICENSE)
+[![Site](https://img.shields.io/badge/site-autochecklist.github.io-purple?style=flat-square)](https://autochecklist.github.io/)
 
 `AutoChecklist` is an open-source library that unifies LLM-based checklist evaluation into composable pipelines, in a `pip`-installable Python package (`autochecklist`) with CLI and UI features.
 
 
@@ -1,6 +1,6 @@
 [project]
 name = "autochecklist"
-version = "0.2.0"
+version = "0.2.1"
 description = "A library of checklist generation and scoring methods for LLM evaluation"
 authors = [{name = "ChicagoHAI"}]
 readme = "README.pypi.md"
@@ -75,6 +75,7 @@ dev = [
     "nbconvert>=7.17.0",
     "pytest>=9.0.2",
     "pytest-asyncio>=1.3.0",
+    "ruff>=0.14.14",
 ]
 
 [tool.pytest.ini_options]