This directory generates executable Python verifier scripts from formatted objective tasks. It is the bridge between accepted objective tasks and task-specific verification code.
Input files should be formatted objective-task JSON files containing:
proposed_question
rubric_tree_analysis_refined.formatted_tree
Typical input comes from the objective task pipeline:
task/obj_task/outputs/objective_trajectories/formatted/refined/
If the refined directory does not exist, use the original formatted/ directory.
From the repository root:
python task/obj_eval/obj_eval_generation.py \
--input /path/to/formatted_tasks \
--template task/obj_eval/generation_prompt.md \
--output /path/to/objective_verifiers \
--model openai/gpt-5 \
--concurrency 20--input can be either a single formatted JSON file or a directory containing
many formatted JSON files.
The generation model can also be configured with:
OBJ_EVAL_MODEL_NAME
The output directory contains one verifier script per input task:
tree2py_<input_json_stem>.py
If the model response does not contain usable Python code, the raw response is saved as:
tree2script_formatted_<input_json_stem>_raw.md
The generator prints total, success, failed, and token-usage counts at the end of the run.
| File | Purpose |
|---|---|
obj_eval_generation.py |
Batch generator for objective verifier scripts |
generation_prompt.md |
Prompt template for verifier generation |
utils/ |
Shared verifier runtime and examples used in the prompt |