Skip to content

Latest commit

 

History

History
70 lines (49 loc) · 1.65 KB

File metadata and controls

70 lines (49 loc) · 1.65 KB

Objective Verifier Generation

This directory generates executable Python verifier scripts from formatted objective tasks. It is the bridge between accepted objective tasks and task-specific verification code.

Overview

Input files should be formatted objective-task JSON files containing:

proposed_question
rubric_tree_analysis_refined.formatted_tree

Typical input comes from the objective task pipeline:

task/obj_task/outputs/objective_trajectories/formatted/refined/

If the refined directory does not exist, use the original formatted/ directory.

Run Generation

From the repository root:

python task/obj_eval/obj_eval_generation.py \
  --input /path/to/formatted_tasks \
  --template task/obj_eval/generation_prompt.md \
  --output /path/to/objective_verifiers \
  --model openai/gpt-5 \
  --concurrency 20

--input can be either a single formatted JSON file or a directory containing many formatted JSON files.

The generation model can also be configured with:

OBJ_EVAL_MODEL_NAME

Outputs

The output directory contains one verifier script per input task:

tree2py_<input_json_stem>.py

If the model response does not contain usable Python code, the raw response is saved as:

tree2script_formatted_<input_json_stem>_raw.md

The generator prints total, success, failed, and token-usage counts at the end of the run.

Main Files

File Purpose
obj_eval_generation.py Batch generator for objective verifier scripts
generation_prompt.md Prompt template for verifier generation
utils/ Shared verifier runtime and examples used in the prompt