SuperagenticAI
diff --git a/‎.github/ISSUE_TEMPLATE/bug_report.md‎
Lines changed: 35 additions & 0 deletions b/‎.github/ISSUE_TEMPLATE/bug_report.md‎
Lines changed: 35 additions & 0 deletions
diff --git a/‎.github/ISSUE_TEMPLATE/feature_request.md‎
Lines changed: 23 additions & 0 deletions b/‎.github/ISSUE_TEMPLATE/feature_request.md‎
Lines changed: 23 additions & 0 deletions
diff --git a/‎.github/pull_request_template.md‎
Lines changed: 19 additions & 0 deletions b/‎.github/pull_request_template.md‎
Lines changed: 19 additions & 0 deletions
diff --git a/‎CHANGELOG.md‎
Lines changed: 36 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 36 additions & 0 deletions
diff --git a/‎CONTRIBUTING.md‎
Lines changed: 24 additions & 0 deletions b/‎CONTRIBUTING.md‎
Lines changed: 24 additions & 0 deletions
diff --git a/‎SECURITY.md‎
Lines changed: 19 additions & 0 deletions b/‎SECURITY.md‎
Lines changed: 19 additions & 0 deletions
diff --git a/‎docs/codex-users.md‎
Lines changed: 227 additions & 0 deletions b/‎docs/codex-users.md‎
Lines changed: 227 additions & 0 deletions
@@ -0,0 +1,35 @@
+---
+name: Bug report
+about: Report a reproducible bug in CodexOpt
+title: "[Bug] "
+labels: bug
+assignees: ""
+---
+
+## Summary
+
+Describe the bug clearly.
+
+## Steps To Reproduce
+
+1. 
+2. 
+3. 
+
+## Expected Behavior
+
+What should have happened?
+
+## Actual Behavior
+
+What happened instead?
+
+## Environment
+
+- OS:
+- Python:
+- CodexOpt version:
+
+## Logs / Output
+
+Add relevant terminal output, stack traces, or run artifacts.
@@ -0,0 +1,23 @@
+---
+name: Feature request
+about: Suggest an improvement for CodexOpt
+title: "[Feature] "
+labels: enhancement
+assignees: ""
+---
+
+## Problem
+
+What problem are you trying to solve?
+
+## Proposed Solution
+
+Describe the feature and expected behavior.
+
+## Alternatives Considered
+
+What alternatives did you evaluate?
+
+## Additional Context
+
+Include examples, references, or related issues.
@@ -0,0 +1,19 @@
+## Summary
+
+Describe what changed and why.
+
+## Checklist
+
+- [ ] I ran lint checks.
+- [ ] I ran tests.
+- [ ] I updated docs/changelog if needed.
+- [ ] I verified this does not introduce unrelated changes.
+
+## Validation
+
+Paste relevant command output (or summarize key results):
+
+```bash
+uv run --no-sync ruff check src tests
+PYTEST_DISABLE_PLUGIN_AUTOLOAD=1 uv run --no-sync pytest -q
+```
@@ -0,0 +1,36 @@
+# Changelog
+
+All notable changes to this project will be documented in this file.
+
+## [Unreleased]
+
+## [0.2.0] - 2026-05-26
+
+### Added
+- Added `codexopt improve` as the one-command Codex workflow for discovery, task mining, reflective optimization, preview, and apply.
+- Added `codexopt improve --live` to opt into Codex-backed optimizer and judge runs.
+- Added the `reflective` engine for SkillOpt and GEPA inspired optimization of `SKILL.md` and `AGENTS.md`.
+- Added tiered rewards with verifier, judge, and static fallback modes.
+- Added Codex rollout parsing for `codex exec --json` trajectories.
+- Added `codexopt tasks init` to mine starter optimization tasks from git history, skills, and issues.
+- Added `skillopt` as a SKILL.md optimization engine with train/validation evidence splits.
+- Added validation-gated candidate acceptance with configurable edit budget and validation delta.
+- Added optional executable rollout tasks from JSON `evidence.task_files`.
+- Added temporary-repo rollout execution for candidate skills, including pass/fail artifact metadata.
+- Added SkillOpt metadata to `optimize.json`, CLI summaries, and markdown reports.
+- Added support for `.agents/skills/**/SKILL.md` discovery.
+- Documented progressive Codex user workflows, reflective optimization, rollout configuration, task format, artifacts, and current boundaries.
+
+### Changed
+- Made offline preview the default for `codexopt improve` so Codex and API budget are only used when explicitly requested.
+- Deprecated the legacy `--engine gepa` path in favor of the maintained `reflective` engine.
+- Updated package description to emphasize Codex and SkillOpt-style validation.
+
+## [0.1.0] - 2026-03-09
+
+### Added
+- Initial open-source release of CodexOpt.
+- CLI workflow for `init`, `scan`, `benchmark`, `optimize`, `apply`, and `report`.
+- Heuristic optimization engine and optional GEPA integration path.
+- Run artifacts and markdown reporting.
+- `uv`-first CI pipeline for lint, test, and build.
@@ -0,0 +1,24 @@
+# Contributing
+
+Thanks for contributing to CodexOpt.
+
+## Development Setup
+
+```bash
+uv lock
+uv sync --extra dev
+```
+
+## Run Checks
+
+```bash
+uv run --no-sync ruff check src tests
+PYTEST_DISABLE_PLUGIN_AUTOLOAD=1 uv run --no-sync pytest -q
+uv build
+```
+
+## Pull Requests
+
+- Keep changes scoped and include tests when behavior changes.
+- Update `README.md` and/or `CHANGELOG.md` when relevant.
+- Ensure CI passes before requesting review.
@@ -0,0 +1,19 @@
+# Security Policy
+
+## Reporting a Vulnerability
+
+Please report security issues privately to:
+
+- `shashi@super-agentic.ai`
+
+Include:
+
+- A clear description of the issue.
+- Reproduction steps or proof of concept.
+- Affected versions and environment details.
+
+## Response
+
+- We will acknowledge reports as quickly as possible.
+- We will work on validation, mitigation, and a fix timeline.
+- Please avoid public disclosure until a fix is available.
@@ -0,0 +1,227 @@
+# Using CodexOpt with Codex
+
+Use this guide when your repo already has Codex instruction files and you want
+CodexOpt to improve them safely.
+
+CodexOpt works with the same files Codex loads:
+
+- `AGENTS.md`
+- `.codex/skills/**/SKILL.md`
+- `.agents/skills/**/SKILL.md`
+
+## Start With A Preview
+
+Run this from the repo where you use Codex:
+
+```bash
+uv run codexopt improve
+```
+
+This command:
+
+1. finds `AGENTS.md` and `SKILL.md` files
+2. mines starter tasks from git history and skill descriptions
+3. runs the reflective optimizer in preview mode
+4. shows what would change
+5. writes review artifacts under `.codexopt/`
+
+The default preview stays offline. It does not spend Codex or API budget unless
+you ask it to.
+
+## Run The Live Codex Loop
+
+Use live mode when you want CodexOpt to evaluate actual Codex behavior:
+
+```bash
+uv run codexopt improve --live
+```
+
+Live mode uses `codex exec` as the optimizer and judge. CodexOpt evaluates the
+candidate instruction file, captures feedback from the run, proposes a focused
+rewrite, and keeps the rewrite only when it improves held-out tasks.
+
+## Apply The Result
+
+After reviewing the preview, apply validated changes:
+
+```bash
+uv run codexopt improve --live --apply
+```
+
+CodexOpt writes backups before changing files.
+
+## Review The Report
+
+Write a markdown report after any run:
+
+```bash
+uv run codexopt report --output codexopt-report.md
+```
+
+The report shows:
+
+- files found
+- files improved
+- validation score movement
+- accepted reflective edits
+- sampled feedback that led to the edit
+- fallback notes when CodexOpt had to use a weaker signal
+
+## Step By Step Workflow
+
+Use this flow when you want more control than `improve`:
+
+```bash
+uv run codexopt init
+uv run codexopt scan
+uv run codexopt benchmark
+uv run codexopt optimize skills --engine reflective
+uv run codexopt apply --kind skills --dry-run
+uv run codexopt report --output codexopt-report.md
+```
+
+Review the dry-run diff, then apply:
+
+```bash
+uv run codexopt apply --kind skills
+```
+
+For `AGENTS.md`:
+
+```bash
+uv run codexopt optimize agents --engine reflective --file AGENTS.md
+uv run codexopt apply --kind agents --dry-run
+```
+
+## Add Simple Task Evidence
+
+Task evidence tells CodexOpt what “better” means for your repo.
+
+Create `tasks.md`:
+
+```md
+- Update changelog entries for patch releases.
+- Add regression tests before changing parser behavior.
+- Summarize risky changes in the final response.
+```
+
+Reference it in `codexopt.yaml`:
+
+```yaml
+evidence:
+  task_files:
+    - tasks.md
+```
+
+Then run:
+
+```bash
+uv run codexopt improve
+```
+
+CodexOpt uses these tasks for train and validation splits. A candidate must
+improve held-out validation score before it can win.
+
+## Mine Starter Tasks
+
+If you do not have task evidence yet, generate a starter file:
+
+```bash
+uv run codexopt tasks init
+```
+
+Review the generated `codexopt-tasks.json`, trim anything noisy, then add it to
+`evidence.task_files`.
+
+## Add Command Rollouts
+
+Use command rollouts when a deterministic verifier can decide whether a skill
+supports a workflow.
+
+Create `skill-rollouts.json`:
+
+```json
+[
+  {
+    "name": "release-skill-smoke",
+    "description": "Verify the release skill mentions changelog and tests.",
+    "command": "python scripts/verify_release_skill.py",
+    "timeout_seconds": 30,
+    "expected_stdout_contains": "ok"
+  }
+]
+```
+
+Reference it:
+
+```yaml
+evidence:
+  task_files:
+    - skill-rollouts.json
+```
+
+Run:
+
+```bash
+uv run codexopt improve
+```
+
+CodexOpt copies the repo to a temporary directory, writes the candidate
+`SKILL.md`, runs the verifier, and uses pass rate as a strong reward signal.
+
+## Add Codex Rollouts
+
+Use Codex rollouts when you want to test how Codex behaves with a candidate
+skill.
+
+Create `codex-rollouts.json`:
+
+```json
+[
+  {
+    "name": "codex-release-notes",
+    "backend": "codex",
+    "description": "Ask Codex to use the candidate release skill on a release-note task.",
+    "codex_prompt": "Use the local release skill to update CHANGELOG.md for a patch release.",
+    "timeout_seconds": 120,
+    "expected_final_response_contains": "CHANGELOG.md",
+    "expected_command_contains": "git status",
+    "expected_file_change": "CHANGELOG.md",
+    "expected_file_contains": {
+      "path": "CHANGELOG.md",
+      "contains": "Patch"
+    }
+  }
+]
+```
+
+Run live mode:
+
+```bash
+uv run codexopt improve --live
+```
+
+CodexOpt runs `codex exec --json` in a temporary repo copy and records the
+trajectory:
+
+- final response
+- command executions
+- file changes
+- token usage
+- errors
+
+## What SkillOpt Means In CodexOpt
+
+CodexOpt now includes SkillOpt-style discipline in the Codex workflow:
+
+- train and validation task splits
+- bounded edits
+- validation-gated acceptance
+- rollout-based reward when available
+- textual feedback that drives reflective mutation
+
+For most users, the entry point is still simple:
+
+```bash
+uv run codexopt improve --live
+```