This repository uses OpenSpec and is currently being normalized through the project-closeout change. Every assistant working here—including Claude, Copilot, Codex, and other LLM-based tools—should optimize for stability, simplification, and high-signal cleanup, not feature expansion.
| File or directory | Purpose |
|---|---|
openspec/specs/ |
Active requirements and capability definitions |
openspec/changes/archive/2026-04-23-project-closeout/ |
Governing closeout change (archived reference) |
AGENTS.md |
Unified AI workflow contract (this file) |
CLAUDE.md |
Claude-specific tooling notes (thin delta to AGENTS.md) |
.github/copilot-instructions.md |
Copilot-specific tooling notes (thin delta to AGENTS.md) |
.claude/CLAUDE.md |
Mirror pointer to CLAUDE.md |
bf16-supportflashattention-backward
These changes are deferred backlog. Do not treat them as release-critical unless the user explicitly re-prioritizes them.
- Keep the shipped kernels, bindings, docs, and automation coherent.
- Remove stale or redundant structure rather than preserving low-value scaffolding.
- Fix real defects uncovered by verification or cleanup.
- Keep repo presentation strong: README, Pages, and GitHub About should align.
- Read the relevant OpenSpec specs and active change artifacts.
- If work changes behavior or scope, update/create OpenSpec artifacts first.
- Implement only the tasks that belong to the active change.
- Keep edits deletion-first and repository-specific; avoid generic process bloat.
- After each meaningful cleanup slice, run validation and do a review pass before merge.
- Use
/reviewor an equivalent code-review pass before merging substantial cleanup. - Prefer short-lived branches scoped to one OpenSpec change.
- Avoid long-lived local/cloud branch drift; merge coherent slices promptly.
- Prefer longer focused sessions over
/fleet-style bursty runs.
All tools should run these checks before claiming changes are ready:
ruff check cuda_llm_ops/ tests/ benchmarks/
pytest tests/ -v -m "not cuda"
pre-commit run --all-filespython3 -m venv .venv
. .venv/bin/activate
pip install -r requirements.txt pytest hypothesis ruff pre-commit- C/CUDA LSP:
clangdwithcmake --preset defaultto generatecompile_commands.json - Python LSP:
pyrightorbasedpyrightusingpyrightconfig.json - Git and task tracking:
ghCLI, OpenSpec commands, targeted subagents (prefer over heavyweight MCP) - Pre-commit hooks: Run
pre-commit run --all-filesbefore merge
Keep MCP minimal and repository-specific. Prefer gh, OpenSpec commands, and targeted subagents over generic plugin infrastructure.
README.md/README.zh-CN.md: concise project entry pointsdocs/: durable user docs only.github/workflows/: only meaningful maintenance checksCONTRIBUTING.md: human workflow, kept aligned with OpenSpec and CLI-first usage
All tools (Claude, Copilot, Codex, etc.): Follow the unified contract above. Do not create tool-specific parallel planning documents or processes.
See CLAUDE.md and .github/copilot-instructions.md for thin delta guidance on LSP/tooling preferences.