AGENTS.md

Shared repository guidance for AI assistants.

Repository posture

This project is in stabilization / close-out mode. Prefer simplification, consolidation, and trustworthiness over feature expansion. Delete or archive low-value material once a stronger canonical replacement exists.

Project-specific context

Implementation status: GEMM Steps 1-5 are fully implemented. Step 6 (MMA PTX) delegates to Step 5 for stability. Step 7 (Software Pipelining) is planned for future implementation.
Type support: CUTLASS baseline only supports float. INT8 GEMM has complete SharedMemTiling optimization; other optimization levels delegate to SharedMemTiling.
Python bindings: Currently expose elementwise, reduction, and gemm only. Not all C++ modules have Python bindings.
CI limitation: GPU validation requires local execution or self-hosted infrastructure; GitHub-hosted runners do not provide CUDA support.

Canonical sources of truth

Active work: openspec/changes/<change>/
Historical decisions: openspec/archive/
Shared assistant workflow: AGENTS.md
Tool adapters: CLAUDE.md, .github/copilot-instructions.md
Public project entry: README.md

Do not create overlapping active OpenSpec changes for the same scope.

Required OpenSpec workflow

For any non-trivial repository change:

/opsx:explore to investigate and narrow scope
/opsx:propose to create a bounded change
/opsx:apply to implement from tasks
/review before merge
/opsx:archive after completion

Additional rules:

Prefer serial, bounded changes over concurrent long-lived branches.
Prefer one long autopilot session over /fleet unless parallelism is clearly worth the added drift.
Keep changes small enough to review and archive cleanly.
Update OpenSpec artifacts before expanding implementation scope.

Project-specific cleanup rules

openspec/ is the only active planning system. Remove or rewrite legacy specs/ references.
Keep the repository root high-signal. Generic summaries, duplicated reports, and filler docs should not stay active.
Public-facing copy must avoid unverifiable claims, stale completion percentages, and speculative roadmap noise.
GitHub Pages should act as a focused landing page, not a verbatim README mirror.
Preserve user-authored in-progress edits; do not revert or overwrite unclear work without understanding it first.

Technical profile

Language / runtime: C++20, CUDA 12.4+, optional Python bindings via nanobind
Build: CMake 3.24+ with presets in CMakePresets.json
Tests: GoogleTest + RapidCheck
Primary code intelligence: clangd using generated compile_commands.json
Preferred GitHub interface: gh

Validation baseline:

cmake --preset default
cmake --build --preset default
ctest --preset default

If the configured build tree exposes zero tests or stale results, reconfigure before trusting it.

Code style and conventions

CUDA kernel organization: One kernel per file under src/<module>/
Error handling: Use CUDA_CHECK macro for all CUDA API calls; throw std::invalid_argument for invalid parameters
Documentation sync: When updating code, ensure README, docs, and API reference reflect the changes
Performance claims: Only state TFLOPS numbers that have been measured; mark projected or estimated values clearly

Tooling and automation posture

Keep hooks and automation narrow; every retained check should protect a real recurring failure mode.
Prefer native tooling, built-in skills, and gh before adding MCP servers or plugins.
Only commit editor or AI-tool config that improves repository-wide consistency. Personal preferences stay local.

Review guidance

Use /review for:

broad refactors
workflow changes
deletion-heavy cleanup
GitHub-facing copy or policy changes

Use subagents only for independent investigation threads. Do not delegate work that is tightly coupled to an active local edit sequence.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AGENTS.md

Repository posture

Project-specific context

Canonical sources of truth

Required OpenSpec workflow

Project-specific cleanup rules

Technical profile

Code style and conventions

Tooling and automation posture

Review guidance

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

AGENTS.md

Repository posture

Project-specific context

Canonical sources of truth

Required OpenSpec workflow

Project-specific cleanup rules

Technical profile

Code style and conventions

Tooling and automation posture

Review guidance