Contributing Guide

Thank you for your interest in contributing! This guide covers development setup, code standards, and contribution workflow.

Project Philosophy: Spec-Driven Development (SDD)

This project strictly follows the Spec-Driven Development (SDD) paradigm. All code implementations must use the specification documents in the /specs directory as the Single Source of Truth (SSOT).

Core Principle: Specs first, code second. Never write code without a corresponding spec.

Specification Structure

Directory	Purpose
`/specs/product/`	Product requirements, user stories, and acceptance criteria
`/specs/rfc/`	Technical design documents and architecture decisions
`/specs/api/`	API interface definitions (OpenAPI, etc.)
`/specs/db/`	Database/schema specifications
`/specs/testing/`	BDD test case specifications

SDD Workflow for Contributors

Review Existing Specs: Before starting work, read the relevant specifications in /specs/.
Update Spec First: If changing interfaces, adding features, or modifying behavior, update the corresponding spec document first.
Implement Against Spec: Write code that 100% complies with the spec definitions.
Test Against Spec: Ensure tests cover all acceptance criteria defined in the specs.

For detailed AI agent workflow instructions, see AGENTS.md.

Quick Reference

Task	Command
Install dependencies	`pip install -r requirements.txt`
Build extension	`pip install -e .`
Run all tests	`pytest tests/ -v`
Run CPU-safe tests	`pytest tests/ -v -m "not cuda"`
Lint check	`ruff check python/ tests/ benchmarks/`
Format code	`ruff format python/ tests/ benchmarks/`

Development Setup

Requirements

Component	Minimum	Recommended
CUDA Toolkit	11.0	12.0+
Python	3.8	3.10+
PyTorch	2.0	2.1+
CMake	3.18	3.25+
GCC/G++	9.0	11+
NVIDIA GPU	SM 7.0	SM 8.0+

Setup Steps

# Clone repository
git clone https://github.com/LessUp/llm-speed.git
cd llm-speed

# Create virtual environment
python -m venv venv
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt
pip install pytest hypothesis ruff

# Build and install
pip install -e . --verbose

IDE Configuration

VS Code Extensions:

ms-vscode.cpptools
ms-vscode.cmake-tools
ms-python.python
ms-python.pytest

Code Standards

C++/CUDA

// Constants: UPPER_SNAKE_CASE
constexpr int BLOCK_SIZE = 256;

// Templates: PascalCase
template<typename T>

// Functions: snake_case
__global__ void attention_kernel(...)

// Variables: snake_case
int thread_idx;

// Structs: PascalCase
struct AttentionConfig { int batch_size; };

Python

PEP 8 compliance
4-space indentation
100 character max line length
f-strings for formatting

def compute_flops(batch: int, heads: int, seq_len: int) -> int:
    """Compute FLOPs for attention.
    
    Args:
        batch: Batch size
        heads: Number of attention heads
        seq_len: Sequence length
    
    Returns:
        Total FLOPs
    """
    return batch * heads * seq_len * seq_len * 4

Linting

# Check code style
ruff check python/ tests/ benchmarks/

# Auto-fix issues
ruff check --fix python/ tests/ benchmarks/

# Format code
ruff format python/ tests/ benchmarks/

Development Workflow

Branch Naming

Prefix	Purpose	Example
`feature/`	New feature	`feature/grouped-query-attention`
`fix/`	Bug fix	`fix/shared-memory-overflow`
`perf/`	Performance	`perf/register-tiling`
`docs/`	Documentation	`docs/api-reference`
`test/`	Testing	`test/boundary-cases`

Workflow

# Create feature branch
git checkout main
git pull origin main
git checkout -b feature/your-feature

# Make changes and test
pip install -e . --verbose
pytest tests/ -v -k "test_your_feature"

# Commit with conventional format
git add <files>
git commit -m "feat: add grouped query attention support"

# Push and create PR
git push origin feature/your-feature

Testing

Test Categories

Marker	Purpose	Command
`cuda`	Requires GPU	`pytest -m cuda`
`property`	Hypothesis tests	`pytest -m property`
`slow`	Long-running	`pytest -m "not slow"`

Test Structure

import pytest
from hypothesis import given, settings, strategies as st

class TestFlashAttention:
    @pytest.mark.cuda
    def test_correctness(self, device):
        """Verify output matches PyTorch reference."""
        q = torch.randn(2, 4, 64, 32, device=device)
        output = flash_attention(q, k, v)
        reference = torch.nn.functional.scaled_dot_product_attention(q, k, v)
        assert_close(output, reference)

    @pytest.mark.cuda
    @pytest.mark.property
    @settings(max_examples=100, deadline=None)
    @given(batch=st.integers(1, 4), seq_len=st.integers(16, 256))
    def test_property(self, batch, seq_len, device):
        """Property-based testing with Hypothesis."""
        pass

Coverage

pip install pytest-cov
pytest tests/ --cov=python --cov-report=html

Commit Convention

Use Conventional Commits:

<type>(<scope>): <subject>

[optional body]

Types

Type	Description	Example
`feat`	New feature	`feat: add sliding window attention`
`fix`	Bug fix	`fix: resolve divide-by-zero in softmax`
`perf`	Performance	`perf: optimize register allocation`
`docs`	Documentation	`docs: add tensor core examples`
`test`	Testing	`test: add causal mask tests`
`refactor`	Refactoring	`refactor: extract reduction logic`
`style`	Formatting	`style: fix clang-format`
`chore`	Maintenance	`chore: update cuda arch flags`

Scopes

attention — Attention kernels
gemm — GEMM kernels
bindings — Python bindings
test — Testing
docs — Documentation
ci — CI/CD

Pull Request Checklist

Code passes ruff check
Code passes ruff format --check
Tests pass: pytest tests/ -v
Commit messages follow convention
Documentation updated if needed
Spec documents updated if applicable (for SDD compliance)

Issue Reporting

Bug Report

Include:

Environment (OS, CUDA, Python, PyTorch, GPU)
Reproduction steps with code
Expected vs actual behavior
Error messages/logs

Feature Request

Include:

Use case description
Expected benefits
Implementation suggestions (optional)

Contact

Issues: https://github.com/LessUp/llm-speed/issues
Discussions: https://github.com/LessUp/llm-speed/discussions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributing Guide

Project Philosophy: Spec-Driven Development (SDD)

Specification Structure

SDD Workflow for Contributors

Quick Reference

Development Setup

Requirements

Setup Steps

IDE Configuration

Code Standards

C++/CUDA

Python

Linting

Development Workflow

Branch Naming

Workflow

Testing

Test Categories

Test Structure

Coverage

Commit Convention

Types

Scopes

Pull Request Checklist

Issue Reporting

Bug Report

Feature Request

Contact

FilesExpand file tree

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing Guide

Project Philosophy: Spec-Driven Development (SDD)

Specification Structure

SDD Workflow for Contributors

Quick Reference

Development Setup

Requirements

Setup Steps

IDE Configuration

Code Standards

C++/CUDA

Python

Linting

Development Workflow

Branch Naming

Workflow

Testing

Test Categories

Test Structure

Coverage

Commit Convention

Types

Scopes

Pull Request Checklist

Issue Reporting

Bug Report

Feature Request

Contact