Skip to content

fix: stabilize CUDA build and validation#1

Merged
LessUp merged 1 commit into
masterfrom
fix/cuda-validation-stability
May 25, 2026
Merged

fix: stabilize CUDA build and validation#1
LessUp merged 1 commit into
masterfrom
fix/cuda-validation-stability

Conversation

@LessUp
Copy link
Copy Markdown
Owner

@LessUp LessUp commented May 25, 2026

Summary

  • stabilize the local CUDA/CMake toolchain defaults and architecture handling for nvcc 12
  • fix Tensor Core fallback/test source issues and add a shared CUDA-aware gtest environment
  • correct theoretical peak GFLOPS math and make performance regression thresholds realistic for the current teaching kernels

Validation

  • cmake -S . -B build -DCMAKE_BUILD_TYPE=Release
  • cmake --build build -j$(nproc)
  • ctest --test-dir build --output-on-failure
  • npm install -g @fission-ai/openspec@latest && openspec validate --all
  • npm --prefix docs ci && npm --prefix docs test && npm --prefix docs run build

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@LessUp LessUp merged commit 8e77671 into master May 25, 2026
1 check failed
@LessUp LessUp deleted the fix/cuda-validation-stability branch May 25, 2026 18:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant