Commit 0517a06
authored
Introduce Quadtrix benchmark suite with Python and C++ support (#44)
## Summary
- Project Versioning: Sets the starting project version to 0.1.0.
- Code Shortcuts (Macros): Creates clean shorthand terms for CUDA
keywords (like wrapping __device__ into QX_DEVICE) to make writing GPU
kernels cleaner.
- Math & Memory Utilities: Adds fast math helpers for aligning memory,
rounding numbers, and calculating power-of-two boundaries quickly.
- Memory Optimization: Forces a 128-byte memory alignment to ensure the
GPU can read data as fast as possible (coalesced memory access).
- Automatic Error Checking: Introduces safety wrappers (CUDA_CHECK,
CUBLAS_CHECK, NCCL_CHECK) that instantly watch for crashes or failures
in Nvidia's core hardware and math libraries, making debugging much
easier.8 files changed
Lines changed: 883 additions & 5 deletions
File tree
- asstes
- benchmark
- cuda/includes
- engine/iGPU
- frontend
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
| 2 | + | |
| 3 | + | |
2 | 4 | | |
3 | 5 | | |
4 | 6 | | |
| |||
15 | 17 | | |
16 | 18 | | |
17 | 19 | | |
18 | | - | |
19 | | - | |
20 | | - | |
21 | | - | |
22 | 20 | | |
23 | 21 | | |
24 | 22 | | |
| |||
File renamed without changes.
File renamed without changes.
0 commit comments