Commit cc60b9b
committed
feat(burn): wire SIMD sin/cos via hpc::vml (eliminate f64 roundtrip)
float_sin → ndarray::hpc::vml::vssin (F32x16 direct, no f64 conversion)
float_cos → ndarray::hpc::vml::vscos (F32x16 direct, no f64 conversion)
Original burn-ndarray: cast f32→f64, compute sin/cos, cast f64→f32.
Our path: operate directly on f32 via SIMD polynomial approximation.
Total SIMD-wired ops: exp, log, sqrt, abs, sin, cos, sigmoid (7 ops).
30 tests passing.
https://claude.ai/code/session_01Y69Vnw751w75iVSBRws7o71 parent 8d3f6bc commit cc60b9b
1 file changed
Lines changed: 10 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
561 | 561 | | |
562 | 562 | | |
563 | 563 | | |
| 564 | + | |
| 565 | + | |
| 566 | + | |
| 567 | + | |
| 568 | + | |
564 | 569 | | |
565 | 570 | | |
566 | 571 | | |
| |||
577 | 582 | | |
578 | 583 | | |
579 | 584 | | |
| 585 | + | |
| 586 | + | |
| 587 | + | |
| 588 | + | |
| 589 | + | |
580 | 590 | | |
581 | 591 | | |
582 | 592 | | |
| |||
0 commit comments