Use fp32 accumulation in SkipLayerNorm/EmbedLayerNorm CUDA kernels #28682
+654
−99
Azure Pipelines / Linux Android Emulator QNN CI Pipeline
succeeded
May 27, 2026 in 13m 6s
Build #20260526.39 succeeded
Loading