Skip to content

Use fp32 accumulation in SkipLayerNorm/EmbedLayerNorm CUDA kernels#28682

Open
tianleiwu wants to merge 4 commits into
mainfrom
tlwu/sln_fp32_compute_type
Open

Use fp32 accumulation in SkipLayerNorm/EmbedLayerNorm CUDA kernels#28682
tianleiwu wants to merge 4 commits into
mainfrom
tlwu/sln_fp32_compute_type

Commits

Commits on May 26, 2026

Commits on May 27, 2026