Make float16/bfloat16 distinct types by cyyever · Pull Request #5736 · pytorch/FBGEMM

cyyever · 2026-05-05T11:20:08Z

This PR is the beginning of a series of works to create formal float16 and bfloat16 types, which can be used to simplify the templates by reducing the parameter number and using compile-time computing.

meta-codesync · 2026-05-05T17:37:40Z

@q10 has imported this pull request. If you are a Meta employee, you can view this in D103884301.

q10 · 2026-05-05T18:30:34Z

@cyyever Looks like there are undefined symbol errors

cyyever · 2026-05-05T23:30:35Z

@q10 fixed

cyyever · 2026-05-06T05:53:25Z

@q10 fixed

q10 · 2026-05-06T23:05:35Z

@cyyever looks like there are test errors on aarch64

cyyever · 2026-05-06T23:36:57Z

@q10 fixed the tests by restoring some uint16_t branches to keep this PR small, I will clean up them in later PRs

The new struct types for fbgemm::float16/bfloat16 change symbol mangling of Float16ToFloat_avx2 etc., exposing a latent gap where the FbgemmFloat16Convert{,Avx2,Avx512}.cc and bf16 counterparts were never linked into fbgemm_gpu's fbgemm.so. CI now hits an undefined-symbol error at import time. Add the missing sources to the cmake source list.

Switch from CHECK_SOURCE_RUNS to CHECK_SOURCE_COMPILES so CXX_AVX*_FOUND reflects compiler capability rather than build-host execution capability. Runtime dispatch (fbgemmHasAvx512Support / isZmm) already gates execution on the target CPU, so the runs-based probe was over-strict and skipped the AVX-512 sources on builders without AVX-512+BF16 hardware, leaving fbgemm_gpu's fbgemm.so with an unresolved FloatToFloat16_avx512 at load time.

meta-cla Bot added the cla signed label May 5, 2026

cyyever force-pushed the bf16-distinct-types branch 2 times, most recently from e706f4c to 620fb4a Compare May 5, 2026 12:39

cyyever force-pushed the bf16-distinct-types branch from 620fb4a to 2b887bf Compare May 5, 2026 23:30

cyyever force-pushed the bf16-distinct-types branch from 2b887bf to 5b37adc Compare May 6, 2026 03:57

cyyever force-pushed the bf16-distinct-types branch from 5b37adc to 947754b Compare May 6, 2026 08:33

cyyever force-pushed the bf16-distinct-types branch 2 times, most recently from 00b67be to f8664a3 Compare May 30, 2026 06:25

cyyever added 4 commits May 30, 2026 14:26

Make float16/bfloat16 distinct types

3d80b70

Treat uint16_t as fp16 in EmbeddingSpMDMRowWiseSparse_{ref,autovec}

31dae49

cyyever force-pushed the bf16-distinct-types branch from f8664a3 to 41a247e Compare May 30, 2026 06:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make float16/bfloat16 distinct types#5736

Make float16/bfloat16 distinct types#5736
cyyever wants to merge 4 commits into
pytorch:mainfrom
cyyever:bf16-distinct-types

cyyever commented May 5, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented May 5, 2026

Uh oh!

q10 commented May 5, 2026

Uh oh!

cyyever commented May 5, 2026

Uh oh!

cyyever commented May 6, 2026

Uh oh!

q10 commented May 6, 2026

Uh oh!

cyyever commented May 6, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cyyever commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

meta-codesync Bot commented May 5, 2026

Uh oh!

q10 commented May 5, 2026

Uh oh!

cyyever commented May 5, 2026

Uh oh!

cyyever commented May 6, 2026

Uh oh!

q10 commented May 6, 2026

Uh oh!

cyyever commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cyyever commented May 5, 2026 •

edited

Loading

cyyever commented May 6, 2026 •

edited

Loading