You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: add high-level DispatchFunc() interface for multi-type and mixed dispatch (#29)
* feat: add a convenient interface for any `int64_t`-convertible types and use `DispatchFunc()` to dispatch `DataType` and block sizes with a single call.
- add a convenient interface for any `int64_t`-convertible types, which is mostly used for multi-type dispatch and mixed dispatch
- use `DispatchFunc()` to dispatch `DataType` and block sizes with a single function call in various kernels' implementation
- remove the `CUDA_BLOCK_SIZE_XXX` macros and simply use numbers instead
* style: fix the styling issue by adding a period to the TODO comment
* fix: fix rebase error
* style: fix the styling issues for comments in `dispatcher.h` and `cuda/causal_softmax/kernel.h`
0 commit comments