Skip to content

Commit 12a8a1c

Browse files
authored
[Optimization] optmize tritonmoe_preprocess op (#7687)
* opt moe_align_kernel
1 parent 8790cec commit 12a8a1c

5 files changed

Lines changed: 1035 additions & 130 deletions

File tree

custom_ops/gpu_ops/helper.h

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -75,6 +75,8 @@ namespace cub = hipcub;
7575
using json = nlohmann::json;
7676
#endif
7777

78+
#define CEILDIV(a, b) (((a + b - 1) / b))
79+
7880
#define CUDA_CHECK(call) \
7981
do { \
8082
const cudaError_t error_code = call; \

0 commit comments

Comments
 (0)