[Contrib] Fix CUDA contrib build after FFI/header cleanups by MasterJH5574 · Pull Request #19539 · apache/tvm

MasterJH5574 · 2026-05-11T19:45:51Z

Six CUDA sources in src/runtime/contrib used LOG(FATAL) via transitive includes that #19483 trimmed; add the explicit <tvm/runtime/logging.h> include to thrust.cu, attention_kernels.cu, and the four cutlass kernel headers (fp16/fp8 sm90/sm100, gemm_runner, fp8_groupwise_scaled_gemm).

cache_kernels.cu used the bare Array{...} alias that #19483 removed; switch to ffi::Array{...}.

attention_kernels.cu registered FFI functions whose parameters were raw DLTensor*; the new reflection registry requires TypeSchema, so wrap both TVM_FFI_STATIC_INIT_BLOCK registrations to take Tensor and forward to the unchanged launchers via GetDLTensorPtr() (with const_cast for the output tensors, matching the mt_random_engine / cudnn pattern).

gemini-code-assist

Code Review

This pull request adds logging headers to several CUTLASS and VLLM source files and refactors VLLM kernel registrations to use the Tensor type. The reviewer identified that using the unqualified Tensor type in attention_kernels.cu is ambiguous and will likely cause compilation errors because tvm::runtime::Tensor lacks the GetDLTensorPtr() method. It is recommended to explicitly use ffi::Tensor to resolve this issue.

gemini-code-assist

Code Review

This pull request updates several CUDA contrib modules (CUTLASS, Thrust, and vLLM) by adding the tvm/runtime/logging.h header and migrating vLLM kernels to use the new FFI Tensor and ffi::Array types. The changes include updating function signatures and registration logic to handle the transition from raw DLTensor pointers. Feedback suggests investigating if the Tensor class provides a more direct way to access mutable DLTensor pointers to avoid the verbose const_cast currently used in the attention kernels.

Six CUDA sources in src/runtime/contrib used LOG(FATAL) via transitive includes that apache#19483 trimmed; add the explicit <tvm/runtime/logging.h> include to thrust.cu, attention_kernels.cu, and the four cutlass kernel headers (fp16/fp8 sm90/sm100, gemm_runner, fp8_groupwise_scaled_gemm). cache_kernels.cu used the bare Array{...} alias that apache#19483 removed; switch to ffi::Array<Tensor>{...}. attention_kernels.cu registered FFI functions whose parameters were raw DLTensor*; the new reflection registry requires TypeSchema, so wrap both TVM_FFI_STATIC_INIT_BLOCK registrations to take Tensor and forward to the unchanged launchers via GetDLTensorPtr() (with const_cast for the output tensors, matching the mt_random_engine / cudnn pattern).

gemini-code-assist Bot reviewed May 11, 2026

View reviewed changes

Comment thread src/runtime/contrib/vllm/attention_kernels.cu

Comment thread src/runtime/contrib/vllm/attention_kernels.cu

Comment thread src/runtime/contrib/vllm/attention_kernels.cu

gemini-code-assist Bot reviewed May 11, 2026

View reviewed changes

Comment thread src/runtime/contrib/vllm/attention_kernels.cu

MasterJH5574 force-pushed the tvm-dev/2026-05-11-contrib branch from c0122a6 to 90683af Compare May 11, 2026 21:48

tlopex approved these changes May 12, 2026

View reviewed changes

tlopex merged commit dfc9fc0 into apache:main May 12, 2026
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Contrib] Fix CUDA contrib build after FFI/header cleanups#19539

[Contrib] Fix CUDA contrib build after FFI/header cleanups#19539
tlopex merged 1 commit into
apache:mainfrom
MasterJH5574:tvm-dev/2026-05-11-contrib

MasterJH5574 commented May 11, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MasterJH5574 commented May 11, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants