GSoC 2026 — Unified Host/Device CIR for CUDA & HIP

This is a development fork of llvm/llvm-project hosting the GSoC 2026 project "Combine/Split CIR for CUDA & HIP offloading".

Mentors: Konstantinos Parasyris, Joseph Huber
Contributor: David Rivera (@RiverDave)

Background

LLVM's offload pipeline keeps host and device code separate until it's too late for cross-boundary optimization. CIR Combine fixes this with a merge-optimize-split stage: both modules are lowered to CIR, merged into a single heterogeneous translation unit, optimized together (constant propagation across launch sites, dead kernel elimination, launch dimensionality inference), then split back into their respective backend pipelines.

The intended flow looks like (We depict tool invocations in this context):

flowchart LR
    SRC[".cu / .cpp"]

    PRE_H["cc1 (host)<br/>emit pre-lowering CIR"]
    PRE_70["cc1 (sm_70)<br/>emit pre-lowering CIR"]
    PRE_90["cc1 (sm_90)<br/>emit pre-lowering CIR"]

    COMBINE["cir-combine-bundler<br/>--combine"]
    BUNDLE[("combined.cir<br/>cir.offload.container")]
    UNBUNDLE["cir-combine-bundler<br/>--unbundle"]

    POST_H["cc1 (host)<br/>post-lowering"]
    POST_70["cc1 (sm_70)<br/>post-lowering"]
    POST_90["cc1 (sm_90)<br/>post-lowering"]

    OBJ["host.o"]
    F70["fatbin_sm_70"]
    F90["fatbin_sm_90"]

    SRC --> PRE_H & PRE_70 & PRE_90
    PRE_H & PRE_70 & PRE_90 --> COMBINE --> BUNDLE --> UNBUNDLE
    UNBUNDLE --> POST_H --> OBJ
    UNBUNDLE --> POST_70 --> F70
    UNBUNDLE --> POST_90 --> F90

    classDef action fill:#bbdefb,stroke:#1565c0,color:#000;
    classDef artifact fill:#d1c4e9,stroke:#512da8,color:#000;
    classDef tool fill:#fff9c4,stroke:#f57f17,color:#000;
    class PRE_H,PRE_70,PRE_90,POST_H,POST_70,POST_90 action;
    class BUNDLE,OBJ,F70,F90 artifact;
    class COMBINE,UNBUNDLE tool;

Status

Updated 2026-05-23. Bootstrapping. RFC draft depicting intended driver semantics in-progress bundler tool and new driver actions not yet committed.

Name		Name	Last commit message	Last commit date
Latest commit History 581,642 Commits
.ci		.ci
.github		.github
bolt		bolt
clang-tools-extra		clang-tools-extra
clang		clang
cmake		cmake
compiler-rt		compiler-rt
cross-project-tests		cross-project-tests
flang-rt		flang-rt
flang		flang
libc		libc
libclc		libclc
libcxx		libcxx
libcxxabi		libcxxabi
libsycl		libsycl
libunwind		libunwind
lld		lld
lldb		lldb
llvm-libgcc		llvm-libgcc
llvm		llvm
mlir		mlir
offload		offload
openmp		openmp
orc-rt		orc-rt
polly		polly
runtimes		runtimes
third-party		third-party
utils/bazel		utils/bazel
.clang-format		.clang-format
.clang-format-ignore		.clang-format-ignore
.clang-tidy		.clang-tidy
.git-blame-ignore-revs		.git-blame-ignore-revs
.gitattributes		.gitattributes
.gitignore		.gitignore
.mailmap		.mailmap
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.TXT		LICENSE.TXT
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GSoC 2026 — Unified Host/Device CIR for CUDA & HIP

Background

Status

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GSoC 2026 — Unified Host/Device CIR for CUDA & HIP

Background

Status

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages