Add CUDA Examples/Improve CUDA robustness by SousaTrashBin · Pull Request #202 · alcides/aeon

SousaTrashBin · 2026-04-27T23:39:24Z

the only example that is not working properly is histogram.ae, I'm pretty sure it's cause I have a kernel inside another kernel. LLVM/CPU was not tested and needs to be verified posteriorly.

…r not

… normal/core ast and gpu ast (not sure if this is the smartest way of doing this)

… before finishing this)

…e GPU AST and llvm instructions

…hings are more isolated and less likely to break

…and LLVM IR generation

…r not

… normal/core ast and gpu ast (not sure if this is the smartest way of doing this)

… before finishing this)

…e GPU AST and llvm instructions

…nagement on vectors

…CD pipeline

…rations (still didn't test if this works on gpu)

…werer When _extract_call_info returns prev_args from a partial application, eff_ty only contains the remaining parameter types. Using eff_ty with offset=len(prev_args) caused incorrect type expectations and premature full-application detection, dropping trailing arguments. This fix uses target.type (the full function type) when prev_args exist, matching what _lower_builtin_call already does. Fixes histogram.ae producing all-zero word counts due to the wordcount call being silently dropped during lowering. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…nodes - Remove stale gpu/llvm imports from decorators/__init__.py - Add missing LLVMVectorGet, LLVMVectorSet, LLVMVectorSize classes to llvm_ast.py - Add find_calls() method to LLVMTerm and all compound term subclasses - Fix target_machine attribute error in CPULLVMExecutionEngine - Update old import "X.ae" syntax to open X in all LLVM examples Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

@cpu

The e2e tests use @llvm which was renamed to @cpu but never aliased. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

The LLVM test examples use Vector_size which is handled natively by the LLVM backend but was missing from the Vector library, causing KeyError when the evaluator falls back. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…g LLVMVectorSize alias - Remove leftover `print(f"DEBUG: ...")` in lowerer.py - Upgrade silent fallback/disable log messages from debug to warning level - Add `LLVMVectorSize = Any` to runtime else-branch in core.py for consistency Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

CPULLVMPipeline was never imported anywhere. MultiBackendPipeline in aeon/llvm/pipeline.py handles CPU-only as its default and is the one used by the driver. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

alcides

A bit of code cleaning would make this good to be merged.

alcides · 2026-05-01T15:27:19Z

        if ir_hash not in self._module_cache:
-            ptx = self._compile_to_ptx(llvm_ir)
-            self._module_cache[ir_hash] = self._load_module(ptx)
+            with open("debug.ll", "w") as f:


Isto pode ir para uma função auxiliar e estar atrás de uma flag?

só a parte de escrever para um ficheiro? ou toda a parte de compilação do ptx?

Escrever para ficheiro.

resolvido no commit bf44b74

alcides · 2026-05-01T15:28:22Z

-        finally:
-            for d_ptr in device_ptrs:
-                self.libcuda.cuMemFree_v2(d_ptr)
+        if isinstance(ret_type, LLVMPointerType) or "Vector" in str(ret_type):


Em vez de Vector in str(...), devíamos ter uma resolução de nomes para colocar tudo qualified, e comparar com o qualified name.

qualified ou neste caso unqualified? assumo que a parte do "qualified" seja à esquerda do '.' e à direita seja a parte unqualified, adiciono dois fields aos LLVMTypes, um para a qualified e outro para a unqualified? (e vejo se qualified == "Vector" e unqualified == "Vector")

resolvido nos commits bf44b74 1001175 e 05473ec (agora o Vector é um tipo built-in e a conversão é feita de forma correta sem "work-arounds" pela função que transforma um tipo nativo para o de LLVM)

…er by extracting multiple implementation steps to simpler individual functions

…ilt-in)

SousaTrashBin requested a review from alcides April 27, 2026 23:39

alcides force-pushed the add_some_cuda_examples branch from 152e24b to 54a8846 Compare April 29, 2026 13:45

SousaTrashBin added 28 commits April 30, 2026 09:54

add gpu decorator (@gpu)

53595cb

add some gpu helper functions such as Tensor_length

f5a6687

add some gpu tests (for now uses CPU but should work nonetheless)

2e9a1b4

remove gpu tests, still need to verify if syntax will stay the same o…

637fee7

…r not

add gpu subset code validation (including recursion checking)

21d0c43

add verification to types and ops (for now only supports builtins)

4dd1947

add gpu subset ast representation as well as a way to convert between…

6131a23

… normal/core ast and gpu ast (not sure if this is the smartest way of doing this)

not sure why this is needed (reminder to ask)

36a6560

add some gpu subset testing (still need to study a bit of llvm syntax…

10cd390

… before finishing this)

eventually this will be the file that hosts the conversion between th…

4a06aad

…e GPU AST and llvm instructions

still need to add the generation of the kernel itself

14996ea

fix, was using a raw str but should instead use the Name dataclass

842013e

forgot to setup auto ruff

8cfe97a

add llvm decorator

8d0fd05

remove unused files

d647c6d

add Vector library, technically we could use List, but in this way, t…

5fa3671

…hings are more isolated and less likely to break

add CPULLVMPipeline implementation, integrate function compilation …

17d6bb1

…and LLVM IR generation

add gpu decorator (@gpu)

72268f1

add some gpu helper functions such as Tensor_length

f5298c5

add some gpu tests (for now uses CPU but should work nonetheless)

76b04dc

remove gpu tests, still need to verify if syntax will stay the same o…

67311bc

…r not

add gpu subset code validation (including recursion checking)

1b79599

add verification to types and ops (for now only supports builtins)

4035bd8

add gpu subset ast representation as well as a way to convert between…

c76d6dd

… normal/core ast and gpu ast (not sure if this is the smartest way of doing this)

not sure why this is needed (reminder to ask)

2a6891c

add some gpu subset testing (still need to study a bit of llvm syntax…

50a6d60

… before finishing this)

eventually this will be the file that hosts the conversion between th…

27a9265

…e GPU AST and llvm instructions

still need to add the generation of the kernel itself

82adb86

SousaTrashBin and others added 14 commits April 30, 2026 09:54

fix(cuda): specify speed level

7bf013b

feat(cpu): add Vector_size implementation, add header-based size ma…

ce6a70e

…nagement on vectors

feat(llvm): add some CPU and GPU simple tests that execute on the CI/…

c19ece6

…CD pipeline

fix: change func name

1c1a89e

fix: remove ';' from imports

65a24d8

feat(llvm): improve backend execution robustness and error logging

5859259

feat(llvm): fix infinite recursion fallback for CPU/GPU execution

b74e4bb

fix(cpu): prevent redundant block creation in function conversion

c2d3fe1

fix(llvm): some fixes according to ruff and mypy

dc71683

feat(cpu): add some cpu test examples

cd2749e

feat(llvm): add LLVMVector class to handle pointer-based vector ope…

49166f2

…rations (still didn't test if this works on gpu)

feat(llvm): add back the gpu busy example

94fd754

alcides force-pushed the add_some_cuda_examples branch from 6345117 to 2dbc106 Compare April 30, 2026 08:54

alcides and others added 5 commits April 30, 2026 12:44

fix: register @llvm as backward-compatible alias for @cpu decorator

75fc079

The e2e tests use @llvm which was renamed to @cpu but never aliased. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat(llvm): squash cuda/cpu backend development commits

6e59205

chore: remove dead CPULLVMPipeline (superseded by MultiBackendPipeline)

bfffb3b

CPULLVMPipeline was never imported anywhere. MultiBackendPipeline in aeon/llvm/pipeline.py handles CPU-only as its default and is the one used by the driver. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

alcides requested changes May 1, 2026

View reviewed changes

SousaTrashBin added 5 commits May 1, 2026 17:59

refactor(llvm): clean up function names, make invoke function clean…

dcf69f3

…er by extracting multiple implementation steps to simpler individual functions

feat(llvm): add debug LLVM IR/PTX file writing

bf44b74

feat(core): add Vector type support and remove GPU/Tensor-related code

0474b2e

refactor(core): remove unused Vector type definition (now it's a bu…

1001175

…ilt-in)

feat(llvm): improve type validation and resolution

05473ec

SousaTrashBin requested a review from alcides May 6, 2026 10:46

SousaTrashBin added 3 commits May 13, 2026 00:00

feat(llvm): add raw vector runtime helpers

6cbcd30

refactor(llvm): simplify backend metadata

9b3abef

feat(llvm): add planned CUDA vector execution

5cac56e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CUDA Examples/Improve CUDA robustness#202

Add CUDA Examples/Improve CUDA robustness#202
SousaTrashBin wants to merge 104 commits into
masterfrom
add_some_cuda_examples

SousaTrashBin commented Apr 27, 2026

Uh oh!

alcides left a comment

Uh oh!

alcides May 1, 2026

Uh oh!

SousaTrashBin May 1, 2026 •

edited

Loading

Uh oh!

alcides May 4, 2026

Uh oh!

SousaTrashBin May 6, 2026

Uh oh!

alcides May 1, 2026

Uh oh!

SousaTrashBin May 1, 2026

Uh oh!

SousaTrashBin May 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SousaTrashBin commented Apr 27, 2026

Uh oh!

alcides left a comment

Choose a reason for hiding this comment

Uh oh!

alcides May 1, 2026

Choose a reason for hiding this comment

Uh oh!

SousaTrashBin May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alcides May 4, 2026

Choose a reason for hiding this comment

Uh oh!

SousaTrashBin May 6, 2026

Choose a reason for hiding this comment

Uh oh!

alcides May 1, 2026

Choose a reason for hiding this comment

Uh oh!

SousaTrashBin May 1, 2026

Choose a reason for hiding this comment

Uh oh!

SousaTrashBin May 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SousaTrashBin May 1, 2026 •

edited

Loading