Skip to content

Commit 9746ca5

Browse files
committed
Add 1.2.0 relesae notes
Signed-off-by: Jay Gu <jagu@nvidia.com>
1 parent 8ce0189 commit 9746ca5

17 files changed

+46
-80
lines changed

CHANGELOG.md

Lines changed: 46 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3,6 +3,52 @@
33

44
Release Notes
55
=============
6+
1.2.0 (2026-03-05)
7+
------------------
8+
### CTK 13.2 features
9+
- Support Ampere and Ada (sm80 family) GPUs.
10+
- Support `pip install cuda-tile[tileiras]` to use `tileiras` from Python environment
11+
without system-wide CTK installation.
12+
- Add `ct.atan2(y, x)` operation for computing the arctangent of y/x.
13+
- Add optional `rounding_mode` parameter for `ct.tanh()`, supporting `RoundingMode.FULL` and
14+
`RoundingMode.APPROX`.
15+
- Compiling FP8 operations for sm80 family GPUs will raise `TileUnsupportedFeatureError`.
16+
- Setting `opt_level=0` on `ct.kernel` is no longer required for `ct.printf()` and `ct.print()`.
17+
18+
19+
### Features
20+
- Add `ct.static_iter` keyword that enables compile-time `for` loops.
21+
- Add `ct.static_assert` keyword that can be used to assert that a condition is true at compile time.
22+
- Add `ct.static_eval` keyword that enables compile-time evaluation using the host Python interpreter.
23+
- Add `ct.scan()` for custom scan.
24+
- Add `ct.isnan()`.
25+
- Add `print()` and `ct.print()` that supports python-style print and f-strings.
26+
- Add optional `mask` parameter to `ct.gather()` and `ct.scatter()` for custom boolean masking.
27+
- Operator `+` can now be used to concatenate tuples.
28+
- Support unpacking nested tuples (e.g., `a, (b, c) = t`) and using square brackets
29+
for unpacking (e.g., `[a, b] = 1, 2`).
30+
- Add bytecode-to-cubin disk cache to avoid recompilation of unchanged kernels.
31+
Controlled by `CUDA_TILE_CACHE_DIR` and `CUDA_TILE_CACHE_SIZE`.
32+
33+
### Bug Fixes
34+
- Fix a bug where `nan != nan` returns False.
35+
- Fix "potentially undefined variable `$retval`" error when a helper function
36+
returns after a `while` loop that contains no early return.
37+
- Fix the missing column indicator in error messages when the underlined text is only one
38+
character wide.
39+
- Add a missing check for unpacking a tuple with too many values. For example, `a, b = 1, 2, 3`
40+
now raises an error instead of silently discarding the extra value.
41+
- Fix a bug where the promoted dtype of uint16 and uint64 was incorrectly set to uint32.
42+
43+
44+
### Enhancements
45+
- Erase the distinction between scalars and zero-dimensional tiles.
46+
They are now completely interchangeable.
47+
- `~x` for const boolean `x` will raise a TypeError to prevent inconsistent
48+
results compared to `~x` on a boolean Tile.
49+
- Add `TileUnsupportedFeatureError` to the public API.
50+
51+
652
1.1.0 (2026-01-30)
753
------------------
854
### Features

changelog.d/ct-print.md

Lines changed: 0 additions & 4 deletions
This file was deleted.

changelog.d/custom-scan.md

Lines changed: 0 additions & 4 deletions
This file was deleted.

changelog.d/disk-cache.md

Lines changed: 0 additions & 5 deletions
This file was deleted.

changelog.d/fix-dtype-promotion.md

Lines changed: 0 additions & 1 deletion
This file was deleted.

changelog.d/gather-scatter-mask.md

Lines changed: 0 additions & 4 deletions
This file was deleted.

changelog.d/improve-nan-support.md

Lines changed: 0 additions & 7 deletions
This file was deleted.

changelog.d/pip-install-with-tileiras.md

Lines changed: 0 additions & 4 deletions
This file was deleted.

changelog.d/raise-fp8-unsupported-error-for-sm80.md

Lines changed: 0 additions & 5 deletions
This file was deleted.

changelog.d/remove-opt-level-print-note.md

Lines changed: 0 additions & 4 deletions
This file was deleted.

0 commit comments

Comments
 (0)