|
3 | 3 |
|
4 | 4 | Release Notes |
5 | 5 | ============= |
| 6 | +1.2.0 (2026-03-05) |
| 7 | +------------------ |
| 8 | +### CTK 13.2 features |
| 9 | +- Support Ampere and Ada (sm80 family) GPUs. |
| 10 | +- Support `pip install cuda-tile[tileiras]` to use `tileiras` from Python environment |
| 11 | + without system-wide CTK installation. |
| 12 | +- Add `ct.atan2(y, x)` operation for computing the arctangent of y/x. |
| 13 | +- Add optional `rounding_mode` parameter for `ct.tanh()`, supporting `RoundingMode.FULL` and |
| 14 | + `RoundingMode.APPROX`. |
| 15 | +- Compiling FP8 operations for sm80 family GPUs will raise `TileUnsupportedFeatureError`. |
| 16 | +- Setting `opt_level=0` on `ct.kernel` is no longer required for `ct.printf()` and `ct.print()`. |
| 17 | + |
| 18 | + |
| 19 | +### Features |
| 20 | +- Add `ct.static_iter` keyword that enables compile-time `for` loops. |
| 21 | +- Add `ct.static_assert` keyword that can be used to assert that a condition is true at compile time. |
| 22 | +- Add `ct.static_eval` keyword that enables compile-time evaluation using the host Python interpreter. |
| 23 | +- Add `ct.scan()` for custom scan. |
| 24 | +- Add `ct.isnan()`. |
| 25 | +- Add `print()` and `ct.print()` that supports python-style print and f-strings. |
| 26 | +- Add optional `mask` parameter to `ct.gather()` and `ct.scatter()` for custom boolean masking. |
| 27 | +- Operator `+` can now be used to concatenate tuples. |
| 28 | +- Support unpacking nested tuples (e.g., `a, (b, c) = t`) and using square brackets |
| 29 | + for unpacking (e.g., `[a, b] = 1, 2`). |
| 30 | +- Add bytecode-to-cubin disk cache to avoid recompilation of unchanged kernels. |
| 31 | + Controlled by `CUDA_TILE_CACHE_DIR` and `CUDA_TILE_CACHE_SIZE`. |
| 32 | + |
| 33 | +### Bug Fixes |
| 34 | +- Fix a bug where `nan != nan` returns False. |
| 35 | +- Fix "potentially undefined variable `$retval`" error when a helper function |
| 36 | + returns after a `while` loop that contains no early return. |
| 37 | +- Fix the missing column indicator in error messages when the underlined text is only one |
| 38 | + character wide. |
| 39 | +- Add a missing check for unpacking a tuple with too many values. For example, `a, b = 1, 2, 3` |
| 40 | + now raises an error instead of silently discarding the extra value. |
| 41 | +- Fix a bug where the promoted dtype of uint16 and uint64 was incorrectly set to uint32. |
| 42 | + |
| 43 | + |
| 44 | +### Enhancements |
| 45 | +- Erase the distinction between scalars and zero-dimensional tiles. |
| 46 | + They are now completely interchangeable. |
| 47 | +- `~x` for const boolean `x` will raise a TypeError to prevent inconsistent |
| 48 | + results compared to `~x` on a boolean Tile. |
| 49 | +- Add `TileUnsupportedFeatureError` to the public API. |
| 50 | + |
| 51 | + |
6 | 52 | 1.1.0 (2026-01-30) |
7 | 53 | ------------------ |
8 | 54 | ### Features |
|
0 commit comments