Skip to content

Commit 799a8ab

Browse files
committed
Update torch dependency to 2.10
- update docs Signed-off-by: Jay Gu <jagu@nvidia.com>
1 parent a13e4e2 commit 799a8ab

5 files changed

Lines changed: 16 additions & 4 deletions

File tree

docs/source/index.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -54,3 +54,4 @@ Proceed to the :ref:`quickstart` page for installation instructions and a comple
5454
performance
5555
operations
5656
debugging
57+
known_issues

docs/source/known_issues.rst

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
.. SPDX-FileCopyrightText: Copyright (c) <2025> NVIDIA CORPORATION & AFFILIATES. All rights reserved.
2+
..
3+
.. SPDX-License-Identifier: Apache-2.0
4+
5+
6+
Known Issues
7+
============
8+
9+
1. FP8 Torch Tensor requires `torch>=2.10`. Older version of PyTorch does not support converting fp8
10+
datatype through `dlpack` protocol and will `leak memory <https://github.com/pytorch/pytorch/issues/171820>`__
11+
when conversion to dlpack tensor fails.

docs/source/quickstart.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -116,5 +116,5 @@ Developer Tools
116116
This profile can then be loaded in a graphical instance of Nsight Compute and the kernel ``vector_add`` selected to see statistics about the kernel.
117117

118118
.. note::
119-
Capturing detailed statistics for cuTile Python kernels requires running on NVIDIA Driver r590 or later.
119+
Capturing detailed statistics for cuTile Python kernels requires running on NVIDIA Driver equals or later than r580.126.09 (linux) or r582.16 (windows).
120120

test/requirements.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,4 +15,4 @@ numba-cuda[cu13]==0.20.0
1515
cupy-cuda13x==13.6.0
1616

1717
--extra-index-url https://download.pytorch.org/whl/cu130
18-
torch==2.9.0
18+
torch==2.10.0

test/test_mma.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -121,7 +121,7 @@ def test_mma_fp8(tile_size, case):
121121
scale = torch.tensor([1.0], dtype=torch.float32, device="cuda")
122122
try:
123123
ref = torch._scaled_mm(A, B.T, scale, scale, out_dtype=C.dtype) + C
124-
except RuntimeError as e:
124+
except (RuntimeError, ValueError) as e:
125125
assert 'Multiplication of two Float8_e5m2 matrices is not supported' in str(e)
126126
ref = None
127127
ct.launch(torch.cuda.current_stream(), (1,), mma_kernel,
@@ -270,7 +270,7 @@ def test_matmul_fp8(tile_size, dtype):
270270
scale = torch.tensor([1.0], dtype=torch.float32, device="cuda")
271271
try:
272272
ref = torch._scaled_mm(A, B.T, scale, scale, out_dtype=torch.float16).to(dtype)
273-
except RuntimeError as e:
273+
except (RuntimeError, ValueError) as e:
274274
assert 'Multiplication of two Float8_e5m2 matrices is not supported' in str(e)
275275
ref = None
276276
ct.launch(torch.cuda.current_stream(), (1,), matmul_kernel,

0 commit comments

Comments
 (0)