Skip to content
Merged
Show file tree
Hide file tree
Changes from 4 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 4 additions & 0 deletions cuda_bindings/DESCRIPTION.rst
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,10 @@
cuda-bindings: Low-level CUDA interfaces
****************************************

.. image:: https://img.shields.io/badge/NVIDIA-black?logo=nvidia
:target: https://www.nvidia.com/
:alt: NVIDIA

`cuda.bindings <https://nvidia.github.io/cuda-python/cuda-bindings/>`_ is a standard set of low-level interfaces, providing full coverage of and 1:1 access to the CUDA host APIs from Python. Checkout the `Overview <https://nvidia.github.io/cuda-python/cuda-bindings/latest/overview.html>`_ for the workflow and performance results.

* `Repository <https://github.com/NVIDIA/cuda-python/tree/main/cuda_bindings>`_
Expand Down
17 changes: 16 additions & 1 deletion cuda_bindings/docs/source/install.rst
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ Runtime Requirements
``cuda.bindings`` supports the same platforms as CUDA. Runtime dependencies are:

* Linux (x86-64, arm64) and Windows (x86-64)
* Python 3.9 - 3.13
* Python 3.9 - 3.14
* Driver: Linux (580.65.06 or later) Windows (580.88 or later)
* Optionally, NVRTC, nvJitLink, NVVM, and cuFile from CUDA Toolkit 13.x

Expand All @@ -20,6 +20,21 @@ Runtime Requirements

Starting from v12.8.0, ``cuda-python`` becomes a meta package which currently depends only on ``cuda-bindings``; in the future more sub-packages will be added to ``cuda-python``. In the instructions below, we still use ``cuda-python`` as example to serve existing users, but everything is applicable to ``cuda-bindings`` as well.


Free-threading Build Support
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

As of cuda-bindings 13.0.2 and 12.9.3, **experimental** packages for the `free-threaded interpreter`_ are shipped.

1. Support for these builds is best effort, due to heavy use of `built-in
modules that are known to be thread-unsafe`_, such as ``ctypes``.
2. For now, you are responsible for making sure that calls into ``cuda-core``
libraries are thread-safe. This is subject to change.
Comment thread
leofang marked this conversation as resolved.
Outdated

.. _built-in modules that are known to be thread-unsafe: https://github.com/python/cpython/issues/116738
.. _free-threaded interpreter: https://docs.python.org/3/howto/free-threading-python.html


Installing from PyPI
--------------------

Expand Down
4 changes: 2 additions & 2 deletions cuda_bindings/docs/source/release.rst
Original file line number Diff line number Diff line change
Expand Up @@ -7,10 +7,10 @@ Release Notes
.. toctree::
:maxdepth: 3

13.X.Y <release/13.X.Y-notes.rst>
13.0.2 <release/13.0.2-notes.rst>
13.0.1 <release/13.0.1-notes.rst>
13.0.0 <release/13.0.0-notes.rst>
12.9.X <release/12.9.X-notes.rst>
12.9.3 <release/12.9.3-notes.rst>
12.9.2 <release/12.9.2-notes.rst>
12.9.1 <release/12.9.1-notes.rst>
12.9.0 <release/12.9.0-notes.rst>
Expand Down
10 changes: 7 additions & 3 deletions cuda_bindings/docs/source/release/12.9.3-notes.rst
Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note: Anything cuda-toolkit related that is mentioned in the 13.0.2 release note is not added here, because the PR(s) was not backported and I am not sure I want to block on this.

Original file line number Diff line number Diff line change
Expand Up @@ -3,20 +3,24 @@

.. module:: cuda.bindings

``cuda-bindings`` 12.9.X Release notes
``cuda-bindings`` 12.9.3 Release notes
======================================

Released on TBD
Released on Oct 9, 2025


Highlights
----------

* This is the last release that officially supports Python 3.9.
* Python 3.14 is supported.
* **Experimental** free-threaded builds for Python 3.13/3.14 are made available. Any bugs can be reported to `our GitHub repo <https://github.com/NVIDIA/cuda-python>`_. More details are available in our :ref:`support` docs.
* Automatic CUDA library path detection based on ``CUDA_HOME``, eliminating the need to manually set ``LIBRARY_PATH`` environment variables for installation.
* The Python overhead of calling functions in CUDA bindings in ``driver``, ``runtime`` and ``nvrtc`` has been reduced by approximately 30%.
* On Windows, the ``pywin32`` dependency has been removed. The necessary Windows API functions are now accessed directly.
* Updated the ``cuda.bindings.runtime`` module to statically link against the CUDA Runtime library from CUDA Toolkit 12.9.1.
* ``cyruntime.getLocalRuntimeVersion`` now uses pathfinder to find the CUDA runtime.
* Experimental free-threaded builds are available on PyPI. More details are available in our :ref:`support` docs.


Known issues
------------
Expand Down
8 changes: 5 additions & 3 deletions cuda_bindings/docs/source/release/13.0.2-notes.rst
Original file line number Diff line number Diff line change
Expand Up @@ -3,23 +3,25 @@

.. module:: cuda.bindings

``cuda-bindings`` 13.X.Y Release notes
``cuda-bindings`` 13.0.2 Release notes
======================================

Released on TBD
Released on Oct 9, 2025


Highlights
----------

* This is the last release that officially supports Python 3.9.
* Python 3.14 is supported.
* **Experimental** free-threaded builds for Python 3.13/3.14 are made available. Any bugs can be reported to `our GitHub repo <https://github.com/NVIDIA/cuda-python>`_. More details are available in our :ref:`support` docs.
* Migrated wheel dependencies from individual NVIDIA packages to the ``cuda-toolkit`` metapackage for improved dependency resolution and version constraints.
* Automatic CUDA library path detection based on ``CUDA_HOME``, eliminating the need to manually set ``LIBRARY_PATH`` environment variables for installation.
* The ``[all]`` optional dependencies now use ``cuda-toolkit`` with appropriate extras instead of individual packages. The NVCC compiler is no longer automatically installed with ``pip install cuda-python[all]`` as it was previously included only to access the NVVM library, which now has its own dedicated wheel. Users who need the NVCC compiler should explicitly install it with ``pip install cuda-toolkit[nvcc]==X.Y`` with the appropriate version for their needs.
* The Python overhead of calling functions in CUDA bindings in ``driver``, ``runtime`` and ``nvrtc`` has been reduced by approximately 30%.
* On Windows, the ``pywin32`` dependency has been removed. The necessary Windows API functions are now accessed directly.
* Updated the ``cuda.bindings.runtime`` module to statically link against the CUDA Runtime library from CUDA Toolkit 13.0.1.
* ``cyruntime.getLocalRuntimeVersion`` now uses pathfinder to find the CUDA runtime.
* Experimental free-threaded builds are available on PyPI. More details are available in our :ref:`support` docs.


Bug fixes
Expand Down
4 changes: 4 additions & 0 deletions cuda_core/DESCRIPTION.rst
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,10 @@
cuda-core: Pythonic access to CUDA core functionalities
*******************************************************

.. image:: https://img.shields.io/badge/NVIDIA-black?logo=nvidia
:target: https://www.nvidia.com/
:alt: NVIDIA

`cuda.core <https://nvidia.github.io/cuda-python/cuda-core/>`_ bridges Python's productivity with CUDA's performance through intuitive and pythonic APIs. The mission is to provide users full access to all of the core CUDA features in Python, such as runtime control, compiler and linker.

* `Repository <https://github.com/NVIDIA/cuda-python/tree/main/cuda_core>`_
Expand Down
29 changes: 17 additions & 12 deletions cuda_core/docs/source/install.rst
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,22 @@ dependencies are as follows:
.. [#f1] Including ``cuda-python``.


``cuda.core`` supports Python 3.9 - 3.13, on Linux (x86-64, arm64) and Windows (x86-64).
``cuda.core`` supports Python 3.9 - 3.14, on Linux (x86-64, arm64) and Windows (x86-64). **Experimental** free-threaded builds for Python 3.13 & 3.14 are also provided.


Free-threading Build Support
~~~~~~~~~~~~~~~~~~~~~~~~~~~~

As of cuda-core 0.4.0, **experimental** packages for the `free-threaded interpreter`_ are shipped.
Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I moved this section up and removed the mention of PyPI, because it should be (hopefully) straightforward to build FT support on conda-forge.


1. Support for these builds is best effort, due to heavy use of `built-in
modules that are known to be thread-unsafe`_, such as ``ctypes``.
2. For now, you are responsible for making sure that calls into ``cuda-core``
libraries are thread-safe. This is subject to change.
Comment thread
leofang marked this conversation as resolved.
Outdated

.. _built-in modules that are known to be thread-unsafe: https://github.com/python/cpython/issues/116738
.. _free-threaded interpreter: https://docs.python.org/3/howto/free-threading-python.html


Installing from PyPI
--------------------
Expand All @@ -42,17 +57,6 @@ and likewise use ``[cu13]`` for CUDA 13.
Note that using ``cuda.core`` with NVRTC installed from PyPI via ``pip install`` requires
``cuda.bindings`` 12.8.0+. Likewise, with nvJitLink it requires 12.8.0+.

Free-threading Build Support
~~~~~~~~~~~~~~~~~~~~~~~~~~~~
As of cuda-core 0.4.0, wheels for the `free-threaded interpreter`_ are shipped to PyPI.

1. Support for these builds is best effort, due to heavy use of `built-in
modules that are known to be thread-unsafe`_, such as ``ctypes``.
2. For now, you are responsible for making sure that calls into ``cuda-core``
libraries are thread-safe. This is subject to change.

.. _built-in modules that are known to be thread-unsafe: https://github.com/python/cpython/issues/116738
.. _free-threaded interpreter: https://docs.python.org/3/howto/free-threading-python.html

Installing from Conda (conda-forge)
-----------------------------------
Expand All @@ -67,6 +71,7 @@ and likewise use ``cuda-version=13`` for CUDA 13.

Note that to use ``cuda.core`` with nvJitLink installed from conda-forge requires ``cuda.bindings`` 12.8.0+.


Installing from Source
----------------------

Expand Down
7 changes: 4 additions & 3 deletions cuda_core/docs/source/release/0.4.0-notes.rst
Original file line number Diff line number Diff line change
Expand Up @@ -3,17 +3,18 @@

.. currentmodule:: cuda.core.experimental

``cuda.core`` 0.X.Y Release Notes
``cuda.core`` 0.4.0 Release Notes
=================================

Released on TBD
Released on Oct 9, 2025


Highlights
----------

- This is the last release that officially supports Python 3.9.
- Fix for :class:`LaunchConfig` grid parameter unit conversion when thread block clusters are used.
- Python 3.14 is supported.
- **Experimental** free-threaded builds for Python 3.13/3.14 are made available. Any bugs can be reported to `our GitHub repo <https://github.com/NVIDIA/cuda-python>`_.


Breaking Changes
Expand Down