Skip to content

Commit 5f31dd8

Browse files
committed
Add release notes for PRs identified by at-leofang:
#519 (comment)
1 parent 147064a commit 5f31dd8

File tree

1 file changed

+18
-6
lines changed

1 file changed

+18
-6
lines changed

cuda_core/docs/source/release/0.2.0-notes.rst

Lines changed: 18 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -27,18 +27,30 @@ New features
2727
- Expose :class:`ObjectCode` as a public API, which allows loading cubins from memory or disk. For loading other kinds of code types, please continue using :class:`Program`.
2828
- A C++ helper function ``get_cuda_native_handle()`` is provided in the new ``include/utility.cuh`` header to retrive the underlying CUDA C objects (ex: ``CUstream``) from a Python object returned by the ``.handle`` attribute (ex: :attr:`Stream.handle`).
2929
- For objects such as :class:`Program` and :class:`Linker` that could dispatch to different backends, a new ``.backend`` attribute is provided to query this information.
30-
- Support CUDA event timing. (#481, #498, #508)
31-
- An :class:`~_event.Event` may now be created without recording it to a :class:`~_stream.Stream` using the :meth:`Device.create_event` method.
30+
- Support CUDA :class:`Event` timing. (#481, #498, #508)
31+
- An :class:`Event` may now be created without recording it to a :class:`~_stream.Stream` using the :meth:`Device.create_event` method.
32+
- :class:`Program` now supports the additional ``PTX`` code type. (#317)
33+
- :meth:`Linker.link` exceptions now include the original error log. (#423)
34+
- In a systematic sweep through the cuda.core implementations, many exceptions messages were made more consistent and informative. (#458)
35+
36+
New examples
37+
------------
38+
- ``jit_lto_fractal.py`` — Demonstrates just-in-time link-time optimization for fractal generation. (:class:`Device`, :class:`LaunchConfig`, :class:`Linker`, :class:`LinkerOptions`, :class:`Program`, :class:`ProgramOptions`) (#475)
39+
- ``simple_multi_gpu_example.py`` — Example of using multiple GPUs. (:class:`Device`, :class:`Program`, :class:`LaunchConfig`) (#304)
40+
- ``show_device_properties.py`` — Displays detailed device properties. (:class:`Device`) (#474)
3241

3342
Minor fixes and enhancements
3443
----------------------------
35-
- Fix a dangling pointer problem in _linker.py (#516)
36-
- Add ``@functools.lru_cache`` decorator for ``get_binding_version()`` (#512)
37-
- Change selected ``.decode()`` to ``.decode("utf-8", errors="backslashreplace")`` (#510)
44+
- A dangling pointer problem in ``_linker.py`` was fixed. (#516)
45+
- Add ``@functools.lru_cache`` decorator for :func:`get_binding_version`. (#512)
46+
- Selected ``.decode()`` were changed to ``.decode("utf-8", errors="backslashreplace")`` to ensure that decoding error messages does not abort the process. (#510)
47+
- The performance of :meth:`Device.compute_capability` was improved. (#459)
48+
- The :class:`Program` constructor now issues a warning when falling back to :func:`cuLink`. (#315)
49+
- To avoid deprecation warnings, the cuda.bindings imports in the cuda.core implementations were cleaned up. (#404)
3850

3951
Test fixes
4052
----------
41-
- Clean up device initialization in test (#507)
53+
- Clean up device initialization in some tests. (#507)
4254

4355
Limitations
4456
-----------

0 commit comments

Comments
 (0)