|
1 | 1 | # TensorRT OSS Release Changelog |
| 2 | +## 10.16 GA - 2026-3-24 |
| 3 | + |
| 4 | +- General |
| 5 | + - Default CUDA version updated to CUDA 13.2. |
| 6 | + |
| 7 | +- Samples |
| 8 | + - Added sampleDistCollective sample to showcase multi-device execution in TensorRT. |
| 9 | + |
| 10 | +- Parsers |
| 11 | + - Added kADJUST_FOR_DLA flag to adjust parsing behavior for ONNX models to be more amenable for DLA hardware execution. |
| 12 | + - Added DistCollective operator support for multi-device execution in TensorRT. |
| 13 | + |
| 14 | +## 10.15 GA - 2026-2-2 |
| 15 | + |
| 16 | +- Sample changes |
| 17 | + - Added 2 safety samples sampleSafeMNIST, and sampleSafePluginV3 to demonstrate how to use TensorRT with the safety workflow. |
| 18 | + - Added trtSafeExec to accompany the safety workflow release. |
| 19 | + - Added python/stream_writer to showcase how to serialize a TensorRT engine directly to a custom stream using the IStreamWriter interface, rather than writing to a file or a contiguous memory buffer. |
| 20 | + - Added python/strongly_type_autocast to demonstrate how to convert FP32 ONNX models to mixed precision (FP32-FP16) using ModelOpt's AutoCast tool and subsequently building the engine with TensorRT's Strong Typing mode. |
| 21 | + - Added sampleCudla to demonstrate how to use the cuDLA API to run TensorRT engines on the Deep Learning Accelerator (DLA) hardware, which is available on NVIDIA Jetson and DRIVE platforms. |
| 22 | + - Deprecated sampleCharRNN. |
| 23 | + |
| 24 | +- Plugin changes |
| 25 | + - Deprecated bertQKVToContextPlugin and will be removed in a future release. No alternatives are planned to be provided. |
| 26 | + |
| 27 | +- Parser changes |
| 28 | + - Added support for `RotaryEmbedding`, `RMSNormalization` and `TensorScatter` for improved LLM model support |
| 29 | + - Added more specialized quantization ops for models quantized through TensorRT ModelOptimizer. |
| 30 | + - Added `kREPORT_CAPABILITY_DLA` flag to enable per-node validation when building DLA engines through TensorRT. |
| 31 | + - Added `kENABLE_PLUGIN_OVERRIDE` flag to enable TensorRT plugin override for nodes that share names with user plugins. |
| 32 | + - Improved error reporting for models with multiple subgraphs, such as `Loop` or `Scan` nodes. |
| 33 | + |
| 34 | +- Demo changes |
| 35 | + - demoDiffusion: Stable Diffusion 1.5, 2.0 and 2.1 pipelines have been deprecated and removed. |
| 36 | + - Added support for Wan2.2-T2V-A14B Text to Video pipeline |
| 37 | + |
| 38 | +## 10.14 GA - 2025-11-7 |
| 39 | +- Sample changes |
| 40 | + - Replace all pycuda usages with cuda-python APIs |
| 41 | + - Removed the efficientnet samples |
| 42 | + - Deprecated tensorflow_object_detection and efficientdet samples |
| 43 | + - Samples will no longer be released with the packages. The TensorRT GitHub repository will be the single source. |
| 44 | + |
| 45 | + |
| 46 | +- Parsers: |
| 47 | + - Added support for the `Attention` operator |
| 48 | + - Improved refit for `ConstantOfShape` nodes |
| 49 | + |
| 50 | +- Demos |
| 51 | + - demoDiffusion: |
| 52 | + - Added support for the Cosmos-Predict2 text2image and video2world pipelines |
| 53 | + |
2 | 54 |
|
3 | 55 | ## 10.13.3 GA - 2025-9-8 |
4 | 56 | - Added support for TensorRT API Capture and Replay feature, see the [developer guide](https://docs.nvidia.com/deeplearning/tensorrt/latest/inference-library/advanced.html) for more information. |
5 | 57 | - Demo changes |
6 | 58 | - Added support for Flux Kontext pipeline. |
7 | 59 |
|
8 | | - |
9 | 60 | ## 10.13.2 GA - 2025-8-18 |
10 | 61 | - Added support for CUDA 13.0, dropped support for CUDA 11.X |
11 | 62 | - Dropped support for Ubuntu 20.04 |
|
24 | 75 | - Added `loadModelProto`, `loadInitializer` and `refitModelProto` APIs for IParserRefitter. These APIs are meant to be used to load user initializers when refitting ONNX models. |
25 | 76 | - Deprecated `IParser::parseWithWeightDescriptors`. |
26 | 77 |
|
27 | | - |
28 | 78 | ## 10.12.0 GA - 2025-6-10 |
29 | 79 | - Plugin changes |
30 | 80 | - Migrated `IPluginV2`-descendent version 1 of `cropAndResizeDynamic`, to version 2, which implements `IPluginV3`. |
|
0 commit comments