Skip to content

Commit 1f4a6d1

Browse files
add attention docs
Signed-off-by: Sudhakar Singh <sudhakars@nvidia.com>
1 parent 5535b09 commit 1f4a6d1

3 files changed

Lines changed: 752 additions & 3 deletions

File tree

docs/envvars.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -143,8 +143,8 @@ Attention Backend Selection
143143
.. envvar:: NVTE_FUSED_ATTN_BACKEND
144144

145145
:Type: ``int`` (1 or 2)
146-
:Default: Auto-selected
147-
:Description: Force a specific FusedAttention backend. ``1`` = F16_arbitrary_seqlen (cuDNN, any seq len), ``2`` = FP8 backend. If not set, the backend is automatically selected based on the input configuration.
146+
:Default: Ignored
147+
:Description: Historical option for forcing a FusedAttention sub-backend. The current PyTorch DotProductAttention selector does not read this variable; the FusedAttention sub-backend is selected automatically from the input configuration and cuDNN support.
148148

149149
.. envvar:: NVTE_FUSED_ATTN_FORCE_WORKSPACE_OPT
150150

0 commit comments

Comments
 (0)