Commit c8a3ff2
zhangyue
docs(paged_attention): explain why
The rationale (CANN CPU-tensor contract + NPUGraph capturability) was
only documented in the Ascend ATB kernel header. Surface it on the base
class where the API contract lives, so any future backend implementor
understands why the optional host tensors are part of the signature.seq_lens_host / block_table_host exist1 parent 1f4c15e commit c8a3ff2
1 file changed
Lines changed: 13 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
32 | 32 | | |
33 | 33 | | |
34 | 34 | | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
35 | 48 | | |
36 | 49 | | |
37 | 50 | | |
| |||
0 commit comments