File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -2542,12 +2542,10 @@ extern "C" {
25422542 // TODO: add ggml_gated_delta_net_set_bcast() to be able to configure Q, K broadcast type: tiled vs interleaved [TAG_GGML_GDN_BCAST]
25432543 // ref: https://github.com/ggml-org/llama.cpp/pull/19468#discussion_r2786394306
25442544 //
2545- // state is a 3D tensor of shape (S_v*S_v*H, K, n_seqs). K is the snapshot slot count:
2546- // K == 1 → output carries the final state only.
2547- // K > 1 → output carries K snapshot slots; the kernel writes the last min(n_tokens, K)
2548- // per-token snapshots into the trailing slots (earlier slots are left untouched
2549- // when n_tokens < K).
2550- // Only slot 0 (state[:, 0, :]) is read as the initial state; the rest is shape signal.
2545+ // state is a 3D tensor of shape (S_v*S_v*H, K, n_seqs):
2546+ // K == 1: output carries the final state only.
2547+ // K > 1: output carries K snapshot slots; the kernel writes the last min(n_tokens, K)
2548+ // per-token snapshots into the trailing slots
25512549 GGML_API struct ggml_tensor * ggml_gated_delta_net (
25522550 struct ggml_context * ctx,
25532551 struct ggml_tensor * q,
You can’t perform that action at this time.
0 commit comments