Skip to content

feat(ggml): add opt-in tree attention verify paths#25

Draft
davide221 wants to merge 3 commits into
luce-dflashfrom
codex/ddtree-ggml-graph-opt
Draft

feat(ggml): add opt-in tree attention verify paths#25
davide221 wants to merge 3 commits into
luce-dflashfrom
codex/ddtree-ggml-graph-opt

Conversation

@davide221

Copy link
Copy Markdown

Draft PR for DDTree/DFlash verify graph work. Adds opt-in ggml tree attention carrier, CUDA/HIP-facing fallback paths, KV_min tile skip plumbing, masked vector routing experiments, and Q8 tree attention experiments. Built on lucebox and benchmarked with Laguna; default dense fallback is retained unless env flags are enabled.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant