Skip to content

hexagon: make vmem and buffer-size configurable#22487

Merged
max-krasnyansky merged 9 commits intoggml-org:masterfrom
qualcomm:hexagon-auto-vmem
Apr 29, 2026
Merged

hexagon: make vmem and buffer-size configurable#22487
max-krasnyansky merged 9 commits intoggml-org:masterfrom
qualcomm:hexagon-auto-vmem

Conversation

@max-krasnyansky
Copy link
Copy Markdown
Member

Overview

This PR adds two new knobs to the Hexagon backend

  • GGML_HEXAGON_VMEM
    Allows for overriding default VMEM limit. The default is the same as before (around 3.2GB)
    If set to 0 the backend will try to measure it by pre-mmaping the buffers
  • GGML_HEXAGON_MBUF
    Allows for overriding default buffer. The default is the same as before (1GB)
    This might be handy on the IOT devices where the allocator might struggle with 1GB DMA buffers

I also streamlined mapping management a bit further (pinned mappings are now managed directly by the host, etc) and updated logging in the related areas.

Requirements

@max-krasnyansky max-krasnyansky requested a review from a team as a code owner April 28, 2026 21:21
@github-actions github-actions Bot added script Script related ggml changes relating to the ggml tensor library for machine learning Hexagon labels Apr 28, 2026
@max-krasnyansky
Copy link
Copy Markdown
Member Author

@ggml-org/maintainers can I get a second approval please.

Comment thread scripts/snapdragon/adb/run-cli.sh
@max-krasnyansky
Copy link
Copy Markdown
Member Author

@lhez can you please re-approve (stale again after merging suggestions)

@max-krasnyansky max-krasnyansky merged commit 41a63be into ggml-org:master Apr 29, 2026
50 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning Hexagon script Script related

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants