Skip to content

feat: add coordinate space abstraction for open weights LLM support#282

Draft
philipph-askui wants to merge 1 commit into
mainfrom
feat/llm_coordinate_system
Draft

feat: add coordinate space abstraction for open weights LLM support#282
philipph-askui wants to merge 1 commit into
mainfrom
feat/llm_coordinate_system

Conversation

@philipph-askui

Copy link
Copy Markdown
Contributor

Introduces VlmCoordinateSpace strategy (pixel, scaled, normalized) so agentOS facades can map model-emitted coordinates to screen pixels.
Thereby adds auto-detection for Qwen, Holo (0-1000 grid) and Kimi (0.0-1.0 floats) in OllamaVlmProvider.
Appends coordinate info to system prompts for OpenAI-compatible providers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant