LLM Assistant

This repository contains two Home Assistant custom integrations, installable via HACS:

Integration	Purpose
LLM Assistant (`llm_assistant`)	OpenAI-compatible voice conversation agent with MCP support
Kokoro TTS (`kokoro_tts`)	Local text-to-speech via Kokoro-FastAPI with ROCm (AMD GPU) support

A Docker setup for running Kokoro-FastAPI with AMD GPU (ROCm) acceleration is included in docker/kokoro-rocm/.

A conversation agent that connects to any OpenAI-compatible LLM endpoint (LM Studio, Ollama, llama.cpp, vLLM, OpenRouter, etc.) with full MCP (Model Context Protocol) server support.

Features

🔌 OpenAI-compatible endpoint – works with any server exposing /v1/chat/completions
🤖 Model selection – fetches available models dynamically and lets you choose from a dropdown
🔧 MCP server configuration – supports both ephemeral MCP (URL-based) and LM Studio plugin references (mcp/...)
🏠 Home Assistant tool calling – full integration with HA's LLM API (control lights, switches, etc.)
🎙️ Voice/TTS streaming – streamed responses for low-latency Assist pipelines
🧠 LM Studio Native API – optional mode that calls /api/v1/chat directly so LM Studio can orchestrate MCP tool execution end-to-end
⚙️ Per-agent configuration – multiple agents on the same server, each with its own model and MCP setup

Installation

Via HACS (recommended)

Open HACS → Integrations
Click the ⋮ menu → Custom repositories
Add https://github.com/rwfsmith/llm_assistant as type Integration
Search for LLM Assistant and click Download
Restart Home Assistant

Manual

Copy the custom_components/llm_assistant folder to your HA custom_components directory and restart.

Configuration

Go to Settings → Devices & Services → Add Integration
Search for LLM Assistant
Enter the server URL (e.g. http://192.168.1.100:1234/v1)
- For LM Studio: http://localhost:1234/v1
- For Ollama: http://localhost:11434/v1
Optionally enter an API key (leave blank for local servers)
Optionally enable LM Studio Native API to use LM Studio's /api/v1/chat endpoint for MCP orchestration

Once the server entry is saved, add one or more Conversation Agents via the subentry flow:

Click Add Agent on the integration card
Choose a model from the dropdown (fetched live from your server)
Configure the system prompt, temperature, HA LLM APIs, etc.
Add MCP servers (optional):

MCP Server Types

Type	When to use	Required fields
`ephemeral_mcp`	Remote/HTTP MCP servers, one-off requests	Label, URL
`plugin`	LM Studio pre-configured servers in `mcp.json`	Plugin ID (e.g. `mcp/playwright`)

Note: Ephemeral MCP requires "Allow per-request MCPs" enabled in LM Studio Server Settings.
Plugin MCP requires "Allow calling servers from mcp.json" enabled.

LM Studio Native API Mode

When Use LM Studio Native API is enabled on the server entry, the integration switches from the standard OpenAI /v1/chat/completions endpoint to LM Studio's /api/v1/chat endpoint. In this mode:

MCP servers are passed as integrations in the request body
LM Studio executes MCP tool calls internally before returning the final response
The response format uses LM Studio's output array (handled transparently by this integration)
HA tool calling still works in parallel alongside MCP tools

Example: LM Studio with Playwright MCP

Install playwright-mcp and add it to LM Studio's mcp.json
Enable "Allow calling servers from mcp.json" in LM Studio settings
In the agent config, add an MCP server with:
- Type: plugin
- Plugin ID: mcp/playwright

The agent can now browse the web on behalf of the user.

Acknowledgements

Inspired by hass_local_openai_llm by @skye-harris
Forked from the OpenRouter integration
MCP support based on the LM Studio MCP API

Kokoro TTS

A Home Assistant TTS (Text-to-Speech) platform that uses Kokoro-FastAPI — a local, high-quality neural TTS server with an OpenAI-compatible API.

Features

🗣️ High-quality local TTS – Kokoro-82M model, no cloud dependency
🎙️ 50+ voices – American English, British English, Japanese, Mandarin Chinese and more
🔀 Voice blending – mix voices (e.g. af_bella+af_sky) for unique timbres
⚡ ROCm support – AMD GPU acceleration via the included Docker setup
🎚️ Per-call options – override voice and speed from automations or the voice pipeline
🔊 Multiple formats – MP3, WAV, FLAC, Opus, PCM
🔌 Wyoming protocol – built-in proxy on port 10200 for native HA voice pipeline integration

Installation

Same HACS custom repository: https://github.com/rwfsmith/llm_assistant → install Kokoro TTS.

Server Setup (ROCm / AMD GPU)

See docker/kokoro-rocm/README.md for the full AMD GPU setup guide.

Quick start (on a Linux host with ROCm drivers installed):

cd docker/kokoro-rocm
bash setup.sh

The TTS API will be available at http://<HOST_IP>:8880, and the Wyoming proxy on port 10200.

HA Configuration

Wyoming (recommended): Settings → Devices & Services → Add Integration → Wyoming → host IP + port 10200.

HTTP (Kokoro TTS custom integration): Settings → Devices & Services → Add Integration → Kokoro TTS → enter http://<HOST_IP>:8880, then choose a default voice, speed and format from the live dropdowns.

Using in Automations

service: tts.speak
target:
  entity_id: tts.kokoro_tts
data:
  media_player_entity_id: media_player.living_room
  message: "The front door has been opened."
  options:
    voice: af_bella
    speed: 1.1

Voice Reference

Prefix	Language
`af_*`	American English (female)
`am_*`	American English (male)
`bf_*`	British English (female)
`bm_*`	British English (male)
`jf_*`	Japanese (female)
`jm_*`	Japanese (male)
`zf_*`	Mandarin Chinese (female)
`zm_*`	Mandarin Chinese (male)

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.github/workflows		.github/workflows
custom_components		custom_components
docker		docker
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
hacs.json		hacs.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Assistant – Home Assistant Custom Integration

LLM Assistant

Features

Installation

Via HACS (recommended)

Manual

Configuration

MCP Server Types

LM Studio Native API Mode

Example: LM Studio with Playwright MCP

Acknowledgements

Kokoro TTS

Features

Installation

Server Setup (ROCm / AMD GPU)

HA Configuration

Using in Automations

Voice Reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM Assistant – Home Assistant Custom Integration

LLM Assistant

Features

Installation

Via HACS (recommended)

Manual

Configuration

MCP Server Types

LM Studio Native API Mode

Example: LM Studio with Playwright MCP

Acknowledgements

Kokoro TTS

Features

Installation

Server Setup (ROCm / AMD GPU)

HA Configuration

Using in Automations

Voice Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages