docs: refresh LightRAG model setup examples by OnlyTerp · Pull Request #17 · OnlyTerp/hermes-optimization-guide

OnlyTerp · 2026-05-27T16:37:07Z

Summary

Replaces #1 with a current-main-safe LightRAG setup refresh. The original PR had the right idea (show multiple LightRAG model options), but the exact examples are now stale:

kimi-2.5 is no longer the recommended current Kimi slug; Kimi's public docs now show kimi-k2.6.
https://api.moonshot.cn/v1 is the China-region endpoint; the international Kimi API docs use https://api.moonshot.ai/v1.
Cerebras qwen-3-32b is deprecated; the current production speed default is gpt-oss-120b.

Changes

Updates both part3-lightrag-setup.md and the combined README.md.
Adds three LightRAG .env options:
- Kimi K2.6 + Fireworks Qwen3-Embedding-8B (quality default)
- Cerebras GPT OSS 120B + Fireworks Qwen3-Embedding-8B (speed default)
- local Ollama LLM + local nomic-embed-text embeddings (free/private)
Keeps LightRAG's real LLM_BINDING_HOST / EMBEDDING_BINDING_HOST env vars.
Updates the entity-extraction model table and slow-ingestion recommendation.
Adds a changelog entry.

Verification

git diff --check clean.
LightRAG docs confirm LLM_BINDING_HOST is the correct env var for OpenAI-compatible and Ollama endpoints.
Kimi API docs confirm https://api.moonshot.ai/v1 and kimi-k2.6.
Cerebras docs mark qwen-3-32b deprecated and list gpt-oss-120b as a current production model.

Supersedes #1.

Replaces the stale LightRAG model guidance from #1 with current endpoints and slugs: - Kimi K2.6 via the international Moonshot API at https://api.moonshot.ai/v1 for the quality-oriented setup. - Cerebras gpt-oss-120b for the speed-oriented setup, replacing the deprecated qwen-3-32b example. - Fireworks Qwen3-Embedding-8B remains the high-quality embedding default. - Local Ollama example now includes both LLM_BINDING_HOST and EMBEDDING_BINDING_HOST. Also keeps README.md and part3-lightrag-setup.md in sync and updates the slow-ingestion recommendation. Supersedes #1.

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 3 additional findings.

OnlyTerp merged commit 2e3e076 into main May 27, 2026
4 of 5 checks passed

OnlyTerp mentioned this pull request May 27, 2026

Update LightRAG setup: Kimi 2.5 entity extraction & Cerebras+Qwen3 options #1

Closed

4 tasks

devin-ai-integration Bot reviewed May 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: refresh LightRAG model setup examples#17

docs: refresh LightRAG model setup examples#17
OnlyTerp merged 1 commit into
mainfrom
docs/lightrag-current-model-options

OnlyTerp commented May 27, 2026 •

edited by devin-ai-integration Bot

Loading

Uh oh!

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

OnlyTerp commented May 27, 2026 • edited by devin-ai-integration Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Verification

Uh oh!

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

OnlyTerp commented May 27, 2026 •

edited by devin-ai-integration Bot

Loading