Skip to content

docs: refresh LightRAG model setup examples#17

Merged
OnlyTerp merged 1 commit into
mainfrom
docs/lightrag-current-model-options
May 27, 2026
Merged

docs: refresh LightRAG model setup examples#17
OnlyTerp merged 1 commit into
mainfrom
docs/lightrag-current-model-options

Conversation

@OnlyTerp

@OnlyTerp OnlyTerp commented May 27, 2026

Copy link
Copy Markdown
Owner

Summary

Replaces #1 with a current-main-safe LightRAG setup refresh. The original PR had the right idea (show multiple LightRAG model options), but the exact examples are now stale:

  • kimi-2.5 is no longer the recommended current Kimi slug; Kimi's public docs now show kimi-k2.6.
  • https://api.moonshot.cn/v1 is the China-region endpoint; the international Kimi API docs use https://api.moonshot.ai/v1.
  • Cerebras qwen-3-32b is deprecated; the current production speed default is gpt-oss-120b.

Changes

  • Updates both part3-lightrag-setup.md and the combined README.md.
  • Adds three LightRAG .env options:
    • Kimi K2.6 + Fireworks Qwen3-Embedding-8B (quality default)
    • Cerebras GPT OSS 120B + Fireworks Qwen3-Embedding-8B (speed default)
    • local Ollama LLM + local nomic-embed-text embeddings (free/private)
  • Keeps LightRAG's real LLM_BINDING_HOST / EMBEDDING_BINDING_HOST env vars.
  • Updates the entity-extraction model table and slow-ingestion recommendation.
  • Adds a changelog entry.

Verification

  • git diff --check clean.
  • LightRAG docs confirm LLM_BINDING_HOST is the correct env var for OpenAI-compatible and Ollama endpoints.
  • Kimi API docs confirm https://api.moonshot.ai/v1 and kimi-k2.6.
  • Cerebras docs mark qwen-3-32b deprecated and list gpt-oss-120b as a current production model.

Supersedes #1.


Open in Devin Review

Replaces the stale LightRAG model guidance from #1 with current endpoints
and slugs:

- Kimi K2.6 via the international Moonshot API at
  https://api.moonshot.ai/v1 for the quality-oriented setup.
- Cerebras gpt-oss-120b for the speed-oriented setup, replacing the
  deprecated qwen-3-32b example.
- Fireworks Qwen3-Embedding-8B remains the high-quality embedding default.
- Local Ollama example now includes both LLM_BINDING_HOST and
  EMBEDDING_BINDING_HOST.

Also keeps README.md and part3-lightrag-setup.md in sync and updates the
slow-ingestion recommendation.

Supersedes #1.
@OnlyTerp OnlyTerp merged commit 2e3e076 into main May 27, 2026
4 of 5 checks passed

@devin-ai-integration devin-ai-integration Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 3 additional findings.

Open in Devin Review

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant