rag-optimization

Here are 11 public repositories matching this topic...

neomatrix369 / rag-params-finder

RAG parameter sweep experimentation tool — systematically evaluate vector databases, embedding models, chunking strategies, and retrieval methods using any Vector database. Supports both Voyage AI (hosted) and local sentence-transformers models.

evaluation embedding-models hyperparameter-tuning similarity-score rag vector-search pre-evaluation rag-optimization retrieval-optimization no-mcp zero-llm no-agents relevancy-score chunking-strategies

Updated Jun 9, 2026
Python

umitkacar / llm-context-optimizer

Star

Biological code organization system with 1,029+ production-ready snippets - 95% token reduction for Claude/GPT with AI-powered discovery & offline packs

Updated Nov 10, 2025
Python

snowfoxHQ / vecRecall

Star

VecRecall 是一个改进版的 AI 长期记忆系统。它基于对原版 MemPalace 的深度分析重新构建，核心设计理念是将“信息检索”与“信息组织”彻底解耦。通过纯向量检索路径和独立的 SQLite UI 层，VecRecall 在保持灵活组织的同时，将召回率（R@5）从原版的 84% 提升至 96.6%+，为 AI Agent 提供更精准、更高效的上下文记忆支持。

python-library local-storage developer-tools ai-agents high-recall privacy-first long-term-memory vector-search chromadb local-llm context-management rag-optimization mcp-server claude-code semantic-recall

Updated May 1, 2026
Python

Kaos599 / BetterRAG

Star

BetterRAG: Powerful RAG evaluation toolkit for LLMs. Measure, analyze, and optimize how your AI processes text chunks with precision metrics. Perfect for RAG systems, document processing, and embedding quality assessment.

optimization evaluation embeddings evaluation-framework rag embeddings-extraction rag-evaluation rag-application rag-optimization chunking-optimization embeddings-optimization

Updated Mar 26, 2025
Python

Pablocheee / aio-project

Star

Official repository for AIO.CORE: The autonomous protocol for AI Optimization (AIO). Eliminates Semantic Drift and Context Fragmentation in LLMs through high-density vector indexing.

aio semantic-search vector-embeddings ai-optimization ai-agent llmo semantic-indexing rag-optimization context-preservation llm-visibility aio-core

Updated Mar 22, 2026
JavaScript

fabivsy / Tu-Mapa-IA-Data-Node

Star

Official Data Node for the Tu Mapa IA technical ecosystem. Features machine-readable manifests (llms.txt) and RAG-optimized entity data for 530+ curated AI tools across 23 professional verticals. Managed under the FixGeo Protocol.

rag-optimization llms-txt generative-engine-optimization ai-curation technical-sovereignty

Updated Jun 5, 2026

annpetrosiann / YSU_DSB_thesis

Star

This repo contains the full pipeline for my Master's thesis at Yerevan State University (YSU), developed as part of the Data Science for Business master's program. The goal of this project is to build an end-to-end Retrieval-Augmented Generation (RAG) system using semantic search, LLMs, and fine-tuned embeddings on Armenian banks’ financial PDFs.

data-science chatbot rag llm-training finetuning-llms datapre-processing rag-optimization

Updated Apr 28, 2025
Python

0xgetz / awesome-token-saving

Star

3,300 practical techniques to cut LLM token usage by up to 90% — 300 core principles + 3,000 context-specific applications.

developer-tools awesome-list prompt-engineering llm-ops rag-optimization token-efficiency ai-cost-reduction llm-cost-optimization api-cost-management

Updated Jun 19, 2026

jchuthemyth / Agize-GEO-Framework

Star

Technical framework for Generative Engine Optimization (GEO) and Share of Synthesis (SoS). Defines the 6-node Immutable Graph for latent space indexing.

geo llm-optimization rag-optimization ai-visibility semantic-architecture share-of-synthesis

Updated Mar 30, 2026

Ariyan-Pro / RAG-Latency-Optimization

Star

CPU-optimized RAG pipeline reducing latency 2.7× (247ms → 92ms). Implements caching, filtering, quantization for production. Complete with FastAPI, Docker, benchmarks, investor materials. The engineering showcase that sells itself.

docker caching dockerfile sales-engineering sqlite showcase embeddings low-latency production-ready demonstration semantic-search faiss fastapi retrieval-augmented-generation cpu-only rag-optimization ai-ml-performance-tuning becnhmarking

Updated May 6, 2026
Python

dfavenfre / RAG-Optimization

Star

optimization embedding-models large-language-models retrieval-augmented-generation rag-evaluation rag-optimization

Updated Oct 1, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the rag-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rag-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rag-optimization

Here are 11 public repositories matching this topic...

neomatrix369 / rag-params-finder

umitkacar / llm-context-optimizer

snowfoxHQ / vecRecall

Kaos599 / BetterRAG

Pablocheee / aio-project

fabivsy / Tu-Mapa-IA-Data-Node

annpetrosiann / YSU_DSB_thesis

0xgetz / awesome-token-saving

jchuthemyth / Agize-GEO-Framework

Ariyan-Pro / RAG-Latency-Optimization

dfavenfre / RAG-Optimization

Improve this page

Add this topic to your repo