Project Instructions for Codex Agents

This repository is a research benchmark for Romanized Nepali customer-support retrieval-augmented generation. It is not a production chatbot.

Scope Control

Keep baseline systems simple and controlled.
Phase 1 is only Traditional Semantic RAG.
Do not mix Phase 2 Agentic RAG behavior into Phase 1.
Do not mix Phase 3 Agentic GraphRAG behavior into Phase 1.
Do not add query rewriting, intent routing, verifier agents, graph extraction, or graph lookup to Phase 1 code.

Do not hardcode API keys or provider secrets.
Read LLM provider keys and model names from the environment.
Supported Phase 1 LLM providers are Google Gemini and NVIDIA NIM only.
The benchmark must support retrieval-only runs when no LLM API key is available.