Think AI: A privacy-first autonomous reasoning engine. This architecture employs two distinct LLMs that continuously cross-examine each other to produce the most accurate results, ensuring your data remains entirely isolated in local storage. It delivers a production-ready infrastructure with full Docker containerization and Redis cache integration.
Standard language models can be prone to hallucinations during complex reasoning processes. Think AI aims to minimize this issue through its BinaryMind Architecture. Instead of presenting a single model's output directly to the user, the system operates on a "thesis-antithesis" principle:
- The Analyst (Qwen 2.5): Gathers data, scans web resources, and proposes an initial solution.
- The Critic (Gemma 2): Audits the proposed solution for logical consistency, identifies gaps, and assigns a truth probability score.
- Synthesis: A refined conclusion is presented to the user, formed by the consensus of these two models.
This approach ensures more reliable outcomes, especially in tasks requiring high accuracy such as technical analysis, code review, and deep research.
- Multi-Agent Orchestration: Qwen and Gemma models operate asynchronously. While one generates content, the other acts as an oversight mechanism.
- Live Confidence Score: The level of agreement or conflict between models is visualized via a dynamic bar on the interface, providing transparency on how much the AI trusts its own answer.
- Real-Time RAG: Thanks to DuckDuckGo integration, models rely not only on training data but also analyze current internet data.
- Semantic Memory: The system stores past conversations using vector embeddings, maintaining context for more consistent responses.
- Professional Reporting: Analysis processes and results can be exported as Markdown (.md) reports suitable for corporate use.
- Privacy-First: Powered by the Ollama infrastructure, all data and processing power remain on your local machine; no data is sent to the cloud.
- Docker Containerization: The entire system runs isolated in containers. Offers single-command setup, cross-platform compatibility, and production-ready deployment.
- Redis Caching: Repetitive queries are answered instantly. Achieved an 88% cache hit rate and a 35x performance boost.
- PII Security: Sensitive data (emails, phone numbers, ID numbers) is automatically detected and masked/redacted.
The project is built on a modern and scalable technology stack:
- Core: Python 3.x, Asyncio (Asynchronous Architecture)
- LLM Runtime: Ollama (Localhost)
- Models: Qwen 2.5 (Analyst) & Gemma 2 (Critic)
- AI & NLP: PyTorch, Scikit-learn
- Interface: Gradio
- Search & RAG: DuckDuckGo Search, Sentence Transformers, NumPy
- Container: Docker
- Cache: Redis 7
Follow the steps below to run the project on your local machine.
- Docker Desktop must be installed.
- Minimum 16 GB RAM recommended.
- 15 GB free disk space.
Windows:
git clone [https://github.com/canberkdumann/Think_AI_Main.git](https://github.com/canberkdumann/Think_AI_Main.git)
cd Think_AI_Main
.\start.ps1