Skip to content

Commit 9ae98f8

Browse files
committed
Update README.md and added LICENSE.md
1 parent 5137f52 commit 9ae98f8

2 files changed

Lines changed: 46 additions & 16 deletions

File tree

LICENSE.md

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
# MIT License
2+
3+
Copyright (c) 2025 barrulus
4+
5+
Permission is hereby granted, free of charge, to any person obtaining a copy
6+
of this software and associated documentation files (the "Software"), to deal
7+
in the Software without restriction, including without limitation the rights
8+
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9+
copies of the Software, and to permit persons to whom the Software is
10+
furnished to do so, subject to the following conditions:
11+
12+
The above copyright notice and this permission notice shall be included in all
13+
copies or substantial portions of the Software.
14+
15+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21+
SOFTWARE.

README.md

Lines changed: 25 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@ Edit `.env` to configure your setup:
5050
```bash
5151
# Ollama Configuration
5252
OLLAMA_HOST=http://localhost:11434
53-
OLLAMA_MODEL=dolphincoder:15b
53+
OLLAMA_MODEL=qwen3:8b
5454
OLLAMA_EMBEDDING_MODEL=nomic-embed-text:latest
5555

5656
# Embedding Configuration
@@ -167,7 +167,7 @@ python ask.py my_queries.md
167167
| Variable | Description | Default |
168168
|----------|-------------|---------|
169169
| `OLLAMA_HOST` | Ollama server URL | `http://localhost:11434` |
170-
| `OLLAMA_MODEL` | LLM model for responses | `dolphincoder:15b` |
170+
| `OLLAMA_MODEL` | LLM model for responses | `qwen3:8b` |
171171
| `OLLAMA_EMBEDDING_MODEL` | Ollama embedding model | `nomic-embed-text:latest` |
172172
| `EMBEDDING_SERVER` | Remote embedding server URL | `http://localhost:5000` |
173173
| `EMBEDDING_MODEL` | Embedding model name | `nomic-ai/nomic-embed-text-v1.5` |
@@ -184,7 +184,7 @@ python ask.py my_queries.md
184184
#### Indexing (`index.py`)
185185

186186
```bash
187-
python index.py <repository> [options]
187+
python index.py /path/to/files/ [options]
188188

189189
Options:
190190
--local-embeddings Use local SentenceTransformer (default)
@@ -293,16 +293,16 @@ python embedding_server.py \
293293
--batch-size 64
294294
```
295295

296-
### Multiple Repositories
296+
### Multiple Folders
297297

298-
Index multiple repositories to the same database:
298+
Index multiple folders to the same database:
299299

300300
```bash
301-
# Index first repo
302-
python index.py /path/to/logs1 --chroma-path ./shared_db
301+
# Index first folder
302+
python index.py /path/to/files1 --chroma-path ./shared_db
303303

304-
# Add second repo to same database
305-
python index.py /path/to/logs2 --chroma-path ./shared_db
304+
# Add second folder to same database
305+
python index.py /path/to/files2 --chroma-path ./shared_db
306306
```
307307

308308
### Ollama Integration
@@ -311,13 +311,25 @@ Ensure Ollama is running with required models:
311311

312312
```bash
313313
# Install Ollama models
314-
ollama pull dolphincoder:15b
315-
ollama pull nomic-embed-text
314+
ollama pull qwen3:8b
315+
ollama pull nomic-embed-text:latest
316316

317317
# Start Ollama (usually runs as service)
318318
ollama serve
319319
```
320320

321+
#### Useful Ollama ENV variables
322+
323+
```bash
324+
OLLAMA_CONTEXT_LENGTH:8192
325+
OLLAMA_DEBUG:INFO
326+
OLLAMA_HOST:http://0.0.0.0:11434
327+
OLLAMA_INTEL_GPU:false
328+
OLLAMA_KEEP_ALIVE:3h
329+
OLLAMA_MAX_LOADED_MODELS:1
330+
OLLAMA_MAX_QUEUE:512
331+
```
332+
321333
## API Endpoints
322334

323335
The embedding server provides RESTful endpoints:
@@ -361,6 +373,7 @@ The system uses a single standardized configuration:
361373
- **tiktoken**: Token counting utilities
362374
- **einops**: Tensor operations for advanced models
363375
- **requests**: HTTP client for remote services
376+
- **pypdf***: PDF parsing library
364377

365378
## File Structure
366379

@@ -374,12 +387,8 @@ The system uses a single standardized configuration:
374387
└── chroma_db/ # Default ChromaDB storage (created after indexing)
375388
```
376389

377-
## License
378-
379-
This project is designed for local development and research use. Please ensure compliance with the terms of service for any external models or APIs used.
380-
381390
## Contributions
382391

383392
I welcome any assistance on this project, especially around trying new models for better performance and testing against ore logs than I have at my disposal!
384393

385-
Please just fork off of dev and then submit a PR
394+
Please just fork off of dev and then submit a PR

0 commit comments

Comments
 (0)