Skip to content

Commit fd61190

Browse files
authored
Merge pull request #20 from rostilos/0.1.2-rc
0.1.2 rc
2 parents 562cb5a + 6ea7127 commit fd61190

96 files changed

Lines changed: 8532 additions & 9604 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

README.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -6,23 +6,23 @@
66

77
CodeCrow supports multiple version control systems with varying levels of integration. Below is the current feature matrix:
88

9-
| Feature | Bitbucket | GitHub | GitLab |
10-
| :--- | :---: | :---: | :---: |
11-
| PR Analysis | + | + | - |
12-
| Branch Analysis | + | + | - |
9+
| Feature | Bitbucket | GitHub | GitLab |
10+
|:-----------------------| :---: | :---: | :---: |
11+
| PR Analysis | + | + | - |
12+
| Branch Analysis | + | + | - |
1313
| Task Context Retrieval | - | - | - |
14-
| /ask | + | + | - |
15-
| /analyze | + | + | - |
16-
| /review | + | + | - |
17-
| Continuous Analysis | + | + | - |
18-
| RAG Pipeline | + | + | - |
14+
| /ask | + | + | - |
15+
| /analyze | + | + | - |
16+
| /summarize | + | + | - |
17+
| Continuous Analysis | + | + | - |
18+
| RAG Pipeline | + | + | - |
1919

2020
## Key Features
2121

2222
- **Context-Aware Reviews**: Powered by a custom RAG (Retrieval-Augmented Generation) pipeline using Qdrant vector storage.
2323
- **Incremental Analysis**: Only scans changed code to keep feedback fast and cost-efficient.
2424
- **Multi-Tenant Architecture**: Securely manage multiple teams and projects from a single dashboard.
25-
- **Interactive Commands**: Command CodeCrow directly from PR comments using `/ask`, `/analyze`, and `/review`.
25+
- **Interactive Commands**: Command CodeCrow directly from PR comments using `/ask`, `/analyze`, and `/summarize`.
2626

2727
## Documentation
2828

deployment/config/mcp-client/.env.sample

Lines changed: 57 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,5 +6,60 @@ RAG_API_URL=http://host.docker.internal:8001
66
CODECROW_API_URL=http://codecrow-web-application:8081
77

88
# Platform MCP JAR path
9-
PLATFORM_MCP_JAR=/app/codecrow-platform-mcp-1.0.jar\
10-
INTERNAL_API_SECRET=codecrow-internal-secret-change-me
9+
PLATFORM_MCP_JAR=/app/codecrow-platform-mcp-1.0.jar
10+
INTERNAL_API_SECRET=codecrow-internal-secret-change-me
11+
12+
# === LLM Settings ===
13+
# Temperature for code review (0.0 = deterministic, 0.1-0.3 = more creative)
14+
LLM_TEMPERATURE=0.0
15+
16+
# === Direct Review Mode ===
17+
# When rawDiff is provided in request, bypasses MCP agent for faster review
18+
# This is controlled by pipeline-agent sending rawDiff field
19+
# No configuration needed - automatic when rawDiff is present
20+
21+
# === Diff Processing Limits (matching MCP server LargeContentFilter) ===
22+
# Maximum file size in bytes (default: 25KB - same as LargeContentFilter.DEFAULT_SIZE_THRESHOLD_BYTES)
23+
DIFF_MAX_FILE_SIZE=25600
24+
# Maximum files to include in review (default: 100)
25+
DIFF_MAX_FILES=100
26+
# Maximum total diff size in bytes (default: 500KB)
27+
DIFF_MAX_TOTAL_SIZE=500000
28+
# Maximum lines per file (default: 1000)
29+
DIFF_MAX_LINES_PER_FILE=1000
30+
31+
# === LLM Reranking (for large PRs) ===
32+
# Enable LLM-based reranking for large PRs
33+
LLM_RERANK_ENABLED=true
34+
# Minimum files to trigger LLM reranking (default: 20)
35+
LLM_RERANK_THRESHOLD=20
36+
# Max RAG items to rerank via LLM
37+
LLM_RERANK_MAX_ITEMS=15
38+
39+
# === RAG Context Settings ===
40+
# Minimum relevance score for RAG results (0.0-1.0)
41+
RAG_MIN_RELEVANCE_SCORE=0.7
42+
# Default number of RAG chunks to retrieve
43+
RAG_DEFAULT_TOP_K=15
44+
# Cache TTL in seconds
45+
RAG_CACHE_TTL_SECONDS=300
46+
# Max cache entries
47+
RAG_CACHE_MAX_SIZE=100
48+
49+
# === Context Budget (token allocation percentages, must sum to 1.0) ===
50+
CONTEXT_BUDGET_HIGH_PRIORITY_PCT=0.30
51+
CONTEXT_BUDGET_MEDIUM_PRIORITY_PCT=0.40
52+
CONTEXT_BUDGET_LOW_PRIORITY_PCT=0.20
53+
CONTEXT_BUDGET_RAG_PCT=0.10
54+
55+
# === Prompt Logging (Debug) ===
56+
# Enable prompt logging for debugging
57+
PROMPT_LOG_ENABLED=true
58+
# Log full prompts to file
59+
PROMPT_LOG_TO_FILE=true
60+
# Also log to console (verbose)
61+
PROMPT_LOG_TO_CONSOLE=false
62+
# Directory for prompt logs
63+
PROMPT_LOG_DIR=/tmp/codecrow_prompts
64+
# Max log files to keep (auto-cleanup)
65+
PROMPT_LOG_MAX_FILES=50

deployment/config/rag-pipeline/.env.sample

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ QDRANT_COLLECTION_PREFIX=codecrow
55
# OpenRouter Configuration
66
# Get your API key from https://openrouter.ai/
77
OPENROUTER_API_KEY=sk-or-v1-your-api-key-here
8-
OPENROUTER_MODEL=openai/text-embedding-3-small
8+
OPENROUTER_MODEL=qwen/qwen3-embedding-8b
99

1010
#Additional Environment Variables
1111
HOME=/tmp
@@ -14,6 +14,10 @@ TRANSFORMERS_CACHE=/tmp/.transformers_cache
1414
HF_HOME=/tmp/.huggingface
1515
LLAMA_INDEX_CACHE_DIR=/tmp/.llama_index
1616

17+
# Free Plan Limits (Optional - set to 0 to disable limits)
18+
RAG_MAX_CHUNKS_PER_INDEX=70000
19+
RAG_MAX_FILES_PER_INDEX=40000
20+
1721
# Alternative OpenRouter models (use full format with provider prefix):
1822
# OPENROUTER_MODEL=openai/text-embedding-3-large # Higher quality, more expensive
1923
# OPENROUTER_MODEL=openai/text-embedding-ada-002 # Legacy model

docs/01-overview.md

Lines changed: 0 additions & 101 deletions
This file was deleted.

0 commit comments

Comments
 (0)