You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Refine README with modern design and improved structure
- Add centered header with styled logo
- Improve badge layout with flat-square design
- Streamline Quick Start section into 3 clear steps
- Better organize CLI commands with subheadings
- Add How It Works section with architecture diagram
- Enhance language support display with emojis
- Improve overall visual hierarchy and readability
- Maintain all technical content and experimental results
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <noreply@anthropic.com>
Given a large and evolving codebase, the ability to automatically generate holistic, architecture-aware documentation that captures not only individual functions but also their cross-file, cross-module, and system-level interactions remains an open challenge. We present **CodeWiki**, a unified framework for automated repository-level documentation across seven programming languages. CodeWiki introduces three key innovations: (i) hierarchical decomposition that preserves architectural context across multiple levels of granularity, (ii) recursive multi-agent processing with dynamic task delegation for scalable generation, and (iii) multi-modal synthesis that integrates textual descriptions with visual artifacts such as architecture diagrams and data-flow representations.
**That's it!** Your documentation will be generated in `./docs/` with comprehensive repository-level analysis.
27
71
28
72
---
29
73
30
-
## Overview
74
+
## What is CodeWiki?
31
75
32
-
CodeWiki addresses the challenge of comprehensive documentation for large-scale repositories through three core innovations:
76
+
CodeWiki is an open-source framework for **automated repository-level documentation** across seven programming languages. It generates holistic, architecture-aware documentation that captures not only individual functions but also their cross-file, cross-module, and system-level interactions.
33
77
34
78
### Key Innovations
35
79
36
80
| Innovation | Description | Impact |
37
81
|------------|-------------|--------|
38
-
|**Hierarchical Decomposition**| Dynamic programming-inspired strategy that partitions repositories into coherent modules while preserving architectural context | Handles codebases of arbitrary size (86K-1.4M LOC tested) |
39
-
|**Recursive Agentic System**| Adaptive multi-agent processing with dynamic delegation capabilities for complex modules | Maintains quality while scaling to repository-level scope |
82
+
|**Hierarchical Decomposition**| Dynamic programming-inspired strategy that preserves architectural context | Handles codebases of arbitrary size (86K-1.4M LOC tested) |
83
+
|**Recursive Agentic System**| Adaptive multi-agent processing with dynamic delegation capabilities | Maintains quality while scaling to repository-level scope |
40
84
|**Multi-Modal Synthesis**| Generates textual documentation, architecture diagrams, data flows, and sequence diagrams | Comprehensive understanding from multiple perspectives |
**View comprehensive results:** See [paper](https://arxiv.org/abs/2510.24428) for complete evaluation on 21 repositories spanning all supported languages.
**View comprehensive results:** See [paper](https://arxiv.org/abs/2510.24428) for complete evaluation on 21 repositories spanning all supported languages.
195
+
196
+
---
197
+
198
+
## How It Works
199
+
200
+
### Architecture Overview
201
+
202
+
CodeWiki employs a three-stage process for comprehensive documentation generation:
203
+
204
+
1.**Hierarchical Decomposition**: Uses dynamic programming-inspired algorithms to partition repositories into coherent modules while preserving architectural context across multiple granularity levels.
205
+
206
+
2.**Recursive Multi-Agent Processing**: Implements adaptive multi-agent processing with dynamic task delegation, allowing the system to handle complex modules at scale while maintaining quality.
207
+
208
+
3.**Multi-Modal Synthesis**: Integrates textual descriptions with visual artifacts including architecture diagrams, data-flow representations, and sequence diagrams for comprehensive understanding.
0 commit comments