Skip to content

Commit 7a17ef9

Browse files
committed
docs: update papers
1 parent ec3e389 commit 7a17ef9

File tree

12 files changed

+66
-23
lines changed

12 files changed

+66
-23
lines changed

README.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212
[![Hugging Face](https://img.shields.io/badge/HF_Paper-2601.11655-FFD21E?style=for-the-badge&logo=huggingface&logoColor=000)](https://huggingface.co/papers/2601.11655)
1313
[![Tables](https://img.shields.io/badge/TABLES-Statistics-blue?style=for-the-badge&logo=databricks)](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/tables/)
1414
[![Contributors](https://img.shields.io/github/contributors/DeepSoftwareAnalytics/Awesome-Issue-Resolution?style=for-the-badge&color=green&logo=github)](https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution/graphs/contributors)
15-
![Papers Count](https://img.shields.io/badge/papers-178-green?style=for-the-badge&logo=googlescholar&logoColor=white)
15+
![Papers Count](https://img.shields.io/badge/papers-181-green?style=for-the-badge&logo=googlescholar&logoColor=white)
1616

1717
[**📖 Documentation Website**](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/) | [**📄 Full Paper**](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/paper/) | [**📋 Tables & Resources**](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/tables/)
1818

@@ -32,7 +32,7 @@
3232

3333
## 📖 Abstract
3434

35-
Based on a systematic review of **178 papers and online resources**, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how **Large Language Models (LLMs)** are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our GitHub repository and project documentation website.
35+
Based on a systematic review of **181 papers and online resources**, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how **Large Language Models (LLMs)** are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our GitHub repository and project documentation website.
3636

3737
<!-- START EXPLORE -->
3838
**🔍 Explore This Survey:**
@@ -67,7 +67,7 @@ Based on a systematic review of **178 papers and online resources**, this survey
6767
## 📚 Complete Paper List
6868

6969

70-
> **Total: 178 works** across 14 categories
70+
> **Total: 181 works** across 14 categories
7171
7272

7373
### 📊 Evaluation Datasets
@@ -118,6 +118,7 @@ Based on a systematic review of **178 papers and online resources**, this survey
118118
- **RepoForge**: RepoForge: Training a SOTA Fast-thinking SWE Agent with an End-to-End Data Curation Pipeline Synergizing SFT and RL at Scale (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2508.01550)
119119
- **SWE-Mirror**: SWE-Mirror: Scaling Issue-Resolving Datasets by Mirroring Issues Across Repositories (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2509.08724)
120120
- **SWE-Lego**: SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.01426)
121+
- **SWE-Universe**: SWE-Universe: Scale Real-World Verifiable Environments to Millions (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://www.arxiv.org/abs/2602.02361)
121122

122123
### 🤖 Single-Agent Systems
123124

@@ -211,6 +212,7 @@ Based on a systematic review of **178 papers and online resources**, this survey
211212
- **InfCode**: InfCode: Adversarial Iterative Refinement of Tests and Patches for Reliable Software Issue Resolution (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.16004)
212213
- **GraphLocator**: GraphLocator: Graph-guided Causal Reasoning for Issue Localization (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2512.22469)
213214
- **SWE-Tester**: SWE-Tester: Training Open-Source LLMs for Issue Reproduction in Real-World Repositories (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.13713)
215+
- **Closing the Loop**: Closing the Loop: Universal Repository Representation with RPG-Encoder (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.02084) [![Website](https://img.shields.io/badge/Website-paper-5B9BD5?logo=googlechrome&logoColor=white)](https://ayanami2003.github.io/RPG-Encoder/) [![GitHub](https://img.shields.io/badge/GitHub-repo-24292F?logo=github&logoColor=white)](https://github.com/microsoft/RPG-ZeroRepo)
214216

215217
### 🧠 Memory-Enhanced Methods
216218

@@ -311,6 +313,7 @@ Based on a systematic review of **178 papers and online resources**, this survey
311313
- **SWE-MERA**: SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2507.11059)
312314
- **RepoForge**: RepoForge: Training a SOTA Fast-thinking SWE Agent with an End-to-End Data Curation Pipeline Synergizing SFT and RL at Scale (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2508.01550)
313315
- **Multi-Docker-Eval**: Multi-Docker-Eval: A `Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2512.06915)
316+
- **DockSmith**: DockSmith: Scaling Reliable Coding Environments via an Agentic Docker Builder (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.00592) [![HuggingFace](https://img.shields.io/badge/HuggingFace-dataset-ff7e21?logo=huggingface&logoColor=white)](https://huggingface.co/collections/8sj7df9k8m5x8/docksmith)
314317

315318
### 🔬 Data Synthesis Methods
316319

data/papers_data_collection.yaml

Lines changed: 10 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,3 @@
1-
# Data Collection
2-
# Auto-generated from taxonomy.tex and BibTeX file
3-
41
- short_name: SWE-rebench
52
title: 'SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated
63
Evaluation of Software Engineering Agents'
@@ -56,3 +53,13 @@
5653
venue: arXiv preprint arXiv:2512.06915 2025
5754
links:
5855
arxiv: https://arxiv.org/abs/2512.06915
56+
- short_name: DockSmith
57+
title: 'DockSmith: Scaling Reliable Coding Environments via an Agentic Docker Builder'
58+
authors: Jiaran Zhang, Luck Ma, Yanhao Li, Fanqi Wan, Di Qi, Xu Zhao, Jieyi Hou,
59+
Zhe Xie, Mengqiang Ren, Xin Wu, Zhewei Huang, Liangyu Chen, Yingwei Ma, Qi Han,
60+
Xiangyu Zhang
61+
venue: arXiv preprint arXiv:2602.00592
62+
year: '2026'
63+
links:
64+
arxiv: https://arxiv.org/abs/2602.00592
65+
huggingface: https://huggingface.co/collections/8sj7df9k8m5x8/docksmith

data/papers_tool.yaml

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -300,3 +300,13 @@
300300
year: '2026'
301301
links:
302302
arxiv: https://arxiv.org/abs/2601.13713
303+
- short_name: Closing the Loop
304+
title: 'Closing the Loop: Universal Repository Representation with RPG-Encoder'
305+
authors: Jane Luo, Chengyu Yin, Xin Zhang, Qingtao Li, Steven Liu, Yiming Huang,
306+
Jie Wu, Hao Liu, Yangyu Huang, Yu Kang, Fangkai Yang, Ying Xin, Scarlett Li
307+
venue: arXiv preprint arXiv:2602.02084
308+
year: '2026'
309+
links:
310+
arxiv: https://arxiv.org/abs/2602.02084
311+
github: https://github.com/microsoft/RPG-ZeroRepo
312+
website: https://ayanami2003.github.io/RPG-Encoder/

data/papers_training_datasets.yaml

Lines changed: 9 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,3 @@
1-
# Training Datasets
2-
# Auto-generated from taxonomy.tex and BibTeX file
3-
41
- short_name: SWE-bench-extra
52
title: 'SWE-bench: Can Language Models Resolve Real-world Github Issues?'
63
authors: Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei,
@@ -123,3 +120,12 @@
123120
venue: arXiv preprint arXiv:2601.01426 2026
124121
links:
125122
arxiv: https://arxiv.org/abs/2601.01426
123+
- short_name: SWE-Universe
124+
title: 'SWE-Universe: Scale Real-World Verifiable Environments to Millions'
125+
authors: Mouxiang Chen, Lei Zhang, Yunlong Feng, Xuwu Wang, Wenting Zhao, Ruisheng
126+
Cao, Jiaxi Yang, Jiawei Chen, Mingze Li, Zeyao Ma, Hao Ge, Zongmeng Zhang, Zeyu
127+
Cui, Dayiheng Liu, Jingren Zhou, Jianling Sun, Junyang Lin, Binyuan Hui
128+
venue: arXiv preprint arXiv:2602.02361
129+
year: '2026'
130+
links:
131+
arxiv: https://www.arxiv.org/abs/2602.02361

docs/about.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
## About This Project
44

5-
Based on a systematic review of 178 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.
5+
Based on a systematic review of 181 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.
66

77

88
---

docs/index.md

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@
3030
<a href="https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution/graphs/contributors" target="_blank">
3131
<img src="https://img.shields.io/github/contributors/DeepSoftwareAnalytics/Awesome-Issue-Resolution?style=for-the-badge&color=green&logo=github" alt="Contributors">
3232
</a>
33-
<img src="https://img.shields.io/badge/papers-178-green?style=for-the-badge&logo=googlescholar&logoColor=white" alt="Papers Count">
33+
<img src="https://img.shields.io/badge/papers-181-green?style=for-the-badge&logo=googlescholar&logoColor=white" alt="Papers Count">
3434
</div>
3535

3636
<!-- Interactive Exploration Badges -->
@@ -57,7 +57,7 @@
5757

5858
<div class="abstract-content" markdown="1">
5959

60-
Based on a systematic review of 178 papers and online resources, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how Large Language Models (LLMs) are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our [GitHub repository](https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution) and project documentation website.
60+
Based on a systematic review of 181 papers and online resources, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how Large Language Models (LLMs) are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our [GitHub repository](https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution) and project documentation website.
6161

6262
**🔍 Explore This Survey:**
6363

@@ -128,6 +128,7 @@ This section covers the datasets used for evaluation and training, as well as me
128128
* **RepoForge**: RepoForge: Training a SOTA Fast-thinking SWE Agent with an End-to-End Data Curation Pipeline Synergizing SFT and RL at Scale (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2508.01550){: target="_blank" }
129129
* **SWE-Mirror**: SWE-Mirror: Scaling Issue-Resolving Datasets by Mirroring Issues Across Repositories (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2509.08724){: target="_blank" }
130130
* **SWE-Lego**: SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.01426){: target="_blank" }
131+
* **SWE-Universe**: SWE-Universe: Scale Real-World Verifiable Environments to Millions (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://www.arxiv.org/abs/2602.02361){: target="_blank" }
131132
<!-- END PAPERS:training_datasets -->
132133

133134
### Data Collection
@@ -139,6 +140,7 @@ This section covers the datasets used for evaluation and training, as well as me
139140
* **SWE-MERA**: SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2507.11059){: target="_blank" }
140141
* **RepoForge**: RepoForge: Training a SOTA Fast-thinking SWE Agent with an End-to-End Data Curation Pipeline Synergizing SFT and RL at Scale (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2508.01550){: target="_blank" }
141142
* **Multi-Docker-Eval**: Multi-Docker-Eval: A `Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2512.06915){: target="_blank" }
143+
* **DockSmith**: DockSmith: Scaling Reliable Coding Environments via an Agentic Docker Builder (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.00592){: target="_blank" } [![HuggingFace](https://img.shields.io/badge/HuggingFace-dataset-ff7e21?logo=huggingface&logoColor=white)](https://huggingface.co/collections/8sj7df9k8m5x8/docksmith){: target="_blank" }
142144
<!-- END PAPERS:data_collection -->
143145

144146
### Data Synthesis
@@ -251,6 +253,7 @@ This section covers both training-free and training-based methods for issue reso
251253
* **InfCode**: InfCode: Adversarial Iterative Refinement of Tests and Patches for Reliable Software Issue Resolution (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.16004){: target="_blank" }
252254
* **GraphLocator**: GraphLocator: Graph-guided Causal Reasoning for Issue Localization (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2512.22469){: target="_blank" }
253255
* **SWE-Tester**: SWE-Tester: Training Open-Source LLMs for Issue Reproduction in Real-World Repositories (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.13713){: target="_blank" }
256+
* **Closing the Loop**: Closing the Loop: Universal Repository Representation with RPG-Encoder (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.02084){: target="_blank" } [![Website](https://img.shields.io/badge/Website-paper-5B9BD5?logo=googlechrome&logoColor=white)](https://ayanami2003.github.io/RPG-Encoder/){: target="_blank" } [![GitHub](https://img.shields.io/badge/GitHub-repo-24292F?logo=github&logoColor=white)](https://github.com/microsoft/RPG-ZeroRepo){: target="_blank" }
254257
<!-- END PAPERS:tool -->
255258

256259
#### Memory

site/about/index.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -704,7 +704,7 @@
704704

705705
<h1 id="about">About<a class="headerlink" href="#about" title="Permanent link">&para;</a></h1>
706706
<h2 id="about-this-project">About This Project<a class="headerlink" href="#about-this-project" title="Permanent link">&para;</a></h2>
707-
<p>Based on a systematic review of 178 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.</p>
707+
<p>Based on a systematic review of 181 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.</p>
708708
<hr />
709709
<h2 id="key-features">Key Features<a class="headerlink" href="#key-features" title="Permanent link">&para;</a></h2>
710710
<ul>

0 commit comments

Comments
 (0)