Skip to content

Commit bceb025

Browse files
committed
docs: update papers
1 parent cb2e9c3 commit bceb025

File tree

11 files changed

+47
-18
lines changed

11 files changed

+47
-18
lines changed

README.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212
[![Hugging Face](https://img.shields.io/badge/HF_Paper-2601.11655-FFD21E?style=for-the-badge&logo=huggingface&logoColor=000)](https://huggingface.co/papers/2601.11655)
1313
[![Tables](https://img.shields.io/badge/TABLES-Statistics-blue?style=for-the-badge&logo=databricks)](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/tables/)
1414
[![Contributors](https://img.shields.io/github/contributors/DeepSoftwareAnalytics/Awesome-Issue-Resolution?style=for-the-badge&color=green&logo=github)](https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution/graphs/contributors)
15-
![Papers Count](https://img.shields.io/badge/papers-183-green?style=for-the-badge&logo=googlescholar&logoColor=white)
15+
![Papers Count](https://img.shields.io/badge/papers-186-green?style=for-the-badge&logo=googlescholar&logoColor=white)
1616

1717
[**📖 Documentation Website**](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/) | [**📄 Full Paper**](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/paper/) | [**📋 Tables & Resources**](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/tables/)
1818

@@ -32,7 +32,7 @@
3232

3333
## 📖 Abstract
3434

35-
Based on a systematic review of **183 papers and online resources**, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how **Large Language Models (LLMs)** are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our GitHub repository and project documentation website.
35+
Based on a systematic review of **186 papers and online resources**, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how **Large Language Models (LLMs)** are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our GitHub repository and project documentation website.
3636

3737
<!-- START EXPLORE -->
3838
**🔍 Explore This Survey:**
@@ -67,7 +67,7 @@ Based on a systematic review of **183 papers and online resources**, this survey
6767
## 📚 Complete Paper List
6868

6969

70-
> **Total: 183 works** across 14 categories
70+
> **Total: 186 works** across 14 categories
7171
7272

7373
### 📊 Evaluation Datasets
@@ -98,6 +98,7 @@ Based on a systematic review of **183 papers and online resources**, this survey
9898
- **SWE-fficiency**: SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads? (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.06090)
9999
- **SWE-Compass**: SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.05459)
100100
- **SWE-EVO**: SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2512.18470)
101+
- **SWE Context Bench**: SWE Context Bench: A Benchmark for Context Learning in Coding (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/pdf/2602.08316)
101102

102103
### 🎯 Training Datasets
103104

@@ -251,6 +252,7 @@ Based on a systematic review of **183 papers and online resources**, this survey
251252
- **SWE-Lego**: SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.01426)
252253
- **Agentic Rubrics**: Agentic Rubrics as Contextual Verifiers for SWE Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.04171)
253254
- **CGM**: Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2505.16901) [![GitHub](https://img.shields.io/badge/GitHub-repo-24292F?logo=github&logoColor=white)](https://github.com/codefuse-ai/CodeFuse-CGM) [![HuggingFace](https://img.shields.io/badge/HuggingFace-dataset-ff7e21?logo=huggingface&logoColor=white)](https://huggingface.co/codefuse-ai/CodeFuse-CGM-72B)
255+
- **SWE-Replay**: SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.22129)
254256

255257
### 🎮 Reinforcement Learning (RL)
256258

@@ -293,6 +295,7 @@ Based on a systematic review of **183 papers and online resources**, this survey
293295
- **LongCat-Flash-Think**: Introducing LongCat-Flash-Thinking: A Technical Report (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2509.18883)
294296
- **MiMo-V2-Flash**: MiMo-V2-Flash Technical Report (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.02780)
295297
- **SWE-Master**: SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.03411) [![GitHub](https://img.shields.io/badge/GitHub-repo-24292F?logo=github&logoColor=white)](https://github.com/RUCAIBox/SWE-Master)
298+
- **SWE-Protégé**: SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.22124)
296299

297300
### ⚡ Inference-Time Scaling
298301

data/papers_evaluation_datasets.yaml

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,3 @@
1-
# Evaluation Datasets
2-
# Auto-generated from taxonomy.tex and BibTeX file
3-
41
- short_name: SWE-bench Lite
52
title: 'SWE-bench: Can Language Models Resolve Real-world Github Issues?'
63
authors: Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei,
@@ -181,3 +178,10 @@
181178
venue: arXiv preprint arXiv:2512.18470 2025
182179
links:
183180
arxiv: https://arxiv.org/abs/2512.18470
181+
- short_name: SWE Context Bench
182+
title: 'SWE Context Bench: A Benchmark for Context Learning in Coding'
183+
authors: Jared Zhu, Minhao Hu, Junde Wu
184+
venue: arxiv
185+
year: '2026'
186+
links:
187+
arxiv: https://arxiv.org/pdf/2602.08316

data/papers_rl.yaml

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -306,3 +306,12 @@
306306
links:
307307
arxiv: https://arxiv.org/abs/2602.03411
308308
github: https://github.com/RUCAIBox/SWE-Master
309+
- short_name: SWE-Protégé
310+
title: 'SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks
311+
Small Language Models as Software Engineering Agents'
312+
authors: Patrick Tser Jern Kon, Archana Pradeep, Ang Chen, Alexander P. Ellis, Warren
313+
Hunt, Zijian Wang, John Yang, Samuel Thompson
314+
venue: arxiv
315+
year: '2026'
316+
links:
317+
arxiv: https://arxiv.org/abs/2602.22124

data/papers_sft.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -129,3 +129,10 @@
129129
arxiv: https://arxiv.org/abs/2505.16901
130130
github: https://github.com/codefuse-ai/CodeFuse-CGM
131131
huggingface: https://huggingface.co/codefuse-ai/CodeFuse-CGM-72B
132+
- short_name: SWE-Replay
133+
title: 'SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents'
134+
authors: Yifeng Ding, Lingming Zhang
135+
venue: arxiv
136+
year: '2026'
137+
links:
138+
arxiv: https://arxiv.org/abs/2601.22129

docs/about.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
## About This Project
44

5-
Based on a systematic review of 183 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.
5+
Based on a systematic review of 186 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.
66

77

88
---

docs/index.md

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@
3030
<a href="https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution/graphs/contributors" target="_blank">
3131
<img src="https://img.shields.io/github/contributors/DeepSoftwareAnalytics/Awesome-Issue-Resolution?style=for-the-badge&color=green&logo=github" alt="Contributors">
3232
</a>
33-
<img src="https://img.shields.io/badge/papers-183-green?style=for-the-badge&logo=googlescholar&logoColor=white" alt="Papers Count">
33+
<img src="https://img.shields.io/badge/papers-186-green?style=for-the-badge&logo=googlescholar&logoColor=white" alt="Papers Count">
3434
</div>
3535

3636
<!-- Interactive Exploration Badges -->
@@ -57,7 +57,7 @@
5757

5858
<div class="abstract-content" markdown="1">
5959

60-
Based on a systematic review of 183 papers and online resources, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how Large Language Models (LLMs) are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our [GitHub repository](https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution) and project documentation website.
60+
Based on a systematic review of 186 papers and online resources, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how Large Language Models (LLMs) are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our [GitHub repository](https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution) and project documentation website.
6161

6262
**🔍 Explore This Survey:**
6363

@@ -108,6 +108,7 @@ This section covers the datasets used for evaluation and training, as well as me
108108
* **SWE-fficiency**: SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads? (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.06090){: target="_blank" }
109109
* **SWE-Compass**: SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.05459){: target="_blank" }
110110
* **SWE-EVO**: SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2512.18470){: target="_blank" }
111+
* **SWE Context Bench**: SWE Context Bench: A Benchmark for Context Learning in Coding (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/pdf/2602.08316){: target="_blank" }
111112
<!-- END PAPERS:evaluation_datasets -->
112113

113114
### Training Datasets
@@ -305,6 +306,7 @@ This section covers both training-free and training-based methods for issue reso
305306
* **SWE-Lego**: SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.01426){: target="_blank" }
306307
* **Agentic Rubrics**: Agentic Rubrics as Contextual Verifiers for SWE Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.04171){: target="_blank" }
307308
* **CGM**: Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2505.16901){: target="_blank" } [![GitHub](https://img.shields.io/badge/GitHub-repo-24292F?logo=github&logoColor=white)](https://github.com/codefuse-ai/CodeFuse-CGM){: target="_blank" } [![HuggingFace](https://img.shields.io/badge/HuggingFace-dataset-ff7e21?logo=huggingface&logoColor=white)](https://huggingface.co/codefuse-ai/CodeFuse-CGM-72B){: target="_blank" }
309+
* **SWE-Replay**: SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.22129){: target="_blank" }
308310
<!-- END PAPERS:sft -->
309311

310312
#### RL-based Methods
@@ -347,6 +349,7 @@ This section covers both training-free and training-based methods for issue reso
347349
* **LongCat-Flash-Think**: Introducing LongCat-Flash-Thinking: A Technical Report (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2509.18883){: target="_blank" }
348350
* **MiMo-V2-Flash**: MiMo-V2-Flash Technical Report (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.02780){: target="_blank" }
349351
* **SWE-Master**: SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.03411){: target="_blank" } [![GitHub](https://img.shields.io/badge/GitHub-repo-24292F?logo=github&logoColor=white)](https://github.com/RUCAIBox/SWE-Master){: target="_blank" }
352+
* **SWE-Protégé**: SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.22124){: target="_blank" }
350353
<!-- END PAPERS:rl -->
351354

352355
---

site/about/index.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -704,7 +704,7 @@
704704

705705
<h1 id="about">About<a class="headerlink" href="#about" title="Permanent link">&para;</a></h1>
706706
<h2 id="about-this-project">About This Project<a class="headerlink" href="#about-this-project" title="Permanent link">&para;</a></h2>
707-
<p>Based on a systematic review of 183 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.</p>
707+
<p>Based on a systematic review of 186 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.</p>
708708
<hr />
709709
<h2 id="key-features">Key Features<a class="headerlink" href="#key-features" title="Permanent link">&para;</a></h2>
710710
<ul>

0 commit comments

Comments
 (0)