DeepSoftwareAnalytics
diff --git a/‎README.md‎
Lines changed: 6 additions & 3 deletions b/‎README.md‎
Lines changed: 6 additions & 3 deletions
diff --git a/‎data/papers_evaluation_datasets.yaml‎
Lines changed: 7 additions & 3 deletions b/‎data/papers_evaluation_datasets.yaml‎
Lines changed: 7 additions & 3 deletions
diff --git a/‎data/papers_rl.yaml‎
Lines changed: 9 additions & 0 deletions b/‎data/papers_rl.yaml‎
Lines changed: 9 additions & 0 deletions
diff --git a/‎data/papers_sft.yaml‎
Lines changed: 7 additions & 0 deletions b/‎data/papers_sft.yaml‎
Lines changed: 7 additions & 0 deletions
diff --git a/‎docs/about.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/about.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/index.md‎
Lines changed: 5 additions & 2 deletions b/‎docs/index.md‎
Lines changed: 5 additions & 2 deletions
diff --git a/‎site/about/index.html‎
Lines changed: 1 addition & 1 deletion b/‎site/about/index.html‎
Lines changed: 1 addition & 1 deletion
@@ -12,7 +12,7 @@
 [![Hugging Face](https://img.shields.io/badge/HF_Paper-2601.11655-FFD21E?style=for-the-badge&logo=huggingface&logoColor=000)](https://huggingface.co/papers/2601.11655)
 [![Tables](https://img.shields.io/badge/TABLES-Statistics-blue?style=for-the-badge&logo=databricks)](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/tables/)
 [![Contributors](https://img.shields.io/github/contributors/DeepSoftwareAnalytics/Awesome-Issue-Resolution?style=for-the-badge&color=green&logo=github)](https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution/graphs/contributors)
-![Papers Count](https://img.shields.io/badge/papers-183-green?style=for-the-badge&logo=googlescholar&logoColor=white)
+![Papers Count](https://img.shields.io/badge/papers-186-green?style=for-the-badge&logo=googlescholar&logoColor=white)
 
 [**📖 Documentation Website**](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/) | [**📄 Full Paper**](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/paper/) | [**📋 Tables & Resources**](https://deepsoftwareanalytics.github.io/Awesome-Issue-Resolution/tables/)
 
@@ -32,7 +32,7 @@
 
 ## 📖 Abstract
 
-Based on a systematic review of **183 papers and online resources**, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how **Large Language Models (LLMs)** are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our GitHub repository and project documentation website. 
+Based on a systematic review of **186 papers and online resources**, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how **Large Language Models (LLMs)** are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our GitHub repository and project documentation website. 
 
 <!-- START EXPLORE -->
 **🔍 Explore This Survey:**
@@ -67,7 +67,7 @@ Based on a systematic review of **183 papers and online resources**, this survey
 ## 📚 Complete Paper List
 
 
-> **Total: 183 works** across 14 categories
+> **Total: 186 works** across 14 categories
 
 
 ### 📊 Evaluation Datasets
@@ -98,6 +98,7 @@ Based on a systematic review of **183 papers and online resources**, this survey
 - **SWE-fficiency**: SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads? (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.06090)
 - **SWE-Compass**: SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.05459)
 - **SWE-EVO**: SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2512.18470)
+- **SWE Context Bench**: SWE Context Bench: A Benchmark for Context Learning in Coding (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/pdf/2602.08316)
 
 ### 🎯 Training Datasets
 
@@ -251,6 +252,7 @@ Based on a systematic review of **183 papers and online resources**, this survey
 - **SWE-Lego**: SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.01426)
 - **Agentic Rubrics**: Agentic Rubrics as Contextual Verifiers for SWE Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.04171)
 - **CGM**: Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2505.16901) [![GitHub](https://img.shields.io/badge/GitHub-repo-24292F?logo=github&logoColor=white)](https://github.com/codefuse-ai/CodeFuse-CGM) [![HuggingFace](https://img.shields.io/badge/HuggingFace-dataset-ff7e21?logo=huggingface&logoColor=white)](https://huggingface.co/codefuse-ai/CodeFuse-CGM-72B)
+- **SWE-Replay**: SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.22129)
 
 ### 🎮 Reinforcement Learning (RL)
 
@@ -293,6 +295,7 @@ Based on a systematic review of **183 papers and online resources**, this survey
 - **LongCat-Flash-Think**: Introducing LongCat-Flash-Thinking: A Technical Report (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2509.18883)
 - **MiMo-V2-Flash**: MiMo-V2-Flash Technical Report (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.02780)
 - **SWE-Master**: SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.03411) [![GitHub](https://img.shields.io/badge/GitHub-repo-24292F?logo=github&logoColor=white)](https://github.com/RUCAIBox/SWE-Master)
+- **SWE-Protégé**: SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.22124)
 
 ### ⚡ Inference-Time Scaling
 
 
@@ -1,6 +1,3 @@
-# Evaluation Datasets
-# Auto-generated from taxonomy.tex and BibTeX file
-
 - short_name: SWE-bench Lite
   title: 'SWE-bench: Can Language Models Resolve Real-world Github Issues?'
   authors: Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei,
@@ -181,3 +178,10 @@
   venue: arXiv preprint arXiv:2512.18470 2025
   links:
     arxiv: https://arxiv.org/abs/2512.18470
+- short_name: SWE Context Bench
+  title: 'SWE Context Bench: A Benchmark for Context Learning in Coding'
+  authors: Jared Zhu, Minhao Hu, Junde Wu
+  venue: arxiv
+  year: '2026'
+  links:
+    arxiv: https://arxiv.org/pdf/2602.08316
@@ -306,3 +306,12 @@
   links:
     arxiv: https://arxiv.org/abs/2602.03411
     github: https://github.com/RUCAIBox/SWE-Master
+- short_name: SWE-Protégé
+  title: 'SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks
+    Small Language Models as Software Engineering Agents'
+  authors: Patrick Tser Jern Kon, Archana Pradeep, Ang Chen, Alexander P. Ellis, Warren
+    Hunt, Zijian Wang, John Yang, Samuel Thompson
+  venue: arxiv
+  year: '2026'
+  links:
+    arxiv: https://arxiv.org/abs/2602.22124
@@ -129,3 +129,10 @@
     arxiv: https://arxiv.org/abs/2505.16901
     github: https://github.com/codefuse-ai/CodeFuse-CGM
     huggingface: https://huggingface.co/codefuse-ai/CodeFuse-CGM-72B
+- short_name: SWE-Replay
+  title: 'SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents'
+  authors: Yifeng Ding, Lingming Zhang
+  venue: arxiv
+  year: '2026'
+  links:
+    arxiv: https://arxiv.org/abs/2601.22129
@@ -2,7 +2,7 @@
 
 ## About This Project
 
-Based on a systematic review of 183 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.
+Based on a systematic review of 186 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.
 
 
 ---
 
@@ -30,7 +30,7 @@
         <a href="https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution/graphs/contributors" target="_blank">
             <img src="https://img.shields.io/github/contributors/DeepSoftwareAnalytics/Awesome-Issue-Resolution?style=for-the-badge&color=green&logo=github" alt="Contributors">
         </a>
-        <img src="https://img.shields.io/badge/papers-183-green?style=for-the-badge&logo=googlescholar&logoColor=white" alt="Papers Count">
+        <img src="https://img.shields.io/badge/papers-186-green?style=for-the-badge&logo=googlescholar&logoColor=white" alt="Papers Count">
     </div>
 
     <!-- Interactive Exploration Badges -->
@@ -57,7 +57,7 @@
 
 <div class="abstract-content" markdown="1">
 
-Based on a systematic review of 183 papers and online resources, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how Large Language Models (LLMs) are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our [GitHub repository](https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution) and project documentation website. 
+Based on a systematic review of 186 papers and online resources, this survey establishes a holistic theoretical framework for Issue Resolution in software engineering. We examine how Large Language Models (LLMs) are transforming the automation of GitHub issue resolution. Beyond the theoretical analysis, we have curated a comprehensive collection of datasets and model training resources, which are continuously synchronized with our [GitHub repository](https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution) and project documentation website. 
 
 **🔍 Explore This Survey:**
 
@@ -108,6 +108,7 @@ This section covers the datasets used for evaluation and training, as well as me
 * **SWE-fficiency**: SWE-fficiency: Can Language Models Optimize Real-World Repositories on Real Workloads? (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.06090){: target="_blank" }
 * **SWE-Compass**: SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2511.05459){: target="_blank" }
 * **SWE-EVO**: SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2512.18470){: target="_blank" }
+* **SWE Context Bench**: SWE Context Bench: A Benchmark for Context Learning in Coding (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/pdf/2602.08316){: target="_blank" }
 <!-- END PAPERS:evaluation_datasets -->
 
 ### Training Datasets
@@ -305,6 +306,7 @@ This section covers both training-free and training-based methods for issue reso
 * **SWE-Lego**: SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.01426){: target="_blank" }
 * **Agentic Rubrics**: Agentic Rubrics as Contextual Verifiers for SWE Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.04171){: target="_blank" }
 * **CGM**: Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2505.16901){: target="_blank" } [![GitHub](https://img.shields.io/badge/GitHub-repo-24292F?logo=github&logoColor=white)](https://github.com/codefuse-ai/CodeFuse-CGM){: target="_blank" } [![HuggingFace](https://img.shields.io/badge/HuggingFace-dataset-ff7e21?logo=huggingface&logoColor=white)](https://huggingface.co/codefuse-ai/CodeFuse-CGM-72B){: target="_blank" }
+* **SWE-Replay**: SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.22129){: target="_blank" }
 <!-- END PAPERS:sft -->
 
 #### RL-based Methods
@@ -347,6 +349,7 @@ This section covers both training-free and training-based methods for issue reso
 * **LongCat-Flash-Think**: Introducing LongCat-Flash-Thinking: A Technical Report (2025) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2509.18883){: target="_blank" }
 * **MiMo-V2-Flash**: MiMo-V2-Flash Technical Report (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2601.02780){: target="_blank" }
 * **SWE-Master**: SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.03411){: target="_blank" } [![GitHub](https://img.shields.io/badge/GitHub-repo-24292F?logo=github&logoColor=white)](https://github.com/RUCAIBox/SWE-Master){: target="_blank" }
+* **SWE-Protégé**: SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents (2026) [![arXiv](https://img.shields.io/badge/arXiv-paper-B31B1B?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2602.22124){: target="_blank" }
 <!-- END PAPERS:rl -->
 
 ---
 
@@ -704,7 +704,7 @@
 
 <h1 id="about">About<a class="headerlink" href="#about" title="Permanent link">&para;</a></h1>
 <h2 id="about-this-project">About This Project<a class="headerlink" href="#about-this-project" title="Permanent link">&para;</a></h2>
-<p>Based on a systematic review of 183 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.</p>
+<p>Based on a systematic review of 186 papers and online resources, this project establishes a holistic theoretical framework for Issue Resolution in software engineering. This website is designed to facilitate efficient literature retrieval and exploration.</p>
 <hr />
 <h2 id="key-features">Key Features<a class="headerlink" href="#key-features" title="Permanent link">&para;</a></h2>
 <ul>