You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -392,6 +392,11 @@ This section includes research works that provide in-depth analysis and discussi
392
392
***Agents in the Wild** (2025) [](https://insights.logicstar.ai/){: target="_blank" }
Copy file name to clipboardExpand all lines: app/docs/news.md
+3-4Lines changed: 3 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,16 +6,15 @@
6
6
7
7
<!-- START_RECENT_PAPERS -->
8
8
-**BeyondSWE**: BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing? [](https://arxiv.org/abs/2603.03194)[](https://github.com/AweAI-Team/BeyondSWE)[](https://huggingface.co/datasets/AweAI-Team/BeyondSWE)[](https://aweai-team.github.io/BeyondSWE/)
9
+
-**MobileDev-Bench**: MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application Development [](https://arxiv.org/abs/2603.24946)
9
10
-**RepoRepair**: RepoRepair: Leveraging Code Documentation for Repository-Level Automated Program Repair [](https://arxiv.org/abs/2603.01048)[](https://github.com/ZhongQiangDev/RepoRepair)
10
11
-**SWE-Adept**: SWE-Adept: An LLM-Based Agentic Framework for Deep Codebase Analysis and Structured Issue Resolution [](https://arxiv.org/abs/2603.01327)
-**SWE-CI**: SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration [](https://arxiv.org/abs/2603.03823)[](https://github.com/SKYLENAGE-AI/SWE-CI)[](https://huggingface.co/datasets/skylenage/SWE-CI)
14
+
-**SWE-Fuse**: SWE-Fuse: Empowering Software Agents via Issue-free Trajectory Learning and Entropy-aware RLVR Training [](https://arxiv.org/abs/2603.07927)
15
+
-**SWE-Next**: SWE-Next: Scalable Real-World Software Engineering Tasks for Agents [](https://arxiv.org/abs/2603.20691)[](https://github.com/TIGER-AI-Lab/SWE-Next)
13
16
-**SWE-Skills-Bench**: SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering? [](https://arxiv.org/abs/2603.15401)[](https://github.com/GeniusHTX/SWE-Skills-Bench)
14
17
-**OpenSWE**: daVinci-Env: Open SWE Environment Synthesis at Scale [](https://arxiv.org/abs/2603.13023)[](https://github.com/GAIR-NLP/OpenSWE)[](https://huggingface.co/datasets/GAIR/OpenSWE)
15
-
-**DockSmith**: DockSmith: Scaling Reliable Coding Environments via an Agentic Docker Builder [](https://arxiv.org/abs/2602.00592)[](https://huggingface.co/collections/8sj7df9k8m5x8/docksmith)
16
-
-**SWE Context Bench**: SWE Context Bench: A Benchmark for Context Learning in Coding [](https://arxiv.org/pdf/2602.08316)
17
-
-**SWE-Master**: SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training [](https://arxiv.org/abs/2602.03411)[](https://github.com/RUCAIBox/SWE-Master)
18
-
-**SWE-World**: SWE-World: Building Software Engineering Agents in Docker-Free Environments [](https://arxiv.org/abs/2602.03419)[](https://github.com/RUCAIBox/SWE-World)
0 commit comments