Skip to content

Commit afabc28

Browse files
committed
refactor: migrate 15 papers from deprecated categories to functional categories
Migrated from benchmarks/surveys/empirical_studies into: - issue_resolution, code_generation, environment_building, terminal, feature_development, automated_data_science, code_executing_game 38 papers skipped (already in other categories) 12 papers filtered (not relevant by LLM)
1 parent eece243 commit afabc28

7 files changed

Lines changed: 1678 additions & 1671 deletions
Lines changed: 37 additions & 27 deletions
Original file line numberDiff line numberDiff line change
@@ -1,31 +1,41 @@
1-
- title: "DeepAnalyze: Agentic Large Language Models for Autonomous Data Science"
2-
authors: "Shaolei Zhang, Ju Fan, Meihao Fan, Guoliang Li, Xiaoyong Du"
3-
venue: "arXiv 2025"
1+
- title: 'LLM-Based Data Science Agents: A Survey of Capabilities, Challenges, and Future Directions'
2+
authors: Mizanur Rahman, Amran Bhuiyan, Mohammed Saidul Islam, Md Tahmid Rahman Laskar, Ridwan Mahbub, Ahmed Masry, Shafiq
3+
Joty, Enamul Hoque
4+
venue: arXiv 2025
5+
tags:
6+
- survey
47
links:
5-
paper: "https://arxiv.org/abs/2510.16872"
6-
github: "https://github.com/ruc-datalab/DeepAnalyze"
7-
website: "https://ruc-deepanalyze.github.io/"
8-
9-
- title: "WebDS: An End-to-End Benchmark for Web-based Data Science"
10-
authors: "Ethan Hsu, Hong Meng Yam, Ines Bouissou, Aaron Murali John, Raj Thota, Josh Koe, Vivek Sarath Putta, G K Dharesan, Alexander Spangher, Shikhar Murty, Tenghao Huang, Christopher D. Manning"
11-
venue: "arXiv 2025"
8+
paper: https://arxiv.org/abs/2510.04023
9+
github: ''
10+
website: ''
11+
- title: 'DeepAnalyze: Agentic Large Language Models for Autonomous Data Science'
12+
authors: Shaolei Zhang, Ju Fan, Meihao Fan, Guoliang Li, Xiaoyong Du
13+
venue: arXiv 2025
1214
links:
13-
paper: "https://arxiv.org/abs/2508.01222"
14-
github: ""
15-
website: ""
16-
17-
- title: "AutoMind: Adaptive Knowledgeable Agent for Automated Data Science"
18-
authors: "Yixin Ou, Yujie Luo, Jingsheng Zheng, Lanning Wei, Zhuoyun Yu, Shuofei Qiao, Jintian Zhang, Da Zheng, Yuren Mao, Yunjun Gao, Huajun Chen, Ningyu Zhang"
19-
venue: "arXiv 2025"
15+
paper: https://arxiv.org/abs/2510.16872
16+
github: https://github.com/ruc-datalab/DeepAnalyze
17+
website: https://ruc-deepanalyze.github.io/
18+
- title: 'WebDS: An End-to-End Benchmark for Web-based Data Science'
19+
authors: Ethan Hsu, Hong Meng Yam, Ines Bouissou, Aaron Murali John, Raj Thota, Josh Koe, Vivek Sarath Putta, G K Dharesan,
20+
Alexander Spangher, Shikhar Murty, Tenghao Huang, Christopher D. Manning
21+
venue: arXiv 2025
2022
links:
21-
paper: "https://arxiv.org/abs/2506.10974"
22-
github: "https://github.com/innovatingAI/AutoMind"
23-
website: "https://innovatingai.github.io/"
24-
25-
- title: "DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models"
26-
authors: "Yiming Huang, Jianwen Luo, Yan Yu, Yitong Zhang, Fangyu Lei, Yifan Wei, Shizhu He, Lifu Huang, Xiao Liu, Jun Zhao, Kang Liu"
27-
venue: "EMNLP 2024"
23+
paper: https://arxiv.org/abs/2508.01222
24+
github: ''
25+
website: ''
26+
- title: 'AutoMind: Adaptive Knowledgeable Agent for Automated Data Science'
27+
authors: Yixin Ou, Yujie Luo, Jingsheng Zheng, Lanning Wei, Zhuoyun Yu, Shuofei Qiao, Jintian Zhang, Da Zheng, Yuren Mao,
28+
Yunjun Gao, Huajun Chen, Ningyu Zhang
29+
venue: arXiv 2025
2830
links:
29-
paper: "https://aclanthology.org/2024.emnlp-main.748/"
30-
github: "https://github.com/yiyihum/da-code"
31-
website: "https://da-code-bench.github.io/"
31+
paper: https://arxiv.org/abs/2506.10974
32+
github: https://github.com/innovatingAI/AutoMind
33+
website: https://innovatingai.github.io/
34+
- title: 'DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models'
35+
authors: Yiming Huang, Jianwen Luo, Yan Yu, Yitong Zhang, Fangyu Lei, Yifan Wei, Shizhu He, Lifu Huang, Xiao Liu, Jun Zhao,
36+
Kang Liu
37+
venue: EMNLP 2024
38+
links:
39+
paper: https://aclanthology.org/2024.emnlp-main.748/
40+
github: https://github.com/yiyihum/da-code
41+
website: https://da-code-bench.github.io/
Lines changed: 36 additions & 27 deletions
Original file line numberDiff line numberDiff line change
@@ -1,31 +1,40 @@
1-
- title: "Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents"
2-
authors: "Zihao Wang, Xujing Li, Yining Ye, Junjie Fang, Haoming Wang, Longxiang Liu, Shihao Liang, Junting Lu, Zhiyong Wu, Jiazhan Feng, Wanjun Zhong, Zili Li, Yu Wang, Yu Miao, Bo Zhou, Yuanfan Li, Hao Wang, Zhongkai Zhao, Faming Wu, Zhengxuan Jiang, Weihao Tan, Heyuan Yao, Shi Yan, Xiangyang Li, Yitao Liang, Yujia Qin, Guang Shi"
3-
venue: "arXiv 2025"
1+
- title: Develop AI Agents for System Engineering in Factorio
2+
authors: Neel Kant
3+
venue: arXiv 2025
4+
tags:
5+
- position
6+
- survey
47
links:
5-
paper: "https://arxiv.org/abs/2510.23691"
6-
github: ""
7-
website: ""
8-
9-
- title: "One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration"
10-
authors: "Zaid Khan, Archiki Prasad, Elias Stengel-Eskin, Jaemin Cho, Mohit Bansal"
11-
venue: "arXiv 2025"
8+
paper: https://arxiv.org/abs/2502.01492
9+
github: ''
10+
website: ''
11+
- title: 'Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents'
12+
authors: Zihao Wang, Xujing Li, Yining Ye, Junjie Fang, Haoming Wang, Longxiang Liu, Shihao Liang, Junting Lu, Zhiyong Wu,
13+
Jiazhan Feng, Wanjun Zhong, Zili Li, Yu Wang, Yu Miao, Bo Zhou, Yuanfan Li, Hao Wang, Zhongkai Zhao, Faming Wu, Zhengxuan
14+
Jiang, Weihao Tan, Heyuan Yao, Shi Yan, Xiangyang Li, Yitao Liang, Yujia Qin, Guang Shi
15+
venue: arXiv 2025
1216
links:
13-
paper: "https://arxiv.org/abs/2510.12088"
14-
github: ""
15-
website: ""
16-
17-
- title: "PoE-World: Compositional World Modeling with Products of Programmatic Experts"
18-
authors: "Wasu Top Piriyakulkij, Yichao Liang, Hao Tang, Adrian Weller, Marta Kryven, Kevin Ellis"
19-
venue: "arXiv 2025"
17+
paper: https://arxiv.org/abs/2510.23691
18+
github: ''
19+
website: ''
20+
- title: 'One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration'
21+
authors: Zaid Khan, Archiki Prasad, Elias Stengel-Eskin, Jaemin Cho, Mohit Bansal
22+
venue: arXiv 2025
2023
links:
21-
paper: "https://arxiv.org/abs/2505.10819"
22-
github: ""
23-
website: ""
24-
25-
- title: "WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment"
26-
authors: "Hao Tang, Darren Yan Key, Kevin Ellis"
27-
venue: "NeurIPS 2024"
24+
paper: https://arxiv.org/abs/2510.12088
25+
github: ''
26+
website: ''
27+
- title: 'PoE-World: Compositional World Modeling with Products of Programmatic Experts'
28+
authors: Wasu Top Piriyakulkij, Yichao Liang, Hao Tang, Adrian Weller, Marta Kryven, Kevin Ellis
29+
venue: arXiv 2025
2830
links:
29-
paper: "https://openreview.net/forum?id=QGJSXMhVaL"
30-
github: "https://github.com/haotang1995/WorldCoder"
31-
website: "https://haotang1995.github.io/projects/worldcoder"
31+
paper: https://arxiv.org/abs/2505.10819
32+
github: ''
33+
website: ''
34+
- title: 'WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment'
35+
authors: Hao Tang, Darren Yan Key, Kevin Ellis
36+
venue: NeurIPS 2024
37+
links:
38+
paper: https://openreview.net/forum?id=QGJSXMhVaL
39+
github: https://github.com/haotang1995/WorldCoder
40+
website: https://haotang1995.github.io/projects/worldcoder

0 commit comments

Comments
 (0)