|
91 | 91 | <!-- START PAPERS SUMMARY --> |
92 | 92 | 🔥 **We are actively tracking the frontier research of code agents.**<br> |
93 | 93 | 🧹 *We periodically curate our collection, retaining only published papers and interesting arXiv preprints from the last six months.*<br> |
94 | | -📚 *Currently collected:* **`494` papers** — *(Last update: 2026-06-01)* |
| 94 | +📚 *Currently collected:* **`498` papers** — *(Last update: 2026-06-02)* |
95 | 95 | <!-- END PAPERS SUMMARY --> |
96 | 96 |
|
97 | 97 | <!-- - [🚀 Products & Tools](#-products--tools) |
@@ -1464,6 +1464,14 @@ This includes OS kernel code, runtime systems, device drivers, and system-level |
1464 | 1464 | > Autonomous agents for solving SQL challenges in real-world database systems (_e.g_., query generation and optimization, issue resolution). |
1465 | 1465 |
|
1466 | 1466 | <!-- START PAPERS:sql_engineering --> |
| 1467 | +- **BADGER: Bridging Agentic and Deterministic Evaluation for Generative Enterprise Reasoning.** |
| 1468 | + _Shannon Serrao, Soumitra Chatterjee, Dorina Strori, Abhishek Sharma, Nathan Miller._ arXiv 2026/06. |
| 1469 | + [](https://arxiv.org/abs/2606.02109)  |
| 1470 | + |
| 1471 | +- **Learning to Retrieve: Dual-Level Long-Term Memory for Text-to-SQL Agents.** |
| 1472 | + _Yibo Wang, Nikki Lijing Kuang, Philip S. Yu, Zhewei Yao, Yuxiong He._ arXiv 2026/05. |
| 1473 | + [](https://arxiv.org/abs/2606.00547) |
| 1474 | + |
1467 | 1475 | - **Rethinking Agentic Workflows: Evaluating Inference-Based Test-Time Scaling Strategies in Text2SQL Tasks.** |
1468 | 1476 | _Jiajing Guo, Kenil Patel, Jorge Piazentin Ono, Wenbin He, Liu Ren._ arXiv 2025/10. |
1469 | 1477 | [](https://arxiv.org/abs/2510.10885)  |
@@ -1517,6 +1525,10 @@ This category covers Verilog/VHDL/RTL, FPGA kernels, and hardware–software co- |
1517 | 1525 | > Code agents for the automated creation and maintenance of web interfaces and front-end components. |
1518 | 1526 |
|
1519 | 1527 | <!-- START PAPERS:website_generation --> |
| 1528 | +- **I-WebGenBench : Evaluating Interactivity in LLM-Generated Scientific Web Applications.** |
| 1529 | + _Dasen Dai, Biao Wu, Meng Fang, Shuoqi Li, Wenhao Wang._ arXiv 2026/05. |
| 1530 | + [](https://arxiv.org/abs/2606.00750)  |
| 1531 | + |
1520 | 1532 | - **ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding.** |
1521 | 1533 | _Yuhang Li, Chenchen Zhang, Ruilin Lv, Ao Liu, Ken Deng, Yuanxing Zhang, Jiaheng Liu, Wiggin Zhou, Bo Zhou._ arXiv 2025/10. |
1522 | 1534 | [](https://arxiv.org/abs/2510.11498)  |
@@ -1803,6 +1815,10 @@ This category covers Verilog/VHDL/RTL, FPGA kernels, and hardware–software co- |
1803 | 1815 | > Agents designed to autonomously utilize specialized scientific software—such as simulation engines, data analysis suites, and visualization platforms—to automate and enhance domain-specific scientific workflows. |
1804 | 1816 |
|
1805 | 1817 | <!-- START PAPERS:scientific_workflows --> |
| 1818 | +- **MPMWorlds: Material-Point-Method Simulations for Inferring and Extrapolating Physical Dynamics.** |
| 1819 | + _Žiga Kovačič, Kevin Ellis._ arXiv 2026/06. |
| 1820 | + [](https://arxiv.org/abs/2606.01538)   |
| 1821 | + |
1806 | 1822 | - **ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows.** |
1807 | 1823 | _Qiushi Sun, Zhoumianze Liu, Chang Ma, Zichen Ding, Fangzhi Xu, Zhangyue Yin, Haiteng Zhao, Zhenyu Wu, Kanzhi Cheng, Zhaoyang Liu, et al._ arXiv 2025/06. |
1808 | 1824 | [](https://arxiv.org/abs/2505.19897) [](https://github.com/OS-Copilot/ScienceBoard) [](https://qiushisun.github.io/ScienceBoard-Home/)  |
|
0 commit comments