Skip to content

Pull requests: AISBench/benchmark

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

refactor(swebp): 重构swebp镜像操作逻辑,优化镜像管理粒度 refactor
#385 opened Jun 30, 2026 by ivanbao9783 Collaborator Loading…
1 of 15 tasks
bugfix: Support for extracting answers expressed in thousand separato… bugfix
#360 opened Jun 24, 2026 by yejj710 Collaborator Loading…
1 of 9 tasks
test ai
#352 opened Jun 23, 2026 by wenba0 Collaborator Loading…
15 tasks
[Docs] Add recommand instruction for custom configs docs
#349 opened Jun 22, 2026 by SJTUyh Collaborator Loading…
1 of 15 tasks
Update answer pattern for COT chat prompt
#347 opened Jun 18, 2026 by F0undLinks Loading…
7 tasks
Update cmmlu_gen_5_shot_cot_chat_prompt.py
#346 opened Jun 18, 2026 by F0undLinks Loading…
7 tasks
【Feature】Supports rerunning specified use cases on SWE dataset
#331 opened Jun 8, 2026 by yejj710 Collaborator Loading…
6 tasks
Fix the issue where TTFT and TPOT have no data when running Kimi2.5 i…
#210 opened Mar 21, 2026 by GaoHuaZhang Collaborator Loading…
15 tasks
[UT] Add new UT for Gedit feature test-cases
#163 opened Mar 5, 2026 by SJTUyh Collaborator Loading…
1 of 15 tasks
[feature] [sub feature 2] Dependency for qwen image edit run feature
#151 opened Feb 13, 2026 by SJTUyh Collaborator Loading…
1 of 15 tasks
[feature] [sub feature 3] Support qwen Image edit infer with gedit dataset feature
#150 opened Feb 13, 2026 by SJTUyh Collaborator Loading…
1 of 15 tasks
【TEST】补充math和agieval数据集的冒烟用例 test-cases
#145 opened Feb 11, 2026 by GaoHuaZhang Collaborator Loading…
1 of 15 tasks
local eval add mindformers model
#110 opened Jan 15, 2026 by muqing-li Loading…
1 of 15 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.