Skip to content

Commit eefd274

Browse files
erwardenaarclaude
andcommitted
Strip size suffixes from model names before arXiv search
Queries like all:"OLMo 2 32B" return no results because papers use the base name ("OLMo 2"), not the size-qualified form. A regex now strips trailing parameter/context suffixes (7B, 32B, 1.4B, 4K, 160M) while leaving MoE names like "Mixtral 8x7B" intact. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
1 parent 2352d76 commit eefd274

1 file changed

Lines changed: 6 additions & 1 deletion

File tree

src/openllm_selector/database.py

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,7 @@
22

33
import json
44
import pathlib
5+
import re
56
import xml.etree.ElementTree as ET
67

78
import requests
@@ -508,10 +509,14 @@ def fetch_recent_papers(model_name: str, max_results: int = 3) -> list[dict]:
508509
>>> papers[0].keys()
509510
dict_keys(['title', 'authors', 'summary', 'published', 'arxiv_url'])
510511
"""
512+
# Strip trailing size/context suffixes (e.g. "7B", "32B", "1.4B", "4K") so
513+
# "OLMo 2 32B" searches as "OLMo 2". Mixture-of-experts names like "8x7B"
514+
# are kept because they are part of the canonical model identity.
515+
search_name = re.sub(r"\s+\d+(\.\d+)?[BbMmKk]$", "", model_name)
511516
response = requests.get(
512517
"https://export.arxiv.org/api/query",
513518
params={
514-
"search_query": f'all:"{model_name}"',
519+
"search_query": f'all:"{search_name}"',
515520
"sortBy": "submittedDate",
516521
"sortOrder": "descending",
517522
"max_results": max_results,

0 commit comments

Comments
 (0)