Skip to content

Commit e0536a9

Browse files
committed
autosave: 2026-02-07 11:28:01
1 parent 1854f06 commit e0536a9

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

posts/2026-01-29-knowledge-creating-llms.qmd

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -100,7 +100,7 @@ Knowledge-creating LLMs.
100100

101101
Knowledge-creating LLMs will differ from knowledge-sharing LLMs in a number of ways:
102102

103-
- Knowledge-creating LLMs will have qualitatively different benchmarks: instead of seeing if they can answer questions which we already know the answer to (most existing benchmarks), we want them to answer *new* questions, e.g. solve an unsolved mathematical problem ([FrontierMath Open Problems](https://epoch.ai/frontiermath/open-problems)) or set a new record on an optimization problem (e.g. GSO-bench, @shetty2025gso). We can use these new frontier benchmarks are indices for capability, but they are more challenging to interpret because the frontier is always moving.
103+
- Knowledge-creating LLMs will have qualitatively different benchmarks: instead of seeing if they can answer questions which we already know the answer to (most existing benchmarks), we want them to answer *new* questions, e.g. solve an unsolved mathematical problem ([FrontierMath Open Problems](https://epoch.ai/frontiermath/open-problems)) or set a new record on an optimization problem (e.g. GSO-bench, @shetty2025gso). We can use these new frontier benchmarks as indices for capability, but they are more challenging to interpret because the frontier is always moving.
104104

105105
- Knowledge-creating LLMs have high returns to compute on individual problems, unlike knowledge-sharing LLMs for which returns asymptote quickly. It can be worth spending billions of tokens to solve a single problem if the solution is generally applicable.
106106

0 commit comments

Comments
 (0)