Skip to content

Commit 01e9a5a

Browse files
authored
fix(compiler): don't cap concepts-plan output at max_tokens=2048 (#90)
Reasoning/thinking models can exhaust a 2048-token budget before emitting the JSON plan, yielding empty output -> unparseable plan -> zero concept pages (silently). Remove the cap so concepts-plan matches the uncapped summary call.
1 parent 85dbcb8 commit 01e9a5a

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

openkb/agent/compiler.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1414,7 +1414,7 @@ async def _compile_concepts(
14141414
concept_briefs=concept_briefs,
14151415
entity_briefs=entity_briefs,
14161416
).replace("__ENTITY_TYPES__", types_str)},
1417-
], "concepts-plan", max_tokens=2048, response_format=_JSON_RESPONSE_FORMAT)
1417+
], "concepts-plan", response_format=_JSON_RESPONSE_FORMAT)
14181418

14191419
def _write_v1_summary_stripped() -> None:
14201420
"""Fallback writer for the v1 summary on early-return paths.

0 commit comments

Comments
 (0)