feat: Add LLMSimulationTraceCompressor for token-efficient AI summaries#112
feat: Add LLMSimulationTraceCompressor for token-efficient AI summaries#112dyrpsf wants to merge 2 commits into
Conversation
|
I would likely not frame this specifically around LLMs but more generally as a summary string mechanism. As for the test, I am uncertain whether there is enough covered with that. At least there is a test though, thank you! |
You make a great point about keeping the core library naming generic. I have renamed it to |
draeger
left a comment
There was a problem hiding this comment.
Minor note; I like to advice using 1d, 1f, 1L, etc. to indicate the type of numeric value when working with literals.
| Mockito.when(mockTable.getColumnCount()).thenReturn(2); | ||
|
|
||
| // Mock time points | ||
| Mockito.when(mockTable.getTimePoint(0)).thenReturn(0.0); |
There was a problem hiding this comment.
I think it is more precise to write 1d or 1f, etc. rather than 1.0. The latter will certainly be interpreted as a double but the first two versions give us developers freedom to specify how much memory we want to spend.
Overview
This PR introduces the
LLMSimulationTraceCompressorto theorg.simulator.mathpackage. This is a foundational utility for the upcominggsoc-sysbio-llm-toolsarchitecture, designed to specifically solve the LLM token-limit bottleneck when handling massive simulation outputs.Changes Made
MultiTabletime-series data and algorithmically extract critical biological waypoints (initial states, peak concentrations, peak times, and final steady states).LLMSimulationTraceCompressorTest.javautilizing Mockito to mockMultiTablebehaviors and verify the statistical extraction logic.Validation
sbsclmodule.