Skip to content

Commit 353db74

Browse files
committed
Bump SessionConcurrencyTest ctx size to 4096
The previous ctxSize=512 was too tight for the concurrent-send test: 4 threads x 3 calls accumulate into a single Session transcript and each send() resubmits the full history, so by ~call 10 the prompt exceeds 512 tokens and the native side throws LlamaException. Bumping to 4096 (well under the 16384 trained ctx) keeps the whole 12-turn transcript well within bounds.
1 parent 8677fa9 commit 353db74

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

src/test/java/net/ladenthin/llama/SessionConcurrencyTest.java

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -57,7 +57,7 @@ public static void setup() {
5757
int gpuLayers = Integer.getInteger(TestConstants.PROP_TEST_NGL, TestConstants.DEFAULT_TEST_NGL);
5858
model = new LlamaModel(
5959
new ModelParameters()
60-
.setCtxSize(512)
60+
.setCtxSize(4096)
6161
.setModel(TestConstants.MODEL_PATH)
6262
.setGpuLayers(gpuLayers)
6363
.setFit(false)

0 commit comments

Comments
 (0)