55Comprehensive end-to-end test of all GitMem Pro features on a fresh Supabase project. Simulates 5 days of real interactive usage with session cycling, data accumulation, and cross-session persistence verification.
66
77** Test file:** ` pro-stress-test.mjs `
8- ** Last run:** 2026-05-25 — 141/141 PASS
8+ ** Last run:** 2026-05-25 — 147/147 PASS (all canonical tools covered)
99
1010## Prerequisites
1111
@@ -55,18 +55,19 @@ su developer -c "cd /home/developer/my-project && node /tmp/test-harness/stress-
5555- ` list_threads ` — verify all 10 visible
5656- ` session_close ` — with closing reflection
5757
58- ### Day 2: Recall, confirm, resolve (18 tests)
58+ ### Day 2: Recall, confirm, resolve (19 tests)
5959- ` session_start ` — loads day 1 context, verifies threads carry over
6060- 5x ` recall ` — diverse queries (deploy, auth, cache, frontend, security)
6161- ` confirm_scars ` — acknowledge recalled scars with APPLYING/N_A decisions
6262- 3x ` resolve_thread ` — close completed work items
6363- ` list_threads ` — verify 7 open, 3 resolved
6464- ` reflect_scars ` — end-of-session scar reflection
6565- ` record_scar_usage ` — track scar application
66+ - ` record_scar_usage_batch ` — batch scar tracking
6667- ` session_refresh ` — re-surface context mid-session
6768- ` session_close `
6869
69- ### Day 3: Docs, search, graph, analytics (23 tests)
70+ ### Day 3: Docs, search, graph, analytics, transcripts (26 tests)
7071- ` session_start ` — loads 2 days of history
7172- Write 3 markdown docs (architecture, deployment, API reference — 1000+ words total)
7273- ` index_docs ` — embed and index the docs
@@ -77,13 +78,18 @@ su developer -c "cd /home/developer/my-project && node /tmp/test-harness/stress-
7778- ` analyze ` — session analytics summary
7879- 3x ` prepare_context ` — compact, gate, and full sub-agent briefings
7980- ` absorb_observations ` — capture sub-agent findings (4 observations)
81+ - ` save_transcript ` — save session conversation
82+ - ` get_transcript ` — retrieve saved transcript
83+ - ` search_transcripts ` — semantic search over transcript chunks
8084- ` session_close `
8185
82- ### Day 4: Cache, health, archive, threads (15 tests)
86+ ### Day 4: Cache, health, archive, suggestions, threads (17 tests)
8387- ` session_start `
8488- 4x cache management — status, health, flush, status-after
8589- ` health ` — write operation success rates
8690- ` archive_learning ` — soft-delete a scar
91+ - ` promote_suggestion ` — promote a suggested thread
92+ - ` dismiss_suggestion ` — dismiss a suggested thread
8793- ` create_thread ` + dedup test (similar text → returns existing)
8894- ` list_threads ` , ` cleanup_threads `
8995- 3x ` resolve_thread ` — close more work items
@@ -114,26 +120,32 @@ su developer -c "cd /home/developer/my-project && node /tmp/test-harness/stress-
114120| confirm_scars | 1 | 2 |
115121| reflect_scars | 1 | 2 |
116122| record_scar_usage | 1 | 2 |
123+ | record_scar_usage_batch | 1 | 2 |
117124| search | 4 | 3 |
118125| log | 4 | 3, 5 |
119126| create_thread | 12 | 1, 4 |
120127| list_threads | 5 | 1-5 |
121128| resolve_thread | 6 | 2, 4 |
122129| cleanup_threads | 1 | 4 |
130+ | promote_suggestion | 1 | 4 |
131+ | dismiss_suggestion | 1 | 4 |
123132| index_docs | 1 | 3 |
124133| search_docs | 5 | 3, 5 |
125134| graph_traverse | 2 | 3 |
126135| analyze | 2 | 3, 5 |
127136| prepare_context | 3 | 3 |
128137| absorb_observations | 1 | 3 |
138+ | save_transcript | 1 | 3 |
139+ | get_transcript | 1 | 3 |
140+ | search_transcripts | 1 | 3 |
129141| archive_learning | 1 | 4 |
130142| health | 1 | 4 |
131143| cache-status | 2 | 4 |
132144| cache-health | 1 | 4 |
133145| cache-flush | 1 | 4 |
134146| contribute_feedback | 1 | 4 |
135147| gitmem-help | 1 | 5 |
136- | ** TOTAL** | ** 141 ** | |
148+ | ** TOTAL** | ** 147 ** | |
137149
138150## What this test validates
139151
@@ -156,3 +168,4 @@ su developer -c "cd /home/developer/my-project && node /tmp/test-harness/stress-
156168| Version | Date | Tests | Result |
157169| ---------| ------| -------| --------|
158170| v1.0 | 2026-05-25 | 141 | 141 PASS |
171+ | v1.1 | 2026-05-25 | 147 | 147 PASS — added record_scar_usage_batch, transcripts, promote/dismiss_suggestion |
0 commit comments