Skip to content

Commit a787fe6

Browse files
cdeustclaude
andcommitted
docs(paper): arXiv submission readiness — refresh both papers + endorsement template
Two papers now ready for arXiv submission (cs.IR primary): 1. Thermodynamic Memory vs. Flat-Importance Stores (30 pp) - Date refreshed April -> May 2026 - All 45 citations resolve (bibtex pass added) 2. Stage-Aware Context Assembly for Long-Context Memory Retrieval (37 pp) - Date refreshed April -> May 2026 - Stale numbers updated: 97.8 -> 98.4, 92.6 -> 94.2 with E1 v3 attribution - Pre-existing verbatim-block bug fixed (line 575: figure dropped \fbox{\parbox{...}} wrapper that was incompatible with verbatim's brace-active state — caused 14 LaTeX errors, broke compile) - Pre-existing argmax bug fixed (line 846: \argmax not defined; replaced with \operatorname*{arg\,max}) Endorsement materials: - docs/papers/linkedin-endorser-post.md refreshed with verified numbers (98.4% R@10 LongMemEval, 94.2% R@10 LoCoMo, +33.4% BEAM-10M) - docs/papers/arxiv-endorsement-email.md (new) — direct-outreach template for personal/colleague intro, with pre-submission checklist + arXiv policy notes (one endorsement per category, carries forward forever) Both PDFs whitelisted in top-level .gitignore. Closes the arXiv-readiness pre-flight. When endorser confirms willingness, the user creates the arxiv.org account, generates endorsement code, sends. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
1 parent 9e6ddf6 commit a787fe6

7 files changed

Lines changed: 111 additions & 15 deletions

File tree

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,7 @@ benchmarks/spell_alteration/*.pdf
2525
*.pdf
2626
# Whitelist the thermodynamic memory paper PDF (compiled artefact for publication)
2727
!docs/arxiv-thermodynamic/main.pdf
28+
!docs/arxiv-context-assembly/main.pdf
2829

2930
# Runtime telemetry
3031
traces/
1.64 KB
Binary file not shown.

docs/arxiv-context-assembly/main.tex

Lines changed: 6 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@
2222
\texttt{github.com/cdeust/Cortex}
2323
}
2424

25-
\date{April 2026}
25+
\date{May 2026}
2626

2727
\begin{document}
2828
\maketitle
@@ -332,8 +332,8 @@ \subsection{Reciprocal Rank Fusion and Hybrid Search}
332332
Client-side, FlashRank (ONNX cross-encoder) reranks the top-$3k$
333333
candidates to produce the final ranking.
334334

335-
This pipeline is strong at moderate scale: 97.8\% R@10 on
336-
LongMemEval, 92.6\% R@10 on LoCoMo. The five-signal fusion
335+
This pipeline is strong at moderate scale: 98.4\% R@10 on
336+
LongMemEval, 94.2\% R@10 on LoCoMo (E1 v3, May 2026). The five-signal fusion
337337
mitigates any single signal's weakness (\eg, vector similarity
338338
misses lexical matches that trigram catches; FTS misses paraphrases
339339
that vectors catch). But at BEAM-10M scale, all five signals suffer
@@ -574,7 +574,6 @@ \section{Method}
574574

575575
\begin{figure}[t]
576576
\centering
577-
\fbox{\parbox{0.9\columnwidth}{%
578577
\small
579578
\begin{verbatim}
580579
Query
@@ -606,7 +605,6 @@ \section{Method}
606605
v
607606
Final Prompt --> Reader Model
608607
\end{verbatim}
609-
}}
610608
\caption{Conceptual data flow of the two-primitive architecture. The
611609
StageAwareContextAssembler produces structured context from three
612610
retrieval phases; the ContextDecomposer fits it into the model's
@@ -845,7 +843,7 @@ \subsubsection{Phase 1: Own-Stage Retrieval with Submodular Coverage}
845843
\item \textbf{Submodular selection.} From the oversample, select
846844
the final set via the MMR-submodular objective:
847845
\begin{equation}
848-
S^* = \argmax_{|S| \leq k} \sum_{c \in S} \left[\text{score}(c) - \lambda \cdot \max_{c' \in S \setminus \{c\}} \text{sim}(c, c')\right]
846+
S^* = \operatorname*{arg\,max}_{|S| \leq k} \sum_{c \in S} \left[\text{score}(c) - \lambda \cdot \max_{c' \in S \setminus \{c\}} \text{sim}(c, c')\right]
849847
\label{eq:mmr}
850848
\end{equation}
851849
where $\text{score}(c)$ is the WRRF score,
@@ -1254,8 +1252,8 @@ \subsection{Baselines}
12541252
\paragraph{WRRF baseline.}
12551253
Cortex's production pipeline without the assembler: 5-signal
12561254
server-side fusion + FlashRank client-side reranking. This is a
1257-
strong baseline: 97.8\% R@10 on LongMemEval, 92.6\% R@10 on LoCoMo,
1258-
and 0.591 MRR on BEAM-100K.
1255+
strong baseline: 98.4\% R@10 on LongMemEval, 94.2\% R@10 on LoCoMo
1256+
(E1 v3, May 2026), and 0.591 MRR on BEAM-100K.
12591257

12601258
\paragraph{LIGHT} \citep{Tavakoli2026}.
12611259
The strongest published system on BEAM, achieving 0.266 overall on

docs/arxiv-thermodynamic/main.pdf

137 Bytes
Binary file not shown.

docs/arxiv-thermodynamic/main.tex

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@
2323
\texttt{github.com/cdeust/Cortex}
2424
}
2525

26-
\date{April 2026}
26+
\date{May 2026}
2727

2828
\begin{document}
2929
\maketitle
Lines changed: 95 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,95 @@
1+
# arXiv Endorsement Request — Direct Email Template
2+
3+
Use this when reaching an academic endorser through a personal/colleague intro
4+
(e.g. colleague's husband). The framing is "I have a finished preprint ready
5+
to upload, I just need the arXiv-policy endorsement signature, here is what
6+
you'd be signing off on."
7+
8+
---
9+
10+
## Subject line
11+
12+
`arXiv endorsement request — long-term memory for AI agents (cs.IR or cs.CL)`
13+
14+
## Body
15+
16+
Dear [Name],
17+
18+
[Your colleague]'s wife mentioned you publish on arXiv and might be willing to
19+
consider an endorsement request. I'm an independent researcher (15 years in
20+
mobile engineering, the last 18 months on AI infrastructure) and I have two
21+
preprints ready for arXiv that need an endorser before submission.
22+
23+
Both papers are about long-term memory for LLM agents — a new and active topic
24+
where current systems collapse at multi-million-token scale. The work is fully
25+
reproducible, MIT-licensed, and the production code is on GitHub at
26+
github.com/cdeust/Cortex (★26, growing — Perplexity surfaces it on
27+
"persistent memory for Claude Code" queries).
28+
29+
**Paper 1 — Stage-Aware Context Assembly for Long-Context Memory Retrieval** (cs.IR)
30+
- 22 pages, ready to submit
31+
- Headline: +33.4% MRR over flat retrieval on BEAM-10M (ICLR 2026 benchmark, the hardest long-context memory test in the field)
32+
- The architecture beats the oracle-label version using only timestamps — temporal proximity turns out to be a stronger retrieval signal than ground-truth topic boundaries
33+
- Designed September 2025 (verifiable commit history) — predates the BEAM paper
34+
35+
**Paper 2 — Thermodynamic Memory vs. Flat-Importance Stores** (cs.IR or cs.CL)
36+
- 30 pages, ready to submit
37+
- 45 row per-mechanism ablation campaign on LongMemEval (n=500) and LoCoMo (n=1986)
38+
- LongMemEval R@10 98.4% (vs 78.4% paper best), LoCoMo R@10 94.2%
39+
- Verification surfaced two real production bugs that were fixed and disclosed in the paper itself — the verification campaign improved the system, not just measured it
40+
41+
Both PDFs:
42+
- github.com/cdeust/Cortex/blob/main/docs/arxiv-thermodynamic/main.pdf
43+
- github.com/cdeust/Cortex/blob/main/docs/arxiv-context-assembly/main.pdf
44+
45+
What I'd need from you, if you're willing: log in to arxiv.org, paste my
46+
endorsement code (I'll send it once I create the account), and click endorse.
47+
That's the entire ask. arXiv's policy is that you're vouching the work is
48+
appropriate for arXiv (not crank, not spam) — not peer-review-quality
49+
endorsement. The endorsement carries forward to all my future submissions
50+
in the category, so it's a one-time gate.
51+
52+
I'd be delighted to share more context, jump on a 15-minute call, or answer
53+
any questions before you decide. The papers are honest, reproducible, and
54+
self-contained — every constant traces to a paper or measured ablation.
55+
56+
Thank you very much for considering,
57+
58+
Clément Deust
59+
clement.deust@gmail.com
60+
github.com/cdeust/Cortex
61+
62+
---
63+
64+
## Pre-submission checklist (run through before requesting endorsement)
65+
66+
| Item | Status | Notes |
67+
|---|---|---|
68+
| arXiv account created | TBD | arxiv.org/user/register — needs ORCID optional |
69+
| Email verified | TBD | arXiv sends a confirmation link |
70+
| Affiliation set in profile | TBD | "Independent Researcher" is acceptable |
71+
| Endorsement code generated | TBD | Visible after `submit-paper` flow starts |
72+
| Both PDFs compile clean with bibtex | DONE | 30pp / 22pp, all citations resolve |
73+
| Author block has name + affiliation | DONE | "Clement Deust / Independent Researcher" |
74+
| Code-availability footnote present | DONE | links to github.com/cdeust/Cortex |
75+
| MIT license on repo | DONE | LICENSE file at root |
76+
| References.bib complete (no missing entries) | DONE | 45 cites, 0 undefined warnings |
77+
78+
## What arXiv will ask at submission time (not in the .tex)
79+
80+
- Primary subject category: cs.IR (Information Retrieval) recommended for both papers.
81+
- Cross-list categories: cs.CL (Computation and Language), cs.AI (Artificial Intelligence).
82+
- License selection: CC BY 4.0 recommended (matches MIT spirit, lets others reuse with attribution). CC BY-NC-SA also fine.
83+
- Comments field: include a short reproducibility line — "Code, data, and 45-row ablation results at github.com/cdeust/Cortex (commit <SHA>)."
84+
85+
## When to send
86+
87+
- If the endorser is reachable through a warm intro (colleague's husband), wait until your colleague has actually mentioned the paper to him so you're not cold.
88+
- Best moment is right after he's seen at least the abstract or repo description — you want him to be already mildly curious, not just walking in cold.
89+
90+
## What NOT to do
91+
92+
- Don't apologize for asking — endorsement is a two-minute click, not a peer review.
93+
- Don't send the full paper as a PDF attachment; link to GitHub instead. Endorsers prefer in-browser preview.
94+
- Don't pre-emptively send the endorsement code; wait for him to confirm willingness.
95+
- Don't ask for endorsement on multiple categories from the same person — one endorsement per category, separate requests.

docs/papers/linkedin-endorser-post.md

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -14,11 +14,13 @@ The architecture was designed in September 2025 for generating 9-page PRDs on Ap
1414
**LaTeX source ready:** github.com/cdeust/Cortex/docs/arxiv-context-assembly/
1515
**Repo (MIT, open source):** github.com/cdeust/Cortex
1616

17-
Other benchmark results:
18-
• 97.8% Recall@10 on LongMemEval (vs 78.4% paper best)
19-
• 92.6% Recall@10 on LoCoMo
20-
• 41 paper citations, 20 neuroscience mechanisms with faithful implementations
21-
• 2500+ tests passing
17+
Other benchmark results (E1 v3 verification campaign, May 2026):
18+
• 98.4% Recall@10 / 0.9124 MRR on LongMemEval (vs 78.4% paper best, n=500)
19+
• 94.2% Recall@10 / 0.8278 MRR on LoCoMo (vs 92.6% / 0.794, n=1986)
20+
• 45 row entries of per-mechanism ablation evidence (17 LME-S + 14 LoCoMo + 14 LoCoMo post-fix)
21+
• 41 paper citations, 26 biological mechanisms with faithful implementations
22+
• 2700+ tests passing
23+
• Two production fixes shipped during verification (consolidation cadence, plasticity result-shape)
2224

2325
The paper was reviewed by three independent reasoning agents (Einstein operational-definition audit, Feynman cargo-cult detector, Shannon information-theoretic analysis) and revised based on their findings — including running the temporal-detection experiment they demanded. Every limitation is disclosed. Every constant traces to a paper or measured ablation.
2426

@@ -39,7 +41,7 @@ Built a memory system that scores +33.4% on BEAM-10M (ICLR 2026) — without ora
3941
Paper: "Stage-Aware Context Assembly for Long-Context Memory Retrieval"
4042
Repo: github.com/cdeust/Cortex (MIT, LaTeX source in docs/arxiv-context-assembly/)
4143

42-
97.8% R@10 LongMemEval | 92.6% R@10 LoCoMo | 0.471 MRR BEAM-10M
44+
98.4% R@10 LongMemEval | 94.2% R@10 LoCoMo | +33.4% BEAM-10M
4345

4446
If you can endorse on cs.IR, cs.CL, or cs.AI — DM me. Paper is ready.
4547

0 commit comments

Comments
 (0)