MaxCode: Add RAG knowledge base — JAX/Flax docs and targeted migration rules by gvanica · Pull Request #23 · AI-Hypercomputer/accelerator-agents

gvanica · 2026-04-22T02:31:59Z

Summary

Adds 55 new RAG source documents to the MaxCode knowledge base, significantly expanding the corpus available for retrieval-augmented generation during PyTorch-to-JAX migrations. Also updates the RAG agent and vector DB to support the expanded corpus with targeted retrieval.

Generic sources (22 files)

Reference implementations and documentation that the LLM retrieves as context during conversion:

JAX/Flax documentation — module API, layers API, setup vs compact, common gotchas, jax.lax primitives
Flash Linear Attention (FLA) library — gated delta-net layers/models, L2 norm, gated layer norm, rotary embeddings, short convolution, naive delta-rule ops
Flax attention patterns — example attention, linen attention implementation
MaxText reference layers — attentions, embeddings, linears, normalizations, Qwen3 and DeepSeek model implementations

Targeted sources (30 files)

Focused migration rules that address specific conversion pitfalls discovered during iterative testing:

Category	Rules
Numerics and dtypes	buffer dtype fidelity, mixed precision, float32 softmax upcast, integer dtype casting
Caching	KV cache prefill/decode, encoder-decoder cache, causal conv1d prefill/decode
Projections and init	fused QKV projection, linear init consistency, weight init patterns, no explicit init for bare layers
Structure	config dataclasses, class hierarchy preservation, default value preservation, source faithfulness
Operations	stop_gradient mapping, triangular masking, WY representation, scan vs for-loop, reduction axis preservation, sum/div vs mean, load balancing loss, MoE capacity routing, cosine similarity
Flax specifics	checkpoint API, train/eval mode, Pallas kernel opportunities, no invented attributes, QKVZ interleaved ordering, dead code helpers

Infrastructure changes

rag/rag_agent.py — Updated to support targeted RAG retrieval alongside generic retrieval
rag/vector_db.py — Minor updates for expanded corpus handling

Test plan

Verify RAG agent loads all 55 new source documents without errors
Confirm targeted retrieval returns relevant rules for a sample conversion query
Validate vector DB indexing completes for the expanded corpus

Split from #17 — PR 1 of 8

google-cla · 2026-04-22T02:32:22Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

…n rules Adds 55 new RAG source documents to improve migration quality: Generic sources (22 files): - JAX/Flax documentation: module API, layers API, setup vs compact, gotchas, lax primitives, attention patterns - Flash Linear Attention (FLA) library references: gated delta net layers/models, l2norm, layernorm gated, rotary, short conv, ops - MaxText reference implementations: attentions, embeddings, linears, normalizations, Qwen3/DeepSeek model layers Targeted sources (30 files): - Migration-specific rules covering: dtype fidelity, causal conv1d, config dataclasses, stop_gradient, mixed precision, KV cache, encoder-decoder cache, Flax checkpoint API, train/eval mode, float32 softmax upcast, fused QKV projection, weight init patterns, class hierarchy preservation, source faithfulness, triangular masking, WY representation, and more Also updates rag_agent.py and vector_db.py to support the expanded corpus with targeted RAG retrieval. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

gvanica force-pushed the split/1-rag-knowledge-base branch from d1c6e43 to 53a3c6d Compare April 22, 2026 02:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MaxCode: Add RAG knowledge base — JAX/Flax docs and targeted migration rules#23

MaxCode: Add RAG knowledge base — JAX/Flax docs and targeted migration rules#23
gvanica wants to merge 1 commit intomainfrom
split/1-rag-knowledge-base

gvanica commented Apr 22, 2026 •

edited

Loading

Uh oh!

google-cla Bot commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gvanica commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Generic sources (22 files)

Targeted sources (30 files)

Infrastructure changes

Test plan

Uh oh!

google-cla Bot commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

gvanica commented Apr 22, 2026 •

edited

Loading