Feature/telemetry demo notebook by jonathanrbelanger-lang · Pull Request #1308 · TransformerLensOrg/TransformerLens

jonathanrbelanger-lang · 2026-05-17T13:34:07Z

Description

Adds a new educational demo notebook (demos/TL_Demo_RT_Viz.ipynb) that provides a lightweight, zero-dependency bridge to extract and visualize mechanistic telemetry (Attention Coherence and Head Agreement) during a training loop.

Motivation and Context:

Optimization: The training loop is intentionally branched so model.run_with_cache is only called at log intervals, saving roughly 10x memory/compute overhead compared to naive caching loops.
Scalability: The dynamic dictionary logging and visualization matrix automatically scale to adapt to n_layers, making it highly forkable for users experimenting with larger architectures.
Linting: The notebook has been run through ruff and passes all modern syntax and formatting checks cleanly.

Fixes # N/A

Type of change

New feature (non-breaking change which adds functionality)

Screenshots

Checklist:

I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation (N/A - standalone demo)
My changes generate no new warnings (ruff checked)
I have added tests that prove my fix is effective or that my feature works (N/A - standalone demo)
New and existing unit tests pass locally with my changes
I have not rewritten tests relating to key interfaces which would affect backward compatibility

* Fix type of HookedTransformerConfig.device This is typed as `Optional[str]` but sometimes returns `torch.device`. Updated the code to just return the `str` instead of wrapping with a device. I'm not confident that every function which takes a device will always be passed a string, so I didn't change functions like warn_if_mps. Found while working on TransformerLensOrg#1219 * more cleanup * 3.0 CI Bugs (TransformerLensOrg#1261) * Fixing `utils` imports * skip gated notebooks on PR from forks * Updating notebooks * Ensure LLaMA only runs when HF_TOKEN is available --------- Co-authored-by: jlarson4 <jonahalarson@comcast.net>

TransformerLens 3.1.0

Release v3.2.0

Release v3.2.1

jonathanrbelanger-lang · 2026-05-17T13:42:09Z

Hey @jlarson4 -- I've opened this PR to address the task (1148) assigned to me.

You'll notice a few minor changes from my initial concept and code. These updates focus specifically on streamlining the loop and eliminating caching overhead, but the final result fully aligns with the original submission goals.

I'll be available this week to tweak or refactor anything based on your critical review. Thanks!

jlarson4 · 2026-05-18T14:42:21Z

Thank you for putting this together @jonathanrbelanger-lang, it looks awesome. I should have time today to give it a thorough review & send over any comments if I have them.

jonathanrbelanger-lang · 2026-05-23T00:35:20Z

Hey - I've pushed the revised Realtime_Training_Telemetry_Demo.ipynb to align with your review.

I know it is a busy weekend for everyone, so please no rush at all on reviewing this, take your time. I am completely prepared to make additional adjustments if you find them necessary.

Here is a quick breakdown of what was updated:

Changelog: Real-time Telemetry Demo Refactor
Migrated Visualizations to Plotly: Completely removed matplotlib dependencies, replacing them with a dual-trace Plotly subplot (Loss/Coherence Line Graph + Layer Depth Heatmap) to perfectly align with the TransformerLens ecosystem.

Cross-Platform Environment Detection: Implemented the standard IN_COLAB try/except block to dynamically assign the correct Plotly renderer ("colab" vs "notebook_connected") and handle pip installations seamlessly across standard Jupyter, VS Code, and Google Colab.

Real-Time Rendering Optimization: Replaced static plotting with a highly optimized real-time loop. To prevent browser DOM crashes and memory leaks, the loop pre-allocates numpy arrays and directly mutates the fig.data traces in-place, relying on IPython.display.clear_output(wait=True) for smooth frame redraws.

Compute Efficiency (Selective Caching): Modified the training loop to only invoke model.run_with_cache during defined logging intervals (every 10 steps), ensuring the extraction process does not suffocate local CPU/GPU memory bandwidth.

Self-Contained Execution: Added a localized synthetic induction data generator (get_batch()) directly above the training loop to guarantee the cell runs cleanly without state leakage or variable dependency issues from upstream cells.

Pedagogical Markdown Overhaul: Rewrote all cell headers to match the rigorous, educational tone of Main_Demo.ipynb. Added specific architectural context detailing:

Why the model uses 2 layers/2 heads (minimum depth for induction).

Why special tokens (BOS) act as attention sinks and how they skew coherence metrics.

Explicit instructions on how to revert to static rendering for users prioritizing raw compute speed.

jlarson4 · 2026-05-26T13:46:22Z

Hey @jonathanrbelanger-lang! Just a couple small additional notes:

Can you remove the old notebook file (TL_Demo_RT_Viz)?
Can you move all of your commented # @markdown comments out of the code cells and into their own text/markdown cells?

Other than that, it looks great! Once those edits come through I will merge and get this released.

jonathanrbelanger-lang · 2026-05-26T14:39:56Z

Absolutely, this will be done by EOD.

jlarson4 · 2026-05-26T19:17:28Z

Excellent, thank you for the update @jonathanrbelanger-lang

Removed initial version.

…lemetry_Demo_xOLD.ipynb Changed name before push of fixed version to ensure separation.

jonathanrbelanger-lang · 2026-05-27T02:49:29Z

@jlarson4 - separated the instructional portions, now shows explanation and clean code cells separately. One open question regarding the HookedTransformer change.

Changelog

Removed unused imports (os, torch.nn.functional, torch.optim, collections) from setup cell
Added # noqa: F811 to numpy re-import in training cell (intentional standalone-cell portability pattern)
Removed deprecated demo notebooks (TL_Demo_RT_Viz.ipynb, prior Realtime_Training_Telemetry_Demo.ipynb)
Note: HookedTransformer/HookedTransformerConfig flagged as deprecated in TL 3.0 — TransformerBridge does not appear to cover from-scratch toy model instantiation; flagging for maintainer confirmation before addressing

jlarson4 · 2026-05-27T12:48:04Z

Hi @jonathanrbelanger-lang! Thanks for getting this wrapped up.

Good question on toy models. This is a known TransformerBridge coverage gap, it doesn't currently have a from-scratch / config-only constructor, so HookedTransformer(cfg) is still the right tool for toy models in training demos. Please keep it as-is. From-scratch instantiation is on my roadmap for future bridge work; in the meantime, no action needed here.

brendanlong and others added 5 commits April 20, 2026 14:50

Merge pull request TransformerLensOrg#1277 from TransformerLensOrg/dev

6f56518

TransformerLens 3.1.0

Merge pull request TransformerLensOrg#1294 from TransformerLensOrg/dev

31d4f6a

Release v3.2.0

Merge pull request TransformerLensOrg#1295 from TransformerLensOrg/dev

5f7b02e

Release v3.2.1

add: interactive attention telemetry and phase transition demo notebook

6d2899f

jlarson4 reviewed May 18, 2026

View reviewed changes

Comment thread demos/TL_Demo_RT_Viz.ipynb Outdated

jlarson4 force-pushed the dev branch from da370b8 to f6192fa Compare May 22, 2026 21:12

Add Real-time Telemetry Demo per maintainer feedback

7592a4f

jonathanrbelanger-lang added 4 commits May 26, 2026 17:10

Delete demos/TL_Demo_RT_Viz.ipynb

aec9890

Removed initial version.

Rename Realtime_Training_Telemetry_Demo.ipynb to Realtime_Training_Te…

56494a1

…lemetry_Demo_xOLD.ipynb Changed name before push of fixed version to ensure separation.

Delete demos/Realtime_Training_Telemetry_Demo_xOLD.ipynb

c62b4e5

add ruff-clean real-time training telemetry demo

8fd31c3

jlarson4 merged commit 7329b20 into TransformerLensOrg:dev May 27, 2026
24 checks passed

jonathanrbelanger-lang mentioned this pull request May 27, 2026

[Proposal] Tutorial for "Real-Time Training Dynamics" (VSM Telemetry) #1148

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/telemetry demo notebook#1308

Feature/telemetry demo notebook#1308
jlarson4 merged 10 commits into
TransformerLensOrg:devfrom
jonathanrbelanger-lang:feature/telemetry-demo-notebook

jonathanrbelanger-lang commented May 17, 2026 •

edited

Loading

Uh oh!

jonathanrbelanger-lang commented May 17, 2026 •

edited

Loading

Uh oh!

jlarson4 commented May 18, 2026

Uh oh!

Uh oh!

jonathanrbelanger-lang commented May 23, 2026

Uh oh!

jlarson4 commented May 26, 2026

Uh oh!

jonathanrbelanger-lang commented May 26, 2026

Uh oh!

jlarson4 commented May 26, 2026

Uh oh!

jonathanrbelanger-lang commented May 27, 2026

Uh oh!

jlarson4 commented May 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jonathanrbelanger-lang commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Screenshots

Checklist:

Uh oh!

jonathanrbelanger-lang commented May 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jlarson4 commented May 18, 2026

Uh oh!

Uh oh!

jonathanrbelanger-lang commented May 23, 2026

Uh oh!

jlarson4 commented May 26, 2026

Uh oh!

jonathanrbelanger-lang commented May 26, 2026

Uh oh!

jlarson4 commented May 26, 2026

Uh oh!

jonathanrbelanger-lang commented May 27, 2026

Uh oh!

jlarson4 commented May 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jonathanrbelanger-lang commented May 17, 2026 •

edited

Loading

jonathanrbelanger-lang commented May 17, 2026 •

edited

Loading