Implement tied LM head & word embeddings for Qwen3 by finbarrtimbers · Pull Request #686 · allenai/OLMo-core

finbarrtimbers · 2026-05-26T22:16:02Z

Implements tied LM head & word embeddings for Qwen3. The three sizes that Qwen ships tied (0.6B, 1.7B, 4B) now default to tying; 8B/14B/32B stay untied. The HF import path is tie-aware.

…aude Opus 4.7 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b1bb3368c5

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

…Claude Opus 4.7 <noreply@anthropic.com>

… 4.7 <noreply@anthropic.com>

… Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 0ce116d97b

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

… Claude Opus 4.8 <noreply@anthropic.com>

Implement tied LM head & word embeddings for Qwen3 Co-Authored-By: Cl…

b1bb336

…aude Opus 4.7 <noreply@anthropic.com>

chatgpt-codex-connector Bot reviewed May 26, 2026

View reviewed changes

Comment thread src/olmo_core/nn/transformer/model.py

finbarrtimbers added 4 commits May 26, 2026 18:05

Support tensor parallelism with tied word embeddings Co-Authored-By: …

6ff9f89

…Claude Opus 4.7 <noreply@anthropic.com>

Parametrize HF roundtrip conversion tests Co-Authored-By: Claude Opus…

0074050

… 4.7 <noreply@anthropic.com>

Skip re-tying word embeddings on pipeline stages missing tied modules…

96b0d7d

… Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Simplify HF convert: drop redundant getattr and tied-embedding guards…

0ce116d

… Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

finbarrtimbers requested a review from AkshitaB May 28, 2026 15:07

chatgpt-codex-connector Bot reviewed May 28, 2026

View reviewed changes

Comment thread src/olmo_core/nn/transformer/model.py

Reject pipeline parallelism with tied word embeddings Co-Authored-By:…

fcd2735

… Claude Opus 4.8 <noreply@anthropic.com>

AkshitaB approved these changes May 29, 2026

View reviewed changes

AkshitaB merged commit 0021abd into finbarr/fix-conversion May 29, 2026
1 check passed

AkshitaB deleted the finbarr/tie-lm branch May 29, 2026 20:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement tied LM head & word embeddings for Qwen3#686

Implement tied LM head & word embeddings for Qwen3#686
AkshitaB merged 6 commits into
finbarr/fix-conversionfrom
finbarr/tie-lm

finbarrtimbers commented May 26, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

finbarrtimbers commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

finbarrtimbers commented May 26, 2026 •

edited

Loading