Skip to content

refactor: patch sharding state dict and warn#151

Merged
dushyantbehl merged 1 commit intofoundation-model-stack:mainfrom
kmehant:nit-patch-sd
Sep 22, 2025
Merged

refactor: patch sharding state dict and warn#151
dushyantbehl merged 1 commit intofoundation-model-stack:mainfrom
kmehant:nit-patch-sd

Conversation

@kmehant
Copy link
Copy Markdown
Collaborator

@kmehant kmehant commented Sep 18, 2025

Patch the sharding strategy instead of erroring out, this would help us reduce the scale of failing training runs we have to instead adapt to the right sharding strategy and continue.

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
Copy link
Copy Markdown
Collaborator

@dushyantbehl dushyantbehl left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @kmehant

@dushyantbehl dushyantbehl merged commit c3bb0cf into foundation-model-stack:main Sep 22, 2025
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants