Clarify 32D base action space padding#939
Open
taivu1998 wants to merge 1 commit into
Open
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
dim_16:dim_31as reserved padding in the provided base action-space convention.pi05_basefine-tuning configs load compatible normalization stats from the pi0.5 base assets, without overclaiming every listed asset is available for every checkpoint.Root Cause
docs/norm_stats.mddescribed the leading robot action dimensions throughdim_15, but the released base model interface can expose a 32D action width. Without an explicit explanation ofdim_16:dim_31, users could reasonably infer those dimensions encode hidden robot controls.User Impact
This makes the action-space documentation self-contained for users interpreting model outputs or wiring custom robot adapters. It makes clear that undefined tail dimensions should not be sent to robot controllers unless a custom adapter, dataset, and matching normalization stats define them.
Validation
PadStatesAndActions, normalization/unnormalization, and ALOHA/DROID/LIBERO output slicing.git diff --check.rgchecks for the newdim_16:dim_31docs and source references.jax-cuda12-plugin==0.5.3because it only ships Linux wheels.Closes #748