[26.04_linux-nvidia] NVIDIA: SAUCE: ovl: keep err zero after successful ovl_cache_get()#425
Closed
nirmoy wants to merge 1 commit into
Closed
Conversation
Contributor
PR Validation ReportPR Lint ✅ All checks passedDetailsChecking 1 commits... Cherry-pick digest: ┌──────────────┬──────────────────────────────────────────────────────────────────┬────────────┬─────────┬───────────────────────────┐ │ Local │ Referenced upstream / Patch subject │ Patch-ID │ Subject │ SoB chain │ ├──────────────┼──────────────────────────────────────────────────────────────────┼────────────┼─────────┼───────────────────────────┤ │ b6d5ce0fbbd5 │ ovl: keep err zero after successful ovl_cache_get() │ match │ found │ ok, backporter: nirmoyd │ └──────────────┴──────────────────────────────────────────────────────────────────┴────────────┴─────────┴───────────────────────────┘ Lint: all checks passed. |
Collaborator
Author
Boro reviewLatest watcher review: open review Head: This comment is maintained by nv-pr-bot. It is updated when the GitHub watcher publishes a newer review. |
Collaborator
|
@nirmoy Sounds like this should be in -next tomorrow and make next week's 7.1-rc. Our target build is for Thursday, so should be able to pick from -next tomorrow (assuming it shows up). |
Collaborator
|
@nirmoy Can you fix up the trailers in this commit? |
f3e1344 to
e7767e5
Compare
Collaborator
|
All of these should be removed, no? I would think the trailers would look like: Then once we review, Brad will add the Acks and his sign-off when applying the patch. |
BugLink: https://bugs.launchpad.net/bugs/2150640 ovl_iterate_merged() stores PTR_ERR(cache) in err before checking IS_ERR(cache). On success err holds the truncated cache pointer and can be returned as a bogus non-zero error. The syzbot reproducer reaches this through overlay-on-overlay readdir: getdents64 iterate_dir(outer overlay file) ovl_iterate_merged() ovl_cache_get() ovl_dir_read_merged() ovl_dir_read() iterate_dir(inner overlay file) ovl_iterate_merged() Only compute PTR_ERR(cache) on the error path. Fixes: d25e4b7 ("ovl: refactor ovl_iterate() and port to cred guard") Reported-by: syzbot+a16fb0cce329a320661c@syzkaller.appspotmail.com Closes: https://syzkaller.appspot.com/bug?extid=a16fb0cce329a320661c Cc: stable@vger.kernel.org Signed-off-by: Nirmoy Das <nirmoyd@nvidia.com> (backported from https://lore.kernel.org/r/20260514144258.3068715-1-nirmoyd@nvidia.com) Signed-off-by: Nirmoy Das <nirmoyd@nvidia.com>
e7767e5 to
b6d5ce0
Compare
Collaborator
|
Thanks Nirmoy, no further issues from me!
|
clsotog
approved these changes
May 20, 2026
clsotog
left a comment
Collaborator
There was a problem hiding this comment.
Acked-by: Carol L Soto <csoto@nvidia.com>
Collaborator
|
Merged, closing PR. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Fix NVBug 6144764 on
26.04_linux-nvidiaby keepingerrzero after a successfulovl_cache_get()inovl_iterate_merged().The installer crash is an overlayfs readdir failure while rsync reads through overlayfs during BaseOS/DGX OS installation. The bad path is the same as syzbot
a16fb0cce329a320661c: a successful cache pointer is passed toPTR_ERR(), truncating pointer bits into a bogusintthat can later be returned as a non-errno value.Sibling BOS PR: #423
Bug Links
Validation
upstream/26.04_linux-nvidia.git show --check --format=short HEAD: clean.scripts/checkpatch.pl --strict --ignore COMMIT_LOG_USE_LINK,COMMIT_LOG_LONG_LINE --git HEAD: 0 errors, 0 warnings.OVL_SYZ_DONE rc=0and no Oops/KASAN/panic markers.