Add PTX code for (highest) CUDA compute capability in PyTorch & torchvision#4129
Open
Flamefire wants to merge 1 commit into
Open
Add PTX code for (highest) CUDA compute capability in PyTorch & torchvision#4129Flamefire wants to merge 1 commit into
Flamefire wants to merge 1 commit into
Conversation
…vision E.g. "5.0+PTX" makes PyTorch add PTX code. We can simply add it to the list in `TORCH_CUDA_ARCH_LIST` to add PTX code for the last architecture.
687e6c2 to
060e0a5
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
E.g. "5.0+PTX" makes PyTorch add PTX code.
We can simply add it to the list in
TORCH_CUDA_ARCH_LISTto add PTX code for the last architecture.Uses parts of #4092. So with easybuilders/easybuild-framework#5144 it will add PTX code for the highest arch