Highlights
- Pro
Pinned Loading
-
Megatron-Energon
Megatron-Energon PublicForked from NVIDIA/Megatron-Energon
Megatron's multi-modal data loader
Python
-
flash-attention
flash-attention PublicForked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Python
-
Megatron-LM
Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Python
-
TransformerEngine
TransformerEngine PublicForked from NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…
Python
-
MMseqs2
MMseqs2 PublicForked from soedinglab/MMseqs2
MMseqs2: ultra fast and sensitive search and clustering suite
C
If the problem persists, check the GitHub status page or contact support.



