You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Nov 19, 2025. It is now read-only.
This repository was archived by the owner on Nov 19, 2025. It is now read-only.
Is there a limit on the largest PPO model size and max context length the package can support to scale out of box? Is it scalable when we add more GPU nodes to the training job? Thanks! Sent from my iPhoneOn Jan 17, 2024, at 11:12 AM, HeyyyyyyG ***@***.***> wrote: #87
is there a limit on the largest PPO model size and max context length the package can support to scale out of box? Is it scalable when we add more GPU nodes to the training job? Thanks! Sent from my iPhoneOn Jan 17, 2024, at 11:12 AM, HeyyyyyyG @.***> wrote:
Hi, we did PPO on Llama-70B model with 4k context length. In terms of GPU count, we used 32x8 GPUs for the actor and 8x8 GPUs for the critic. Try our NV-Llama2-70B-RLHF model on NVIDIA AI Foundation for free.
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you commented.Message ID: @.***>
is there a limit on the largest PPO model size and max context length the package can support to scale out of box? Is it scalable when we add more GPU nodes to the training job? Thanks! Sent from my iPhoneOn Jan 17, 2024, at 11:12 AM, HeyyyyyyG @.***> wrote:
Hi, we did PPO on Llama-70B model with 4k context length. In terms of GPU count, we used 32x8 GPUs for the actor and 8x8 GPUs for the critic. Try our NV-Llama2-70B-RLHF model on NVIDIA AI Foundation for free.
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you commented.Message ID: @.***>
Originally posted by @panyi121 in #70 (comment)