You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a multi-part patch that allows the CPU to prepare a data copy
mapped onto a device.
1. The first question is how is such a device selected ?
The allocation of such a copy happen way before the scheduler is invoked
for a task, in fact before the task is even ready. Thus, we need to
decide on the location of this copy only based on some static
information, such as the task affinity. Therefore, this approach only
works for owner-compute type of tasks, where the task will be executed
on the device that owns the data used for the task affinity.
2. Pass the correct data copy across the entire system, instead of
falling back to data copy of the device 0 (CPU memory)
Signed-off-by: George Bosilca <gbosilca@nvidia.com>
0 commit comments