NODES offers a mechanism to parallelize inner runtime code using OmpSs-2 without having to pass through the compiler. Below we list the main features of this mechanism:
- It is enabled through the
runtime.enable_inner_parallelismconfigure variable. - At the moment, only specific parts of the
taskitercode leverage its potential. InnerParallelism::taskLoopallows parallelizing an inner loop within the NODES runtime as-is.InnerParallelism::taskWaitallows to synchronize the internally created tasks up until that point.- Task synchronization falls entirely upon the developer, as the implicit taskwait in programs will not enforce their finalization.