Commit d9c556d
committed
ptx: Fix synchronisation of stencil border kernels
The border kernels did synchronise with the parent stream at the _end_
of the border kernel (so that the Future that contains the result of the
stencil operation is dependent on the borders too), but not at the
_start_ of the border kernel (meaning that the borders can start
executing in parallel with the computation of the argument of the
stencil operation). The latter is clearly wrong, and this commit fixes
it. This fixes a stencil nondeterminism bug that we were having
(nondet-stencil in https://github.com/tomsmeding/accelerate-tests).1 parent dca0f75 commit d9c556d
1 file changed
Lines changed: 7 additions & 2 deletions
Lines changed: 7 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
709 | 709 | | |
710 | 710 | | |
711 | 711 | | |
| 712 | + | |
712 | 713 | | |
713 | 714 | | |
714 | 715 | | |
| |||
719 | 720 | | |
720 | 721 | | |
721 | 722 | | |
| 723 | + | |
| 724 | + | |
| 725 | + | |
| 726 | + | |
| 727 | + | |
722 | 728 | | |
723 | 729 | | |
724 | 730 | | |
725 | 731 | | |
726 | 732 | | |
727 | 733 | | |
728 | 734 | | |
729 | | - | |
730 | | - | |
| 735 | + | |
731 | 736 | | |
732 | 737 | | |
733 | 738 | | |
| |||
0 commit comments