Skip to content

Commit b37ce07

Browse files
committed
Also vectorize on GPU to make sure we test that.
1 parent ce6fe63 commit b37ce07

1 file changed

Lines changed: 2 additions & 1 deletion

File tree

test/correctness/fast_function_approximations.cpp

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -324,7 +324,8 @@ int main(int argc, char **argv) {
324324
if (target.has_gpu_feature()) {
325325
Var io, ii;
326326
ref_func.never_partition_all();
327-
ref_func.gpu_tile(i, io, ii, 256, TailStrategy::ShiftInwards);
327+
// also vectorize to make sure that works on GPU as well...
328+
ref_func.gpu_tile(i, io, ii, 256, TailStrategy::ShiftInwards).vectorize(ii, 2);
328329
ref_func.realize(out_approx);
329330
out_approx.copy_to_host();
330331

0 commit comments

Comments
 (0)