@@ -345,6 +345,7 @@ Index is 1-indexed. Shape must be compile-time constant.
345345- `order`: Optional tuple specifying the logical-to-physical dimension mapping (1-indexed).
346346 For example, `order=(2, 1)` indicates dimension 2 is contiguous in memory,
347347 enabling coalesced loads from transposed/permuted arrays.
348+ `index[i]` and `shape[i]` describe tile dim `i`, which maps to source dim `order[i]`.
348349 Default: `nothing` → identity `(1, 2, ..., N)`.
349350
350351# Padding Modes
@@ -366,10 +367,10 @@ outside the array, the behavior is undefined regardless of `padding_mode`.
366367
367368# Example
368369```julia
369- tile = ct.load(arr, (bid,), (TILE_N[] ,); padding_mode=ct.PaddingMode.Zero, latency=3)
370+ tile = ct.load(arr, (bid,), (TILE_N,); padding_mode=ct.PaddingMode.Zero, latency=3)
370371
371372# Load from a transposed array with coalesced access
372- tile = ct.load(arr, (bidx, bidy ), (TM, TN ); order=(2, 1))
373+ tile = ct.load(arr, (bidy, bidx ), (TN, TM ); order=(2, 1))
373374```
374375"""
375376@inline function load (arr:: TileArray , index, shape:: NTuple{<:Any, Int} ;
@@ -442,6 +443,7 @@ behavior is undefined.
442443# Dimension Ordering
443444- `order`: Optional tuple specifying the logical-to-physical dimension mapping (1-indexed).
444445 Must match the `order` used in the corresponding `load` for permuted arrays.
446+ `index[i]` and `shape[i]` describe tile dim `i`, which maps to destination dim `order[i]`.
445447 Default: `nothing` → identity `(1, 2, ..., N)`.
446448
447449# Optimization Hints
0 commit comments