Feature/add oversampled polyphase channelizer by tbensonatl · Pull Request #1151 · NVIDIA/MatX

tbensonatl · 2026-04-11T20:42:48Z

Add support for oversampling to the polyphase channelizer.

This update adds support for decimation factors (D) that are lower than the number of channels (M) with the polyphase channelizer. For all cases, the channelizer generates M outputs for each D inputs with any remaining partial set of inputs being zero-padded to D elements. Cases of D == M correspond to the maximally decimated, or critically sampled, case, which was previously supported. With maximal decimation, the channel frequency bands partition the frequency space, so the channel frequency support is adjacent but not overlapping. The oversampled cases D < M maintain the same channel center frequencies, but the channels have some overlap.

The per-channel length of the output tensor is (input_len + D - 1) / D. Thus, for example, with M=20 D=10, there is a 2x oversampling factor and the output tensor will have twice as many samples per channel as with M=20 D=20.

Add support for channelize_poly with decimation factors (D) less than the number of output channels (M). The case with D < M corresponds to oversampling where a set of M outputs is produced for each D inputs. Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Using a direct pointer for dense filter tensors offers a performance benefit, but adds complexity. It would be better to pursue adding traits to MatX to optimize ALU usage for memory access. Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Added flags to control the phase rotation assumption for the first output in the oversampled case. Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

copy-pr-bot · 2026-04-11T20:42:53Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

tbensonatl · 2026-04-11T20:58:56Z

/build

greptile-apps · 2026-04-11T21:19:45Z

Greptile Summary

This PR adds oversampling support to the polyphase channelizer, enabling decimation factors D < M. It introduces a new tiled shared-memory kernel (ChannelizePoly1D_SmemTiled) supporting both the maximally-decimated and oversampled paths, a fused complex MAC helper (channelize_cmac), comprehensive tests covering integer/rational oversampling, large-M tiling, and batch modes, and a Python reference generator for cross-validation. Previously flagged P0/P1 issues (division-by-zero, alignment, loop-invariant s, misleading comment) are all addressed in the current HEAD.

Confidence Score: 5/5

Safe to merge; all previously flagged P0/P1 issues are resolved and remaining findings are P2 style/comment improvements

All prior P0/P1 concerns (division-by-zero, smem alignment, loop-invariant hoisting, misleading inequality comment) are addressed. The new kernel logic, phase-rotation math, and dispatch hierarchy are sound. Remaining findings are a silently-ignored size parameter in the Python test generator and a misleading test comment — neither affects correctness of the production code.

test/test_vectors/generators/00_transforms.py — harris2003_oversampled_operators::channelize() ignores self.size

Important Files Changed

Filename	Overview
include/matx/kernels/channelize_poly.cuh	Adds MaximallyDecimated template parameter to ChannelizePoly1D, new ChannelizePoly1D_SmemTiled kernel (D==M and D<M paths), and channelize_cmac FMA helper; logic appears correct with proper circular-buffer and phase-rotation handling
include/matx/transforms/channelize_poly.h	Dispatch logic updated to route oversampled inputs through SmemTiled or generic kernels; fused-DFT and Smem kernels correctly gated to D==M only
include/matx/operators/channelize_poly.h	Output dimension formula correctly updated from num_channels to decimation_factor; documentation substantially expanded with Harris convention explanation and reference
test/00_transform/ChannelizePoly.cu	Thorough new test suite (identity filter, integer/rational oversampling, large-M tiling, batched, complex filter, fallback path); one test comment incorrectly describes what triggers the fallback
test/test_vectors/generators/00_transforms.py	Adds channelize_oversampled reference and harris2003_oversampled_operators; harris class hardcodes M/D/input_len constants and silently ignores the size parameter passed from C++
examples/channelize_poly_bench.cu	Refactored to accept explicit -M/-D flags with proper validation guards; atol() used for parsing without detecting invalid string inputs

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A["channelize_poly_impl(in, filter, M, D)"] --> B{"real input\n& real filter?"}
    B -- yes --> C{"D==M and M<=6?"}
    B -- no --> D{"D==M and M<=6?"}
    C -- yes --> E["FusedChan kernel"]
    C -- no --> F{"D==M and Smem fits?"}
    F -- yes --> G["Smem kernel"]
    F -- no --> H{"SmemTiled eligible?"}
    H -- yes --> I["SmemTiled kernel"]
    H -- no --> J["Generic ChannelizePoly1D"]
    D -- yes --> E
    D -- no --> K{"D==M and Smem fits?"}
    K -- yes --> G
    K -- no --> L{"SmemTiled eligible?"}
    L -- yes --> I
    L -- no --> J
    I --> M{"MaximallyDecimated\nD==M?"}
    M -- yes --> N["Fixed phase per channel\nincremental buf_row advance"]
    M -- no --> O["phase = (c + t*D) % M\nK-rotation filter cache"]

_{Reviews (5): Last reviewed commit: "Make ::detail constants visible for host..." | Re-trigger Greptile}

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

tbensonatl · 2026-04-11T22:23:06Z

/build

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

tbensonatl · 2026-04-11T22:38:14Z

/build

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

tbensonatl · 2026-04-12T16:35:53Z

/build

coveralls · 2026-04-12T22:14:41Z

Coverage is 91.829% — feature/add-oversampled-polyphase-channelizer into main. No base build found for main.

cliffburdick

Awesome work!

tbensonatl added 10 commits March 29, 2026 17:15

Add constexpr path for M==D case in generic kernel

31609ad

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Directly declare dynamic shared memory type as filter_t

139d98e

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

First version with working SmemTiled; still needs cleanup

70ede9d

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Remove filter access fast-path

90c24da

Using a direct pointer for dense filter tensors offers a performance benefit, but adds complexity. It would be better to pursue adding traits to MatX to optimize ALU usage for memory access. Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Cleanup and optimization path

b6a8cf9

Added flags to control the phase rotation assumption for the first output in the oversampled case. Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Add branch remapping to more closely match the Harris convention

2b52af9

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Remove FIRST_PHASE_OFFSET constant; stick with Harris convention

096379b

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Update documentation and comments

def1f8a

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Update channelize_poly_bench

2d1bb01

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

tbensonatl self-assigned this Apr 11, 2026

greptile-apps Bot reviewed Apr 11, 2026

View reviewed changes

Comment thread examples/channelize_poly_bench.cu Outdated

Comment thread include/matx/kernels/channelize_poly.cuh Outdated

Comment thread include/matx/kernels/channelize_poly.cuh

Comment thread include/matx/kernels/channelize_poly.cuh

Addressed greptile comments

7dc7cc8

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

greptile-apps Bot reviewed Apr 11, 2026

View reviewed changes

Comment thread examples/channelize_poly_bench.cu Outdated

Update channelize_poly_bench to explicitly take M/D arguments

e33e550

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Tighten the use_32bit checks in channelize_poly

0ad4739

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

Make ::detail constants visible for host compilation

36f4940

Signed-off-by: Thomas Benson <tbenson@nvidia.com>

tbensonatl requested a review from cliffburdick April 12, 2026 22:14

cliffburdick reviewed Apr 13, 2026

View reviewed changes

Comment thread examples/channelize_poly_bench.cu

cliffburdick reviewed Apr 13, 2026

View reviewed changes

Comment thread include/matx/kernels/channelize_poly.cuh

cliffburdick approved these changes Apr 13, 2026

View reviewed changes

tbensonatl merged commit 013c856 into main Apr 13, 2026
1 check passed

cliffburdick deleted the feature/add-oversampled-polyphase-channelizer branch April 13, 2026 16:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/add oversampled polyphase channelizer#1151

Feature/add oversampled polyphase channelizer#1151
tbensonatl merged 14 commits intomainfrom
feature/add-oversampled-polyphase-channelizer

tbensonatl commented Apr 11, 2026

Uh oh!

copy-pr-bot Bot commented Apr 11, 2026

Uh oh!

tbensonatl commented Apr 11, 2026

Uh oh!

greptile-apps Bot commented Apr 11, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tbensonatl commented Apr 11, 2026

Uh oh!

tbensonatl commented Apr 11, 2026

Uh oh!

tbensonatl commented Apr 12, 2026

Uh oh!

coveralls commented Apr 12, 2026

Uh oh!

Uh oh!

Uh oh!

cliffburdick left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tbensonatl commented Apr 11, 2026

Uh oh!

copy-pr-bot Bot commented Apr 11, 2026

Uh oh!

tbensonatl commented Apr 11, 2026

Uh oh!

greptile-apps Bot commented Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tbensonatl commented Apr 11, 2026

Uh oh!

tbensonatl commented Apr 11, 2026

Uh oh!

tbensonatl commented Apr 12, 2026

Uh oh!

coveralls commented Apr 12, 2026

Uh oh!

Uh oh!

Uh oh!

cliffburdick left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

greptile-apps Bot commented Apr 11, 2026 •

edited

Loading