Simplify GPU quantile tests and tighten sketch container ownership by RAMitchell · Pull Request #12160 · dmlc/xgboost

RAMitchell · 2026-04-13T12:06:46Z

Summary

This continues the GPU quantile test rewrite and tightens the GPU SketchContainer
interface so the sketch owns more of its own sizing and maintenance behavior.

What Changed

simplify the low-level GPU sketch tests in tests/cpp/common/test_quantile.cu
- use direct SketchContainer::Push(...) input for push/prune/merge coverage
- remove tests that belonged at the wrapper/cut layer instead of the sketch layer
- keep explicit duplicate/rank coverage for Push(...)
align GPU property-style cut tests with the shared CPU helper layer
- shared container cases
- shared cut invariant validation
tighten the GPU sketch container interface
- SketchContainer::Push(...) now computes its own batch-local cut layout
- internal maintenance prune uses a per-feature bound
- remove external post-batch prune from QuantileDMatrix
remove redundant GPU wrapper tests from tests/cpp/common/test_hist_util.cu
simplify categorical dedup bookkeeping by updating the new column scan buffer directly

Testing

Built testxgboost with CUDA and ran focused GPU coverage, including:

GPUQuantile.*
GPUQuantileProperty.Invariants
MGPUQuantileTest.SameOnAllWorkers
focused remaining HistUtil GPU tests

Local result:

focused GPU low-level slice passed
SameOnAllWorkers still skips locally without federated/NCCL support

…-strategy

…-strategy # Conflicts: # tests/cpp/common/test_hist_util.cu

Copilot

Pull request overview

This PR refactors GPU quantile sketch tests and tightens the GPU SketchContainer interface so the container computes its own per-batch cut layout and owns more of its maintenance behavior.

Changes:

Simplify/realign GPU quantile tests to exercise SketchContainer::Push/Prune/Merge/AllReduce more directly and share CPU/GPU cut invariant validation via helpers.
Update GPU sketch container API so Push(...) computes batch-local cut pointers internally; remove redundant external prune in QuantileDMatrix.
Simplify categorical dedup bookkeeping by updating the column scan buffer directly; remove redundant GPU wrapper tests from test_hist_util.cu.

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/cpp/common/test_quantile.h	Updates helper header guard and seed generation used by quantile tests.
tests/cpp/common/test_quantile.cu	Major GPU quantile test rewrite: new synthetic batch helpers + property-style tests; updated calls to new `SketchContainer::Push` signature.
tests/cpp/common/test_quantile.cc	Replaces local container-invariant logic with shared `ValidateContainerCuts` helper.
tests/cpp/common/test_quantile_helpers.h	Adds shared `ValidateContainerCuts(...)` helper used by both CPU/GPU tests.
tests/cpp/common/test_hist_util.cu	Removes redundant DeviceSketch wrapper tests; updates categorical dedup test for new API.
src/data/quantile_dmatrix.cu	Removes per-batch external prune now handled internally by the sketch container.
src/common/quantile.cuh	Tightens `SketchContainer::Push` signature; renames intermediate sizing helper to per-feature semantics.
src/common/quantile.cu	Implements internal batch-local cut pointer construction and prunes using a per-feature bound.
src/common/hist_util.cuh	Removes external cut-pointer plumbing; relies on `SketchContainer::Push` to compute cut layout.
src/common/hist_util.cu	Simplifies categorical dedup to update column scan only; updates weighted sketch path to new Push API.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 11 out of 11 changed files in this pull request and generated 4 comments.

Comments suppressed due to low confidence (1)

src/common/quantile.cuh:157

SketchContainer::Prune(Context const* ctx, size_t to) treats to as a per-feature cap (it does length = std::min(length, to) for each column). The current docstring says "maximum size of pruned quantile" which reads like a global/total cap. Updating the comment to explicitly say "maximum entries per feature" would make the API contract clearer, especially now that Push() prunes internally using IntermediateCutsPerFeature().

  /**
   * @brief Prune the quantile structure.
   *
   * @param to The maximum size of pruned quantile.  If the size of quantile structure is
   *           already less than `to`, then no operation is performed.
   */
  void Prune(Context const* ctx, size_t to);

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

RAMitchell added 10 commits April 10, 2026 07:43

Add GPU quantile property tests

676aebc

Merge remote-tracking branch 'upstream/master' into quantile-gpu-test…

c3b9812

…-strategy

Share quantile cut invariants across CPU and GPU tests

f236a49

Simplify GPU quantile tests

b722265

Tighten GPU sketch container interface

73a8c12

Simplify GPU quantile sketch tests

90ebca5

Merge remote-tracking branch 'upstream/master' into quantile-gpu-test…

8a87d06

…-strategy # Conflicts: # tests/cpp/common/test_hist_util.cu

Adjust GPU sketch tests after upstream merge

c24a0d0

Restore duplicate coverage in GPU push test

1d7fe85

Simplify GPU categorical scan update

15d0c98

RAMitchell requested a review from Copilot April 13, 2026 12:07

Copilot started reviewing on behalf of RAMitchell April 13, 2026 12:08 View session

Copilot AI reviewed Apr 13, 2026

View reviewed changes

Comment thread tests/cpp/common/test_quantile.cu Outdated

RAMitchell added 4 commits April 13, 2026 05:23

Fix GPU merge duplicated test dimensions

28dbfb7

Fix CUDA min usage in sketch cut sizing

16d339d

Relax EllpackPageExt buffer equality

2192a8b

Use emplace_back in GPU quantile test helper

b6eefee

RAMitchell requested a review from Copilot April 14, 2026 09:00

Copilot started reviewing on behalf of RAMitchell April 14, 2026 09:01 View session

Fold quantile helpers into test_quantile header

a4d2487

Copilot AI reviewed Apr 14, 2026

View reviewed changes

Comment thread tests/cpp/data/test_sparse_page_dmatrix.cu

Comment thread tests/cpp/common/test_quantile.cu

Comment thread src/common/hist_util.cu

Comment thread src/common/hist_util.cuh

RAMitchell added 2 commits April 14, 2026 03:48

Fix FreeBSD include and Windows CUDA warning

8d1fdf8

Remove dead cut budget variables

989cf1f

RAMitchell requested a review from trivialfis April 15, 2026 07:14

RAMitchell marked this pull request as ready for review April 15, 2026 07:14

Avoid temporary in quantile test emplace_back

848402c

trivialfis approved these changes Apr 15, 2026

View reviewed changes

RAMitchell merged commit d6fbb76 into dmlc:master Apr 15, 2026
78 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Simplify GPU quantile tests and tighten sketch container ownership#12160

Simplify GPU quantile tests and tighten sketch container ownership#12160
RAMitchell merged 18 commits into
dmlc:masterfrom
RAMitchell:quantile-gpu-test-strategy

RAMitchell commented Apr 13, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Uh oh!

Conversation

RAMitchell commented Apr 13, 2026

Summary

What Changed

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants