fix(ml): use SafeInt checked arithmetic in ML operator coefficient size validation by tianleiwu · Pull Request #28001 · microsoft/onnxruntime

tianleiwu · 2026-04-07T18:50:29Z

Replaces unchecked static_cast<size_t> multiplications in coefficient/support-vector size guards with SafeInt<size_t> arithmetic. Without this, a crafted model with adversarial dimension values could cause the product to silently wrap around, bypassing the validation entirely before downstream GEMM/dot-product reads.

Also improves the SVMClassifier feature-count mismatch error to include expected feature_count, actual num_features, input_rank, and num_batches for easier diagnosis.

Changes

linearclassifier.cc — SafeInt<size_t>(class_count_) * SafeInt<size_t>(num_features) with try/catch returning INVALID_ARGUMENT on overflow
linearregressor.cc — Same pattern for num_targets_ * num_features
svmclassifier.cc — SafeInt overflow protection for both the linear (class_count_ * num_features) and non-linear (vector_count_ * num_features) branches; descriptive error message on feature-count mismatch
svmclassifier_test.cc — Update assertion substring to match the new error message

Motivation and Context

Unchecked integer multiplication in size-validation guards can be bypassed by a crafted model whose dimension values wrap around size_t, effectively disabling the bounds check before out-of-bounds memory reads occur downstream. This hardens the existing validation added for LinearClassifier, LinearRegressor, SVMClassifier, and SVMRegressor.

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds runtime shape/attribute validation for CPU ML operators so malformed model metadata (e.g., undersized coefficients/support vectors) is rejected early, preventing potential out-of-bounds reads. Also adds regression tests for the new validation paths.

Changes:

Add runtime checks for coefficient and support-vector buffer sizes based on runtime num_features for Linear* and SVM* operators.
Tighten SVMRegressor coefficient validation against n_supports and validate input rank before reading feature dimension.
Add negative tests covering invalid coefficient sizes and feature-count mismatches.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
onnxruntime/core/providers/cpu/ml/linearclassifier.cc	Validates `coefficients_` matches `class_count_ * num_features` at runtime.
onnxruntime/core/providers/cpu/ml/linearregressor.cc	Validates `coefficients_` matches `num_targets_ * num_features` before output/GEMM.
onnxruntime/core/providers/cpu/ml/svmclassifier.cc	Validates input rank/features and validates attribute buffer sizes against runtime feature count.
onnxruntime/core/providers/cpu/ml/svmregressor.cc	Tightens `coefficients_` size check and validates input rank before feature extraction.
onnxruntime/test/providers/cpu/ml/linearclassifer_test.cc	Adds failing LinearClassifier case for undersized coefficients.
onnxruntime/test/providers/cpu/ml/linearregressor_test.cc	Adds failing LinearRegressor case for undersized coefficients.
onnxruntime/test/providers/cpu/ml/svmclassifier_test.cc	Adds failing SVMClassifier case for input feature-count mismatch.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tianleiwu · 2026-04-07T22:53:11Z

@copilot apply changes based on the comments in this thread

Copilot · 2026-04-07T22:58:42Z

@copilot apply changes based on the comments in this thread

Applied all four review suggestions in commit 480b3d7:

linearclassifier.cc, linearregressor.cc, svmclassifier.cc (linear/SVM size checks): Replaced unchecked static_cast<size_t> multiplication with SafeInt<size_t> wrapped in try/catch(...), returning INVALID_ARGUMENT with a descriptive message on overflow.
svmclassifier.cc feature-count check: Expanded the error message to include expected feature_count, actual num_features, input_rank, and num_batches.
svmclassifier_test.cc: Updated SVMClassifierInvalidInputFeatureCount test to assert on the new "Invalid input for SVMClassifier" substring.

Note: catch (...) is used (matching the reviewer's own suggestion) because ORT's safeint.h overrides the SafeInt exception handler to throw OnnxRuntimeException — there is no standalone SafeIntException type in this codebase.

vraspar

SVMClassifier constructor still has unchecked overflow

The constructor computescoefficient and classifier counts with bare static_cast<size_t> multiplications:

const size_t expected_coefficients = static_cast<size_t>(class_count_ - 1) * static_cast<size_t>(vector_count_);
const size_t num_classifiers = static_cast<size_t>(class_count_) * static_cast<size_t>(class_count_ - 1) / 2;

In SVC mode, Compute relies on the constructor's validation of coefficients_, rho_, prob_a_, and prob_b_ sizes. The new SafeInt checks in Compute only cover support_vectors_ for SVC mode. If an attacker crafts class_count_ and vector_count_ values that overflow expected_coefficients to a small number, the constructor check passes, and Compute later does OOB reads on coefficients_.

Fix is straightforward -- wrap both multiplications in SafeInt the same way the Compute methods do it:

const size_t expected_coefficients = SafeInt<size_t>(class_count_ - 1) * static_cast<size_t>(vector_count_);
const size_t num_classifiers = SafeInt<size_t>(class_count_) * static_cast<size_t>(class_count_ - 1) / 2;

This needs to land in the same PR since the constructor path is part of the same attack surface.

No overflow regression tests

The new negative tests all check "wrong size" mismatches, which is useful but doesn't exercise the overflow path. That's the actual security scenario this PR is fixing. At least one test should use dimension values large enough to overflow size_t multiplication without requiring a huge allocation. For example, for LinearClassifier you could set coefficients to a small vector but set dimensions such that class_count * feature_count overflows to match that small size. The test passes if the operator rejects the model with an error instead of silently accepting the overflowed value.

Without this, we don't have regression coverage for the specific bug being fixed.

Minor notes

SVMRegressor still uses a generic "Invalid argument" message in its size check, while LinearClassifier, LinearRegressor, and SVMClassifier now have descriptive messages. Not blocking, but it would be nice to make it consistent in the same pass.

Separately, rank validation is inconsistent across the four operators. SVMClassifier/SVMRegressor check input_rank > 0 && input_rank <= 2, LinearClassifier only checks dims == 0, and LinearRegressor checks > 2 but not == 0. This is pre-existing and out of scope for this PR, just flagging it.

…cient size validation Agent-Logs-Url: https://github.com/microsoft/onnxruntime/sessions/bfad8ac3-b618-42ef-9653-527b2df8e313 Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>

…dation

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated 5 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

ONNX models may have more coefficients than needed for a given input dimension (e.g. logreg_iris.onnx has 12 coefficients for 3 classes but input has only 2 features). The safety check only needs to ensure the coefficient array is large enough to prevent OOB reads, not exactly equal.

- Replace try/catch SafeInt with SafeMultiply for builds where exceptions are disabled (minimal build). - Fix wasm -Wshorten-64-to-32 by using SafeInt<size_t> for resize args. - Add explicit safeint.h include.

tianleiwu requested review from Copilot, vraspar and xadupre and removed request for vraspar and xadupre April 7, 2026 18:51

Copilot AI reviewed Apr 7, 2026

View reviewed changes

Copilot started reviewing on behalf of tianleiwu April 7, 2026 19:11 View session

Copilot started work on behalf of tianleiwu April 7, 2026 22:53 View session

Copilot AI changed the title ~~Validate ML model coefficient sizes~~ fix(ml): use SafeInt checked arithmetic in ML operator coefficient size validation Apr 7, 2026

Copilot finished work on behalf of tianleiwu April 7, 2026 22:59

vraspar requested changes Apr 9, 2026

View reviewed changes

tianleiwu and others added 4 commits April 13, 2026 13:46

fix(ml): validate linear model coefficient sizes

a4e78df

fix(ml): use SafeInt checked arithmetic to prevent overflow in coeffi…

31bb123

…cient size validation Agent-Logs-Url: https://github.com/microsoft/onnxruntime/sessions/bfad8ac3-b618-42ef-9653-527b2df8e313 Co-authored-by: tianleiwu <30328909+tianleiwu@users.noreply.github.com>

review feedback

7048557

fix: resolve conflict with #27989 - remove duplicate coefficient vali…

cc19972

…dation

tianleiwu force-pushed the tlwu/20260407/fix_linear_classifier branch from fa5b942 to cc19972 Compare April 13, 2026 20:52

tianleiwu requested review from Copilot and vraspar April 13, 2026 20:53

Copilot AI reviewed Apr 13, 2026

View reviewed changes

Copilot started reviewing on behalf of tianleiwu April 13, 2026 21:02 View session

tianleiwu added 3 commits April 13, 2026 15:39

fix: use SafeMultiply instead of try/catch for exception-disabled builds

631d62e

- Replace try/catch SafeInt with SafeMultiply for builds where exceptions are disabled (minimal build). - Fix wasm -Wshorten-64-to-32 by using SafeInt<size_t> for resize args. - Add explicit safeint.h include.

Merge branch 'main' into tlwu/20260407/fix_linear_classifier

b9dbf03

vraspar approved these changes Apr 16, 2026

View reviewed changes

vraspar reviewed Apr 16, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/cpu/ml/svmclassifier.cc

Comment thread onnxruntime/test/providers/cpu/ml/svmclassifier_test.cc

tianleiwu merged commit 7063e74 into main Apr 17, 2026
96 checks passed

tianleiwu deleted the tlwu/20260407/fix_linear_classifier branch April 17, 2026 01:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(ml): use SafeInt checked arithmetic in ML operator coefficient size validation#28001

fix(ml): use SafeInt checked arithmetic in ML operator coefficient size validation#28001
tianleiwu merged 7 commits intomainfrom
tlwu/20260407/fix_linear_classifier

tianleiwu commented Apr 7, 2026 •

edited by Copilot AI

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianleiwu commented Apr 7, 2026

Uh oh!

Copilot AI commented Apr 7, 2026

Uh oh!

vraspar left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

tianleiwu commented Apr 7, 2026 • edited by Copilot AI Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Motivation and Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianleiwu commented Apr 7, 2026

Uh oh!

Copilot AI commented Apr 7, 2026

Uh oh!

vraspar left a comment

Choose a reason for hiding this comment

SVMClassifier constructor still has unchecked overflow

No overflow regression tests

Minor notes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tianleiwu commented Apr 7, 2026 •

edited by Copilot AI

Loading