Better handle types and overflows in softmax (especially lagacy and stable) by jmitrevs · Pull Request #1476 · fastmachinelearning/hls4ml

jmitrevs · 2026-05-20T21:54:48Z

Description

The legacy softmax was prone to overflow, so this PR made it use an accumulator type. Also, for stable, the default types are better chosen. There are some changes in latency, as well, to avoid overflow, but my feeling is that latency can often behave badly, so this doesn't fix that.

Type of change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change which adds functionality)

Tests

The standard softmax tests should still pass. One could consider adding specific overflow tests, like we have in the current version of the brevitas tutorial.

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

Copilot

Pull request overview

This PR focuses on making the Vivado softmax implementations more robust against overflow by improving internal type handling (notably for legacy and latency variants), while also improving default type choices for stable softmax and adding auto-precision inference support for Softmax accumulators.

Changes:

Updated Vivado softmax templates to use accumulator types for summation / inversion-table addressing and to split exp/inv table types (legacy).
Updated Vivado stream softmax templates accordingly (latency/stable/legacy).
Added Softmax-specific accum_t inference in InferPrecisionTypes and adjusted FPGA backend Softmax default type attributes (unsigned table defaults, new inp_norm_t attribute).

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File	Description
hls4ml/templates/vivado/nnet_utils/nnet_activation.h	Aligns softmax (latency/stable/legacy) internal types to reduce overflow risk and clarifies exp/inv table type usage.
hls4ml/templates/vivado/nnet_utils/nnet_activation_stream.h	Mirrors softmax type changes for streaming implementations (but currently has inconsistencies that need fixing).
hls4ml/model/optimizer/passes/infer_precision.py	Adds Softmax-specific accumulator type inference when precision is set to `auto`.
hls4ml/backends/fpga/fpga_backend.py	Updates Softmax type attribute defaults (unsigned tables, adds `inp_norm_t`, adjusts accumulator handling).

Comments suppressed due to low confidence (1)

hls4ml/templates/vivado/nnet_utils/nnet_activation_stream.h:295

This legacy stream implementation still uses CONFIG_T::table_size to compute/clamp index, but exp_table is now allocated with CONFIG_T::exp_table_size. If these differ (and they can, since softmax exposes separate exp/inv table sizes), index can go out of bounds or the scaling will be wrong. Use exp_table_size consistently for exp table addressing here.

                    auto data_round = (data_cache[j] - data_cache[i]) * CONFIG_T::table_size / 16;
                    int index = data_round + 8 * CONFIG_T::table_size / 16;
                    if (index < 0)
                        index = 0;
                    if (index > CONFIG_T::table_size - 1)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

jmitrevs · 2026-06-08T21:58:27Z

I should also update the oneAPI backend. Truthfully that is a bit more involved. I will look into it this month, but have to work some hardware tests the next two weeks, so this for the moment will be lower priority.

jmitrevs · 2026-06-09T17:04:24Z

On the oneAPI (and Quartus) side there are already some softmax changes in PR #1432 so we need to coordinate.

jmitrevs added 3 commits May 20, 2026 12:08

fix overflow for legacy softmax, start looking at others

091cd8e

fix stable and streaming legacy softmax

6c9f9a7

minor softmax latency fixes

7a004b2

jmitrevs added the please test Trigger testing by creating local PR branch label May 20, 2026

jmitrevs mentioned this pull request May 23, 2026

Fix stable softmax exp_table indexing sign-bit inefficiency #1480

Closed

jmitrevs requested a review from Copilot June 6, 2026 20:31

Copilot started reviewing on behalf of jmitrevs June 6, 2026 20:31 View session

Copilot AI reviewed Jun 6, 2026

View reviewed changes

Comment thread hls4ml/templates/vivado/nnet_utils/nnet_activation_stream.h

Comment thread hls4ml/templates/vivado/nnet_utils/nnet_activation_stream.h Outdated

Comment thread hls4ml/model/optimizer/passes/infer_precision.py Outdated

Comment thread hls4ml/model/optimizer/passes/infer_precision.py

Merge branch 'main' into sofmax_fix

9d588f2

jmitrevs marked this pull request as draft June 8, 2026 21:48

fix copilot review issues (other than adding a test)

d5e3493

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Jun 9, 2026

add test for softmax auto inferrence

a56a8d6

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Jun 9, 2026

jmitrevs mentioned this pull request Jun 25, 2026

Softmax update #1494

Open

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Better handle types and overflows in softmax (especially lagacy and stable)#1476

Better handle types and overflows in softmax (especially lagacy and stable)#1476
jmitrevs wants to merge 6 commits into
fastmachinelearning:mainfrom
jmitrevs:sofmax_fix

jmitrevs commented May 20, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jmitrevs commented Jun 8, 2026 •

edited

Loading

Uh oh!

jmitrevs commented Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

jmitrevs commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Tests

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jmitrevs commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jmitrevs commented Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jmitrevs commented May 20, 2026 •

edited

Loading

jmitrevs commented Jun 8, 2026 •

edited

Loading