Softmax update by bugracyln · Pull Request #1494 · fastmachinelearning/hls4ml

bugracyln · 2026-06-25T18:50:11Z

Description

📝 Please include a summary of the change.

The softmax table generation logic was updated. The implementation for writing the softmax tables was revised, and memory attributes were added to enable a more efficient FPGA compilation flow. In addition, the templates were modified to use weights directly from the configuration.

Please also include relevant motivation and context.

The primary motivation for these changes was to bring the oneAPI backend closer to the Vivado backend in terms of implementation.

Memory attributes were added to enable memory banking on the FPGA, allowing for more efficient memory access. The weights are now copied directly into the configuration so that the compiler can recognise the entire table as a set of fixed values. This enables the memory to be implemented more efficiently, resulting in improved resource utilisation during FPGA compilation.

List any dependencies that are required for this change.

N/A

Type of change

For a new feature or function, please create an issue first to discuss it
with us before submitting a pull request.

Note: Please delete options that are not relevant.

Bug fix (non-breaking change that fixes an issue)
Documentation update
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality not to work as expected)
A new research paper code implementation
Other (Specify)

Tests

📝 Please describe the tests that you ran to verify your changes.

The changes were primarily verified using black-box tests on an isolated softmax unit. Testing was performed for both quantised and non-quantised implementations. For the quantised version, both configurations, with and without exp and inv table quantisers (QuantiserConfig(...)), were tested.

Additional testing included:

Generating FPGA RTL reports.
Building the emulator.
Performing a hardware compilation using the new Intel oneAPI compiler.

This PR currently supports only the Intel oneAPI compiler. Support for the Altera HLS compiler will be added in a future PR.

The implementation was also evaluated with different table sizes, and the resulting RTL reports were inspected to verify improvements in resource utilisation.

Provide instructions so we can reproduce.

A Python test file and a Keras model containing only a single softmax layer (Softmax or QSoftmax) were used. For the quantised implementation, the input and output quantisers for the exp and inv lookup tables were configured using QuantiserConfig(...). Tests were run with both the quantisers enabled and disabled.

The test configuration included:

Standard Softmax and QSoftmax models.
Explicit exp and inv table input output quantisation.
FPGA RTL generation.
Emulator build.
Hardware compilation with the Altera HLS (newer version of Intel oneAPI)compiler.

Please also list any relevant details for your test configuration.

Test Configuration:

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

…pi_weights

…api_qmha

…i_qmha

jmitrevs · 2026-06-25T18:56:40Z

We should probably squash the commits when merging. This should also be coordinated with #1476. We should see how best to do that.

JanFSchulte · 2026-06-25T19:03:00Z

Could we also get any form of description of what this is? I was about to close it as spam before I noticed that this was related to work by Lauri.

calad0i · 2026-06-25T21:07:07Z

About to close it as spam also if not seeing the please-test tag... Could you add some descriptions?

laurilaatu and others added 30 commits January 26, 2026 20:37

weights for dense

3d463b3

hgq2 homogeneous quant fix

d678573

Merge branch 'hgq2_homo_quant' of github.com:calad0i/hls4ml into onea…

77258bc

…pi_weights

Changes required for oneAPI MHA

59bd96f

Original weight implementation

dbb207b

Merge branch 'main' of github.com:fastmachinelearning/hls4ml into one…

0c59255

…api_qmha

Restore oneAPI weight placement

51efff0

pre-commit

6067bea

Merge branch 'main' into oneapi_qmha

06fda4e

Merge branch 'main' into oneapi_qmha

bf38a6b

Merge branch 'main' into oneapi_qmha

e27fd11

Merge branch 'main' into oneapi_qmha

9f4a448

softmax multidim templates

16ca197

Merge branch 'oneapi_qmha' of github.com:laurilaatu/hls4ml into oneap…

564b692

…i_qmha

pre-commit

974e75a

uncomment

060c398

Merge branch 'main' into oneapi_qmha

f78558c

int_inp_t to config

772b93a

Merge branch 'oneapi_qmha' of github.com:laurilaatu/hls4ml into oneap…

d2b8921

…i_qmha

Merge branch 'main' into oneapi_qmha

a1ad891

Merge branch 'main' into oneapi_qmha

d65544d

Merge branch 'main' into oneapi_qmha

2d6a5cc

softmax fixed

c3a4584

Merge branch 'main' into oneapi_qmha

9b1cf17

table generation cleanup

31b7ad6

Merge pull request fastmachinelearning#4 from bugracyln/smax_fix

70b19d1

Merge branch 'main' into oneapi_qmha

29bdbb3

Fix formatting of inp_norm_t name string

cab4cbc

pre-commit for core templates

42ece34

pre-commit all

7e2798a

bugracyln and others added 3 commits June 25, 2026 19:15

softmax update

bd4778e

minor syntax fix

3946858

Merge branch 'oneapi_qmha' into softmax_updated

be76917

jmitrevs added the please test Trigger testing by creating local PR branch label Jun 25, 2026

Merge branch 'main' into softmax_updated

189f64a

jmitrevs self-requested a review June 25, 2026 18:58

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Jun 25, 2026

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Jun 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Softmax update#1494

Softmax update#1494
bugracyln wants to merge 34 commits into
fastmachinelearning:mainfrom
bugracyln:softmax_updated

bugracyln commented Jun 25, 2026 •

edited

Loading

Uh oh!

jmitrevs commented Jun 25, 2026 •

edited

Loading

Uh oh!

JanFSchulte commented Jun 25, 2026

Uh oh!

calad0i commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

bugracyln commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Tests

Checklist

Uh oh!

jmitrevs commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JanFSchulte commented Jun 25, 2026

Uh oh!

calad0i commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

bugracyln commented Jun 25, 2026 •

edited

Loading

jmitrevs commented Jun 25, 2026 •

edited

Loading