fix: LRL1 results now reproducible by immu4989 · Pull Request #1546 · microsoft/FLAML

immu4989 · 2026-05-11T18:09:58Z

Why are these changes needed?

Summary

Seed random_state on LRL1Classifier so LogisticRegression(solver="saga", penalty="l1") produces deterministic results across runs.
Uses the same defensive pattern as fix: SGD results now reproducible #1541: pop the FLAML-internal random_seed key from self.params, and only set random_state when the caller has not already provided one.
Uncomment "lrl1" in both classification reproducibility test parametrize lists (it was previously disabled).

Why

LRL1Classifier defaults to solver="saga", a stochastic-gradient solver that shuffles samples each pass. Without random_state, identical fits produce different results — same root cause as SGD (#1541). LRL2Classifier is unaffected since it uses the deterministic lbfgs solver.

Test plan

pytest test/automl/test_classification.py -k "reproducibility and lrl1" — both wrapper and underlying-model tests pass
pre-commit run --files flaml/automl/model.py test/automl/test_classification.py — all hooks pass
CI green on the PR

Related issue number

Follows the same pattern as LGBM (#1369), CatBoost (#1364), ElasticNet (#1374), LinearSVC (#1376), and SGD (#1541).

Checks

I've used pre-commit to lint the changes in this PR (note the same in integrated in our CI checks).
I've included any doc changes needed for https://microsoft.github.io/FLAML/. See https://microsoft.github.io/FLAML/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

Copilot

Pull request overview

This PR improves determinism for FLAML’s lrl1 estimator by ensuring LRL1Classifier seeds scikit-learn’s stochastic LogisticRegression(solver="saga", penalty="l1"), and re-enables reproducibility coverage for lrl1 in the classification test suite.

Changes:

Set LRL1Classifier.params["random_state"] from FLAML’s internal random_seed (defaulting to 10242048) when the caller hasn’t explicitly provided random_state.
Remove (pop) random_seed from LRL1Classifier.params to avoid passing an unsupported parameter into scikit-learn constructors.
Re-enable lrl1 in both classification reproducibility parametrized test lists.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
`flaml/automl/model.py`	Seeds `LRL1Classifier`’s underlying `LogisticRegression` via `random_state` for deterministic `saga` behavior.
`test/automl/test_classification.py`	Re-enables `lrl1` in reproducibility test parametrizations to prevent regressions.

fix: LRL1 results now reproducible

98f064f

thinkall requested a review from Copilot May 12, 2026 01:34

Copilot started reviewing on behalf of thinkall May 12, 2026 01:34 View session

Copilot AI reviewed May 12, 2026

View reviewed changes

thinkall approved these changes May 12, 2026

View reviewed changes

thinkall merged commit 4dc2de8 into microsoft:main May 12, 2026
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: LRL1 results now reproducible#1546

fix: LRL1 results now reproducible#1546
thinkall merged 1 commit into
microsoft:mainfrom
immu4989:flaml-fix-lrl1-reproducibility

immu4989 commented May 11, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

immu4989 commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Summary

Why

Test plan

Related issue number

Checks

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

immu4989 commented May 11, 2026 •

edited

Loading