[FEATURE] Audio normalization by Gautzilla · Pull Request #268 · Project-OSmOSE/OSEkit

Gautzilla · 2025-08-29T10:42:59Z

⚠️ Obsolete description, see the updated comment below

🐳 What's new?

The fetched audio data can now be normalized according to 3 presets:

Preset	Definition
raw	$x$
dc_reject	$x-\overline{x}$
zscore	$\frac{x-\overline{x}}{\sigma (x)}$

The automatic dc rejection in the AudioData.get_value() method was removed, as it now is controlable through the normalization property.

🐳 How to use it?

The normalization property on AudioData, AudioDataset and Analysis can be set to either "raw", "dc_reject" or "zscore".
The fetched audio data will then be normalized accordingly when AudioData.get_value() is called.

🐬 Core API

Simply set the normalization property of AudioData or AudioDataset objects:

from osekit.core_api.audio_dataset import AudioDataset

ads = AudioDataset(
    ...,
    normalization="zscore"
)

ads.write(...) # The written audio files will be normalized z-scores.

🐬 Public API

Simply set the normalization property of the Analysis object:

from osekit.public_api.dataset import Dataset
from osekit.public_api.analysis import Analysis

dataset = Dataset(...)

analysis = Analysis(
    ...,
    normalization = "zscore",
)

dataset.run_analysis(analysis=analysis) # The audio data is turned to z-score during the analysis

Gautzilla · 2025-08-29T10:43:42Z

The PR is still in draft mode as I have to include normalization in the docs.

mathieudpnt · 2025-09-02T09:31:46Z

As we discussed, this might benefit from adding a "full-scale" normalization mode

Gautzilla · 2025-09-03T12:39:55Z

🐳 What's new?

The fetched audio data can now be normalized according to 4 presets given by the osekit.utils.audio_utils.Normalization flag:

Preset	Definition
Normalization.RAW	$x$
Normalization.DC_REJECT	$x-\overline{x}$
Normalization.PEAK	$\frac{x}{x_\text{max}}$
Normalization.ZSCORE	$\frac{x-\overline{x}}{\sigma (x)}$

The automatic dc rejection in the AudioData.get_value() method was removed, as it now is controlable through the normalization property.

🐳 How to use it?

The normalization property on AudioData, AudioDataset and Analysis can be set to any Normalization flag.
The fetched audio data will then be normalized accordingly when AudioData.get_value() is called.

⚠️ Normalization.DC_REJECT can be combined with any other (single) normalization, but any other combination will raise a ValueError:

from osekit.utils.audio_utils import Normalization

n = Normalization.DC_REJECT | Normalization.PEAK # OK
n = Normalization.DC_REJECT | Normalization.ZSCORE # OK
n = Normalization.ZSCORE| Normalization.PEAK # raises a ValueError
n =Normalization.DC_REJECT | Normalization.ZSCORE| Normalization.PEAK # raises a ValueError

🐬 Core API

Simply set the normalization property of AudioData or AudioDataset objects:

from osekit.core_api.audio_dataset import AudioDataset
from osekit.utils.audio_utils import Normalization

ads = AudioDataset(
    ...,
    normalization=Normalization.ZSCORE
)

ads.write(...) # The written audio files will be normalized z-scores.

🐬 Public API

Simply set the normalization property of the Analysis object:

from osekit.public_api.dataset import Dataset
from osekit.public_api.analysis import Analysis
from osekit.utils.audio_utils import Normalization

dataset = Dataset(...)

analysis = Analysis(
    ...,
    normalization=Normalization.ZSCORE,
)

dataset.run_analysis(analysis=analysis) # The audio data is turned to z-score during the analysis

mathieudpnt · 2025-09-08T10:01:47Z

Gautzilla · 2025-09-11T16:01:04Z

Resolving conflicts in notebooks looks buggy as f, so I had to make a few tweaks and now I need your doppelganger to approve the PR

Gautzilla added 10 commits August 28, 2025 11:37

add normalization functions to audio_utils module

5f2755d

add normalization util tests

05afcc1

add AudioData.normalization property

a0b9c91

remove AudioData.get_value reject_dc parameter

454f972

add AudioData normalization serialization

2980418

add AudioData normalization serialization tests

6b88cab

add normalization to AudioDataset

c4b2752

add AudioData and AudioDataset normalization tests

8f2f6b3

add AudioDataset normalization serialization tests

b74a395

add public_api normalization in analysis

0ec5eba

Gautzilla added 6 commits September 1, 2025 09:30

add AudioData.normalization in the docs

1db6340

add AudioDataset.normalization in the docs

640ff7f

add public API normalization in doc

fe4a0d0

add AudioDataset.from_folder sample_rate parameter

65e0aa9

add normalization in doc notebooks

76b7a40

remove reset cell from public LTAS notebook

cb04f13

Gautzilla marked this pull request as ready for review September 1, 2025 08:43

mathieudpnt mentioned this pull request Sep 1, 2025

normalize audio segments for APLOSE campaign so the annotator is able to clearly hear the wav segment. Useful for annotation from exterior users #271

Closed

Gautzilla added 10 commits September 2, 2025 17:18

change normalization to a Flag

0861c58

adapt normalization test to new normalization system

cf32e72

add combined normalization test

9cd31f2

use metaclass to check normalization validity on call

3df6e01

use new Normalization flag in AudioData

bf0a84d

use new Normalization flag in AudioDataset

9147277

use Normalization flag in the public API

0392ee4

use Normalization flag in example notebooks

b99efcf

update docs with Normalization flag

93f82b4

add Normalization flag to API doc

54f8ccd

Gauthier BERTHOMIEU added 2 commits September 5, 2025 11:36

add negative peak tests

f62386c

fix peak normalization with negative values

c04c0f5

Merge branch 'main' into feature/audio-normalization

9259f42

mathieudpnt assigned Gautzilla Sep 11, 2025

Gautzilla requested a review from mathieudpnt September 11, 2025 08:21

mathieudpnt approved these changes Sep 11, 2025

View reviewed changes

Gauthier BERTHOMIEU added 2 commits September 11, 2025 17:12

move Normalization import to idoine cell

60c0634

Merge branch 'main' into feature/audio-normalization

009c6f3

Gautzilla requested a review from mathieudpnt September 11, 2025 15:59

mathieudpnt approved these changes Sep 15, 2025

View reviewed changes

Merge branch 'main' into feature/audio-normalization

3760c7e

mathieudpnt merged commit e1627eb into Project-OSmOSE:main Sep 15, 2025
1 check passed

Gautzilla deleted the feature/audio-normalization branch September 15, 2025 13:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Audio normalization#268

[FEATURE] Audio normalization#268
mathieudpnt merged 32 commits into
Project-OSmOSE:mainfrom
Gautzilla:feature/audio-normalization

Gautzilla commented Aug 29, 2025 •

edited

Loading

Uh oh!

Gautzilla commented Aug 29, 2025

Uh oh!

mathieudpnt commented Sep 2, 2025

Uh oh!

Gautzilla commented Sep 3, 2025

Uh oh!

mathieudpnt commented Sep 8, 2025 •

edited

Loading

Uh oh!

Gautzilla commented Sep 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Gautzilla commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ Obsolete description, see the updated comment below

🐳 What's new?

🐳 How to use it?

🐬 Core API

🐬 Public API

Uh oh!

Gautzilla commented Aug 29, 2025

Uh oh!

mathieudpnt commented Sep 2, 2025

Uh oh!

Gautzilla commented Sep 3, 2025

🐳 What's new?

🐳 How to use it?

🐬 Core API

🐬 Public API

Uh oh!

mathieudpnt commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Gautzilla commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Gautzilla commented Aug 29, 2025 •

edited

Loading

mathieudpnt commented Sep 8, 2025 •

edited

Loading

Gautzilla commented Sep 11, 2025 •

edited

Loading