You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-`wav2mel.pt` is the preprocessing module which is composed of 2 modules:
28
+
-`sox_effects.pt` is used to normalize volume, remove silence, resample audio to 16 KHz, 16 bits, and remix all channels to single channel
29
+
-`log_melspectrogram.pt` is used to transform waveforms to log mel spectrograms
30
+
-`dvector.pt` is the speaker encoder
31
+
32
+
Since all the modules are compiled with [TorchScript](https://pytorch.org/docs/stable/jit.html), you can simply load them and use anywhere **without any dependencies**.
33
+
34
+
### Pretrianed models & preprocessing modules
35
+
36
+
You can download them from the page of [*Releases*](https://github.com/yistLin/dvector/releases).
0 commit comments