You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
An improvised Residual FSQ, for an attempt to improve audio encoding.
356
+
An improvised Residual FSQ, for an attempt to improve audio encoding.
357
357
358
358
Credit goes to [@sekstini](https://github.com/sekstini) for originally incepting the idea [here](https://github.com/lucidrains/vector-quantize-pytorch/pull/74#issuecomment-1742048597)
359
359
@@ -506,7 +506,7 @@ from vector_quantize_pytorch import LatentQuantize
506
506
quantizer = LatentQuantize(
507
507
levels= [5, 5, 8], # number of levels per codebook dimension
508
508
dim=16, # input dim
509
-
commitment_loss_weight=0.1,
509
+
commitment_loss_weight=0.1,
510
510
quantization_loss_weight=0.1,
511
511
)
512
512
@@ -530,7 +530,7 @@ from vector_quantize_pytorch import LatentQuantize
530
530
quantizer = LatentQuantize(
531
531
levels= [5, 5, 8],
532
532
dim=16,
533
-
commitment_loss_weight=0.1,
533
+
commitment_loss_weight=0.1,
534
534
quantization_loss_weight=0.1,
535
535
)
536
536
@@ -720,7 +720,7 @@ assert loss.item() >= 0
720
720
721
721
```bibtex
722
722
@misc{hsu2023disentanglement,
723
-
title = {Disentanglement via Latent Quantization},
723
+
title = {Disentanglement via Latent Quantization},
724
724
author = {Kyle Hsu and Will Dorrell and James C. R. Whittington and Jiajun Wu and Chelsea Finn},
Copy file name to clipboardExpand all lines: vector_quantize_pytorch/latent_quantization.py
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -52,7 +52,7 @@ def __init__(
52
52
(default is 1)
53
53
codebook_dim (int): the dimension of the codebook.
54
54
If levels is a list, codebook_dim is the length of the list.
55
-
(default to -1)
55
+
(default to -1)
56
56
keep_num_codebooks_dim (Optional[bool]): Whether to keep the number of codebooks dimension in the output tensor. If not provided, it is set to True if num_codebooks > 1, otherwise False.
57
57
optimize_values (Optional[bool]): Whether to optimize the values of the codebook. If not provided, it is set to True.
0 commit comments