Skip to content

Commit f8ffc22

Browse files
authored
docs: Replace OpenReview links with open-access alternatives (#766)
1 parent f30118e commit f8ffc22

5 files changed

Lines changed: 6 additions & 6 deletions

File tree

CHANGELOG.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ changelog does not include internal changes that do not affect the user.
5959
### Added
6060

6161
- Added `IMTL-L` (the loss-balancing variant of Impartial Multi-Task Learning) from [Towards
62-
Impartial Multi-Task Learning](https://openreview.net/pdf?id=IMPnRXEWpvr) (ICLR 2021), a stateful
62+
Impartial Multi-Task Learning](https://www.semanticscholar.org/paper/Towards-Impartial-Multi-task-Learning-Liu-Li/45c0828baec1dd53b81f1b2635788fdf27d0792d) (ICLR 2021), a stateful
6363
`Scalarizer` that learns a per-task scale `s_i` and combines the values as
6464
`Σ (exp(s_i) · L_i − s_i)`.
6565
- Added `UW` (Uncertainty Weighting) from [Multi-Task Learning Using Uncertainty to Weigh Losses
@@ -74,7 +74,7 @@ changelog does not include internal changes that do not affect the user.
7474
### Added
7575

7676
- Added `STCH` from [Smooth Tchebycheff Scalarization for Multi-Objective
77-
Optimization](https://openreview.net/pdf?id=m4dO5L6eCp), a `Scalarizer` that combines the input
77+
Optimization](https://arxiv.org/abs/2402.19078), a `Scalarizer` that combines the input
7878
tensor of values into a smooth approximation of their (weighted, shifted) maximum.
7979
- Added `MoDoWeighting` from [Three-Way Trade-Off in Multi-Objective Learning: Optimization, Generalization and Conflict-Avoidance](https://www.jmlr.org/papers/volume25/23-1287/23-1287.pdf) (JMLR 2024). It is a stateful `Weighting` that maintains task weights across calls via a simplex-projected gradient step on a cross-batch matrix `G = J_1 @ J_2.T`, computed from two independent mini-batches using `autojac.jac`.
8080
- Added `GeometricMean` (also known as GLS) studied in [MultiNet++: Multi-Stream Feature

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -235,7 +235,7 @@ TorchJD provides many existing aggregators from the literature, listed in the fo
235235
| [FairGrad](https://torchjd.org/stable/docs/aggregation/fairgrad#torchjd.aggregation.FairGrad) | [FairGradWeighting](https://torchjd.org/stable/docs/aggregation/fairgrad#torchjd.aggregation.FairGradWeighting) | [Fair Resource Allocation in Multi-Task Learning](https://arxiv.org/pdf/2402.15638) |
236236
| [GradDrop](https://torchjd.org/stable/docs/aggregation/graddrop#torchjd.aggregation.GradDrop) | - | [Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout](https://arxiv.org/pdf/2010.06808) |
237237
| [GradVac](https://torchjd.org/stable/docs/aggregation/gradvac#torchjd.aggregation.GradVac) | [GradVacWeighting](https://torchjd.org/stable/docs/aggregation/gradvac#torchjd.aggregation.GradVacWeighting) | [Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models](https://arxiv.org/pdf/2010.05874) |
238-
| [IMTLG](https://torchjd.org/stable/docs/aggregation/imtl_g#torchjd.aggregation.IMTLG) | [IMTLGWeighting](https://torchjd.org/stable/docs/aggregation/imtl_g#torchjd.aggregation.IMTLGWeighting) | [Towards Impartial Multi-task Learning](https://discovery.ucl.ac.uk/id/eprint/10120667/) |
238+
| [IMTLG](https://torchjd.org/stable/docs/aggregation/imtl_g#torchjd.aggregation.IMTLG) | [IMTLGWeighting](https://torchjd.org/stable/docs/aggregation/imtl_g#torchjd.aggregation.IMTLGWeighting) | [Towards Impartial Multi-task Learning](https://www.semanticscholar.org/paper/Towards-Impartial-Multi-task-Learning-Liu-Li/45c0828baec1dd53b81f1b2635788fdf27d0792d) |
239239
| [Krum](https://torchjd.org/stable/docs/aggregation/krum#torchjd.aggregation.Krum) | [KrumWeighting](https://torchjd.org/stable/docs/aggregation/krum#torchjd.aggregation.KrumWeighting) | [Machine Learning with Adversaries: Byzantine Tolerant Gradient Descent](https://proceedings.neurips.cc/paper/2017/file/f4b9ec30ad9f68f89b29639786cb62ef-Paper.pdf) |
240240
| [Mean](https://torchjd.org/stable/docs/aggregation/mean#torchjd.aggregation.Mean) | [MeanWeighting](https://torchjd.org/stable/docs/aggregation/mean#torchjd.aggregation.MeanWeighting) | - |
241241
| [MGDA](https://torchjd.org/stable/docs/aggregation/mgda#torchjd.aggregation.MGDA) | [MGDAWeighting](https://torchjd.org/stable/docs/aggregation/mgda#torchjd.aggregation.MGDAWeighting) | [Multiple-gradient descent algorithm (MGDA) for multiobjective optimization](https://comptes-rendus.academie-sciences.fr/mathematique/articles/10.1016/j.crma.2012.03.014/) |

src/torchjd/aggregation/_gradvac.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -135,7 +135,7 @@ class GradVac(GramianWeightedAggregator, Stateful, _NonDifferentiable):
135135
:class:`~torchjd.aggregation.GramianWeightedAggregator` implementing the aggregation step of
136136
Gradient Vaccine (GradVac) from `Gradient Vaccine: Investigating and Improving Multi-task
137137
Optimization in Massively Multilingual Models (ICLR 2021 Spotlight)
138-
<https://openreview.net/forum?id=F1vEjWK-lH_>`_.
138+
<https://arxiv.org/abs/2010.05874>`_.
139139
140140
For each task :math:`i`, the order in which other tasks :math:`j` are visited is drawn at
141141
random. For each pair :math:`(i, j)`, the cosine similarity :math:`\phi_{ij}` between the

src/torchjd/scalarization/_imtl_l.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ class IMTLL(Scalarizer, Stateful):
1414
:class:`~torchjd.scalarization.Scalarizer` that combines the input tensor of values using learned
1515
per-task scales. ``IMTL-L`` is the loss-balancing variant of Impartial
1616
Multi-Task Learning, proposed in `Towards Impartial Multi-Task Learning
17-
<https://openreview.net/pdf?id=IMPnRXEWpvr>`_.
17+
<https://www.semanticscholar.org/paper/Towards-Impartial-Multi-task-Learning-Liu-Li/45c0828baec1dd53b81f1b2635788fdf27d0792d>`_.
1818
1919
Each value :math:`L_i` is assigned a learnable scale parameter :math:`s_i`, and the values are
2020
combined as

src/torchjd/scalarization/_stch.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ class STCH(Scalarizer):
88
r"""
99
:class:`~torchjd.scalarization.Scalarizer` that combines the input tensor of values using smooth
1010
Tchebycheff scalarization, as defined in `Smooth Tchebycheff Scalarization for Multi-Objective
11-
Optimization <https://openreview.net/pdf?id=m4dO5L6eCp>`_.
11+
Optimization <https://arxiv.org/abs/2402.19078>`_.
1212
1313
It returns
1414

0 commit comments

Comments
 (0)