Skip to content

Commit 16ff7e6

Browse files
authored
Merge branch 'main' into KhusPatel4450-patch-1
2 parents f44422e + f8ffc22 commit 16ff7e6

11 files changed

Lines changed: 16 additions & 16 deletions

File tree

.github/workflows/build-deploy-docs.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ jobs:
2222
contents: write
2323
steps:
2424
- name: Checkout repository
25-
uses: actions/checkout@v6
25+
uses: actions/checkout@v7
2626

2727
- name: Set up uv
2828
uses: astral-sh/setup-uv@v7

.github/workflows/check-links.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ jobs:
1616
runs-on: ubuntu-latest
1717
steps:
1818
- name: Checkout code
19-
uses: actions/checkout@v6
19+
uses: actions/checkout@v7
2020

2121
# This will restore the cache for the current commit if it exists, or the most recent lychee
2222
# cache otherwise (including those saved for the main branch). It will also save the cache for
@@ -25,7 +25,7 @@ jobs:
2525
# temporary (rate limiting, network issue, etc.), and we always want to retry those links
2626
# everytime this action is run.
2727
- name: Restore lychee cache
28-
uses: actions/cache@v5
28+
uses: actions/cache@v6
2929
with:
3030
path: .lycheecache
3131
key: cache-lychee-${{ github.sha }}

.github/workflows/checks.yml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -44,7 +44,7 @@ jobs:
4444

4545
steps:
4646
- name: Checkout repository
47-
uses: actions/checkout@v6
47+
uses: actions/checkout@v7
4848

4949
- name: Set up uv
5050
uses: astral-sh/setup-uv@v7
@@ -63,7 +63,7 @@ jobs:
6363
PYTEST_TORCH_DTYPE: ${{ matrix.dtype || 'float32' }}
6464

6565
- name: Upload results to Codecov
66-
uses: codecov/codecov-action@v6
66+
uses: codecov/codecov-action@v7
6767
with:
6868
token: ${{ secrets.CODECOV_TOKEN }}
6969

@@ -72,7 +72,7 @@ jobs:
7272
runs-on: ubuntu-latest
7373
steps:
7474
- name: Checkout repository
75-
uses: actions/checkout@v6
75+
uses: actions/checkout@v7
7676

7777
- name: Set up uv
7878
uses: astral-sh/setup-uv@v7
@@ -96,7 +96,7 @@ jobs:
9696
runs-on: ubuntu-latest
9797
steps:
9898
- name: Checkout repository
99-
uses: actions/checkout@v6
99+
uses: actions/checkout@v7
100100

101101
- name: Set up uv
102102
uses: astral-sh/setup-uv@v7

.github/workflows/opencode.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ jobs:
2525
issues: write
2626
steps:
2727
- name: Checkout repository
28-
uses: actions/checkout@v6
28+
uses: actions/checkout@v7
2929
with:
3030
fetch-depth: 1
3131
persist-credentials: false

.github/workflows/release.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ jobs:
1414
id-token: write
1515
steps:
1616
- name: Checkout repository
17-
uses: actions/checkout@v6
17+
uses: actions/checkout@v7
1818

1919
- name: Set up uv
2020
uses: astral-sh/setup-uv@v7

.pre-commit-config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ repos:
1010
- id: check-merge-conflict # Check for files that contain merge conflict strings.
1111

1212
- repo: https://github.com/astral-sh/ruff-pre-commit
13-
rev: v0.15.15
13+
rev: v0.15.20
1414
hooks:
1515
- id: ruff-check
1616
args: [ --fix, --ignore, FIX ] # Allow committing with TODOs. Only CI checks should prevent merging with TODOs.

CHANGELOG.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ changelog does not include internal changes that do not affect the user.
5959
### Added
6060

6161
- Added `IMTL-L` (the loss-balancing variant of Impartial Multi-Task Learning) from [Towards
62-
Impartial Multi-Task Learning](https://openreview.net/pdf?id=IMPnRXEWpvr) (ICLR 2021), a stateful
62+
Impartial Multi-Task Learning](https://www.semanticscholar.org/paper/Towards-Impartial-Multi-task-Learning-Liu-Li/45c0828baec1dd53b81f1b2635788fdf27d0792d) (ICLR 2021), a stateful
6363
`Scalarizer` that learns a per-task scale `s_i` and combines the values as
6464
`Σ (exp(s_i) · L_i − s_i)`.
6565
- Added `UW` (Uncertainty Weighting) from [Multi-Task Learning Using Uncertainty to Weigh Losses
@@ -74,7 +74,7 @@ changelog does not include internal changes that do not affect the user.
7474
### Added
7575

7676
- Added `STCH` from [Smooth Tchebycheff Scalarization for Multi-Objective
77-
Optimization](https://openreview.net/pdf?id=m4dO5L6eCp), a `Scalarizer` that combines the input
77+
Optimization](https://arxiv.org/abs/2402.19078), a `Scalarizer` that combines the input
7878
tensor of values into a smooth approximation of their (weighted, shifted) maximum.
7979
- Added `MoDoWeighting` from [Three-Way Trade-Off in Multi-Objective Learning: Optimization, Generalization and Conflict-Avoidance](https://www.jmlr.org/papers/volume25/23-1287/23-1287.pdf) (JMLR 2024). It is a stateful `Weighting` that maintains task weights across calls via a simplex-projected gradient step on a cross-batch matrix `G = J_1 @ J_2.T`, computed from two independent mini-batches using `autojac.jac`.
8080
- Added `GeometricMean` (also known as GLS) studied in [MultiNet++: Multi-Stream Feature

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -162,7 +162,7 @@ TorchJD provides many existing aggregators from the literature, listed in the fo
162162
| [FairGrad](https://torchjd.org/stable/docs/aggregation/fairgrad#torchjd.aggregation.FairGrad) | [FairGradWeighting](https://torchjd.org/stable/docs/aggregation/fairgrad#torchjd.aggregation.FairGradWeighting) | [Fair Resource Allocation in Multi-Task Learning](https://arxiv.org/pdf/2402.15638) |
163163
| [GradDrop](https://torchjd.org/stable/docs/aggregation/graddrop#torchjd.aggregation.GradDrop) | - | [Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout](https://arxiv.org/pdf/2010.06808) |
164164
| [GradVac](https://torchjd.org/stable/docs/aggregation/gradvac#torchjd.aggregation.GradVac) | [GradVacWeighting](https://torchjd.org/stable/docs/aggregation/gradvac#torchjd.aggregation.GradVacWeighting) | [Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models](https://arxiv.org/pdf/2010.05874) |
165-
| [IMTLG](https://torchjd.org/stable/docs/aggregation/imtl_g#torchjd.aggregation.IMTLG) | [IMTLGWeighting](https://torchjd.org/stable/docs/aggregation/imtl_g#torchjd.aggregation.IMTLGWeighting) | [Towards Impartial Multi-task Learning](https://discovery.ucl.ac.uk/id/eprint/10120667/) |
165+
| [IMTLG](https://torchjd.org/stable/docs/aggregation/imtl_g#torchjd.aggregation.IMTLG) | [IMTLGWeighting](https://torchjd.org/stable/docs/aggregation/imtl_g#torchjd.aggregation.IMTLGWeighting) | [Towards Impartial Multi-task Learning](https://www.semanticscholar.org/paper/Towards-Impartial-Multi-task-Learning-Liu-Li/45c0828baec1dd53b81f1b2635788fdf27d0792d) |
166166
| [Krum](https://torchjd.org/stable/docs/aggregation/krum#torchjd.aggregation.Krum) | [KrumWeighting](https://torchjd.org/stable/docs/aggregation/krum#torchjd.aggregation.KrumWeighting) | [Machine Learning with Adversaries: Byzantine Tolerant Gradient Descent](https://proceedings.neurips.cc/paper/2017/file/f4b9ec30ad9f68f89b29639786cb62ef-Paper.pdf) |
167167
| [Mean](https://torchjd.org/stable/docs/aggregation/mean#torchjd.aggregation.Mean) | [MeanWeighting](https://torchjd.org/stable/docs/aggregation/mean#torchjd.aggregation.MeanWeighting) | - |
168168
| [MGDA](https://torchjd.org/stable/docs/aggregation/mgda#torchjd.aggregation.MGDA) | [MGDAWeighting](https://torchjd.org/stable/docs/aggregation/mgda#torchjd.aggregation.MGDAWeighting) | [Multiple-gradient descent algorithm (MGDA) for multiobjective optimization](https://comptes-rendus.academie-sciences.fr/mathematique/articles/10.1016/j.crma.2012.03.014/) |

src/torchjd/aggregation/_gradvac.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -135,7 +135,7 @@ class GradVac(GramianWeightedAggregator, Stateful, _NonDifferentiable):
135135
:class:`~torchjd.aggregation.GramianWeightedAggregator` implementing the aggregation step of
136136
Gradient Vaccine (GradVac) from `Gradient Vaccine: Investigating and Improving Multi-task
137137
Optimization in Massively Multilingual Models (ICLR 2021 Spotlight)
138-
<https://openreview.net/forum?id=F1vEjWK-lH_>`_.
138+
<https://arxiv.org/abs/2010.05874>`_.
139139
140140
For each task :math:`i`, the order in which other tasks :math:`j` are visited is drawn at
141141
random. For each pair :math:`(i, j)`, the cosine similarity :math:`\phi_{ij}` between the

src/torchjd/scalarization/_imtl_l.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ class IMTLL(Scalarizer, Stateful):
1414
:class:`~torchjd.scalarization.Scalarizer` that combines the input tensor of values using learned
1515
per-task scales. ``IMTL-L`` is the loss-balancing variant of Impartial
1616
Multi-Task Learning, proposed in `Towards Impartial Multi-Task Learning
17-
<https://openreview.net/pdf?id=IMPnRXEWpvr>`_.
17+
<https://www.semanticscholar.org/paper/Towards-Impartial-Multi-task-Learning-Liu-Li/45c0828baec1dd53b81f1b2635788fdf27d0792d>`_.
1818
1919
Each value :math:`L_i` is assigned a learnable scale parameter :math:`s_i`, and the values are
2020
combined as

0 commit comments

Comments
 (0)