Skip to content

Port distributed training support from existing PR #1315

@lrzpellegrini

Description

@lrzpellegrini

I'm opening this PR to keep track of the work needed to port the content of the #996 PR to the main branch.

The idea is to split that PR (which is huge and based on a quite old version of the codebase) and, starting from the current state of the main branch, port its main elements in smaller PRs.
I'll keep this issue updated as I work on this.

Many changes are not strictly related to supporting distributed training but may benefit Avalanche in general.

  1. I'm starting with porting the modernized object detection/segmentation dataset, strategies, and metrics. I'll also port the generalized batch collate functionality.

Changes in Distributed Training PR #996:

Legend:

  • 🔲 Not ported
  • ⌛ Work in progress
  • 💬 PR opened, discussion in progress
  • ✔️ Merged into main branch

Base elements

Strategy e plugins

Models

Detection

Data Loader

Loggers and metrics

Unit tests

Typing

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions