Skip to content

Issue with Non-stop increasing GPU memory consumption #24

@eddiegaoo

Description

@eddiegaoo

I found that training TGAT model (default parameter setting in config/TGAT.yml) would incur a non-stop increasing GPU memory consumption. But for the other models, such issue does not exist.

I assume it might be related with Python garbage collection mechanism, so I add torch.cuda.empty_cache() at the end of every training batch. Now such issue is solved, but I find the empty_cache operation is somewhat time-consuming.

I wonder if you notice the same issue and how it can be solved in a better way.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions