Skip to content

Conversation

@Capricorn231
Copy link

The method _update_stats() is called with val, which is a Tensor with gradients. When use with AverageMeter, the gradient graph will be accumulated during training and won't be released and collected by GC, causing significant memory leak.

@martin-danelljan
Copy link
Contributor

val is not necessarily always a tensor. So we need to have a check if its actually a tensor first.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants