Skip to content

Conversation

@pthombre
Copy link
Contributor

No description provided.

@copy-pr-bot
Copy link

copy-pr-bot bot commented Oct 17, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

is_rank0 = (not dist.is_initialized()) or dist.get_rank() == 0

if is_rank0:
training_state = {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@adil-a can you provide guidance wrt checkpointing?

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
…peline

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
@akoumpa akoumpa force-pushed the pranav/wan21_finetuning branch from 2376f55 to 4873cc3 Compare October 21, 2025 20:47
@akoumpa
Copy link
Contributor

akoumpa commented Oct 21, 2025

/ok to test 4873cc3

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
@pthombre
Copy link
Contributor Author

/ok to test 21bc81e

Signed-off-by: Pranav Prashant Thombre <pthombre@nvidia.com>
@pthombre
Copy link
Contributor Author

/ok to test 93e8b00

@linnanwang
Copy link
Contributor

everything looks good to me

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Run CICD Trigger Testing CICD

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants