Question about Training scripts

The default model path in Training scripts is Nickyang/FastCuRL-1.5B-Preview. Should we load DeepSeek-R1-Distill-Qwen-1.5B in stage 1, and then load the previous stage's weights in subsequent stages?