Skip to content

[Executor] Adjust signal sending order in RL training#3773

Merged
Jiang-Jia-Jun merged 7 commits into
PaddlePaddle:developfrom
gongshaotian:rl_multi_device
Sep 10, 2025
Merged

[Executor] Adjust signal sending order in RL training#3773
Jiang-Jia-Jun merged 7 commits into
PaddlePaddle:developfrom
gongshaotian:rl_multi_device

Conversation

@gongshaotian

Copy link
Copy Markdown
Collaborator

In RL training, CUDAGraph capture can only be performed after the weight is updated, and the capture needs to last for a period of time. The status of the worker will be updated immediately after the weight is updated in the original implementation, but the worker may not be ready at this time.

@paddle-bot

paddle-bot Bot commented Sep 1, 2025

Copy link
Copy Markdown

Thanks for your contribution!

@SigureMo SigureMo left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTMeow 🐾

@Jiang-Jia-Jun Jiang-Jia-Jun merged commit d3e4ae3 into PaddlePaddle:develop Sep 10, 2025
23 of 28 checks passed
qingqing01 pushed a commit that referenced this pull request Sep 11, 2025
* Adjust processing order

* fix bug

* fix update_parameters bug

* refine code
Jiang-Jia-Jun pushed a commit that referenced this pull request Sep 22, 2025
…#4178)

* Adjust processing order

* fix bug

* fix update_parameters bug

* refine code
@gongshaotian gongshaotian deleted the rl_multi_device branch November 3, 2025 06:49
xiaoguoguo626807 pushed a commit to xiaoguoguo626807/FastDeploy that referenced this pull request May 7, 2026
)

* Adjust processing order

* fix bug

* fix update_parameters bug

* refine code
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants