Skip to content

Continuous Batching support for AsyncGRPO#5781

Draft
qgallouedec wants to merge 1 commit into
mainfrom
cb-async-grpo
Draft

Continuous Batching support for AsyncGRPO#5781
qgallouedec wants to merge 1 commit into
mainfrom
cb-async-grpo

Conversation

@qgallouedec
Copy link
Copy Markdown
Member

What does this PR do?

Fixes # (issue)

Before submitting

  • This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
  • Did you read the contributor guideline, Pull Request section?
  • Was this discussed/approved via a GitHub issue? Please add a link to it if that's the case.
  • Did you make sure to update the documentation with your changes?
  • Did you write any new necessary tests?

AI writing disclosure

We welcome the use of AI tools to help with contributions. For transparency and to help us improve our review process, please indicate the level of AI involvement in this PR.

  • No AI usage: the PR was written entirely by a human.
  • AI-assisted: some parts were suggested or improved by AI, but the PR was written and reviewed by a human.
  • AI-generated: the PR was mostly or fully generated by an AI tool.

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.

Copy link
Copy Markdown
Member

@sergiopaniego sergiopaniego left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

small pass
it'd nice to have an example script + update the docs for the new feature.

> Parameters that control the rollout backend

backend (`str`, *optional*, defaults to `"vllm"`):
Generation backend. One of `"vllm"` (use an external vLLM server, default) or `"cb"` (use
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

backend param needs a validation check ("vllm", "cb")

# Use the injected worker (e.g. a stub in tests). The queue is owned by the worker.
self.rollout_worker = rollout_worker
elif self.args.backend == "cb":
from .cb_rollout_worker import CBRolloutWorker
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cb_rollout_worker.py missing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants