Support broadcasting just lora weights and drop support for merged weights by Jackmin801 · Pull Request #1320 · PrimeIntellect-ai/prime-rl

Jackmin801 · 2025-11-20T18:05:30Z

Note

Adds adapter-only filesystem weight broadcasting and runtime LoRA loading, wires LoRA names through orchestrator/scheduler, removes merged-weight path, and updates tests/CI accordingly.

LoRA broadcasting and loading:
- Add adapter_only option to weight broadcast configs (trainer/rl/config.py); filesystem broadcaster now saves only adapters and writes adapter_config.json; NCCL path disallows adapter-only.
- Introduce get_adapter_state_dict and adapter filename handling in trainer/weights.py; drop merged-LoRA gathering path; simplify gather_weights_on_master.
- Pass lora_config into broadcaster setup and persist with adapter saves.
Orchestrator & Scheduler:
- Add OrchestratorConfig.lora_name and thread through Scheduler; use LoRA name as model_name after updates.
- update_weights now accepts lora_name and uses new /v1/load_lora_adapter; avoid base reload when using LoRA.
Inference (vLLM server):
- Inject hack to bypass adapter already-loaded checks to allow in-place LoRA updates.
Trainer:
- Broadcast weights each step using adapter_only flag; checkpoint manager unchanged but aligned with new adapter flow.
Tests/CI:
- Add integration tests for LoRA RL with dynamic adapter loading; separate LoRA tests in CI; capture vLLM/stdout logs for debugging.

^{Written by Cursor Bugbot for commit 7857428. This will update automatically on new commits. Configure here.}

Signed-off-by: Jackmin801 <56836461+Jackmin801@users.noreply.github.com>

This reverts commit 18326d4.

This reverts commit 1ca54b2.

This reverts commit 20fb5f3.

This reverts commit 05631de.

Comment out the unloading of the LoRA adapter in orchestrator.py. Signed-off-by: Jackmin801 <56836461+Jackmin801@users.noreply.github.com>

mikasenghaas

very clean, i love it. nice that we decouple the full vs adapter only weight saving as well, has been bugging me for a while how entangled these were

Jackmin801 and others added 29 commits November 13, 2025 10:45

expose lora apis in client

4b63660

support lora in orchestrator

fad071b

fix: dumbest bug ever

7370661

fix: dont unload because it will 404 inflight requests

1fbe523

fix: dont crash without lora

7b37717

add full_weight prefix

6df9a50

add dynamic lora tests

285aaa9

run non lora and lora test separately

046d2cd

use lora_name instead of load_lora

77ec9c1

have step in lora name so its less confusing when name ends with number

a924e14

fix: test should use lora name now

2f8994c

do inplace update of lora adapter

1f569bd

cleanup lora adapter after training

c4af08f

TEMP: test scripts

05631de

TEMP: script changes

20fb5f3

TEMP: parallel scripts

1ca54b2

TEMP: update meow.sh to use reverse text model

18326d4

dont unload weights in update

c214f0e

Merge branch 'main' into feat-update-lora

3b769a3

Signed-off-by: Jackmin801 <56836461+Jackmin801@users.noreply.github.com>

fix ruff

2af2f38

fix: save adapter separately moved

6c3eb6b

support broadcasting

4dc3690

Revert "TEMP: update meow.sh to use reverse text model"

dd4ae0a

This reverts commit 18326d4.

Revert "TEMP: parallel scripts"

42cf189

This reverts commit 1ca54b2.

Revert "TEMP: script changes"

1c2a8d8

This reverts commit 20fb5f3.

Revert "TEMP: test scripts"

40331ce

This reverts commit 05631de.

drop unused merged weights

4f58bad

remove unused counting

f0beaf7

remove unused stuff

e7025cb

Jackmin801 marked this pull request as ready for review November 20, 2025 18:34

cursor Bot reviewed Nov 20, 2025

View reviewed changes

Comment thread src/prime_rl/trainer/ckpt.py

Comment thread tests/integration/lora/test_rl.py Outdated

Jackmin801 added 5 commits November 20, 2025 18:39

fix server in test

d431c40

fix ckpt saving for lora

b7f8f27

fix args in test

09221f0

print logs for easier debugging ci

0d3f165

run lora one first to fail fast

fa24f21

cursor Bot reviewed Nov 20, 2025

View reviewed changes

Comment thread src/prime_rl/orchestrator/orchestrator.py

Jackmin801 added 2 commits November 20, 2025 23:56

add logging to non lora test

5e847cd

make output dir?

e7b820b

cursor Bot reviewed Nov 21, 2025

View reviewed changes

Comment thread tests/conftest.py Outdated

Jackmin801 and others added 4 commits November 21, 2025 11:13

Comment out LoRA adapter unloading logic

fbb8f37

Comment out the unloading of the LoRA adapter in orchestrator.py. Signed-off-by: Jackmin801 <56836461+Jackmin801@users.noreply.github.com>

make cursor bot happy

206b29c

dont unload for now

220b50a

restore original order

a0e4282

cursor Bot reviewed Nov 21, 2025

View reviewed changes

Comment thread src/prime_rl/trainer/rl/broadcast/__init__.py

Comment thread src/prime_rl/trainer/rl/broadcast/filesystem.py

add validation for adapter only

635fe76

Jackmin801 requested review from mikasenghaas and samsja November 21, 2025 19:53

mikasenghaas reviewed Nov 21, 2025

View reviewed changes

Comment thread .github/workflows/gpu_tests.yaml

Comment thread src/prime_rl/orchestrator/config.py

Comment thread src/prime_rl/orchestrator/scheduler.py

samsja reviewed Nov 21, 2025

View reviewed changes

Comment thread .github/workflows/gpu_tests.yaml

Comment thread src/prime_rl/trainer/rl/config.py Outdated

Comment thread src/prime_rl/trainer/rl/config.py Outdated

Jackmin801 added 2 commits November 21, 2025 22:27

change everything to adapter only

70588a3

have base broadcast config

9c6e5e8

cursor Bot reviewed Nov 21, 2025

View reviewed changes

Comment thread src/prime_rl/trainer/rl/train.py

guard nccl broadcast from adapter_only

7857428

Jackmin801 merged commit 2aa47ae into main Nov 21, 2025
5 checks passed

AmeenP mentioned this pull request May 2, 2026

[DYNAMO] chore: remove dead unload_lora_adapter #2396

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support broadcasting just lora weights and drop support for merged weights#1320

Support broadcasting just lora weights and drop support for merged weights#1320
Jackmin801 merged 44 commits intomainfrom
feat-update-lora

Jackmin801 commented Nov 20, 2025 •

edited by cursor Bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mikasenghaas left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Jackmin801 commented Nov 20, 2025 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mikasenghaas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Jackmin801 commented Nov 20, 2025 •

edited by cursor Bot

Loading