chore: default weight_decay to 0 in BaseOptimizerConfig by samsja · Pull Request #2422 · PrimeIntellect-ai/prime-rl

samsja · 2026-05-05T20:51:07Z

Summary

Flip the BaseOptimizerConfig.weight_decay default from 0.01 → 0.0.

Most RL runs in this repo either don't want weight decay or override it explicitly. Leaving the default at 0.01 silently penalizes any run whose config omits the field. Configs that genuinely want 0.01 already set it explicitly:

examples/Intellect-3.1/rl.toml
examples/minimax_m2.5_swe/rl.toml

So no existing run changes behavior unless its author was relying on the implicit default.

🤖 Generated with Claude Code

Note

Low Risk
Low risk: a one-line config default change, but it will alter training behavior for any runs that relied on the previous implicit 0.01 weight decay.

Overview
Updates BaseOptimizerConfig.weight_decay default in configs/trainer.py from 0.01 to 0.0, so optimizer creation (e.g., AdamW/SGD/SignSGD/Muon) no longer applies weight decay unless explicitly configured.

^{Reviewed by Cursor Bugbot for commit 12359e9. Bugbot is set up for automated code reviews on this repo. Configure here.}

Most prime-rl RL runs explicitly disable weight decay anyway (or want to); 0.01 as the default silently penalizes runs whose configs don't override it. Existing configs that want 0.01 (Intellect-3.1, minimax_m2.5_swe) already set it explicitly, so they're unaffected. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

samsja marked this pull request as ready for review May 5, 2026 21:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: default weight_decay to 0 in BaseOptimizerConfig#2422

chore: default weight_decay to 0 in BaseOptimizerConfig#2422
samsja wants to merge 1 commit intomainfrom
chore/default-weight-decay-zero

samsja commented May 5, 2026 •

edited by cursor Bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

samsja commented May 5, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

samsja commented May 5, 2026 •

edited by cursor Bot

Loading