Skip to content

[feat] enhance export efficiency by restoring state dict directly instead of copying and gathering#177

Merged
tiankongdeguiji merged 7 commits intoalibaba:masterfrom
tiankongdeguiji:features/opt_export
May 8, 2025
Merged

[feat] enhance export efficiency by restoring state dict directly instead of copying and gathering#177
tiankongdeguiji merged 7 commits intoalibaba:masterfrom
tiankongdeguiji:features/opt_export

Conversation

@tiankongdeguiji
Copy link
Collaborator

@tiankongdeguiji tiankongdeguiji commented May 7, 2025

When using state_dict_gather with the gloo backend, you may encounter an out-of-memory issue if the embedding tensor is very large.

@tiankongdeguiji tiankongdeguiji changed the title [feat] enhance export efficiency by eestoring state dict directly instead of copying and gathering [feat] enhance export efficiency by restoring state dict directly instead of copying and gathering May 7, 2025
chengaofei
chengaofei previously approved these changes May 7, 2025
@tiankongdeguiji tiankongdeguiji merged commit 4b77160 into alibaba:master May 8, 2025
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants