fix(embeddings): respect token limits in EmbeddingsBuilder batching by godnight10061 · Pull Request #1221 · 0xPlaygrounds/rig

godnight10061 · 2026-01-05T14:18:29Z

Fixes #462

Summary

EmbeddingsBuilder::build() previously batched only by M::MAX_DOCUMENTS. For providers like OpenAI embeddings, requests can also fail when the combined input exceeds the provider’s per-request token budget.

Changes

Add EmbeddingModel::max_tokens_per_request() (default None) so providers can expose a per-request token budget.
Batch by both M::MAX_DOCUMENTS and max_tokens_per_request() (uses text.len() as a conservative proxy for tokens to avoid adding a tokenizer dependency).
Set OpenAI embedding models to Some(300_000) (based on the provider error in the issue).
Add a regression test covering token-budget batching.

Test

cargo test -p rig-core --lib

Respect token limits in EmbeddingsBuilder batching

2c2b676

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(embeddings): respect token limits in EmbeddingsBuilder batching#1221

fix(embeddings): respect token limits in EmbeddingsBuilder batching#1221
godnight10061 wants to merge 1 commit into0xPlaygrounds:mainfrom
godnight10061:fix-embeddingbuilder-token-batching

godnight10061 commented Jan 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

godnight10061 commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

godnight10061 commented Jan 5, 2026 •

edited

Loading