feat: scope trust_remote_code to Kimi-K2 family with pinned revisions by hallerite · Pull Request #7 · PrimeIntellect-ai/renderers

hallerite · 2026-05-07T14:54:08Z

Summary

Centralise tokenizer loading through a new `renderers.base.load_tokenizer` helper. Default: `trust_remote_code=False`. Opt-in only for the Moonshot Kimi-K2 family, and even then with `revision` pinned to a reviewed sha so a future malicious push to the upstream repo cannot auto-propagate to anyone calling `create_renderer_pool`.

Why

Empirical audit of every model in `MODEL_RENDERER_MAP`:

family	needs `trust_remote_code`?
Qwen3 / 3.5 / 3.6 / 3-VL	no
Qwen2.5 (DefaultRenderer fixture)	no
GLM-5 / 5.1 / 4.7-Flash / 4.5-Air	no
MiniMax-M2 / M2.5	no
DeepSeek-V3 / V3-Base	no
Nemotron-3 Nano / Super	no
GPT-OSS 20b / 120b	no
Kimi-K2-Instruct / K2.5 / K2.6	YES

Only 3 of 32 entries actually need it. The previous unconditional `trust_remote_code=True` in `create_renderer_pool` granted arbitrary-Python-on-`from_pretrained` for every supported model.

The Kimi requirement is real: `tokenizer_config.json` has `auto_map.AutoTokenizer = ["tokenization_kimi.TikTokenTokenizer", null]`, which makes transformers download and `import` a 353-line tiktoken wrapper shipped in-repo. Pinning the `revision` keeps the trust narrow — even with `trust_remote_code=True`, transformers executes the tokenizer Python from that exact commit only.

Pinned revisions (current as of 2026-05-07)

```
moonshotai/Kimi-K2-Instruct: fd1984e2b7a3350dbf7305fe73a4ede25c14de50
moonshotai/Kimi-K2.5: 4d01dfe0332d63057c186e0b262165819efb6611
moonshotai/Kimi-K2.6: 2755962d07cb42aa2d988a35bcb65cd4a9c2de82
```

Bumping requires deliberate review of the upstream diff.

Changes

`renderers/base.py`: add `TRUSTED_REVISIONS` allow-list and `load_tokenizer` helper; `create_renderer_pool`'s factory uses it.
`tests/conftest.py` + every test that loaded tokenizers directly (`test_bridge`, `test_message_indices`, `test_parse_response`, `test_parsers`, `test_preserve_thinking`, `test_render_ids`, `test_roundtrip`): route through `load_tokenizer`. No more ad-hoc `trust_remote_code=True`.
`tests/test_load_tokenizer.py` (new, 7 tests): unit-tests the policy itself — allow-list shape (Kimi-only), revision is a 40-char sha (rejects branch names like `main`), `AutoTokenizer.from_pretrained` call shape per model class, unknown paths fall through to no-trust, real Qwen + Kimi smoke loads.
Bump 0.1.6 → 0.1.7.

Downstream

verifiers / prime-rl: do not need code changes. They build pools via `create_renderer_pool(...)` — the tokenizer loading is internal. Once this lands they bump the renderers pin and inherit the safer default for free.
Callers loading tokenizers themselves: if anything in the wider org calls `AutoTokenizer.from_pretrained(model, trust_remote_code=True)` directly, they keep doing what they're doing — this PR doesn't restrict the transformers API, it just makes `renderers` the principled-default entry point.

Test plan

`pytest tests/` — 902 passed, 48 skipped, 1 xfailed (no parity regressions; +7 are the new policy tests).
All trust_remote_code call sites in tests/ + renderers/ removed except inside `load_tokenizer` itself.
Pre-commit / ruff format clean.

🤖 Generated with Claude Code

Centralise tokenizer loading through a new ``renderers.base.load_tokenizer`` helper. Default policy: ``trust_remote_code=False``. Opt-in for the Moonshot Kimi-K2 family only — and even then with ``revision`` pinned to a reviewed sha so a future malicious push to the upstream repo cannot auto-propagate to anyone calling ``create_renderer_pool``. Why scoped now: empirical audit of every model in MODEL_RENDERER_MAP shows only ``moonshotai/Kimi-K2-Instruct``, ``Kimi-K2.5``, and ``Kimi-K2.6`` actually require ``trust_remote_code`` (their tokenizer config has an ``auto_map.AutoTokenizer`` entry pointing at ``tokenization_kimi.TikTokenTokenizer`` — a 353-line tiktoken wrapper shipped in-repo). Every other model — Qwen3/3.5/3.6/3-VL, GLM-5/4.5, MiniMax-M2, DeepSeek-V3, Nemotron-3, GPT-OSS, Qwen2.5 — loads cleanly without remote code. The previous unconditional ``trust_remote_code=True`` in ``create_renderer_pool`` granted arbitrary-Python-on-from_pretrained for every supported model, when only 3 actually need it. Pinned revisions (current as of 2026-05-07): - moonshotai/Kimi-K2-Instruct: fd1984e2b7a3350dbf7305fe73a4ede25c14de50 - moonshotai/Kimi-K2.5: 4d01dfe0332d63057c186e0b262165819efb6611 - moonshotai/Kimi-K2.6: 2755962d07cb42aa2d988a35bcb65cd4a9c2de82 Bumping these requires deliberate review of the upstream diff. Changes: - renderers/base.py: add ``TRUSTED_REVISIONS`` allow-list and ``load_tokenizer`` helper; ``create_renderer_pool``'s factory uses it. - tests/conftest.py + every test that loaded tokenizers directly: route through ``load_tokenizer``. No more ad-hoc ``trust_remote_code=True``. - tests/test_load_tokenizer.py (new): unit-tests the policy itself — allow-list shape (Kimi-only), revision is a 40-char sha (no branch names), AutoTokenizer.from_pretrained call shape per model class, unknown paths fall through to no-trust, real Qwen + Kimi smoke loads. - Bump 0.1.6 → 0.1.7. 902 passed, 48 skipped, 1 xfailed locally. No parity regressions. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

hallerite merged commit 9cc0d28 into main May 7, 2026
6 checks passed

hallerite deleted the feat/scoped-trust-remote-code branch May 7, 2026 16:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: scope trust_remote_code to Kimi-K2 family with pinned revisions#7

feat: scope trust_remote_code to Kimi-K2 family with pinned revisions#7
hallerite merged 1 commit intomainfrom
feat/scoped-trust-remote-code

hallerite commented May 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hallerite commented May 7, 2026

Summary

Why

Pinned revisions (current as of 2026-05-07)

Changes

Downstream

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant