add moonshot/kimi-k2.6 to model registry#26200
Conversation
|
|
Greptile SummaryThis PR adds Confidence Score: 5/5Safe to merge; all findings are P2 clarification items on optional registry fields The core change is a well-formed JSON model registry addition with correct pricing math and a thorough test suite. The two open questions (missing cache_read_input_token_cost and supports_reasoning compared to kimi-k2.5) are P2 — they won't break routing but could silently under-report capabilities or miss a caching discount. No logic errors, security issues, or test-integrity problems detected. model_prices_and_context_window.json — verify cache pricing and reasoning support against official kimi-k2.6 docs
|
| Filename | Overview |
|---|---|
| model_prices_and_context_window.json | Adds moonshot/kimi-k2.6 entry; missing cache_read_input_token_cost and supports_reasoning compared to sibling kimi-k2.5 model |
| litellm/model_prices_and_context_window_backup.json | Backup JSON mirrors main JSON — same entry with same omissions as above |
| tests/test_litellm/llms/moonshot/test_moonshot_chat_transformation.py | Adds TestKimiK26ModelRegistry with 5 unit tests verifying pricing, context window, capabilities, and provider — no real network calls, complies with test rules |
Flowchart
%%{init: {'theme': 'neutral'}}%%
flowchart TD
A[litellm.completion\nmodel='moonshot/kimi-k2.6'] --> B[model_prices_and_context_window.json\nlookup]
B --> C{Entry found?}
C -- Yes --> D[Resolve litellm_provider\n= 'moonshot']
D --> E[Apply pricing\n$0.60/M input · $2.80/M output]
D --> F[Apply limits\n262K ctx · 262K output]
D --> G[Enable capabilities\nfunction calling · vision · video]
E & F & G --> H[Route to Moonshot API]
C -- No --> I[KeyError / fallback]
Reviews (1): Last reviewed commit: "add tests for moonshot/kimi-k2.6 model r..." | Re-trigger Greptile
| "moonshot/kimi-k2.6": { | ||
| "input_cost_per_token": 6e-07, | ||
| "litellm_provider": "moonshot", | ||
| "max_input_tokens": 262144, | ||
| "max_output_tokens": 262144, | ||
| "max_tokens": 262144, | ||
| "mode": "chat", | ||
| "output_cost_per_token": 2.8e-06, | ||
| "source": "https://platform.kimi.ai/docs/guide/kimi-k2-6-quickstart", | ||
| "supports_function_calling": true, | ||
| "supports_tool_choice": true, | ||
| "supports_video_input": true, | ||
| "supports_vision": true | ||
| }, |
There was a problem hiding this comment.
Missing
cache_read_input_token_cost
Every other moonshot/kimi-k2* model in the registry includes cache_read_input_token_cost (e.g., kimi-k2.5 has 1e-07, kimi-k2-0905-preview has 1.5e-07). If prompt caching is supported for kimi-k2.6, omitting this field means LiteLLM will never apply the discounted rate, silently over-billing users on cached tokens. If the model genuinely doesn't support caching yet, a brief source note explaining that would help future readers.
| "moonshot/kimi-k2.6": { | ||
| "input_cost_per_token": 6e-07, | ||
| "litellm_provider": "moonshot", | ||
| "max_input_tokens": 262144, | ||
| "max_output_tokens": 262144, | ||
| "max_tokens": 262144, | ||
| "mode": "chat", | ||
| "output_cost_per_token": 2.8e-06, | ||
| "source": "https://platform.kimi.ai/docs/guide/kimi-k2-6-quickstart", | ||
| "supports_function_calling": true, | ||
| "supports_tool_choice": true, | ||
| "supports_video_input": true, | ||
| "supports_vision": true | ||
| }, |
There was a problem hiding this comment.
supports_reasoning not set — verify vs kimi-k2.5
moonshot/kimi-k2.5 (the predecessor in the registry) has "supports_reasoning": true, but this entry omits it. If kimi-k2.6 also surfaces a <thinking> block or extended reasoning output, leaving this flag out will prevent LiteLLM from routing reasoning-aware handling. Please verify against the official docs and add the flag if applicable.
Codecov Report✅ All modified and coverable lines are covered by tests. 📢 Thoughts on this report? Let us know! |
|
Refiling with correct base branch (litellm_internal_staging) and branch naming convention. |
Relevant Issues / Related PRs
Adds Kimi K2.6 (released April 20, 2026) to the LiteLLM model registry.
Pre-Submission Checklist
tests/litellm/make test-unitpasses for the modified testsChanges
moonshot/kimi-k2.6tomodel_prices_and_context_window.jsonand backupUsage