add moonshot/kimi-k2.6 to model registry by ishaan-berri · Pull Request #26200 · BerriAI/litellm

ishaan-berri · 2026-04-21T22:25:09Z

Relevant Issues / Related PRs

Adds Kimi K2.6 (released April 20, 2026) to the LiteLLM model registry.

Pre-Submission Checklist

I have added tests in tests/litellm/
make test-unit passes for the modified tests

Changes

Added moonshot/kimi-k2.6 to model_prices_and_context_window.json and backup
Pricing: $0.60/M input, $2.80/M output (from https://platform.kimi.ai/docs/guide/kimi-k2-6-quickstart)
262K context window, supports function calling, vision, and video input
Tests for model registry entry

Usage

import litellm

response = litellm.completion(
    model="moonshot/kimi-k2.6",
    messages=[{"role": "user", "content": "Hello!"}],
)

CLAassistant · 2026-04-21T22:25:16Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

codspeed-hq · 2026-04-21T22:27:04Z

Merging this PR will not alter performance

✅ 16 untouched benchmarks

_{Comparing worktree-golden-yawning-hammock (240c061) with main (26fcbc9)}

greptile-apps · 2026-04-21T22:27:23Z

Greptile Summary

This PR adds moonshot/kimi-k2.6 to the LiteLLM model registry with pricing ($0.60/M input, $2.80/M output), a 262K context window, and capability flags for function calling, vision, and video input. Both JSON files are kept in sync and the accompanying tests correctly read from the in-memory model cost map without making real network calls.

Confidence Score: 5/5

Safe to merge; all findings are P2 clarification items on optional registry fields

The core change is a well-formed JSON model registry addition with correct pricing math and a thorough test suite. The two open questions (missing cache_read_input_token_cost and supports_reasoning compared to kimi-k2.5) are P2 — they won't break routing but could silently under-report capabilities or miss a caching discount. No logic errors, security issues, or test-integrity problems detected.

model_prices_and_context_window.json — verify cache pricing and reasoning support against official kimi-k2.6 docs

Important Files Changed

Filename	Overview
model_prices_and_context_window.json	Adds moonshot/kimi-k2.6 entry; missing cache_read_input_token_cost and supports_reasoning compared to sibling kimi-k2.5 model
litellm/model_prices_and_context_window_backup.json	Backup JSON mirrors main JSON — same entry with same omissions as above
tests/test_litellm/llms/moonshot/test_moonshot_chat_transformation.py	Adds TestKimiK26ModelRegistry with 5 unit tests verifying pricing, context window, capabilities, and provider — no real network calls, complies with test rules

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[litellm.completion\nmodel='moonshot/kimi-k2.6'] --> B[model_prices_and_context_window.json\nlookup]
    B --> C{Entry found?}
    C -- Yes --> D[Resolve litellm_provider\n= 'moonshot']
    D --> E[Apply pricing\n$0.60/M input · $2.80/M output]
    D --> F[Apply limits\n262K ctx · 262K output]
    D --> G[Enable capabilities\nfunction calling · vision · video]
    E & F & G --> H[Route to Moonshot API]
    C -- No --> I[KeyError / fallback]

_{Reviews (1): Last reviewed commit: "add tests for moonshot/kimi-k2.6 model r..." | Re-trigger Greptile}

greptile-apps · 2026-04-21T22:27:27Z

+    "moonshot/kimi-k2.6": {
+        "input_cost_per_token": 6e-07,
+        "litellm_provider": "moonshot",
+        "max_input_tokens": 262144,
+        "max_output_tokens": 262144,
+        "max_tokens": 262144,
+        "mode": "chat",
+        "output_cost_per_token": 2.8e-06,
+        "source": "https://platform.kimi.ai/docs/guide/kimi-k2-6-quickstart",
+        "supports_function_calling": true,
+        "supports_tool_choice": true,
+        "supports_video_input": true,
+        "supports_vision": true
+    },


Missing cache_read_input_token_cost

Every other moonshot/kimi-k2* model in the registry includes cache_read_input_token_cost (e.g., kimi-k2.5 has 1e-07, kimi-k2-0905-preview has 1.5e-07). If prompt caching is supported for kimi-k2.6, omitting this field means LiteLLM will never apply the discounted rate, silently over-billing users on cached tokens. If the model genuinely doesn't support caching yet, a brief source note explaining that would help future readers.

greptile-apps · 2026-04-21T22:27:28Z

+    "moonshot/kimi-k2.6": {
+        "input_cost_per_token": 6e-07,
+        "litellm_provider": "moonshot",
+        "max_input_tokens": 262144,
+        "max_output_tokens": 262144,
+        "max_tokens": 262144,
+        "mode": "chat",
+        "output_cost_per_token": 2.8e-06,
+        "source": "https://platform.kimi.ai/docs/guide/kimi-k2-6-quickstart",
+        "supports_function_calling": true,
+        "supports_tool_choice": true,
+        "supports_video_input": true,
+        "supports_vision": true
+    },


supports_reasoning not set — verify vs kimi-k2.5

moonshot/kimi-k2.5 (the predecessor in the registry) has "supports_reasoning": true, but this entry omits it. If kimi-k2.6 also surfaces a <thinking> block or extended reasoning output, leaving this flag out will prevent LiteLLM from routing reasoning-aware handling. Please verify against the official docs and add the flag if applicable.

codecov · 2026-04-21T22:28:56Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

ishaan-berri · 2026-04-21T23:09:15Z

Refiling with correct base branch (litellm_internal_staging) and branch naming convention.

ishaan-berri added 3 commits April 21, 2026 15:24

add moonshot/kimi-k2.6 to model registry

d2e4b14

add moonshot/kimi-k2.6 to backup model registry

60346a8

add tests for moonshot/kimi-k2.6 model registry

240c061

greptile-apps Bot reviewed Apr 21, 2026

View reviewed changes

ishaan-berri temporarily deployed to integration-postgres April 21, 2026 23:09 — with GitHub Actions Inactive

ishaan-berri had a problem deploying to integration-postgres April 21, 2026 23:09 — with GitHub Actions Error

ishaan-berri temporarily deployed to integration-postgres April 21, 2026 23:09 — with GitHub Actions Inactive

ishaan-berri closed this Apr 21, 2026

coderabbitai Bot mentioned this pull request May 3, 2026

feat(apps/hermes): add Hermes Agent usage analyzer ryoppippi/ccusage#973

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add moonshot/kimi-k2.6 to model registry#26200

add moonshot/kimi-k2.6 to model registry#26200
ishaan-berri wants to merge 3 commits intomainfrom
worktree-golden-yawning-hammock

ishaan-berri commented Apr 21, 2026

Uh oh!

CLAassistant commented Apr 21, 2026

Uh oh!

codspeed-hq Bot commented Apr 21, 2026

Uh oh!

greptile-apps Bot commented Apr 21, 2026

Important Files Changed

Uh oh!

greptile-apps Bot Apr 21, 2026

Uh oh!

greptile-apps Bot Apr 21, 2026

Uh oh!

codecov Bot commented Apr 21, 2026

Uh oh!

ishaan-berri commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ishaan-berri commented Apr 21, 2026

Relevant Issues / Related PRs

Pre-Submission Checklist

Changes

Usage

Uh oh!

CLAassistant commented Apr 21, 2026

Uh oh!

codspeed-hq Bot commented Apr 21, 2026

Merging this PR will not alter performance

Uh oh!

greptile-apps Bot commented Apr 21, 2026

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

greptile-apps Bot Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Apr 21, 2026

Codecov Report

Uh oh!

ishaan-berri commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants