fix(reasoning): support models with reasoning without starting thinking tag #8132

mudler · 2026-01-20T17:33:03Z

Description

This PR fixes models that have as part of the template the thinking token start, especially the imported models which do not ship a specific template from LocalAI. This makes it hard to discover it while processing the LLM streaming, and this PR tries to workaround that by detecting if the model has thinking tags in the template and in the first model replies.

What it does:

Reasoning is still extracted by expecting the full output to contain start and ending thinking tag, for instance <thinking> ... </thinking>
When in the template or in the messages we detect thinking start tags, we decide if we need to inject it before extracting reasoning (if it's present already we don't, if it's not present and it should, we inject it)
When guessing options from GGUF files, it also reads the jinja template and loads it in the model
It is backward compatible, in worse case models with other backends that do not provide jinja templates, won't split the reasoning from the response
Users can disable this behavior by setting reasoning.disable_reasoning_tag_prefill = false in the YAML configuration file of the model

Notes for Reviewers

This fixes models that are reported in #8036 #7944 and mades GLM work with LocalAI as expected.

Signed commits

Yes, I signed my commits.

Signed-off-by: Ettore Di Giacinto <[email protected]>

netlify · 2026-01-20T17:33:09Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`c8484b3`
🔍 Latest deploy log	https://app.netlify.com/projects/localai/deploys/696fbc53df282500081193b2
😎 Deploy Preview	https://deploy-preview-8132--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

mudler added 3 commits January 20, 2026 14:59

chore: extract reasoning to its own package

9f0623a

Signed-off-by: Ettore Di Giacinto <[email protected]>

make sure we detect thinking tokens from template

ebbca8a

Signed-off-by: Ettore Di Giacinto <[email protected]>

Allow to override via config, add tests

c8484b3

Signed-off-by: Ettore Di Giacinto <[email protected]>

mudler added the bug Something isn't working label Jan 20, 2026

This was referenced Jan 20, 2026

Handling of models pulled from HF does not properly handle thinking #7944

Closed

feat(openresponses): Support reasoning blocks #8133

Merged

mudler merged commit 34e054f into master Jan 20, 2026
38 checks passed

mudler deleted the fix/reasoning-template branch January 20, 2026 20:08

BrewTestBot mentioned this pull request Jan 23, 2026

localai 3.10.1 Homebrew/homebrew-core#264198

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(reasoning): support models with reasoning without starting thinking tag #8132

fix(reasoning): support models with reasoning without starting thinking tag #8132

Uh oh!

mudler commented Jan 20, 2026 •

edited

Loading

Uh oh!

netlify bot commented Jan 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

fix(reasoning): support models with reasoning without starting thinking tag #8132

fix(reasoning): support models with reasoning without starting thinking tag #8132

Uh oh!

Conversation

mudler commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for localai ready!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mudler commented Jan 20, 2026 •

edited

Loading

netlify bot commented Jan 20, 2026 •

edited

Loading