feat: Add Ollama reasoning mode support for deepseek-r1 and similar models #916

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

tombii merged 6 commits into dev from ollama_thinking

Oct 11, 2025

Merged

feat: Add Ollama reasoning mode support for deepseek-r1 and similar models #916

tombii merged 6 commits into dev from ollama_thinking

Oct 11, 2025

Conversation

tombii

Copy link

Collaborator

@tombii tombii commented Oct 10, 2025

Summary

Adds support for Ollama's reasoning parameter to properly handle reasoning/thinking models like deepseek-r1.

Changes

New setting: llm.ollama.reasoning (checkbox, default: true)
- When enabled, separates thinking/reasoning content from the main response
- Reasoning content is stored in additional_kwargs["reasoning_content"]
- Prevents reasoning blocks from appearing in final answers
Code changes:
- Updated llm_config.py to pass reasoning parameter to ChatOllama
- Added setting definition to default_settings.json
Bug fix:
- Capped progress display at 100% in page title (prevents "Research (102%)" displays)

Why This Matters

Models like deepseek-r1 output structured thinking blocks (chain-of-thought reasoning) that should be excluded from final answers. Without this parameter, the thinking blocks appear in research results, making them verbose and harder to read.

With reasoning=True (recommended default), the thinking is separated and the user gets clean, concise answers.

Test Plan

Verify new setting appears in LLM settings UI
Test with deepseek-r1 model and confirm thinking is excluded
Test with regular Ollama models to ensure no regression
Verify progress display caps at 100%

🤖 Generated with Claude Code

@tombii @claude


 feat: add Ollama reasoning mode support for models like deepseek-r1

d77d975

- Add llm.ollama.reasoning setting (checkbox, default: true)
- When enabled, separates thinking/reasoning content from main response
- Reasoning content is stored in additional_kwargs["reasoning_content"]
- Fix progress display to cap at 100% in page title
This enables proper handling of reasoning models like deepseek-r1 that
output structured thinking blocks that should be excluded from final answers.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <noreply@anthropic.com>

@tombii tombii requested review from HashedViking, LearningCircuit and djpetti as code owners

October 10, 2025 11:54

@actions-user


 chore: auto-bump version to 1.2.10

fa7d24c

@claude Claude

Copy link

claude bot commented Oct 10, 2025 •

edited

Loading

Claude finished @tombii's task —— View job

Pull Request Review

Reviewing PR #916: feat: Add Ollama reasoning mode support for deepseek-r1 and similar models

Todo List:

Read repository guidelines (CLAUDE.md)
Analyze version file changes
Review LLM config implementation
Examine default settings structure
Review progress display fix
Check for potential bugs and security issues
Assess performance considerations
Evaluate test coverage needs
Provide comprehensive feedback

Review Feedback

✅ Code Quality & Implementation

Strong Points:

Clean, well-documented implementation that follows existing patterns
Proper error handling with debug logging in llm_config.py:719-721
Consistent with existing Ollama parameter handling
Good separation of concerns - reasoning logic is contained to the Ollama provider

Settings Structure:

New llm.ollama.reasoning setting follows the established pattern in default_settings.json
Appropriate UI element (checkbox) and sensible default (true)
Good description explaining the feature's purpose

🔧 Technical Implementation

LLM Config Changes (src/local_deep_research/config/llm_config.py:709-721):

reasoning_mode = get_setting_from_snapshot(
 "llm.ollama.reasoning",
 True, # Default to True to exclude thinking from responses
 settings_snapshot=settings_snapshot,
)
if reasoning_mode is not None:
 ollama_params["reasoning"] = reasoning_mode

Implementation is correct and follows existing parameter handling patterns
Good default value (True) to prevent verbose thinking blocks in responses
Proper integration with the settings system

Progress Display Fix (src/local_deep_research/web/static/js/components/progress.js:592-593):

const cappedProgress = Math.max(0, Math.min(100, Math.floor(data.progress)));

Simple and effective fix for the progress display bug
Prevents display values exceeding 100%

🚨 Potential Issues

Missing Category in Settings: The new setting has "category": null in default_settings.json. Consider adding it to a relevant category like "llm" for better organization in the UI.
No Input Validation: While reasoning_mode is checked for None, there's no validation that it's a boolean value. Consider adding type validation.
Test Coverage: No tests are included for the new functionality. Consider adding:
- Unit tests for the reasoning parameter being passed correctly
- Integration tests with a mock Ollama response containing reasoning content

🔒 Security Considerations

Low Risk: The reasoning parameter is a boolean setting that doesn't involve user input or file operations
Settings are properly retrieved through the established get_setting_from_snapshot mechanism
No new attack vectors introduced

⚡ Performance Considerations

Minimal Impact: Adding one boolean parameter to Ollama requests has negligible performance impact
Positive Impact: When enabled, this actually improves performance by reducing response size (excluding thinking content)
Settings retrieval is cached appropriately

📋 Suggested Improvements

Add setting category:

"llm.ollama.reasoning": {
 "category": "llm", // Add this
 "description": "...",
 // ... rest of config
}

Add type validation in llm_config.py:

reasoning_mode = get_setting_from_snapshot(
 "llm.ollama.reasoning",
 True,
 settings_snapshot=settings_snapshot,
)
if reasoning_mode is not None and isinstance(reasoning_mode, bool):
 ollama_params["reasoning"] = reasoning_mode

Consider adding tests for the new functionality, especially testing that:
- The reasoning parameter is correctly passed to ChatOllama
- The setting appears in the UI as expected
- Default behavior works correctly

🎯 Overall Assessment

This is a well-implemented feature that addresses a specific need for reasoning models like deepseek-r1. The code is clean, follows existing patterns, and includes a useful bug fix for progress display.

Recommendation: ✅ Approve with minor suggestions

The implementation is solid and ready for merge. The suggested improvements are minor enhancements that could be addressed in follow-up PRs if desired.

@tombii @claude


 fix: improve Ollama reasoning mode settings

e0c5acb

- Add category "llm" to llm.ollama.reasoning setting for proper UI grouping
- Add type validation to ensure reasoning_mode is boolean before use
This prevents potential type errors if the setting gets corrupted or set to an unexpected value type.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <noreply@anthropic.com>

@tombii tombii changed the base branch from main to dev

October 10, 2025 12:07

@LearningCircuit

Copy link

Owner

LearningCircuit commented Oct 10, 2025

Isnt this done in the wrapper already?

local-deep-research/src/local_deep_research/config/llm_config.py

Line 874 in b3eb86f

"""Create a wrapper class that processes LLM outputs with remove_think_tags and token counting"""

@LearningCircuit

Copy link

Owner

LearningCircuit commented Oct 10, 2025

the PR is still draft or?

@LearningCircuit LearningCircuit marked this pull request as draft

October 10, 2025 16:21

@LearningCircuit LearningCircuit linked an issue

Oct 10, 2025

that may be closed by this pull request

Allow /nothink for qwen 3 models #346

Closed

tombii and others added 2 commits

October 10, 2025 22:01

@tombii @claude


 feat: implement Ollama thinking mode with user control

f97c2f1

Add checkbox setting to enable/disable thinking for Ollama reasoning models
like deepseek-r1 and qwen3. When enabled, models perform reasoning but
thinking content is automatically discarded - only final answers are shown.
Changes:
- Add llm.ollama.enable_thinking checkbox setting (default: true)
- Pass reasoning parameter to ChatOllama based on user setting
- LangChain converts reasoning=true to think=true in Ollama API
- Thinking separated to additional_kwargs["reasoning_content"] (discarded)
- Enhanced remove_think_tags() to clean orphaned <think> tags from answers
When thinking enabled (default):
- Model performs reasoning for smarter responses
- Thinking content automatically discarded
- Users only see clean final answers
When thinking disabled:
- Model gives direct answers without reasoning
- Faster but potentially less thorough responses
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <noreply@anthropic.com>

@tombii @claude


 docs: clarify Ollama thinking mode description

062c366

Update the description for llm.ollama.enable_thinking setting to accurately reflect its behavior:
- Clarify that reasoning content is automatically separated and excluded from final response
- Explain the accuracy vs speed trade-off between enabled/disabled modes
- Update model reference from qwen3 to qwen2.5
- Add recommendation to enable for better accuracy
The previous description incorrectly stated that the model "shows its reasoning process" when enabled, but the reasoning is actually discarded by LDR and only the final answer is returned.
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude <noreply@anthropic.com>

@tombii tombii force-pushed the ollama_thinking branch from c9ce0b7 to 062c366 Compare

October 10, 2025 20:12

@tombii

Copy link

Collaborator Author

tombii commented Oct 10, 2025 •

edited

Loading

Isnt this done in the wrapper already?

local-deep-research/src/local_deep_research/config/llm_config.py

Line 874 in b3eb86f

"""Create a wrapper class that processes LLM outputs with remove_think_tags and token counting"""

Yes you are right but Ollama I was still seeing </think> that we need to filter out.

@tombii tombii mentioned this pull request

Oct 10, 2025

Allow /nothink for qwen 3 models #346

Closed

@tombii tombii added the claude-review label

Oct 10, 2025

@tombii tombii marked this pull request as ready for review

October 10, 2025 20:31

@tombii


 Merge branch 'dev' into ollama_thinking

dc3fb06

djpetti

djpetti approved these changes

Oct 11, 2025

View reviewed changes

src/local_deep_research/utilities/search_utilities.py

return text

# Remove paired <think>...</think> tags

text = re.sub(r"<think>.*?</think>", "", text, flags=re.DOTALL)

# Remove any orphaned opening or closing think tags

Copy link

Collaborator

@djpetti djpetti Oct 11, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this a thing that can happen?

Copy link

Collaborator Author

@tombii tombii Oct 11, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, it came up in one of my tests, where I could see answer (line 9 was already there, I only added the remove orphaned tags).

@tombii tombii merged commit 00f6a0b into dev

Oct 11, 2025

37 of 41 checks passed

@tombii tombii deleted the ollama_thinking branch

October 11, 2025 06:31

Labels

claude-review

4 participants

@tombii @LearningCircuit @djpetti @actions-user

Uh oh!

feat: Add Ollama reasoning mode support for deepseek-r1 and similar models #916

feat: Add Ollama reasoning mode support for deepseek-r1 and similar models #916

Uh oh!

Conversation

@tombii tombii commented Oct 10, 2025

Summary

Changes

Why This Matters

Test Plan

Uh oh!

claude bot commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Review

Todo List:

Review Feedback

✅ Code Quality & Implementation

🔧 Technical Implementation

🚨 Potential Issues

🔒 Security Considerations

⚡ Performance Considerations

📋 Suggested Improvements

🎯 Overall Assessment

Uh oh!

LearningCircuit commented Oct 10, 2025

Uh oh!

LearningCircuit commented Oct 10, 2025

Uh oh!

tombii commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

@djpetti djpetti Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

@tombii tombii Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

claude bot commented Oct 10, 2025 •

edited

Loading

tombii commented Oct 10, 2025 •

edited

Loading