Context aware filtering with research_with_chunking method (issue #647) #915

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

devtcu wants to merge 1 commit into LearningCircuit:dev

from devtcu:feature/context-window-chunking

Open

Context aware filtering with research_with_chunking method (issue #647) #915

devtcu wants to merge 1 commit into LearningCircuit:dev from devtcu:feature/context-window-chunking

Conversation

devtcu

Copy link

Collaborator

@devtcu devtcu commented Oct 7, 2025

Adds research_with_chunking() method to LDRClient which thenautomatically limits sources when too many are found, hopefully preventing context window crashes..

Limits to top 10 sources by default (configurable)
Only applies chunking when needed
Shows when chunking was used
Drop-in replacement for quick_research()

@devtcu


 Add research_with_chunking method to prevent context window crashes.

4e2d1e3

@Copilot Copilot AI review requested due to automatic review settings

October 7, 2025 23:01

@devtcu devtcu requested review from HashedViking, LearningCircuit and djpetti as code owners

October 7, 2025 23:01

Copilot

Copilot AI reviewed

Oct 7, 2025

View reviewed changes

Copy link

Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR adds a new research_with_chunking() method to the LDRClient class to prevent context window crashes by automatically limiting the number of sources processed when research returns too many results.

Adds research_with_chunking() method that acts as a drop-in replacement for quick_research()
Implements automatic source limiting to top 10 sources (configurable) when needed
Provides feedback when chunking is applied through result metadata

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

src/local_deep_research/api/client.py

self, query: str, max_chunk_size: int = 10, **kwargs

) -> Dict[str, Any]:

"""

Trying detect situations in which context will be a problem, and automatically break the operation up into multiple separate filtering operation

Copy link

Copilot AI Oct 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed grammatical errors in the docstring.

Suggested change

Tryingdetect situations in which context willbe aproblem, and automatically break the operation upinto multiple separate filtering operation

Attemptstodetect situations in which thecontext windowwouldbe exceeded, and automatically breaks the operation into multiple separate filtering operations.

Copilot uses AI. Check for mistakes.

src/local_deep_research/api/client.py

if not self.logged_in:

raise RuntimeError("Not logged in. Call login() first.")

# lets get initial results

Copy link

Copilot AI Oct 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed capitalization and apostrophe in comment.

Suggested change

# lets get initial results

# Let's get initial results

Copilot uses AI. Check for mistakes.

src/local_deep_research/api/client.py

sources = result.get("sources", [])

# if there too many sources, keep only the best ones..

Copy link

Copilot AI Oct 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed grammatical error in comment.

Suggested change

# if there too many sources, keep only the best ones..

# if there are too many sources, keep only the best ones.

Copilot uses AI. Check for mistakes.

src/local_deep_research/api/client.py

Comment on lines +377 to +379

def research_with_chunking(

self, query: str, max_chunk_size: int = 10, **kwargs

) -> Dict[str, Any]:

Copy link

Copilot AI Oct 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The method name and parameter 'max_chunk_size' are misleading. This method doesn't actually chunk sources into multiple requests, it simply truncates to the top N sources. Consider renaming to 'research_with_limit' and the parameter to 'max_sources' for clarity.

Copilot uses AI. Check for mistakes.

@devtcu devtcu changed the title ~~(削除) Fix context window crashes with research_with_chunking method (fixes #647) (削除ここまで)~~ (追記) Fix context window crashes with research_with_chunking method (issue #647) (追記ここまで)

Oct 7, 2025

@devtcu devtcu changed the title ~~(削除) Fix context window crashes with research_with_chunking method (issue #647) (削除ここまで)~~ (追記) Context aware filtering with research_with_chunking method (issue #647) (追記ここまで)

Oct 7, 2025

@devtcu devtcu linked an issue

Oct 7, 2025

that may be closed by this pull request

Context-Aware Filtering #647

Open

djpetti

djpetti requested changes

Oct 8, 2025

View reviewed changes

Copy link

Collaborator

@djpetti djpetti left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The overall idea is reasonable, but I don't see how the provided code actually solves the problem it's purporting to solve. AFAIK, it runs a normal research and then limits the sources after the fact. I don't see how that will help with context window issues.

As a side note, the proper way to deal with this is probably to modify CrossEngineFilter to filter the sources in chunks instead of all at once. @devtcu You're welcome to attempt that change if you want 😄. I can help you if you get stuck!

@LearningCircuit LearningCircuit added the discussion label

Oct 11, 2025

Labels

discussion

3 participants

@devtcu @djpetti @LearningCircuit

Uh oh!

Context aware filtering with research_with_chunking method (issue #647) #915

Are you sure you want to change the base?

Context aware filtering with research_with_chunking method (issue #647) #915

Uh oh!

Conversation

@devtcu devtcu commented Oct 7, 2025

Uh oh!

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

@djpetti djpetti left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants