fix: engine cannot give response to the second user request with stre... #701

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

lihaiyin88 wants to merge 1 commit into mlc-ai:main

from lihaiyin88:main

Open

fix: engine cannot give response to the second user request with stre... #701

lihaiyin88 wants to merge 1 commit into mlc-ai:main from lihaiyin88:main

Conversation

lihaiyin88

Copy link

@lihaiyin88 lihaiyin88 commented Jun 24, 2025 •

edited

Loading

see title


 fix: engine cannot give response to the second user request with stre...

fb351cd

...am way

@CharlieFRuan

Copy link

Member

CharlieFRuan commented Jun 24, 2025

thanks a lot for the contribution. Would it be possible for you to provide a script for reproducing the issue / elaborate on the issue? Thank you!

@CharlieFRuan CharlieFRuan requested a review from Copilot

June 24, 2025 14:55

Copilot

Copilot AI reviewed

Jun 24, 2025

View reviewed changes

Copy link

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR restructures the streaming generation method in MLCEngine to ensure the model lock is always released once, fixing an issue where the engine could not handle a second streaming request.

Flattened multiple nested try/catch blocks into a single try + finally around the main logic.
Converted inner helper functions (_countTrailingReplacementChar, _getChunk) to arrow function expressions.
Removed redundant lock.release() calls spread across catches; now released once in finally.

Comments suppressed due to low confidence (2)

src/engine.ts:706

This TODO is still open—either implement the usage support for non-chat completions or file a tracking issue to ensure it isn't overlooked.

 // TODO(Charlie): support usage for completion

src/engine.ts:483

Add a test case that performs two consecutive streaming requests (with and without errors) to verify the lock is always released and reacquired correctly.

 genConfig: GenerationConfig,

src/engine.ts

// Since it is an async generator, we need to do fine-grained try-catch to ensure lock is

// released only when errors occur. Then release at the very end when no error occurs.

// TODO: This makes code less readable, is there a better way to do this?

const lock = this.loadedModelIdToLock.get(model)!;

Copy link

Copilot AI Jun 24, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consider adding a brief comment above this try/finally to clarify its scope is for lock acquisition and release, improving future readability.

Suggested change

const lock = this.loadedModelIdToLock.get(model)!;

// Acquire the lock and ensure its release in the `finally` block.

Copilot uses AI. Check for mistakes.

@lihaiyin88

Copy link

Author

lihaiyin88 commented Jun 30, 2025

thanks a lot for the contribution. Would it be possible for you to provide a script for reproducing the issue / elaborate on the issue? Thank you!

Hey there! So sorry for the super late reply! 😊
I've just pushed all the updates to the forked project right here: https://github.com/lihaiyin88/web-llm/tree/demo_for_issue
If you have any questions at all, don't hesitate to let me know! I'm all ears! 😎

Labels

None yet

2 participants

@lihaiyin88 @CharlieFRuan

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: engine cannot give response to the second user request with stre... #701

Are you sure you want to change the base?

fix: engine cannot give response to the second user request with stre... #701

Uh oh!

Conversation

@lihaiyin88 lihaiyin88 commented Jun 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

CharlieFRuan commented Jun 24, 2025

Uh oh!

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

lihaiyin88 commented Jun 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: engine cannot give response to the second user request with stre... #701

Are you sure you want to change the base?

fix: engine cannot give response to the second user request with stre... #701

Uh oh!

Conversation

@lihaiyin88 lihaiyin88 commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CharlieFRuan commented Jun 24, 2025

Uh oh!

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

lihaiyin88 commented Jun 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

@lihaiyin88 lihaiyin88 commented Jun 24, 2025 •

edited

Loading