Pull requests: InternLM/lmdeploy

New pull request New

48 Open 1,998 Closed

Pull requests list

[Add] add Qwen3-8B accuracy evaluation in llm_compressor.md

#4319 opened Feb 3, 2026 by 43758726

Negative KV sequence length error in Attention op

#4316 opened Feb 2, 2026 by jinminxi104

Compatible with transformers 5.0 at TurboMind side improvement

#4304 opened Jan 28, 2026 by lvhan028

fix rotary embedding for transformers v5 improvement

#4303 opened Jan 28, 2026 by grimoire

Improve metrics log

#4297 opened Jan 27, 2026 by CUHKSZzxy

change ascend paged attention from BSH format to TND format for better performace

#4295 opened Jan 27, 2026 by jinminxi104 • Draft

Support ignore layers in quant config for qwen3 models improvement

#4293 opened Jan 26, 2026 by RunningLeon

return BadRequest for all invlid inputs Bug:P2

#4291 opened Jan 26, 2026 by lvhan028

support repetition ngram logits processor

#4288 opened Jan 23, 2026 by grimoire

fix dllm mask on set_step

#4278 opened Jan 18, 2026 by grimoire

[ascend] fix awq and smoothq

#4277 opened Jan 16, 2026 by wanfengcxz • Draft

[ci] refactor ete testcase

#4274 opened Jan 15, 2026 by zhulinJulia24

test: add mixing guided and non-guided tests

#4267 opened Jan 12, 2026 by windreamer

feat: implement online bf16-to-fp8 conversion and inference in TurboMind improvement

#4237 opened Dec 25, 2025 by 43758726

Update benchmark serving script for proxy_server

#4173 opened Dec 1, 2025 by lvhan028

[WIP]: Support prefix caching with routed experts

#4171 opened Nov 28, 2025 by RunningLeon • Draft

Support fp32 head for qwen and internlm models improvement

#4160 opened Nov 27, 2025 by RunningLeon

fix: fix lora weight loading for internvl

#4106 opened Nov 6, 2025 by windreamer • Draft

Update installation.md

#4095 opened Nov 3, 2025 by krescent

Add step_map to track token decoding order in DLLM

#4057 opened Oct 21, 2025 by Auraithm

4 tasks done

[POC] Encoder Disaggregation

#4047 opened Oct 17, 2025 by CUHKSZzxy • Draft

2 of 7 tasks

quant blocked fp8 enhancement

#4018 opened Sep 29, 2025 by CUHKSZzxy

4 of 5 tasks

Add reasoning parser for GPT-OSS style channels.

#3998 opened Sep 21, 2025 by GY19A

[PD Disaggregation] remote recomputation preemption

#3854 opened Aug 18, 2025 by JimyMa

add ppu quick start doc documentation

#3841 opened Aug 14, 2025 by guozixu2001

ProTip! no:milestone will show everything without a milestone.

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull requests: InternLM/lmdeploy

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list