Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Pull requests: vectorch-ai/ScaleLLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

refactor: clean up load_state_dict for quant linear
#504 opened Oct 8, 2025 by guocuimi Loading...
kernel: support fp8 kv cache
#381 opened Jan 21, 2025 by guocuimi Loading...
[WIP] Llava support
#352 opened Nov 20, 2024 by guocuimi Loading...
feat: added marlin qlinear support
#303 opened Aug 9, 2024 by guocuimi Loading...
bugfix: fix multiple definition issue.
#261 opened Jul 3, 2024 by liutongxuan Loading...
[wip] feat: add embeddings support
#246 opened Jun 20, 2024 by guocuimi Loading...
[model] add support for mixtral moe model
#128 opened Apr 16, 2024 by 936187425 Loading...
benchmark test script
#124 opened Apr 13, 2024 by ShijiaTang Loading...
ProTip! Exclude everything labeled bug with -label:bug.

AltStyle によって変換されたページ (->オリジナル) /