Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

alibaba/EfficientAI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

12 Commits

Repository files navigation

๐Ÿš€ EfficientAI

Efficient Inference for LLMs & MLLMs
An open-source research project from Alibaba Cloud dedicated to efficient large language model inference.

EfficientAI Banner

License Papers Stars Issues


๐Ÿ“‹ Table of Contents


โœจ Key Features

EfficientAI focuses on inference-time optimizations for LLMs and MLLMs:

Feature Description Status
๐Ÿ”น Activation Sparsity Dynamic sparsity methods for faster inference โœ… LaRoSa (ICML 2025)
๐Ÿ”น Quantization Post-training & quantization-aware techniques for MLLMs โœ… MASQuant (CVPR 2026)
๐Ÿ”น Agentic Reasoning Efficient tool-use and reasoning frameworks โœ… D-CORE ( ICML 2026)
๐Ÿ”น Reproducible Benchmarks Standardized eval pipelines for research & production ๐Ÿ”„ In Progress

๐Ÿ”ฅ Latest Updates

๐Ÿ“ฐ Changelog (Click to expand)

๐Ÿ“ฆ Installation

# Clone the repository
git clone https://github.com/alibaba/EfficientAI.git
cd EfficientAI
# Install dependencies (recommended: use conda)
pip install -r requirements.txt
# Optional: Install with specific module support
# pip install -e ".[larosa]" # for LaRoSa
# pip install -e ".[masquant]" # for MASQuant

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

Contributors

Languages

AltStyle ใซใ‚ˆใฃใฆๅค‰ๆ›ใ•ใ‚ŒใŸใƒšใƒผใ‚ธ (->ใ‚ชใƒชใ‚ธใƒŠใƒซ) /