-
Notifications
You must be signed in to change notification settings - Fork 700
Open
@reneleonhardt
Description
🚀 The feature, motivation and pitch
IBM is at the forefront of the most efficient LLMs on edge devices, for example Granite-4.0-H-Micro.
Could Granite models be supported for responsible, high throughput and quality with extremely low memory consumption?
Alternatives
Qwen3-Next achieves many similar breakthroughs, but no small model yet.
https://qwen3-next.com/
Additional context
RFC (Optional)
No response
Metadata
Metadata
Assignees
Labels
No labels