Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Releases: google-ai-edge/LiteRT-LM

v0.13.1

03 Jun 20:52
@ztenghui ztenghui

Choose a tag to compare

Bug fixes based on 0.13.0

Assets 4
Loading
MarcinTZuraw reacted with thumbs up emoji DJDrama, MarcinTZuraw, and oliver021 reacted with heart emoji
3 people reacted

v0.13.0

02 Jun 17:48
@ztenghui ztenghui

Choose a tag to compare

🔥 What's New (v0.13.0)

  • 🚀 Agent skill support: Support creating a standalone LiteRT-LM Android demo app with backend selection and multi-modality support. See README.md for the example prompt.
  • LiteRT LM CLI update: Support OpenAI API Compatible server (doc)
  • Swift package for MacOS: Swift package supports MacOS besides iOS now.
Loading
Junzz0, nt-dongjun-lee, yoshi-m-ai, getStRiCtd, and komodgn reacted with hooray emoji
5 people reacted

v0.12.0

18 May 20:53
@ztenghui ztenghui

Choose a tag to compare

🔥 What's New (v0.12.0)

  • 🚀 Swift APIs: Natively integrate LiteRT-LM into iOS applications with Metal GPU acceleration.
  • 🚀 Web JavaScript APIs: Run models inside web browsers with high performance via web GPU/CPU.
  • LiteRT-LM CLI Update: The command-line interface now supports NPU, besides CPU and GPU backends across Linux, macOS, and Windows.
  • 🚀 Community-Maintained Flutter APIs: Build cross-platform Flutter applications using the community flutter_gemma package.

Features and bug fixes:

CLI

  • [feature] NPU support for Intel OpenVINO with --backend=npu.
  • [feature] Add --max-num-tokens (context length) to benchmark
  • [bugfix] Pin CLI version with API version. (0.12.0 CLI uses 0.12.0 API)

Python API

  • [feature] NPU support for Intel OpenVINO.
  • [feature] New API to construct Message object.
  • [bugfix] Correct the GPU activation type. Prefill speed back to normal (was limited to 50%).
  • [bugfix] Propagate cache_dir to vision and audio backend.
Loading
Junzz0, timwang-heidi, shahizat, and lin72h reacted with thumbs up emoji dolphinDoReMi, shahizat, and lin72h reacted with laugh emoji ken0nek, nt9142, whhone, SimonSchubert, Monkeygamer2010, helenkwok, Herklos, J-CP-Betclic, mhbl3, shahizat, and 3 more reacted with rocket emoji
16 people reacted

v0.11.0

07 May 03:35
@ztenghui ztenghui

Choose a tag to compare

🔥 What's New: v0.11.0

  • Gemma 4 Multi-token Prediction (MTP) Support: Supercharge Gemma 4 on-device inference with Single Position Multi Token Prediction (MTP), delivering >2x faster decode speeds on mobile GPUs with zero quality degradation (blog, documentation).

  • Windows Native Support: The LiteRT-LM CLI now runs natively on Windows with both CPU and GPU backend support.

Loading
4ntoine, prosperenergy, Junzz0, Monkeygamer2010, chamav, Ashu135, wrakrnphumhan63, condr-at, hskim084, and MahadMuhammad reacted with thumbs up emoji Klrfl, chris-hatton, prosperenergy, Monkeygamer2010, and MahadMuhammad reacted with heart emoji chris-hatton, prosperenergy, Monkeygamer2010, LaKanDoR, Heitorkk2, condr-at, and MahadMuhammad reacted with rocket emoji
14 people reacted

v0.11.0-rc.1

30 Apr 20:42
@ztenghui ztenghui

Choose a tag to compare

v0.11.0-rc.1 Pre-release
Pre-release

Release candidate for 0.11.0
Bug fixes

Loading
titatheking and Monkeygamer2010 reacted with thumbs up emoji Monkeygamer2010 and vinodmamilla reacted with eyes emoji
3 people reacted

v0.10.2

14 Apr 23:57
@ztenghui ztenghui

Choose a tag to compare

  • Various Bug fixes
  • Improve the UI smoothness
Loading
Nico-las3, Monkeygamer2010, VITALFORCE757, Giuseppe-Toscano, surajkant89, shitaoliu, skundu42, jshptai, masud7777r, and Seth-crypto reacted with heart emoji Monkeygamer2010, fikrikarim, Minatozaki-A, Nico-las3, VITALFORCE757, surajkant89, iamPulakesh, shitaoliu, skundu42, wanfuton, and 4 more reacted with rocket emoji
18 people reacted

v0.10.1

03 Apr 01:51
@advaitjain advaitjain

Choose a tag to compare

🔥 Gemma 4 support

Deploy Gemma 4 across a broad range of hardware with stellar performance (blog).

👉 Try on Linux, macOS, Windows (WSL) or Raspberry Pi with the
LiteRT-LM CLI:

litert-lm run \
 --from-huggingface-repo=litert-community/gemma-4-E2B-it-litert-lm \
 gemma-4-E2B-it.litertlm \
 --prompt="What is the capital of France?"

Release Notes

  • CLI Enhancements & Migration: Migrated the CLI from fire to click, adding features like --verbose, --version, improved help formatting, and enhanced terminal output styling (#1784, #1733, #1791, #1792).
  • Hugging Face Integration: Added support for importing models directly from Hugging Face and implemented auto-conversion for missing models during "run" commands (#1797, #1735).
  • Core Performance & Features: Introduced a LiteRT-based KV cache implementation, speculative decoding support, and improved context merging for conversation history (#1601, #1793, #1742).
  • Platform & Build Improvements: Refactored CMake for better Android/cross-compilation support, updated the Windows build with a CPU sampler workaround, and transitioned nightly releases to Ubuntu-22.04 (#1741, #1734, #1772).
  • API & Documentation: Expanded the Kotlin API for response channel configuration and launched new Python API resources, including a "Getting Started" guide and a Colab notebook (#1724, #1737, #1757).
Loading
Monkeygamer2010, ZaurezTimmapuri, sadbinsiddique, Harshsharma-0, and kostyabelousov001-hue reacted with hooray emoji junior4bayano-cloud, Monkeygamer2010, Proximusomega, and Harshsharma-0 reacted with heart emoji josephbou, ZibanPirate, ibodev1, whhone, ptitjes, Xlone9773, Monkeygamer2010, oliverne, tyabu12, mohamedragui1-art, and 2 more reacted with rocket emoji
17 people reacted

v0.9.0

27 Mar 20:44
@ztenghui ztenghui

Choose a tag to compare

Android & iOS Update

  • Performance Optimizations: Significant improvements to app initialization speed and memory management.

  • Bug Fixes: General stability enhancements for a smoother user experience.

Loading
hung-yueh, ptitjes, gynsolomon, sean-noh239, jshptai, DevMaker-face, and Proximusomega reacted with heart emoji
7 people reacted

v0.9.0-rc

25 Mar 21:26
@ztenghui ztenghui

Choose a tag to compare

v0.9.0-rc Pre-release
Pre-release

Android / iOS release
With many bug fixes and performance improvements.

Loading
ptitjes, nricklin, hung-yueh, and raghavs4589-stack reacted with heart emoji
4 people reacted

v0.9.0-beta

13 Mar 00:18
@ztenghui ztenghui

Choose a tag to compare

v0.9.0-beta Pre-release
Pre-release

Beta release for v0.9.0.

Loading
raghavs4589-stack reacted with hooray emoji shanta765 reacted with rocket emoji shanta765 reacted with eyes emoji
2 people reacted
Previous 1 3
Previous

AltStyle によって変換されたページ (->オリジナル) /