AirLLM 70B inference with single 4GB GPU
-
Updated
Mar 10, 2026 - Jupyter Notebook
AirLLM 70B inference with single 4GB GPU
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Ziya, vLLM
⚡ Zero-Stall MoE Inference via Lookahead Prediction & Async DMA Prefetching. Optimized for SSD I/O with Hybrid MLA+Sliding Window Attention.
Formal Psychological Models of Categorization and Learning
A simple prompt-based approach to detecting prompt injection and jailbreaking attempts using small, self-hosted language models.
Brick of Knowledge on Open Models : Open Source, Open Science, Open Education, Open Collaboration, Open Hardware...
🇫🇷 parler: Multilingual voice intelligence built on Mistral Voxtral model — decision logs from French/English meetings
Air.rs 70B+ inference on consumer GPU, LLM inference in Rust
A framework that enables consistent assessment across environmental claims.
Desktop AI coding-agent platform (Tauri + Rust) for racing, evaluating & orchestrating LLMs across Anthropic and open models (Kimi, MiniMax, DeepSeek, GLM, Qwen) — with a deterministic Axolotl Civilization arena the models compete in.
A simple Python project that brings Google Gemini to your terminal using a free Google Gemini API key.
Systematic scoping review paper for healthcare DES model sharing.
Space and tools for the digital world.
Open-source, locally runnable text-to-video generation — inspired by Sora, built for everyone. Run cinematic video synthesis offline, with full control and transparency.
🚀 Optimize memory for large language models, enabling 70B models on a 4GB GPU and 405B Llama3.1 on 8GB VRAM without compression techniques.
Monte Carlo scenario engine modeling UAE post-conflict recovery. Duration as regime variable. 1,500 paths ×ばつ 7 sectors ×ばつ 48 months. Open, falsifiable, forkable.
Markdown benchmark-card generator for open model evaluation results.
Browse license metadata for popular open models, sourced from Hugging Face model cards, tags, and repo files.
Herramienta de diagnóstico y análisis de sistemas comunitarios, no técnicos, sino sociales, productivos y organizativos.
Add a description, image, and links to the open-models topic page so that developers can more easily learn about it.
To associate your repository with the open-models topic, visit your repo's landing page and select "manage topics."