Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Robust Speech Recognition via Large-Scale Weak Supervision
FastAPI framework, high performance, easy to learn, fast to code, ready for production
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...
A lightweight coding agent for open models like Deepseek, Kimi, and Qwen
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
LlamaIndex is the leading document agent and OCR platform
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i...
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Making large AI models cheaper, faster and more accessible
Official inference framework for 1-bit LLMs
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
DSPy: The framework for programming—not prompting—language models
A modular graph-based Retrieval-Augmented Generation (RAG) system
ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Generative Models by Stability AI