Overview of self-supervised learning of tiny models, including distillation-based methods (aks. self-supervised distillation) and non-distillation methods.
-
Updated
Nov 13, 2022
Overview of self-supervised learning of tiny models, including distillation-based methods (aks. self-supervised distillation) and non-distillation methods.
π Explore recursive reasoning with TinyRecursiveModels, a compact 7M parameter neural network achieving high scores on tough tasks without massive resources.
π Streamline parallel development with Ralph MCP: run multiple PRDs in isolated workspaces, ensuring quality and efficient merging.
First 1-bit (BitNet b1.58) recursive reasoner for Sudoku-Extreme - distilled from a 7M-param FP TRM teacher into a 1.4 MB ternary student
Phi-3-Vision model test - running locally
Local, project-scoped memory system for LLMs with evidence-based truth validation. Provides reliable long-term context via OpenAI-compatible Proxy and MCP, using Chain-of-Verification (CoVe) to eliminate hallucinations and the Ralph Loop for autonomous codebase repair.
Testing the Moondream tiny vision model
Train the smallest LM you can that fits in 16MB. Best model wins!
π Implement the Tiny Recursive Model (TRM) for improved performance in recursive tasks, building on the HRM framework by Sapient AI.
Add a description, image, and links to the tiny-models topic page so that developers can more easily learn about it.
To associate your repository with the tiny-models topic, visit your repo's landing page and select "manage topics."