rithvik Rithvikcc
Stars
LLM Council works together to answer your hardest questions
Analyzing Hacker News discussions from a decade ago in hindsight with LLMs
Perplexica is an AI-powered answering engine. It is an Open source alternative to Perplexity AI
A minimal GPU design in Verilog to learn how GPUs work from the ground up
lightweight, standalone C++ inference engine for Google's Gemma models.
The official PyTorch implementation of Google's Gemma models
A high-throughput and memory-efficient inference and serving engine for LLMs
Code and documentation to train Stanford's Alpaca models, and generate the data.
RuLES: a benchmark for evaluating rule-following in language models
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
π¦π The platform for reliable agents.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Implementation of Diffusion Transformer (DiT) in JAX
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A lightweight library for portable low-level GPU computation using WebGPU.
the scott CPU from "But How Do It Know?" by J. Clark Scott
Official inference repo for FLUX.1 models
Official repository for our work on micro-budget training of large-scale diffusion models.
Implementing DeepSeek R1's GRPO algorithm from scratch
The simplest, fastest repository for training/finetuning small-sized VLMs.
Simple MPI implementation for prototyping or learning
A huge chunk of my personal notes since I started playing CTFs and working as a Red Teamer.
A guide to LLM hacking: fundamentals, prompt injection, offense, and defense
A curated list of data science blogs