Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@kdexd
kdexd
Follow

Karan Desai kdexd

💎
🙌
I do computer vision. Prev: CS PhD at the University of Michigan.

Block or report kdexd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.

Python 1,991 175 Updated Jan 14, 2026

✨ An advanced 3D Gaussian Splatting renderer for THREE.js

TypeScript 1,731 157 Updated Feb 17, 2026

Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)

Python 72 10 Updated Mar 14, 2025

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 5,284 399 Updated Apr 21, 2025

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a...

Rust 6,060 549 Updated Feb 17, 2026

Official inference repo for FLUX.1 models

Python 25,209 1,854 Updated Jul 31, 2025

Utilities intended for use with Llama models.

Python 7,482 1,332 Updated Feb 11, 2026

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,986 135 Updated Nov 7, 2025

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,363 373 Updated Oct 19, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,234 1,286 Updated May 23, 2024

The official Meta Llama 3 GitHub site

Python 29,250 3,512 Updated Jan 26, 2025

Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from images to 3D grids.

Python 285 9 Updated Aug 6, 2024

A PyTorch native platform for training generative AI models

Python 5,076 707 Updated Feb 17, 2026

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Python 320 26 Updated Dec 9, 2023

Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.

Python 102 2 Updated Mar 23, 2025

NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024

Python 1,813 75 Updated Nov 27, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,318 1,004 Updated Jul 1, 2024

Fast bare-bones BPE for modern tokenizer training

Python 176 6 Updated Jun 23, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 17,299 1,379 Updated Feb 8, 2026

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,893 245 Updated Feb 17, 2026

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

1,099 45 Updated Sep 27, 2024

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,399 69 Updated Aug 4, 2025

Building blocks for foundation models.

602 29 Updated Jan 3, 2024

MLX: An array framework for Apple silicon

C++ 23,964 1,515 Updated Feb 17, 2026

A batched offline inference oriented version of segment-anything

Python 1,322 80 Updated Aug 22, 2025

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Jupyter Notebook 962 95 Updated Jun 22, 2024

The Modular Platform (includes MAX & Mojo)

Mojo 25,591 2,775 Updated Feb 17, 2026

Fast Implementation of Generalised Geodesic Distance Transform for CPU (OpenMP) and GPU (CUDA)

C++ 106 16 Updated Feb 9, 2024

Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023

Python 197 23 Updated Aug 23, 2023

Fast and memory-efficient exact attention

Python 22,272 2,388 Updated Feb 16, 2026
Next

AltStyle によって変換されたページ (->オリジナル) /