Ruman Kim rumaniel
- Seoul, Korea
- https://rumaniel.github.io/
Highlights
- Pro
Organizations
@mit-gamesML
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
COYO-700M: Large-scale Image-Text Pair Dataset
π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
An easy 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, a...
PyTorch reimplementation of Diffusion Models
stable diffusion finetuned on weeb stuff
Automatically create masks for Stable Diffusion inpainting using natural language.
Robust Speech Recognition via Large-Scale Weak Supervision
A neat Discord bot to run Stable Diffusion locally
Cross-platform, customizable ML solutions for live and streaming media.
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
Stable Diffusion web UI
π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The ...
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references und...
A simple notebook demonstrating prompt-based music generation via Mubert API
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Yet another PyTorch implementation of Stable Diffusion (probably easy to read)
High Resolution Depth Maps for Stable Diffusion WebUI
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Stable Diffusion with Core ML on Apple Silicon