View enjoyyi00's full-sized avatar
🎯
Focusing
Yi Jiang enjoyyi00
🎯
Focusing
Large Language Model & Generative Models
-
Bytedance Seed
- San Jose
- https://enjoyyi.github.io/
- @Enjoy_Yi
Pinned Loading
-
FoundationVision/VAR
FoundationVision/VAR Public[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A...
-
FoundationVision/Waver
FoundationVision/Waver PublicIndustry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.
-
FoundationVision/Liquid
FoundationVision/Liquid Public(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
-
FoundationVision/Groma
FoundationVision/Groma Public[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
-
FoundationVision/UniTok
FoundationVision/UniTok Public[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
-
FoundationVision/Infinity
FoundationVision/Infinity Public[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.