View CircleRadon's full-sized avatar
π
Through it all
Yuqian Yuan CircleRadon
Through it all
Computer Vision;
PhD student@ZJU
-
Zhejiang University
- Hangzhou
- https://yuqianyuan.github.io/
Highlights
- Pro
Pinned Loading
-
TokenPacker
TokenPacker PublicThe code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025
-
DAMO-NLP-SG/PixelRefer
DAMO-NLP-SG/PixelRefer Public[CVPR 2025] The code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"
-
DAMO-NLP-SG/VideoLLaMA3
DAMO-NLP-SG/VideoLLaMA3 PublicFrontier Multimodal Foundation Models for Image and Video Understanding
-
alibaba-damo-academy/RynnEC
alibaba-damo-academy/RynnEC PublicRynnEC: Bringing MLLMs into Embodied World
-
EvolvingLMMs-Lab/lmms-eval
EvolvingLMMs-Lab/lmms-eval PublicOne-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.