🌱 Interests: computer vision and multimodal large language model
🔭 Publications: Google scholar
📬 Reach out to me: lyuwenyu@foxmail.com
🌱 Interests: computer vision and multimodal large language model
🔭 Publications: Google scholar
📬 Reach out to me: lyuwenyu@foxmail.com
Instance Capability Tagger(InsCapTagger) is a multimodal data capability tagging model. 多模态数据能力标签模型,可用于图文数据分析和处理(e.g. 基于信息密度的数据过滤方案、基于模型能力的数据配比方案)。 🔥 🔥 🔥
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high ...