查看2025年综述文献点这里↘️ 2025-CV-Surveys
2025 年,计算机视觉相关综述。包括目标检测、跟踪........
📗📗📗在【我爱计算机视觉】微信公众号后台回复"CV综述",即可收到本文列出的全部论文的打包下载。至10月31日已公开 429 篇。
1月36篇。
2月50篇。
3月45篇。
4月41篇。
5月56篇。
6月39篇。
7月49篇。
316
8月36篇。
352
9月31篇。
383
10月46篇。
429
| 🐱 | 🐶 | 🐯 | 🐺 |
|---|---|---|---|
| 1.Unkown(未分) |
- Recent Advances in Out-of-Distribution Detection with CLIP-Like Models: A Survey
[2025年05月06日] - From Pixel to Mask: A Survey of Out-of-Distribution Segmentation
[2025年08月18日]
- Reconstructing 4D Spatial Intelligence: A Survey
⭐code
[2025年07月29日] - 3D and 4D World Modeling: A Survey
⭐code
[2025年09月11日] - Advances in 4D Representation: Geometry, Motion, and Interaction
⭐code
[2025年10月23日]
- Computational Imaging for Enhanced Computer Vision
[2025年09月11日]
- Deep Learning for Crack Detection: A Review of Learning Paradigms, Generalizability, and Datasets
⭐code
[2025年08月18日]
- Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques
[2025年07月31日]
- Machine Learning Applications to Diffuse Reflectance Spectroscopy in Optical Diagnosis; A Systematic Review
[2025年03月06日] - Prompt Mechanisms in Medical Imaging: A Comprehensive Survey
[2025年07月03日] - 强化学习
- 对比学习
- 持续学习
- 类增量学习
- 对抗
- Unmasking Synthetic Realities in Generative AI: A Comprehensive Review of Adversarially Robust Deepfake Detection Systems
⭐code
[2025年07月30日]
- A survey of datasets for computer vision in agriculture
⭐code
[2025年02月25日] - Advancing Wheat Crop Analysis: A Survey of Deep Learning Approaches Using Hyperspectral Imaging
⭐code
[2025年05月05日] - Vision Transformers in Precision Agriculture: A Comprehensive Survey
[2025年05月01日] - Domain Adaptation in Agricultural Image Analysis: A Comprehensive Review from Shallow Models to Deep Learning
[2025年06月09日] - AI in Agriculture: A Survey of Deep Learning Techniques for Crops, Fisheries and Livestock
⭐code
[2025年07月31日]
- 掌纹识别
- Neural Radiance Fields for the Real World: A Survey
[2025年01月23日]
- Text-driven Motion Generation: Overview, Challenges and Directions
[2025年05月15日] - Motion Generation: A Survey of Generative Approaches and Benchmarks
[2025年07月09日] - Human Motion Video Generation: A Survey
⭐code
[2025年09月05日]
- Semantic Mapping in Indoor Embodied AI – A Comprehensive Survey and Future Directions
[2025年01月13日] - OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation
⭐code
[2025年05月08日] - Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review
[2025年05月28日] - Multimodal Data Storage and Retrieval for Embodied AI: A Survey
[2025年08月20日] - Large VLM-based Vision-Language-Action Models for Robotic Manipulation: A Survey
⭐code
[2025年09月04日] - The Safety Challenge of World Models for Embodied AI Agents: A Review
[2025年10月08日] - Vision-Language-Action Models for Robotics: A Review Towards Real-World Applications
⭐code
[2025年10月09日] - A Comprehensive Survey on World Models for Embodied AI
⭐code
[2025年10月21日] - 位置识别
- 导航
- Anomaly Detection for Industrial Applications, Its Challenges, Solutions, and Future Directions: A Review
[2025年01月22日] - A Survey on Industrial Anomalies Synthesis
⭐code
[2025年02月25日] - A Survey on Foundation-Model-Based Industrial Defect Detection
[2025年02月27日] - A Comprehensive Survey for Real-World Industrial Defect Detection: Challenges, Approaches, and Prospects
[2025年07月21日] - 异常检测
- A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems
[2025年02月11日] - Survey of Video Diffusion Models: Foundations, Implementations, and Applications
⭐code
[2025年04月23日] - 视频分析
- 视频理解
- VideoLLM Benchmarks and Evaluation: A Survey
[2025年05月08日] - Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
⭐code
[2025年06月09日] - Looking Beyond the Obvious: A Survey on Abstract Concept Recognition for Video Understanding
[2025年08月29日] - Video Understanding by Design: How Datasets Shape Architectures and Insights
[2025年09月12日]
- VideoLLM Benchmarks and Evaluation: A Survey
- 视频监控
- 视频帧插值
- 视频异常检测
- 长视频叙事生成
- Action Valuation in Sports: A Survey
[2025年04月09日] - Action Spotting and Precise Event Detection in Sports: Datasets, Methods, and Challenges
[2025年05月08日] - 3D Skeleton-Based Action Recognition: A Review
[2025年06月03日]
- Recent Deep Learning in Crowd Behaviour Analysis: A Brief Review
[2025年05月27日] - Causality and "In-the-Wild" Video-Based Person Re-ID: A Survey
[2025年05月28日] - Domain Generalization for Person Re-identification: A Survey Towards Domain-Agnostic Person Matching
⭐code
[2025年06月17日] - A review of Recent Techniques for Person Re-Identification
[2025年09月30日] - 行为检测
- A Survey of World Models for Autonomous Driving
[2025年01月22日] - The Role of World Models in Shaping Autonomous Driving: A Comprehensive Survey
⭐code
[2025年02月18日] - 4D mmWave Radar in Adverse Environments for Autonomous Driving: A Survey
[2025年04月01日] - Systematic Literature Review on Vehicular Collaborative Perception -- A Computer Vision Perspective
[2025年04月08日] - Adversarial Examples in Environment Perception for Automated Driving (Review)
[2025年04月14日] - Collaborative Perception Datasets for Autonomous Driving: A Review
⭐code
[2025年04月18日] - Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends
[2025年04月24日] - Wireless Communication as an Information Sensor for Multi-agent Cooperative Perception: A Survey
[2025年05月05日] - Generative AI for Autonomous Driving: A Review
[2025年05月23日] - A Survey on Vision-Language-Action Models for Autonomous Driving
⭐code
[2025年07月01日] - Towards Autonomous Riding: A Review of Perception, Planning, and Control in Intelligent Two-Wheelers
[2025年07月17日] - A Survey on Deep Multi-Task Learning in Connected Autonomous Vehicles
[2025年08月05日] - Progressive Bird's Eye View Perception for Safety-Critical Autonomous Driving: A Comprehensive Survey
[2025年08月12日] - Hyperspectral Sensors and Autonomous Driving: Technologies, Limitations, and Opportunities
[2025年08月28日] - To New Beginnings: A Survey of Unified Perception in Autonomous Vehicle Software
[2025年08月29日] - Foundation Models for Autonomous Driving Perception: A Survey Through Core Capabilities
[2025年09月11日] - Maps for Autonomous Driving: Full-process Survey and Frontiers
[2025年09月17日] - From Static to Dynamic: a Survey of Topology-Aware Perception in Autonomous Driving
[2025年09月30日] - 车道线检测
- 分心驾驶检测
- 交通事故预测
- 全景视觉
- A Systematic Review of Machine Learning Methods for Multimodal EEG Data in Clinical Application
[2025年01月16日]
- 零样本
- Compositional Zero-Shot Learning: A Survey
⭐code
[2025年10月14日]
- Compositional Zero-Shot Learning: A Survey
- 域泛化
- Non-Transferable Learning(反迁移学习)
- Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook
⭐code
[2025年03月25日]
- Large Vision-Language Model Alignment and Misalignment: A Survey Through the Lens of Explainability
[2025年01月03日] - Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey
⭐code
[2025年01月07日] - Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches
[2025年01月07日] - Visual Large Language Models for Generalized and Specialized Applications
⭐code
[2025年01月07日] - When Data Manipulation Meets Attack Goals: An In-depth Survey of Attacks for VLMs
⭐code
[2025年02月11日] - Survey on Vision-Language-Action Models
[2025年02月12日] - Vision-Language Models for Edge Networks: A Comprehensive Survey
[2025年02月13日] - Harnessing Vision Models for Time Series Analysis: A Survey
[2025年02月14日] - A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations
⭐code
[2025年02月24日] - Multi-Modal Foundation Models for Computational Pathology: A Survey
[2025年03月13日] - Small Vision-Language Models: A Survey on Compact Architectures and Techniques
[2025年03月17日] - A Survey on Efficient Vision-Language Models
⭐code
[2025年04月15日] - Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
⭐code
[2025年05月09日] - Generalizing Vision-Language Models to Novel Domains: A Comprehensive Survey
[2025年06月24日] - Continual Learning for VLMs: A Survey and Taxonomy Beyond Forgetting
⭐code
[2025年08月07日] - Review of Hallucination Understanding in Large Language and Vision Models
[2025年10月02日] - Vision Language Models: A Survey of 26K Papers
[2025年10月13日] - Towards General Urban Monitoring with Vision-Language Models: A Review, Evaluation, and a Research Agenda
[2025年10月15日] - Survey of Multimodal Geospatial Foundation Models: Techniques, Applications, and Challenges
[2025年10月28日] - Cross-view Localization and Synthesis -- Datasets, Challenges and Opportunities
⭐code
[2025年10月28日] - LLM
- Leveraging Large Language Models For Scalable Vector Graphics Processing: A Review
[2025年03月10日] - A Review on Large Language Models for Visual Analytics
[2025年03月20日] - Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions
[2025年03月24日] - How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM
[2025年04月09日] - PEFT A2Z: Parameter-Efficient Fine-Tuning Survey for Large Language and Vision Models
⭐code
[2025年04月22日] - A Survey on (M)LLM-Based GUI Agents
[2025年04月22日] - Towards Transparent AI: A Survey on Explainable Large Language Models
[2025年06月30日] - Speed Always Wins: A Survey on Efficient Architectures for Large Language Models
⭐code
[2025年08月14日]
- Leveraging Large Language Models For Scalable Vector Graphics Processing: A Review
- MLLM
- Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
[2025年02月25日] - Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
⭐code
[2025年03月18日] - Aligning Multimodal LLM with Human Preference: A Survey
⭐code
[2025年03月19日] - Survey of Adversarial Robustness in Multimodal Large Language Models
[2025年03月19日] - A Survey of Multimodal Hallucination Evaluation and Detection
[2025年07月28日] - When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
⭐code
[2025年07月29日] - OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
[2025年08月07日] - A Survey on Agentic Multimodal Large Language Models
⭐code
[2025年10月14日] - Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks
⭐code
[2025年10月30日]
- Multimodal Large Language Models for Text-rich Image Understanding: A Comprehensive Review
- 基础模型
- Vision Generalist Model: A Survey
[2025年06月12日]
- Vision Generalist Model: A Survey
- 视觉定位
- 多模态推理
- VLA
- A Survey on Efficient Vision-Language-Action Models
⭐code
[2025年10月30日]
- A Survey on Efficient Vision-Language-Action Models
- Generative AI for Cel-Animation: A Survey
⭐code
[2025年01月14日] - Generative Physical AI in Vision: A Survey
⭐code
[2025年01月22日] - Survey on AI-Generated Media Detection: From Non-MLLM to MLLM
[2025年02月11日] - A Survey on Text-Driven 360-Degree Panorama Generation
⭐code
[2025年02月21日] - Methods and Trends in Detecting Generated Images: A Comprehensive Review
[2025年02月24日] - Simulating the Real World: A Unified Survey of Multimodal Generative Models
[2025年03月07日] - Generative AI for Film Creation: A Survey of Recent Advances
[2025年04月14日] - Erasing Concepts, Steering Generations: A Comprehensive Survey of Concept Suppression
[2025年05月27日] - A Survey of Automatic Evaluation Methods on Text, Visual and Speech Generations
[2025年06月13日] - GAN
- Image Inversion: A Survey from GANs to Diffusion and Beyond
⭐code
[2025年02月18日] - Generative Adversarial Networks with Limited Data: A Survey and Benchmarking
[2025年04月09日] - A Review on Domain Adaption and Generative Adversarial Networks(GANs)
[2025年10月15日] - Generative AI in Depth: A Survey of Recent Advances, Model Variants, and Real-World Applications
[2025年10月28日]
- Image Inversion: A Survey from GANs to Diffusion and Beyond
- 图像生成
- Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing
[2025年02月13日] - Personalized Image Generation with Deep Generative Models: A Decade Survey
⭐code
[2025年02月19日] - SoK: Can Synthetic Images Replace Real Data? A Survey of Utility and Privacy of Synthetic Image Generation
[2025年06月25日]
- Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing
- AIGC
- Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC
[2025年02月12日] - Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions
⭐code
[2025年04月29日] - Secure and Robust Watermarking for AI-generated Images: A Comprehensive Survey
[2025年10月06日]
- Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC
- 图像到图像翻译
- 文本-图像
- A Comprehensive Survey on Concept Erasure in Text-to-Image Diffusion Models
[2025年02月24日] - A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images
[2025年03月03日] - A Systematic Review of Open Datasets Used in Text-to-Image (T2I) Gen AI Model Safety
[2025年03月04日] - A Survey on Self-supervised Contrastive Learning for Multimodal Text-Image Analysis
[2025年03月17日] - A Comprehensive Survey on Visual Concept Mining in Text-to-image Diffusion Models
[2025年03月19日] - Text to Image Generation and Editing: A Survey
[2025年05月06日]
- A Comprehensive Survey on Concept Erasure in Text-to-Image Diffusion Models
- 视频生成
- A Survey: Spatiotemporal Consistency in Video Generation
[2025年02月26日] - Exploring the Evolution of Physics Cognition in Video Generation: A Survey
⭐code
[2025年03月28日] - A Survey of Interactive Generative Video
[2025年05月01日] - Controllable Video Generation: A Survey
⭐code
[2025年07月24日] - Bridging Text and Video Generation: A Survey
[2025年10月07日]
- A Survey: Spatiotemporal Consistency in Video Generation
- 4D生成
- Advances in 4D Generation: A Survey
⭐code
[2025年03月19日]
- Advances in 4D Generation: A Survey
- 3D生成
- 视觉-音乐生成
- Vision-to-Music Generation: A Survey
⭐code
[2025年03月28日]
- Vision-to-Music Generation: A Survey
- 场景生成
- 3D Scene Generation: A Survey
⭐code
[2025年05月09日]
- 3D Scene Generation: A Survey
- 迁移
- A Survey on Dynamic Neural Networks: from Computer Vision to Multi-modal Sensor Fusion
[2025年01月14日] - Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies
[2025年03月06日] - Image Recognition with Online Lightweight Vision Transformer: A Survey
⭐code
[2025年05月07日] - Token Compression Meets Compact Vision Transformers: A Survey and Comparative Evaluation for Edge AI
[2025年07月15日] - 量化
- Zero-shot Quantization: A Comprehensive Survey
[2025年05月15日]
- Zero-shot Quantization: A Comprehensive Survey
- KD
- A Comprehensive Survey on Knowledge Distillation
⭐code
[2025年03月18日]
- A Comprehensive Survey on Knowledge Distillation
- Visual question answering: from early developments to recent advances -- a survey
[2025年01月08日] - The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
[2025年01月14日] - A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task
[2025年04月25日] - Multimedia-Aware Question Answering: A Review of Retrieval and Cross-Modal Reasoning Architectures
[2025年10月24日]
- In the Picture: Medical Imaging Datasets, Artifacts, and their Living Review
[2025年01月22日] - Foundation Models in Computational Pathology: A Review of Challenges, Opportunities, and Impact
[2025年02月13日] - A Survey of LLM-based Agents in Medicine: How far are we from Baymax?
[2025年02月18日] - Denoising, segmentation and volumetric rendering of optical coherence tomography angiography (OCTA) image using deep learning techniques: a review
[2025年02月24日] - The Impact of Artificial Intelligence on Emergency Medicine: A Review of Recent Advances
[2025年03月20日] - Comprehensive Review of Reinforcement Learning for Medical Ultrasound Imaging
[2025年03月24日] - Deep Learning Approaches for Medical Imaging Under Varying Degrees of Label Availability: A Comprehensive Survey
[2025年04月17日] - A Comprehensive Review on RNA Subcellular Localization Prediction
[2025年04月25日] - A Methodological and Structural Review of Parkinsons Disease Detection Across Diverse Data Modalities
[2025年05月02日] - From Pixels to Polygons: A Survey of Deep Learning Approaches for Medical Image-to-Mesh Reconstruction
[2025年05月07日] - Physical foundations for trustworthy medical imaging: a review for artificial intelligence researchers
[2025年05月07日] - The Eye as a Window to Systemic Health: A Survey of Retinal Imaging from Classical Techniques to Oculomics
[2025年05月08日] - The Application of Deep Learning for Lymph Node Segmentation: A Systematic Review
[2025年05月12日] - Computationally Efficient Diffusion Models in Medical Imaging: A Comprehensive Review
[2025年05月14日] - Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges
[2025年05月19日] - A Narrative Review on Large AI Models in Lung Cancer Screening, Diagnosis, and Treatment Planning
[2025年06月10日] - Foundation Models in Medical Imaging -- A Review and Outlook
[2025年06月12日] - Brain Imaging Foundation Models, Are We There Yet? A Systematic Review of Foundation Models for Brain Imaging and Biomedical Research
[2025年06月17日] - Taming Vision-Language Models for Medical Image Analysis: A Comprehensive Review
[2025年06月24日] - Systematic Review of Pituitary Gland and Pituitary Adenoma Automatic Segmentation Techniques in Magnetic Resonance Imaging
[2025年06月25日] - Handcrafted vs. Deep Radiomics vs. Fusion vs. Deep Learning: A Comprehensive Review of Machine Learning -Based Cancer Outcome Prediction in PET and SPECT Imaging
[2025年07月23日] - Harmonization in Magnetic Resonance Imaging: A Survey of Acquisition, Image-level, and Feature-level Methods
[2025年07月24日] - Review of Deep Learning Applications to Structural Proteomics Enabled by Cryogenic Electron Microscopy and Tomography
[2025年07月29日] - Medical Reasoning in the Era of LLMs: A Systematic Review of Enhancement Techniques and Applications
[2025年08月04日] - A Survey of Multimodal Ophthalmic Diagnostics: From Task-Specific Approaches to Foundational Models
[2025年08月07日] - A Survey of Medical Point Cloud Shape Learning: Registration, Reconstruction and Variation
[2025年08月06日] - Federated Learning for Large Models in Medical Imaging: A Comprehensive Review
[2025年08月29日] - Deep Learning in Dental Image Analysis: A Systematic Review of Datasets, Methodologies, and Emerging Challenges
[2025年10月24日] - 医学图像分割
- A Comprehensive Review of U-Net and Its Variants: Advances and Applications in Medical Image Segmentation
[2025年02月12日] - Recent Advances in Medical Imaging Segmentation: A Survey
⭐code
[2025年05月15日] - Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches
[2025年06月13日] - Is the medical image segmentation problem solved? A survey of current developments and future directions
[2025年08月29日] - Advances in Medical Image Segmentation: A Comprehensive Survey with a Focus on Lumbar Spine Applications
[2025年10月07日]
- A Comprehensive Review of U-Net and Its Variants: Advances and Applications in Medical Image Segmentation
- 医学图像融合
- 医学图像分类
- 医学图像分析
- 医学图像增强
- 手术场景理解
- 手术视频分割
- 图像配准
- MRI重建
- A Survey of fMRI to Image Reconstruction
[2025年02月25日] - A Comprehensive Survey on Magnetic Resonance Image Reconstruction
[2025年03月11日] - A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli
⭐code
[2025年03月21日] - Systematic Review and Meta-analysis of AI-driven MRI Motion Artifact Detection and Correction
[2025年09月06日] - Evaluation of Machine Learning Reconstruction Techniques for Accelerated Brain MRI Scans
[2025年09月10日] - From 2D to 3D, Deep Learning-based Shape Reconstruction in Magnetic Resonance Imaging: A Review
[2025年10月03日] - A Structured Review and Quantitative Profiling of Public Brain MRI Datasets for Foundation Model Development
[2025年10月24日]
- A Survey of fMRI to Image Reconstruction
- VQA
- CT
- 报告生成
- 异常检测
- Handwritten Text Recognition: A Survey
[2025年02月13日] - Visual Text Processing: A Comprehensive Review and Unified Evaluation
⭐code
[2025年05月01日] - A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions
[2025年06月06日] - Advancing Offline Handwritten Text Recognition: A Systematic Review of Data Augmentation and Generation Techniques
[2025年07月10日] - Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis
[2025年07月17日] - Automatic Intermodal Loading Unit Identification using Computer Vision: A Scoping Review
[2025年09月23日] - 古文字图像识别
- 文档理解
- 中文字体生成
- Advancing Earth Observation: A Survey on AI-Powered Image Processing in Satellites
[2025年01月22日] - Plantation Monitoring Using Drone Images: A Dataset and Performance Review
[2025年02月13日] - A Survey on Remote Sensing Foundation Models: From Vision to Multimodality
[2025年03月31日] - A Decade of Deep Learning for Remote Sensing Spatiotemporal Fusion: Advances, Challenges, and Opportunities
⭐code
[2025年04月02日] - MIMRS: A Survey on Masked Image Modeling in Remote Sensing
[2025年04月07日] - A comprehensive review of remote sensing in wetland classification and mapping
[2025年04月16日] - Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook
⭐code
[2025年05月02日] - Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives
[2025年05月21日] - A Compendium of Autonomous Navigation using Object Detection and Tracking in Unmanned Aerial Vehicles
[2025年06月09日] - Advancements in Weed Mapping: A Systematic Review
[2025年07月03日] - From Physics to Foundation Models: A Review of AI-Driven Quantitative Remote Sensing Inversion
[2025年07月15日] - Hyper-spectral Unmixing algorithms for remote compositional surface mapping: a review of the state of the art
[2025年07月22日] - Sun sensor calibration algorithms: A systematic mapping and survey
[2025年07月30日] - A Comprehensive Review of Agricultural Parcel and Boundary Delineation from Remote Sensing Images: Recent Progress and Future Perspectives
[2025年08月21日] - Deep Learning Based Domain Adaptation Methods in Remote Sensing: A Comprehensive Survey
[2025年10月20日] - Dimensionality Reduction for Remote Sensing Data Analysis: A Systematic Review of Methods and Applications
[2025年10月23日] - 目标检测
- Anti-UAV
- 变化检测
- 船舶分类
- A Survey on SAR ship classification using Deep Learning
[2025年03月18日]
- A Survey on SAR ship classification using Deep Learning
- 火灾烟雾
Fire and Smoke Datasets in 20 Years: An In-depth Review
[2025年03月20日] - 野生动物监测
- 遥感图像分割
- 遥感超分辨率
- Deep Learning-Based Multi-Object Tracking: A Comprehensive Survey from Foundations to State-of-the-Art
[2025年06月17日] - A Deep Dive into Generic Object Tracking: A Survey
[2025年08月01日] - Omni Survey for Multimodality Analysis in Visual Object Tracking
⭐code
[2025年08月19日]
- Context in object detection: a systematic literature review
[2025年04月01日] - Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation
⭐code
[2025年04月15日] - Architectural Insights into Knowledge Distillation for Object Detection: A Comprehensive Review
[2025年08月06日] - Object Detection with Multimodal Large Vision-Language Models: An In-depth Review
[2025年08月28日] - Explaining What Machines See: XAI Strategies in Deep Object Detection Models
[2025年09月03日] - 线路检测
- 小目标检测
- 3D目标检测
- 水下监测
- AI-Driven Marine Robotics: Emerging Trends in Underwater Perception and Ecosystem Monitoring
[2025年09月03日] - A Structured Review of Underwater Object Detection Challenges and Solutions: From Traditional to Large Vision Language Models
[2025年09月11日] - Sonar Image Datasets: A Comprehensive Survey of Resources, Challenges, and Applications
[2025年10月07日]
- AI-Driven Marine Robotics: Emerging Trends in Underwater Perception and Ecosystem Monitoring
- YOLO
- YOLOv8 to YOLO11: A Comprehensive Architecture In-depth Comparative Review
[2025年01月24日] - A Review of YOLOv12: Attention-Based Enhancements vs. Previous Versions
[2025年04月17日] - A Decade of You Only Look Once (YOLO) for Object Detection
[2025年04月29日] - YOLOv1 to YOLOv11: A Comprehensive Survey of Real-Time Object Detection Innovations and Challenges
[2025年08月05日] - Ultralytics YOLO Evolution: An Overview of YOLO26, YOLO11, YOLOv8 and YOLOv5 Object Detectors for Computer Vision and Pattern Recognition
[2025年10月14日]
- YOLOv8 to YOLO11: A Comprehensive Architecture In-depth Comparative Review
- 3D Human Interaction Generation: A Survey
[2025年03月18日] - A Survey on Human Interaction Motion Generation
⭐code
[2025年03月18日]
- Trajectory Prediction Meets Large Language Models: A Survey
⭐code
[2025年06月05日] - Recent Advances in Multi-Agent Human Trajectory Prediction: A Comprehensive Review
[2025年06月19日]
- Survey on Hand Gesture Recognition from Visual Input
[2025年01月22日] - Emotion Recognition from Skeleton Data: A Comprehensive Survey
[2025年07月25日] - 手势识别
- 3D HPE
- Implicit Guidance and Explicit Representation of Semantic Information in Points Cloud: A Survey
[2025年01月13日] - Point Cloud Based Scene Segmentation: A Survey
[2025年03月18日] - Point Cloud Compression and Objective Quality Assessment: A Survey
[2025年07月01日] - Deep Learning For Point Cloud Denoising: A Survey
[2025年08月19日] - Deep learning for 3D point cloud processing - from approaches, tasks to its implications on urban and environmental applications
[2025年09月17日]
- Deep Learning Reforms Image Matching: A Survey and Outlook
[2025年06月06日] - R3eVision: A Survey on Robust Rendering, Restoration, and Enhancement for 3D Low-Level Vision
⭐code
[2025年06月23日] - From 2D to 3D Cognition: A Brief Survey of General World Models
[2025年06月26日] - Out-of-distribution detection in 3D applications: a review
[2025年07月02日] - 三维重建
- Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison
[2025年02月28日] - Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey
[2025年03月20日] - A Survey on Event-driven 3D Reconstruction: Development under Different Categories
[2025年03月26日] - Explicit and Implicit Representations in AI-based 3D Reconstruction for Radiology: A systematic literature review
⭐code
[2025年04月16日] - A Survey on 3D Reconstruction Techniques in Plant Phenotyping: From Classical Methods to Neural Radiance Fields (NeRF), 3D Gaussian Splatting (3DGS), and Beyond
[2025年05月05日] - A Survey of 3D Reconstruction with Event Cameras: From Event-based Geometry to Neural 3D Rendering
[2025年05月14日] - Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT
[2025年07月14日] - Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey
⭐code
[2025年07月22日] - Event Camera Guided Visual Media Restoration & 3D Reconstruction: A Survey
[2025年09月15日]
- Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison
- 深度估计
- 三维形状生成
- 3D Shape Generation: A Survey
[2025年07月01日]
- 3D Shape Generation: A Survey
- 3DGS
- A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation
⭐code
[2025年08月14日] - From Fields to Splats: A Cross-Domain Survey of Real-Time Neural Scene Representations
[2025年09月30日] - From Volume Rendering to 3D Gaussian Splatting: Theory and Applications
[2025年10月22日] - The Impact and Outlook of 3D Gaussian Splatting
[2025年10月31日]
- A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation
- A Survey on Facial Image Privacy Preservation in Cloud-Based Services
[2025年01月16日] - Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
[2025年02月12日] - Face Deepfakes - A Comprehensive Review
[2025年02月17日] - Generative Models at the Frontier of Compression: A Survey on Generative Face Video Coding
[2025年06月10日] - Inclusive Review on Advances in Masked Human Face Recognition Technologies
[2025年08月05日] - Deep Data Hiding for ICAO-Compliant Face Images: A Survey
[2025年08月28日] - 情绪分析
- 情感识别
- 说话头
- [Compressed Video Quality Enhancement: Classifying and Benchmarking over Standards(https://arxiv.org/abs/2509.10407)
[2025年09月15日]
- Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation
[2025年06月09日]
- A Comparative Review of the Histogram-based Image Segmentation Methods
[2025年02月27日] - SAM2 for Image and Video Segmentation: A Comprehensive Survey
[2025年03月18日] - Self-Supervised Learning for Image Segmentation: A Comprehensive Survey
[2025年05月21日] - Reasoning Segmentation for Images and Videos: A Survey
[2025年05月27日] - Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems
[2025年06月18日] - Multimodal Referring Segmentation: A Survey
⭐code
[2025年08月04日] - 语义分割
- 场景解析
- 场景理解
- VOS
- Learning-Based Hashing for ANN Search: Foundations and Early Advances
[2025年10月07日] - A Comprehensive Survey on Composed Image Retrieval
[2025年02月27日] - Composed Multi-modal Retrieval: A Survey of Approaches and Applications
[2025年03月04日]
- Plant Leaf Disease Detection and Classification Using Deep Learning: A Review and A Proposed System on Bangladesh's Perspective
[2025年01月08日]基于深度学习的植物叶片病害检测与分类 - 作物害虫分类
- CD
- Category Discovery: An Open-World Perspective
⭐code
[2025年09月29日]
- Category Discovery: An Open-World Perspective
- State-of-the-Art Transformer Models for Image Super-Resolution: Techniques, Challenges, and Applications
[2025年01月15日] - Deep Learning Empowered Super-Resolution: A Comprehensive Survey and Future Prospects
⭐code
[2025年09月30日] - VSR
- A Survey of Deep Learning Video Super-Resolution
[2025年06月05日]
- A Survey of Deep Learning Video Super-Resolution
- Fuzzy Theory in Computer Vision: A Review
[2025年07月28日] - 图像恢复
- 水下图像增强
- 图像质量评估/增强
- Fundus Image Quality Assessment and Enhancement: a Systematic Review
[2025年01月22日] - A Comprehensive Survey on Image Signal Processing Approaches for Low-Illumination Image Enhancement
[2025年02月11日] - A Survey on Image Quality Assessment: Insights, Analysis, and Future Outlook
[2025年02月13日] - A review of advancements in low-light image enhancement using deep learning
[2025年05月12日] - Recent Advancements in Microscopy Image Enhancement using Deep Learning: A Survey
[2025年09月22日] - Diffusion Models for Low-Light Image Enhancement: A Multi-Perspective Taxonomy and Performance Analysis
⭐code
[2025年10月08日]
- Fundus Image Quality Assessment and Enhancement: a Systematic Review
- 去反射
- 视频恢复
- Visualizing Uncertainty in Image Guided Surgery a Review
[2025年01月14日] - A Preliminary Survey of Semantic Descriptive Model for Images
[2025年01月16日] - New Fashion Products Performance Forecasting: A Survey on Evolutions, Models and Emerging Trends
[2025年01月20日] - Explainable artificial intelligence (XAI): from inherent explainability to large language models
[2025年01月20日] - Explainability for Vision Foundation Models: A Survey
[2025年01月22日] - Advanced technology in railway track monitoring using the GPR Technique: A Review
[2025年01月22日] - Reproducibility review of "Why Not Other Classes": Towards Class-Contrastive Back-Propagation Explanations
[2025年01月22日] - Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation
[2025年02月10日] - Diffusion Models for Computational Neuroimaging: A Survey
⭐code
[2025年02月11日] - Safety at Scale: A Comprehensive Survey of Large Model Safety
[2025年02月11日] - Event Vision Sensor: A Review
[2025年02月11日] - A Survey on Mamba Architecture for Vision Applications
[2025年02月12日] - A Survey of Representation Learning, Optimization Strategies, and Applications for Omnidirectional Vision
⭐code
[2025年02月18日] - Event-based Solutions for Human-centered Applications: A Comprehensive Review
⭐code
[2025年02月27日] - A Survey on Ordinal Regression: Applications, Advances and Prospects
[2025年03月04日] - Lossy Neural Compression for Geospatial Analytics: A Review
[2025年03月04日] - A Review on Geometry and Surface Inspection in 3D Concrete Printing
[2025年03月11日] - A Systematic Review of ECG Arrhythmia Classification: Adherence to Standards, Fair Evaluation, and Embedded Feasibility
[2025年03月11日] - A Survey on Wi-Fi Sensing Generalizability: Taxonomy, Techniques, Datasets, and Future Research Prospects
[2025年03月12日] - Challenges and Trends in Egocentric Vision: A Survey
[2025年03月20日] - A Comprehensive Survey on Architectural Advances in Deep CNNs: Challenges, Applications, and Emerging Research Directions
[2025年03月24日] - Hybrid Multi-Stage Learning Framework for Edge Detection: A Survey
[2025年03月31日] - Towards Mobile Sensing with Event Cameras on High-mobility Resource-constrained Devices: A Survey
[2025年04月01日] - Foundation Models For Seismic Data Processing: An Extensive Review
[2025年04月01日] - A Survey of Pathology Foundation Model: Progress and Future Directions
⭐code
[2025年04月08日] - Attention in Diffusion Model: A Survey
[2025年04月08日] - Loss Functions in Deep Learning: A Comprehensive Review
[2025年04月08日] - Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review
[2025年04月14日] - Computer-Aided Layout Generation for Building Design: A Review
⭐code
[2025年04月15日] - Digital Twin Generation from Visual Data: A Survey
⭐code
[2025年04月18日] - A Survey on Small Sample Imbalance Problem: Metrics, Feature Analysis, and Solutions
[2025年04月22日] - Unsupervised Time-Series Signal Analysis with Autoencoders and Vision Transformers: A Review of Architectures and Applications
[2025年04月25日] - A Survey on Event-based Optical Marker Systems
[2025年04月30日] - Diffusion Model Quantization: A Review
⭐code
[2025年05月09日] - From Events to Enhancement: A Survey on Event-Based Imaging Technologies
⭐code
[2025年05月12日] - Towards Artificial General or Personalized Intelligence? A Survey on Foundation Models for Personalized Federated Intelligence
[2025年05月13日] - A Survey on the Safety and Security Threats of Computer-Using Agents: JARVIS or Ultron?
[2025年05月19日] - Diffusion Model in Hyperspectral Image Processing and Analysis: A Review
[2025年05月19日] - Plane Geometry Problem Solving with Multi-modal Reasoning: A Survey
[2025年05月21日] - Semantic Correspondence: Unified Benchmarking and a Strong Baseline
⭐code
[2025年05月26日] - Camera Trajectory Generation: A Comprehensive Survey of Methods, Metrics, and Future Directions
[2025年06月03日] - Towards Geometry Problem Solving in the Large Model Era: A Survey
[2025年06月04日] - A Comprehensive Survey on Deep Learning Solutions for 3D Flood Mapping
[2025年06月17日] - Style-based Composer Identification and Attribution of Symbolic Music Scores: a Systematic Survey
[2025年06月17日] - Integrating Multi-Modal Sensors: A Review of Fusion Techniques for Intelligent Vehicles
[2025年06月30日] - A Survey on Interpretability in Visual Recognition
[2025年07月16日] - A Survey of Deep Learning for Geometry Problem Solving
⭐code
[2025年07月17日] - Transformer-based Spatial Grounding: A Comprehensive Survey
[2025年07月18日] - Agentic Design Review System
[2025年08月18日] - Towards Open World Detection: A Survey
[2025年08月25日] - Responsible Diffusion: A Comprehensive Survey on Safety, Ethics, and Trust in Diffusion Models
[2025年09月30日] - Color Models in Image Processing: A Review and Experimental Comparison
[2025年10月02日] - Unsupervised Transformer Pre-Training for Images: Self-Distillation, Mean Teachers, and Random Crops
[2025年10月07日] - A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety
[2025年10月07日] - Quantum-enhanced Computer Vision: Going Beyond Classical Algorithms
[2025年10月09日] - Multi Camera Connected Vision System with Multi View Analytics: A Comprehensive Survey
[2025年10月14日] - Prompt-based Adaptation in Large-scale Vision Models: A Survey
[2025年10月16日] - A Survey on Cache Methods in Diffusion Models: Toward Efficient Multi-Modal Generation
[2025年10月23日] - Vision-Based Mistake Analysis in Procedural Activities: A Review of Advances and Challenges
[2025年10月23日]