[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
- 
 Updated
 Aug 7, 2025 
- Python
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
reproduction of semantic segmentation using masked autoencoder (mae)
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations
Unofficial PyTorch implementation of Masked Autoencoders that Listen
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
Official repo for Recursion's accepted spotlight paper at NeurIPS 2023 Generative AI & Biology workshop.
[SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"
Add a description, image, and links to the masked-autoencoder topic page so that developers can more easily learn about it.
To associate your repository with the masked-autoencoder topic, visit your repo's landing page and select "manage topics."