data-efficiency

Star

Here are 13 public repositories matching this topic...

Language: All

Filter by language

All 13 Python 12 PowerShell 1

Sort: Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

CSfufu / Revisual-R1

Star 185

🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to achieve faithful, concise, and self-reflective state-of-the-art performance in visual and textual reasoning.

reinforcement-learning visual-reasoning mathematical-reasoning data-efficiency multimodal-large-language-model prioritized-advantage-distillation cold-start-initialization efficient-length-reward open-source-7b-model self-reflective-chain-of-thought

Updated Oct 13, 2025
Python

amazon-science / mix-generation

Star 126

MixGen: A New Multi-Modal Data Augmentation

data-augmentation multimodal vision-language pretraining data-efficiency

Updated Jan 9, 2023
Python

UCSC-REAL / DS2

Star 97

[ICLR 2025] Official implementation of paper "Improving Data Efficiency via Curating LLM-Driven Rating Systems"

data-curation data-efficiency large-language-models instruction-tuning

Updated Mar 24, 2025
Python

BIT-DA / EADA

Star 89

[AAAI 2022] Official Implementation of Active Learning for Domain Adaptation: An Energy-based Approach https://arxiv.org/abs/2112.01406

active-learning domain-adaptation energy-based-model data-efficiency

Updated Nov 4, 2023
Python

encounter1997 / DE-DETRs

Star 79

Official Implementation of DE-DETR and DELA-DETR in "Towards Data-Efficient Detection Transformers"

object-detection detection-transformer data-efficiency

Updated Mar 10, 2024
Python

encounter1997 / DE-CondDETR

Star 46

Official Implementation of DE-CondDETR and DELA-CondDETR in "Towards Data-Efficient Detection Transformers"

object-detection detection-transformer data-efficiency

Updated Aug 25, 2022
Python

microsoft / DELT

Star 39

DELT: Data Efficacy for Language Model Training

data-efficiency llm-training data-ordering data-efficacy data-scoring

Updated Aug 31, 2025
Python

CameliaD / File-Organizer

Sponsor

Star 9

This Git repository contains a PowerShell script with a user-friendly interface to automatically organize cluttered files into folders by year and month. Ideal for individuals who struggle with file organization, the tool frees up time and simplifies finding and accessing files.