These models provide utility functions for working with media like images, audio, and video. They serve as convenient building blocks for media processing pipelines and workflows.
Some highllights:
Featured models
Automatically add captions to a video
Updated 2 years ago
72.6K runs
Add a watermark to your videos using the power of Replicate brought to you from your friends at FullJourney.AI
Updated 2 years ago
1.3M runs
Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification
Updated 2 years, 1 month ago
70.1M runs
Recommended Models
If you need quick, low-overhead processing—like extracting frames or audio—models such as lucataco/frame-extractor and lucataco/extract-audio are some of the speedier options. These utilities focus on simple transformations, so they typically run faster than more complex generation models.
Keep in mind that performance still depends on input file size and format.
For more advanced workflows, models like fictions-ai/autocaption, charlesmccarthy/addwatermark, and falcons-ai/nsfw_image_detection add extra functionality such as captioning, watermarking, or filtering content.
If your workflow involves bulk processing or automation, combining lightweight extractors with these more feature-rich utilities can give you a solid balance between speed and capability.
For low-level media manipulation:
Utility models usually return:
You can package your own processing script or pipeline with Cog and publish it to Replicate under the Media Utilities collection.
Clearly define your input/output types (e.g., video → frames), set versioning, and configure sharing or pricing if needed.
Many models in the Media Utilities collection support commercial use, but licenses vary. Check each model’s card for attribution requirements or restrictions before using them in production workflows.
Recommended Models
Updated 3 weeks, 6 days ago
8.5M runs
Color match and white balance fixes for images
Updated 4 months, 2 weeks ago
177.1K runs
Simple tool to extract audio from a video file
Updated 4 months, 3 weeks ago
3.9K runs
Simple tool to merge together separate video snippets
Updated 5 months, 3 weeks ago
15.7K runs
Extract the first or last frame from any video file as a high-quality image
Updated 9 months, 1 week ago
784K runs
Simple tool to merge a foreground and background image
Updated 11 months, 4 weeks ago
3K runs
Simple tool to split apart a video into snippets
Updated 1 year ago
154 runs
AI generated Normal maps, Displacement maps, and Roughness maps
Updated 1 year, 9 months ago
215 runs
Model for Sound demixing challenge 2023: Music Demixing Track - MDX'23
Updated 1 year, 9 months ago
22.6K runs
Depth Anything on full video files
Updated 1 year, 10 months ago
618 runs
Take an image and an audio file and create a video clip
Updated 1 year, 11 months ago
13.2K runs
Video toolkit – convert, make GIFs, extract audio
Updated 1 year, 11 months ago
16.7K runs
A pipeline for superfast video editing! Make cuts to a video by editing its transcript.
Updated 2 years ago
819 runs
Take a list of image URLs as frames and output a video
Updated 2 years, 1 month ago
1.2K runs
Canny, soft edge, depth, lineart, segmentation, pose, etc
Updated 2 years, 1 month ago
42.9K runs
Create a waveform video from audio
Updated 2 years, 6 months ago
383.9K runs
Split a video into frames
Updated 2 years, 6 months ago
25K runs
Convert a set of frames to a video
Updated 2 years, 6 months ago
1.7K runs