These models can generate and edit videos from text prompts and images. They use advanced AI techniques like diffusion models and latent space interpolation to create high-quality, controllable video content.
Key capabilities:
For most people looking to generate custom videos from text prompts, we recommend google/veo-3
The Wan video models model by Wan-AI is an excellent open-source option, competitive with the best proprietary video models. Try adjusting the number of steps used for each frame to trade off between generation speed and detail.
Generative video is a rapidly advancing field. Check out the arena and leaderboard at Artificial Analysis to see what's popular today.
Featured models
OpenAI's Flagship video generation with synced audio
Updated 1 week, 5 days ago
158.4K runs
Alibaba Wan 2.5 text to video generation model
Updated 3 weeks, 6 days ago
28.9K runs
Alibaba Wan 2.5 Image to video generation with background audio
Updated 3 weeks, 6 days ago
142.2K runs
New and improved version of Veo 3 Fast, with higher-fidelity video, context-aware audio and last frame support
Updated 1 month ago
199.4K runs
New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support
Updated 1 month ago
243.6K runs
Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions
Updated 1 month, 1 week ago
734.4K runs
Wan 2.5 text-to-video, optimized for speed
Updated 1 month, 1 week ago
32.1K runs
A faster and cheaper version of Seedance 1 Pro
Updated 1 month, 3 weeks ago
390.2K runs
Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.
Updated 1 month, 3 weeks ago
1.5M runs
Wan 2.5 image-to-video, optimized for speed
Updated 1 month, 3 weeks ago
40.8K runs
A high-fidelity video generation model optimized for realistic human motion, cinematic VFX, expressive characters, and strong prompt and style adherence across both text-to-video and image-to-video workflows
Updated 1 month, 3 weeks ago
31.2K runs
A lower-latency image-to-video version of Hailuo 2.3 that preserves core motion quality, visual consistency, and stylization performance while enabling faster iteration cycles.
Updated 1 month, 3 weeks ago
24.9K runs
Recommended Models
The open-source Wan suite (like wan-video/wan-2.1-t2v-480p) is among the faster text-to-video options on Replicate, especially at lower resolutions and shorter durations. Many models also have "fast" variants, like google/veo-3-fast, designed for quicker turnaround.
Note: Faster runs usually mean lower resolution or simpler motion.
PixVerse v4 offers a strong balance for many use cases. It uses a unit-based system at 0ドル.01 per unit — for example, a 5-second, 360p video costs about 0ドル.30. Hailuo 02 is another good middle-ground option, with both standard and pro modes for different quality levels. Your ideal choice depends on how much resolution and runtime you need and how much you want to spend.
For short, stylized clips (5–10 seconds at lower resolution), PixVerse v4 and Wan models are great picks. They’re fast and relatively inexpensive, making them ideal for concept work, storyboarding, or rapid iteration.
If you want high-fidelity motion, longer clips, or more realistic physics, Veo 3 or Hailuo 02 Pro are better options. Hailuo 02 supports 768p in standard mode and 1080p in Pro mode, which makes it a solid choice for more polished results.
Most text-to-video models generate short video clips (5–10 seconds) at 24 or 30 fps. Supported resolutions range from 360p to 1080p, depending on the model. Some, like Veo 3, can include audio as part of the output.
Costs vary by model and resolution:
You can push your own model by packaging it with Cog and deploying it. If you’re working with open-source video models, you can also fine-tune them and publish your version for others to use.
Yes, but always check the model’s license. Most text-to-video models on Replicate are available for commercial use, but some authors include additional restrictions.
You can use the Replicate playground or run them programmatically.
Recommended Models
OpenAI's Most advanced synced-audio video generation
Updated 1 week, 5 days ago
66.7K runs
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B image-to-video
Updated 2 weeks, 1 day ago
5.1M runs
Sound on: Google’s flagship Veo 3 text to video model, with audio
Updated 1 month ago
209.2K runs
A faster and cheaper version of Google’s Veo 3 video model, with audio
Updated 1 month ago
149K runs
State of the art video generation model. Veo 2 can faithfully follow simple and complex instructions, and convincingly simulates real-world physics as well as a wide range of visual styles.
Updated 1 month ago
104.3K runs
Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p
Updated 1 month, 1 week ago
37K runs
Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.
Updated 1 month, 1 week ago
226.9K runs
Accelerated inference for Wan 2.1 14B text to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 1 week ago
35.1K runs
Accelerated inference for Wan 2.1 14B image to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 1 week ago
85.8K runs
Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 1 week ago
432.9K runs
A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution
Updated 1 month, 3 weeks ago
1.4M runs
A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution
Updated 1 month, 3 weeks ago
2.3M runs
Create 5s 480p videos from a text prompt
Updated 1 month, 3 weeks ago
9.7K runs
Generate 5s and 10s videos in 720p resolution
Updated 1 month, 3 weeks ago
84.5K runs
Generate 5s and 10s videos in 1080p resolution
Updated 1 month, 3 weeks ago
795.9K runs
A premium version of Kling v2.1 with superb dynamics and prompt adherence. Generate 1080p 5s and 10s videos from text or an image
Updated 1 month, 3 weeks ago
81.5K runs
Generate 5s and 10s videos in 720p resolution at 30fps
Updated 1 month, 3 weeks ago
1.5M runs
Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)
Updated 1 month, 3 weeks ago
3.1M runs
Generate 5s and 9s 540p videos
Updated 1 month, 3 weeks ago
10.6K runs
Generate 5s and 9s 720p videos
Updated 1 month, 3 weeks ago
33.3K runs
Accelerated inference for Wan 2.1 14B text to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 3 weeks ago
182.4K runs
Fast, high quality text-to-video and image-to-video (Also known as Dream Machine)
Updated 1 month, 3 weeks ago
67.2K runs
Generate 5s and 9s 720p videos, faster and cheaper than Ray 2
Updated 1 month, 3 weeks ago
41K runs
Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.
Updated 1 month, 3 weeks ago
636.2K runs
Generate videos with specific camera movements
Updated 1 month, 3 weeks ago
72.5K runs
An image-to-video (I2V) model specifically trained for Live2D and general animation use cases
Updated 1 month, 3 weeks ago
172.5K runs
Generate 5s and 9s 540p videos, faster and cheaper than Ray 2
Updated 1 month, 3 weeks ago
56.3K runs
Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.
Updated 1 month, 3 weeks ago
286.6K runs
Image-to-video at 720p and 480p with Wan 2.2 A14B
Updated 4 months, 3 weeks ago
44.3K runs
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video
Updated 4 months, 3 weeks ago
152.9K runs
Make a very realistic looking real-world AI video
Updated 5 months, 2 weeks ago
2.3K runs
Generate 5s 480p videos. Wan is an advanced and powerful visual generation model developed by Tongyi Lab of Alibaba Group
Updated 10 months ago
45.6K runs
A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions
Updated 11 months, 1 week ago
116K runs
LTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched.
Updated 11 months, 3 weeks ago
163.9K runs
A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions
Updated 1 year ago
2.9K runs
Mochi 1 preview is an open video generation model with high-fidelity motion and strong prompt adherence in preliminary evaluation
Updated 1 year, 1 month ago
3.1K runs
Text-to-Video + Image-to-Video: Pyramid Flow Autoregressive Video Generation method based on Flow Matching
Updated 1 year, 2 months ago
9.1K runs
Generate high quality videos from a prompt
Updated 1 year, 4 months ago
2.5K runs
SAM 2: Segment Anything v2 (for videos)
Updated 1 year, 4 months ago
52K runs
Create videos from illustrated input images
Updated 1 year, 5 months ago
64.5K runs
Generate a video that morphs between subjects, with an optional style
Updated 1 year, 8 months ago
15K runs
VideoCrafter2: Text-to-Video and Image-to-Video Generation and Editing
Updated 1 year, 11 months ago
138.6K runs
RESEARCH/NON-COMMERCIAL USE ONLY: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Updated 1 year, 11 months ago
128.2K runs
Personalized Image Animator
Updated 2 years ago
103.5K runs
Monster Labs' Controlnet QR Code Monster v2 For SD-1.5 on top of AnimateDiff Prompt Travel (Motion Module SD 1.5 v2)
Updated 2 years, 2 months ago
10.5K runs
😊 Hotshot-XL is an AI text-to-GIF model trained to work alongside Stable Diffusion XL
Updated 2 years, 2 months ago
883.3K runs
🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narratives
Updated 2 years, 2 months ago
5.7K runs
🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Updated 2 years, 2 months ago
59.1K runs
Animate Your Personalized Text-to-Image Diffusion Models
Updated 2 years, 3 months ago
324.2K runs
Zeroscope V2 XL & 576w
Updated 2 years, 5 months ago
301.1K runs
Training-free Controllable Text-to-Video Generation
Updated 2 years, 7 months ago
2.4K runs
Text-to-Image Diffusion Models are Zero-Shot Video Generators
Updated 2 years, 8 months ago
42K runs
Multi-stage text-to-video generation
Updated 2 years, 9 months ago
155K runs
Create tileable animations with seamless transitions
Updated 2 years, 10 months ago
529.4K runs
Add colours to old video footage.
Updated 2 years, 11 months ago
8.8K runs
RealBasicVSR: Investigating Tradeoffs in Real-World Video Super-Resolution
Updated 2 years, 11 months ago
9.3K runs
extract foreground of a video
Updated 3 years ago
71K runs
Use Runway's Stable-diffusion inpainting model to create an infinite loop video
Updated 3 years, 1 month ago
38.5K runs
Animate Stable Diffusion by interpolating between two prompts
Updated 3 years, 1 month ago
119.6K runs
Generate videos by interpolating the latent space of Stable Diffusion
Updated 3 years, 3 months ago
58.5K runs
Animating prompts with stable diffusion
Updated 3 years, 3 months ago
266.2K runs