Official AI models: Always available, stable, and predictably priced
Recommended Models
"Official" models are those maintained directly by Replicate or in close collaboration with trusted partners like openai/gpt-5, google/veo-3, bytedance/seedream-4, and black-forest-labs/flux-pro.
They’re kept warm, meaning they are:
These models represent the most production-ready versions for image, video, language, and audio generation.
Official models span across multiple domains:
In short, this collection gives you a reliable foundation for nearly every generative AI task—all powered by models that are always online.
The most widely used include:
These consistently rank among the highest-run models on Replicate and are trusted for commercial, creative, and research use.
By keeping them warm, Replicate ensures:
This is ideal for developers running production-level applications, where uptime and consistency matter as much as model quality.
Official models:
Community models, while diverse and experimental, can go offline, change parameters, or vary in speed and output quality.
For text-to-video and image-to-video:
For language understanding and complex reasoning:
These are ideal for soundtracks, content production, and creative prototyping.
Yes. All Official AI Models are available for commercial use, unless explicitly stated otherwise on their individual pages.
They are licensed, production-ready, and have predictable pricing structures designed for apps, agencies, and enterprise deployments.
Each official model includes a green "Official" tag on its page.
They are also grouped in this "Official AI Models" collection for easy discovery.
Recommended Models
Kling 2.6 Pro: Top-tier image-to-video with cinematic visuals, fluid motion, and native audio generation
Updated 21 hours ago
294 runs
Qwen Image 2512 is an improved version of Qwen Image with more realistic human generation, finer textures, and stronger text rendering
Updated 1 day, 4 hours ago
1.2K runs
A sub 1 second text-to-image model built for production use cases.
Updated 2 days, 9 hours ago
476.3K runs
Z-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.
Updated 6 days, 6 hours ago
3.8M runs
A joint audio-video model that accurately follows complex instructions.
Updated 1 week, 1 day ago
34.2K runs
An enhanced version over Qwen-Image-Edit-2509, featuring multiple improvements including notably better consistency
Updated 1 week, 2 days ago
36K runs
GPT-5 with support for structured outputs, web search and custom tools
Updated 1 week, 2 days ago
342.7K runs
OpenAI's new model excelling at coding, writing, and reasoning.
Updated 1 week, 2 days ago
849.4K runs
Fastest, most cost-effective GPT-5 model from OpenAI
Updated 1 week, 2 days ago
3.2M runs
Faster version of OpenAI's flagship GPT-5 model
Updated 1 week, 2 days ago
579K runs
Enables precise control of character actions and expressions from a reference image.
Updated 1 week, 3 days ago
2.4K runs
An AI system that can create realistic images and art from a description in natural language.
Updated 1 week, 5 days ago
90.9K runs
The original classic DALLᐧE 2
Updated 1 week, 5 days ago
829 runs
OpenAI's latest image generation model with better instruction following and adherence to prompts
Updated 1 week, 5 days ago
347.1K runs
A cost-efficient version of GPT Image 1
Updated 1 week, 5 days ago
319.2K runs
OpenAI's Most advanced synced-audio video generation
Updated 1 week, 5 days ago
66.8K runs
OpenAI's Flagship video generation with synced audio
Updated 1 week, 5 days ago
158.6K runs
Google's state of the art image generation and editing model 🍌🍌
Updated 1 week, 6 days ago
6.5M runs
Google's Imagen 4 flagship model
Updated 2 weeks, 2 days ago
6.9M runs
Upscale images 2x or 4x times
Updated 2 weeks, 2 days ago
81.5K runs
Google's latest image generation model in Gemini 2.5
Updated 2 weeks, 2 days ago
499.8K runs
Use this fast version of Imagen 4 when speed and cost are more important than quality
Updated 2 weeks, 2 days ago
3.2M runs
Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty
Updated 2 weeks, 2 days ago
1.9M runs
Google’s hybrid "thinking" AI model optimized for speed and cost-efficiency
Updated 2 weeks, 2 days ago
491.6K runs
A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality
Updated 2 weeks, 2 days ago
530.8K runs
Use this ultra version of Imagen 4 when quality matters more than speed and cost
Updated 2 weeks, 2 days ago
1.3M runs
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B image-to-video
Updated 2 weeks, 2 days ago
5.1M runs
The highest fidelity image model from Black Forest Labs
Updated 2 weeks, 2 days ago
93.4K runs
Alibaba Wan 2.6 text to video generation model
Updated 2 weeks, 2 days ago
1.9K runs
Alibaba Wan 2.6 image to video generation model
Updated 2 weeks, 2 days ago
6.5K runs
The fastest open source TTS model without sacrificing quality.
Updated 2 weeks, 3 days ago
15.6K runs
The best model for coding and agentic tasks across industries
Updated 3 weeks ago
144.3K runs
VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video
Updated 3 weeks ago
930 runs
Realistic lipsync with refined human emotion capabilities
Updated 3 weeks ago
270 runs
This is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!
Updated 3 weeks, 2 days ago
7M runs
Alibaba Wan 2.5 text to video generation model
Updated 4 weeks ago
28.9K runs
Alibaba Wan 2.5 Image to video generation with background audio
Updated 4 weeks ago
142.5K runs
Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge
Updated 4 weeks ago
918.3K runs
Bria Increase resolution upscales the resolution of any image. It increases resolution using a dedicated upscaling method that preserves the original image content without regeneration.
Updated 4 weeks ago
100.2K runs
Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market
Updated 4 weeks ago
125.3K runs
SOTA Open source model trained on licensed data, transforming intent into structured control for precise, high-quality AI image generation in enterprise and agentic workflows.
Updated 4 weeks ago
9.3K runs
Bria Background Generation allows for efficient swapping of backgrounds in images via text prompts or reference image, delivering realistic and polished results. Trained exclusively on licensed data for safe and risk-free commercial use
Updated 4 weeks ago
40.9K runs
SOTA Object removal, enables precise removal of unwanted objects from images while maintaining high-quality outputs. Trained exclusively on licensed data for safe and risk-free commercial use
Updated 4 weeks ago
209.7K runs
Bria AI's remove background model
Updated 4 weeks ago
296.6K runs
Bria GenFill enables high-quality object addition or visual transformation. Trained exclusively on licensed data for safe and risk-free commercial use.
Updated 4 weeks ago
10.4K runs
A sub 1 second 0.01$ multi-image editing model built for production use cases. For image generation, check out p-image here: https://replicate.com/prunaai/p-image
Updated 4 weeks, 2 days ago
4M runs
A 17 billion parameter model with 16 experts
Updated 1 month ago
3.4M runs
Max-quality image generation and editing with support for ten reference images
Updated 1 month ago
62.1K runs
High-quality image generation and editing with support for eight reference images
Updated 1 month ago
882.7K runs
Google's latest image editing model in Gemini 2.5
Updated 1 month, 1 week ago
65.1M runs
New and improved version of Veo 3 Fast, with higher-fidelity video, context-aware audio and last frame support
Updated 1 month, 1 week ago
200.2K runs
New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support
Updated 1 month, 1 week ago
244.1K runs
Sound on: Google’s flagship Veo 3 text to video model, with audio
Updated 1 month, 1 week ago
209.2K runs
A faster and cheaper version of Google’s Veo 3 video model, with audio
Updated 1 month, 1 week ago
149.1K runs
State of the art video generation model. Veo 2 can faithfully follow simple and complex instructions, and convincingly simulates real-world physics as well as a wide range of visual styles.
Updated 1 month, 1 week ago
104.3K runs
Lyria 2 is a music generation model that produces 48kHz stereo audio through text-based prompts
Updated 1 month, 1 week ago
51.7K runs
Google's most advanced reasoning Gemini model
Updated 1 month, 1 week ago
102.5K runs
Delivers high visual fidelity with fast turnaround. Great for daily content creation, marketing teams, and iterative creative workflows.
Updated 1 month, 1 week ago
12.1K runs
Ideal for rapid ideation and mobile workflows. Perfect for creators who need instant feedback, real-time previews, or high-throughput content.
Updated 1 month, 1 week ago
38.4K runs
Quality image generation and editing with support for reference images
Updated 1 month, 1 week ago
276K runs
Take any shot and edit specific sections. Rephrase, change the action, camera angles and more
Updated 1 month, 1 week ago
866 runs
Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p
Updated 1 month, 1 week ago
37K runs
Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.
Updated 1 month, 1 week ago
227K runs
Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions
Updated 1 month, 1 week ago
734.8K runs
Create videos in as little as 10 seconds. 5s or 8s videos at 360p, 540p, 720p or 1080p.
Updated 1 month, 1 week ago
2.7K runs
Generate realistic lipsync animations from audio for high-quality synchronization
Updated 1 month, 1 week ago
119.2K runs
Wan 2.5 text-to-video, optimized for speed
Updated 1 month, 1 week ago
32.2K runs
Accelerated inference for Wan 2.1 14B text to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 1 week ago
35.1K runs
Accelerated inference for Wan 2.1 14B image to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 1 week ago
85.8K runs
Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 1 week ago
432.9K runs
A 20B MMDiT model for next-gen text-to-image generation
Updated 1 month, 1 week ago
9.2K runs
4MP text-to-image generation with enhanced cinematic-quality image generation with precise style control, improved text rendering, and commercial design optimization.
Updated 1 month, 1 week ago
106.1K runs
High-precision image upscaler optimized for portraits, faces and products. One of the upscale modes powered by Clarity AI. X:https://x.com/philz1337x
Updated 1 month, 1 week ago
298K runs
Unified text-to-image generation and precise single-sentence editing at up to 4K resolution
Updated 1 month, 1 week ago
20.3M runs
Generate complex 3D models from images with Rodin Gen-2
Updated 1 month, 1 week ago
1.7K runs
All the tools you need for generating pixel art tilesets
Updated 1 month, 1 week ago
1.4K runs
High quality and authentic pixel art image generation
Updated 1 month, 1 week ago
3.9K runs
Fast pixel art image generation
Updated 1 month, 1 week ago
5.9K runs
Style consistent animated pixel art sprite generation
Updated 1 month, 1 week ago
1.2K runs
A film-grade digital human model that generates realistic video from a single image, audio clip, and optional text prompt.
Updated 1 month, 2 weeks ago
7.9K runs
120b open-weight language model from OpenAI
Updated 1 month, 2 weeks ago
137.2K runs
The best model for coding and agentic tasks with configurable reasoning effort.
Updated 1 month, 2 weeks ago
182.1K runs
Fusion – Product/object blending that fixes perspective and lighting so the subject melts into a new background via the Fusion LoRA.
Updated 1 month, 2 weeks ago
683 runs
Relight – Soft, curtain-filtered relighting that repaints the scene with golden-hour or moody tones using the Relight LoRA.
Updated 1 month, 2 weeks ago
1.3K runs
Upscale – Detail-loving upscale/restore pass that sharpens textures and color fidelity with the Upscale LoRA.
Updated 1 month, 2 weeks ago
1.1K runs
Next Scene – "Next beat" cinematic edits that keep subject identity while steering to the next camera move via the Next Scene LoRA
Updated 1 month, 2 weeks ago
4.2K runs
Skin – Natural beauty retouch that enhances pores and tonal variation (no plastic skin) via the Skin LoRA.
Updated 1 month, 2 weeks ago
11.9K runs
Photo to Anime – Stylized conversion that turns photos into crisp cel-shaded anime frames using the Photo-to-Anime LoRA.
Updated 1 month, 2 weeks ago
1K runs
Generate synced sounds for any video and return it with its new soundtrack - now enhanced in version 1.5 for improved sound synchronization and realism
Updated 1 month, 2 weeks ago
4.4K runs
Generate synced sounds for any video, and return it with its new sound track
Updated 1 month, 2 weeks ago
3.1K runs
an open-source, 2B-parameter model built for real-world applications
Updated 1 month, 2 weeks ago
2.1K runs
Kimi K2 Thinking is the latest, most capable version of an open-source thinking model.
Updated 1 month, 2 weeks ago
2.4K runs
Latest hybrid thinking model from Deepseek
Updated 1 month, 2 weeks ago
159.9K runs
Grok 4 is xAI’s most advanced reasoning model. Excels at logical thinking and in-depth analysis. Ideal for insightful discussions and complex problem-solving.
Updated 1 month, 2 weeks ago
22.8K runs
A reasoning model trained with reinforcement learning, on par with OpenAI o1
Updated 1 month, 2 weeks ago
2.2M runs
Meta's flagship 405 billion parameter language model, fine-tuned for chat completions
Updated 1 month, 2 weeks ago
6.9M runs
Qwen Image Edit 2509 LoRA explorer, uses HuggingFace URLs to load any safetensor
Updated 1 month, 2 weeks ago
117.5K runs
20b open-weight language model from OpenAI
Updated 1 month, 2 weeks ago
125.4K runs
Image generation model from Reve
Updated 1 month, 2 weeks ago
37.6K runs
Image editing model from Reve
Updated 1 month, 2 weeks ago
56.7K runs
Image generation model from Reve which handles multiple input reference images
Updated 1 month, 2 weeks ago
26.2K runs
Reve's fast image edit model at only 0ドル.01 per edit
Updated 1 month, 2 weeks ago
15.7K runs
An experimental FLUX Kontext model that can combine two input images
Updated 1 month, 2 weeks ago
213.6K runs
Become a character, in style
Updated 1 month, 2 weeks ago
72.8K runs
A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts
Updated 1 month, 2 weeks ago
9.4M runs
Quickly change someone's hair style and hair color, powered by FLUX.1 Kontext [pro]
Updated 1 month, 2 weeks ago
154.8K runs
Create a professional headshot photo from any single image
Updated 1 month, 2 weeks ago
67.1K runs
A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language
Updated 1 month, 2 weeks ago
42M runs
Use FLUX Kontext to restore, fix scratches and damage, and colorize old photos
Updated 1 month, 2 weeks ago
773.8K runs
Use flux-kontext-pro to change the first or last frame of a video. Useful to use as inputs for restyling an entire video in a certain way
Updated 1 month, 2 weeks ago
631 runs
Remove all text from an image with FLUX.1 Kontext
Updated 1 month, 2 weeks ago
76.3K runs
An experimental model with FLUX Kontext Pro that can combine two input images
Updated 1 month, 2 weeks ago
2.4M runs
Create a series of portrait photos from a single image
Updated 1 month, 2 weeks ago
79.5K runs
Bring your subjects into focus with FLUX.1 Kontext [pro]
Updated 1 month, 2 weeks ago
2.6K runs
Turn your image into a cartoon with FLUX.1 Kontext [pro]
Updated 1 month, 2 weeks ago
107.8K runs
Add simple filters to your images
Updated 1 month, 2 weeks ago
6.1K runs
FLUX Kontext max with list input for multiple images
Updated 1 month, 2 weeks ago
166.7K runs
Experience impossible adventures and extreme scenarios from a single image
Updated 1 month, 2 weeks ago
5.9K runs
Put yourself in an iconic location around the world from a single image
Updated 1 month, 2 weeks ago
8.9K runs
Camera-aware edits for Qwen/Qwen-Image-Edit-2509 with Lightning + multi-angle LoRA
Updated 1 month, 3 weeks ago
235.2K runs
Inference model for FLUX 1.1 [pro] Ultra using custom `finetune_id`. Supports 4MP images and raw mode for realism
Updated 1 month, 3 weeks ago
106.3K runs
FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.
Updated 1 month, 3 weeks ago
19.7M runs
Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.
Updated 1 month, 3 weeks ago
65.7M runs
Inference model for FLUX.1 [pro] using custom `finetune_id`
Updated 1 month, 3 weeks ago
10.7K runs
State-of-the-art image generation with top of the line prompt following, visual quality, image detail and output diversity.
Updated 1 month, 3 weeks ago
13.8M runs
Turns your audio/video/images into professional-quality animated videos
Updated 1 month, 3 weeks ago
148.9K runs
A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution
Updated 1 month, 3 weeks ago
1.4M runs
A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution
Updated 1 month, 3 weeks ago
2.3M runs
Text-guided image editing model that preserves original details while making targeted modifications like lighting changes, object removal, and style conversion
Updated 1 month, 3 weeks ago
735.5K runs
A text-to-image model with support for native high-resolution (2K) image generation
Updated 1 month, 3 weeks ago
3.1M runs
A faster and cheaper version of Seedance 1 Pro
Updated 1 month, 3 weeks ago
391.6K runs
This is the fastest Flux endpoint in the world.
Updated 1 month, 3 weeks ago
37.2M runs
Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.
Updated 1 month, 3 weeks ago
3.6M runs
A speech-to-text model that uses GPT-4o mini to transcribe audio
Updated 1 month, 3 weeks ago
10.4K runs
Generate 5s and 10s videos in 720p resolution at 30fps
Updated 1 month, 3 weeks ago
815 runs
Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.
Updated 1 month, 3 weeks ago
1.5M runs
A speech-to-text model that uses GPT-4o to transcribe audio
Updated 1 month, 3 weeks ago
33.7K runs
Create 5s 480p videos from a text prompt
Updated 1 month, 3 weeks ago
9.7K runs
Generate 5s and 10s videos in 720p resolution
Updated 1 month, 3 weeks ago
84.6K runs
Studio-grade lipsync in minutes, not weeks
Updated 1 month, 3 weeks ago
8.1K runs
Leonardo AI’s first foundational model produces images up to 5 megapixels (fast, quality and ultra modes)
Updated 1 month, 3 weeks ago
30.1K runs
Artistic and high-quality visuals with improved prompt adherence, diversity, and definition
Updated 1 month, 3 weeks ago
208.1K runs
Affordable and fast vector images
Updated 1 month, 3 weeks ago
82.3K runs
Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis
Updated 1 month, 3 weeks ago
7.4M runs
Recraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.
Updated 1 month, 3 weeks ago
339.4K runs
Affordable and fast images
Updated 1 month, 3 weeks ago
300.1K runs
Generate realistic lipsyncs with Sync Labs' 2.0 model
Updated 1 month, 3 weeks ago
14.3K runs
Creative Upscale focuses on enhancing details and refining complex elements in the image. It doesn’t just increase resolution but adds depth by improving textures, fine details, and facial features.
Updated 1 month, 3 weeks ago
11.3K runs
Convert raster images to high-quality SVG format with precision and clean vector paths, perfect for logos, icons, and scalable graphics.
Updated 1 month, 3 weeks ago
98.4K runs
Automated background removal for images. Tuned for AI-generated content, product photos, portraits, and design workflows
Updated 1 month, 3 weeks ago
179.9K runs
Designed to make images sharper and cleaner, Crisp Upscale increases overall quality, making visuals suitable for web use or print-ready materials.
Updated 1 month, 3 weeks ago
1.3M runs
Generate 5s and 10s videos in 1080p resolution
Updated 1 month, 3 weeks ago
796K runs
A premium version of Kling v2.1 with superb dynamics and prompt adherence. Generate 1080p 5s and 10s videos from text or an image
Updated 1 month, 3 weeks ago
81.5K runs
Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 1 month, 3 weeks ago
5.7M runs
Generate 5s and 10s videos in 1080p resolution at 30fps
Updated 1 month, 3 weeks ago
2.2K runs
Generate 5s and 10s videos in 720p resolution at 30fps
Updated 1 month, 3 weeks ago
1.5M runs
Like Ideogram v2 turbo, but now faster and cheaper
Updated 1 month, 3 weeks ago
375K runs
Generate consistent characters from a single reference image. Outputs can be in many styles. You can also use inpainting to add your character to an existing image.
Updated 1 month, 3 weeks ago
501.4K runs
Add lip-sync to any video with an audio file or text
Updated 1 month, 3 weeks ago
25.8K runs
Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)
Updated 1 month, 3 weeks ago
3.1M runs
An excellent image model with state of the art inpainting, prompt comprehension and text rendering
Updated 1 month, 3 weeks ago
2.6M runs
The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles
Updated 1 month, 3 weeks ago
2.1M runs
Like Ideogram v2, but faster and cheaper
Updated 1 month, 3 weeks ago
2M runs
Balance speed, quality and cost. Ideogram v3 creates images with stunning realism, creative designs, and consistent styles
Updated 1 month, 3 weeks ago
354.6K runs
A fast image model with state of the art inpainting, prompt comprehension and text rendering.
Updated 1 month, 3 weeks ago
2.8M runs
Translate videos into over 150 languages
Updated 1 month, 3 weeks ago
3.2K runs
Modify a video with style transfer and prompt-based editing
Updated 1 month, 3 weeks ago
5.3K runs
Generate 5s and 9s 540p videos
Updated 1 month, 3 weeks ago
10.6K runs
Generate 5s and 9s 720p videos
Updated 1 month, 3 weeks ago
33.3K runs
Wan 2.5 image-to-video, optimized for speed
Updated 1 month, 3 weeks ago
40.8K runs
Accelerated inference for Wan 2.1 14B text to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 1 month, 3 weeks ago
182.4K runs
2.5 billion parameter image model with improved MMDiT-X architecture
Updated 1 month, 3 weeks ago
93.3K runs
Fast, high quality text-to-video and image-to-video (Also known as Dream Machine)
Updated 1 month, 3 weeks ago
67.2K runs
Change the aspect ratio of any video up to 30 seconds long, outputs will be 720p
Updated 1 month, 3 weeks ago
29.4K runs
Generate 5s and 9s 720p videos, faster and cheaper than Ray 2
Updated 1 month, 3 weeks ago
41K runs
A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.
Updated 1 month, 3 weeks ago
1.8M runs
A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps
Updated 1 month, 3 weeks ago
866.8K runs
Gen-4 Image Turbo is cheaper and 2.5x faster than Gen-4 Image. An image model with references, use up to 3 reference images to create the exact image you need. Capture every angle.
Updated 1 month, 3 weeks ago
93.6K runs
Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.
Updated 1 month, 3 weeks ago
636.3K runs
Generate videos with specific camera movements
Updated 1 month, 3 weeks ago
72.5K runs
A high-fidelity video generation model optimized for realistic human motion, cinematic VFX, expressive characters, and strong prompt and style adherence across both text-to-video and image-to-video workflows
Updated 1 month, 3 weeks ago
31.4K runs
A lower-latency image-to-video version of Hailuo 2.3 that preserves core motion quality, visual consistency, and stylization performance while enabling faster iteration cycles.
Updated 1 month, 3 weeks ago
25K runs
An image-to-video (I2V) model specifically trained for Live2D and general animation use cases
Updated 1 month, 3 weeks ago
172.5K runs
Low‐latency MiniMax Speech 2.6 Turbo brings multilingual, emotional text-to-speech to Replicate with 300+ voices and real-time friendly pricing
Updated 1 month, 3 weeks ago
56.9K runs
Runway's Gen-4 Image model with references. Use up to 3 reference images to create the exact image you need. Capture every angle.
Updated 1 month, 3 weeks ago
541.1K runs
A new way to edit, transform and generate video
Updated 1 month, 3 weeks ago
100.4K runs
Clone voices to use with Minimax's speech-02-hd and speech-02-turbo
Updated 1 month, 3 weeks ago
27.4K runs
Generate 5s and 10s 720p videos fast
Updated 1 month, 3 weeks ago
48.1K runs
Generate 5s and 9s 540p videos, faster and cheaper than Ray 2
Updated 1 month, 3 weeks ago
56.3K runs
MiniMax Speech 2.6 HD delivers studio-quality multilingual text-to-audio on Replicate with nuanced prosody, subtitle export, and premium voices
Updated 1 month, 3 weeks ago
97.2K runs
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Designed for real-time applications with low latency
Updated 1 month, 3 weeks ago
7.3M runs
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.
Updated 1 month, 3 weeks ago
1.3M runs
Upscale videos by 4x, up to a maximum of 4k
Updated 1 month, 3 weeks ago
24.5K runs
Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.
Updated 1 month, 3 weeks ago
286.9K runs
Music-1.5: Full-length songs (up to 4 mins) with natural vocals & rich instrumentation
Updated 1 month, 3 weeks ago
24.2K runs
Minimax's first image model, with character reference support
Updated 1 month, 3 weeks ago
2.4M runs
A low cost and fast version of Hailuo 02. Generate 6s and 10s videos in 512p
Updated 1 month, 3 weeks ago
36.5K runs
Quickly generate up to 1 minute of music with lyrics and vocals in the style of a reference track
Updated 1 month, 3 weeks ago
484.7K runs
Change the aspect ratio of any photo using AI (not cropping)
Updated 1 month, 3 weeks ago
112.7K runs
Accelerated variant of Photon prioritizing speed while maintaining quality
Updated 1 month, 3 weeks ago
216K runs
Generate high-quality music and sound from text prompts
Updated 1 month, 3 weeks ago
10.8K runs
Ultra fast flux kontext endpoint
Updated 1 month, 3 weeks ago
16.2M runs
Professional edge-guided image generation. Control structure and composition using Canny edge detection
Updated 2 months ago
401.3K runs
Professional depth-aware image generation. Edit images while preserving spatial relationships.
Updated 2 months ago
280.9K runs
Fine-tunable Qwen Image model with exceptional composition abilities - train custom LoRAs for any style or subject
Updated 2 months ago
313 runs
Real-ESRGAN with optional face correction and adjustable upscale
Updated 2 months ago
82.1M runs
Convert PDF to markdown + JSON quickly with high accuracy
Updated 2 months, 1 week ago
9.4K runs
Detect and transcribe text in images with accurate bounding boxes, layout analysis, reding order, and table recognition, in 90 languages
Updated 2 months, 1 week ago
6.2K runs
Claude Haiku 4.5 gives you similar levels of coding performance but at one-third the cost and more than twice the speed
Updated 2 months, 2 weeks ago
38.4K runs
Generate vivid, realistic images based on a text prompt. Excels at generating images for marketing, social media, and entertainment.
Updated 2 months, 2 weeks ago
2.2K runs
A powerful native multimodal model for image generation (PrunaAI squeezed)
Updated 2 months, 3 weeks ago
39.9K runs
Granite-4.0-H-Small is a 32B parameter long-context instruct model finetuned from Granite-4.0-H-Small-Base using a combination of open source instruction datasets with permissive license and internally collected synthetic datasets.
Updated 2 months, 3 weeks ago
100.2K runs
The smartest, fastest, most useful model yet, with built-in thinking that puts expert-level intelligence in everyone’s hands
Updated 2 months, 3 weeks ago
1.3K runs
Ovi: generate videos with audio from image and text inputs
Updated 2 months, 3 weeks ago
12.6K runs
Bria Expand expands images beyond their borders in high quality. Resizing the image by generating new pixels to expand to the desired aspect ratio. Trained exclusively on licensed data for safe and risk-free commercial use
Updated 2 months, 4 weeks ago
260.9K runs
Claude Sonnet 4.5 is the best coding model to date, with significant improvements across the entire development lifecycle
Updated 3 months ago
399.6K runs
Use Wan 2.2 Animate to replace a character in a video scene
Updated 3 months, 1 week ago
16K runs
Use Wan 2.2 Animate to copy the motion of a video to another scene
Updated 3 months, 1 week ago
10K runs
The latest Qwen-Image’s iteration with improved multi-image editing, single-image consistency, and native support for ControlNet
Updated 3 months, 1 week ago
7M runs
A multimodal image generation model that creates high-quality images. You need to bring your own verified OpenAI key to use this model. Your OpenAI account will be charged for usage.
Updated 3 months, 1 week ago
1.3M runs
Granite-3.3-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for improved reasoning and instruction-following capabilities.
Updated 3 months, 1 week ago
1.6M runs
Updated 3 months, 2 weeks ago
380 runs
OpenAI's Flagship GPT model for complex tasks.
Updated 3 months, 2 weeks ago
268.8K runs
Fastest, most cost-effective GPT-4.1 model from OpenAI
Updated 3 months, 2 weeks ago
727K runs
Fast, affordable version of GPT-4.1
Updated 3 months, 2 weeks ago
1.3M runs
Generate high-quality 2K resolution images from text prompts
Updated 3 months, 2 weeks ago
10.8K runs
Generate a video from an audio clip and a reference image
Updated 3 months, 2 weeks ago
29.4K runs
Add consistent, customizable shadows to product cutouts for enhanced visual appeal
Updated 3 months, 3 weeks ago
373 runs
Transform any product photo into professional 2000x2000px packshots with optimal positioning
Updated 3 months, 3 weeks ago
553 runs
Precise AI-powered product cutout with 256-level transparency for eCommerce
Updated 3 months, 3 weeks ago
860 runs
OpenAI's high-intelligence chat model
Updated 3 months, 3 weeks ago
336.2K runs
An image generation foundation model in the Qwen series that achieves significant advances in complex text rendering.
Updated 3 months, 3 weeks ago
1.3M runs
Edit images using a prompt. This model extends Qwen-Image’s unique text rendering capabilities to image editing tasks, enabling precise text editing
Updated 4 months, 2 weeks ago
1.4M runs
Color match and white balance fixes for images
Updated 4 months, 2 weeks ago
176.6K runs
OpenAI's fast, lightweight reasoning model
Updated 4 months, 2 weeks ago
377.5K runs
A small model alternative to o1
Updated 4 months, 2 weeks ago
3.3K runs
OpenAI's first o-series reasoning model
Updated 4 months, 2 weeks ago
16.3K runs
Low latency, low cost version of OpenAI's GPT-4o model
Updated 4 months, 2 weeks ago
12.3M runs
Image-to-video at 720p and 480p with Wan 2.2 A14B
Updated 4 months, 3 weeks ago
44.3K runs
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video
Updated 4 months, 3 weeks ago
153K runs
The fastest Wan 2.2 text-to-image and image-to-video model
Updated 4 months, 3 weeks ago
364.9K runs
Updated Qwen3 model for instruction following
Updated 4 months, 3 weeks ago
142.7K runs
Official CLIP models, generate CLIP (clip-vit-large-patch14) text & image embeddings
Updated 5 months ago
2.3M runs
An opinionated text-to-image model from Black Forest Labs in collaboration with Krea that excels in photorealism. Creates images that avoid the oversaturated "AI look".
Updated 5 months ago
2M runs
Granite-speech-3.3-8b is a compact and efficient speech-language model, specifically designed for automatic speech recognition (ASR) and automatic speech translation (AST).
Updated 5 months ago
16.7K runs
Granite-vision-3.3-2b is a compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
Updated 5 months ago
121.7K runs
FLUX.1 Kontext[dev] image editing model for running lora finetunes
Updated 5 months, 1 week ago
167.3K runs
This model generates beautiful cinematic 2 megapixel images in 3-4 seconds and is derived from the Wan 2.2 model through optimisation techniques from the pruna package
Updated 5 months, 2 weeks ago
919.2K runs
Open-weight version of FLUX.1 Kontext
Updated 6 months ago
5.3M runs
A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference
Updated 6 months ago
5.4M runs
A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
Updated 6 months ago
36.2M runs
The fastest image generation model tailored for local development and personal use
Updated 6 months ago
582.8M runs
The fastest image generation model tailored for fine-tuned use
Updated 6 months, 1 week ago
3.6M runs
Updated 6 months, 1 week ago
24.7K runs
Generate expressive, natural speech. Features unique emotion control, instant voice cloning from short audio, and built-in watermarking.
Updated 6 months, 1 week ago
201.2K runs
Generate expressive, natural speech with Resemble AI's Chatterbox.
Updated 6 months, 2 weeks ago
16K runs
Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions
Updated 6 months, 2 weeks ago
1.5M runs
Turn yourself into a renaissance-era painting for those renaissance moments
Updated 6 months, 4 weeks ago
4K runs
Granite-Embedding-278M-Multilingual is a 278M parameter model from the Granite Embeddings suite that can be used to generate high quality text embeddings
Updated 7 months, 1 week ago
1.7K runs
Use one or two face images to create AI avatars
Updated 8 months ago
34.9K runs
Professional-grade image upscaling, from Topaz Labs
Updated 8 months, 1 week ago
1.1M runs
Video Upscaling from Topaz Labs
Updated 8 months, 1 week ago
800.7K runs
A 17 billion parameter model with 128 experts
Updated 8 months, 3 weeks ago
2M runs
Open-weight inpainting model for editing and extending images. Guidance-distilled from FLUX.1 Fill [pro].
Updated 9 months ago
1.3M runs
DeepSeek-V3-0324 is the leading non-reasoning model, a milestone for open source
Updated 9 months ago
4.4M runs
Fast, efficient image variation model for rapid iteration and experimentation.
Updated 9 months, 2 weeks ago
65.3K runs
Open-weight image variation model. Create new versions while preserving key elements of your original.
Updated 9 months, 2 weeks ago
278.3K runs
Open-weight depth-aware image generation. Edit images while preserving spatial relationships.
Updated 9 months, 2 weeks ago
964.8K runs
Open-weight edge-guided image generation. Control structure and composition using Canny edge detection.
Updated 9 months, 2 weeks ago
188.6K runs
Face swap one or two people into a target image
Updated 9 months, 3 weeks ago
169.3K runs
Granite-3.2-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for reasoning and instruction-following capabilities.
Updated 9 months, 3 weeks ago
458.2K runs
Generate 5s 480p videos. Wan is an advanced and powerful visual generation model developed by Tongyi Lab of Alibaba Group
Updated 10 months ago
45.6K runs
The most intelligent Claude model and the first hybrid reasoning model on the market (claude-3-7-sonnet-20250219)
Updated 10 months, 1 week ago
3.5M runs
Anthropic's fastest, most cost-effective model, with a 200K token context window (claude-3-5-haiku-20241022)
Updated 10 months, 2 weeks ago
2.9M runs
Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)
Updated 10 months, 2 weeks ago
604K runs
End-to-end AI speech model designed for natural-sounding conversational speech synthesis, with support for context-aware prosody, intonation, and emotional expression.
Updated 11 months, 2 weeks ago
26.9K runs
Granite-3.1-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 1 year ago
772.8K runs
Granite-3.1-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 1 year ago
9.1K runs
High-quality image generation model optimized for creative professional workflows and ultra-high fidelity outputs
Updated 1 year ago
3.1M runs
Granite-3.0-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 1 year, 2 months ago
181.4K runs
Granite-3.0-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 1 year, 2 months ago
420.3K runs
Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of use cases using this model. https://github.com/ibm-granite-community
Updated 1 year, 4 months ago
552.3K runs
Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of use cases using this model. https://github.com/ibm-granite-community
Updated 1 year, 4 months ago
110K runs
A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency
Updated 1 year, 5 months ago
1.8M runs
An efficient, intelligent, and truly open-source language model
Updated 1 year, 8 months ago
2M runs
Base version of Llama 3, a 70 billion parameter language model from Meta.
Updated 1 year, 8 months ago
852.5K runs
A 70 billion parameter language model from Meta, fine tuned for chat completions
Updated 1 year, 8 months ago
164.3M runs
An 8 billion parameter language model from Meta, fine tuned for chat completions
Updated 1 year, 8 months ago
395M runs
Base version of Llama 3, an 8 billion parameter language model from Meta.
Updated 1 year, 8 months ago
51.2M runs
Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification
Updated 2 years, 1 month ago
70M runs
A 7 billion parameter language model from Meta, fine tuned for chat completions
Updated 2 years, 1 month ago
18.4M runs
A 7 billion parameter language model from Mistral.
Updated 2 years, 3 months ago
1.9M runs
A 70 billion parameter language model from Meta, fine tuned for chat completions
Updated 2 years, 3 months ago
10M runs
Base version of Llama 2, a 70 billion parameter language model from Meta.
Updated 2 years, 3 months ago
359.8K runs
A 13 billion parameter language model from Meta, fine tuned for chat completions
Updated 2 years, 3 months ago
4.9M runs
Base version of Llama 2 13B, a 13 billion parameter language model
Updated 2 years, 3 months ago
209.1K runs
Base version of Llama 2 7B, a 7 billion parameter language model
Updated 2 years, 3 months ago
659.5K runs