Official AI models

Official AI models: Always available, stable, and predictably priced

Recommended Models

Frequently asked questions

What does "Official AI Models" mean on Replicate?

"Official" models are those maintained directly by Replicate or in close collaboration with trusted partners like openai/gpt-5, google/veo-3, bytedance/seedream-4, and black-forest-labs/flux-pro.
They’re kept warm, meaning they are:

Always available with minimal cold start times
Continuously monitored for uptime and reliability
Predictably priced and stable across API usage

These models represent the most production-ready versions for image, video, language, and audio generation.

Which categories of models are included?

Official models span across multiple domains:

Text-to-video and image-to-video – e.g. google/veo-3, luma/ray, bytedance/seedream-4, pixverse-ai/pixverse-v5, kling-ai/kling-v2.5, wan-ai/wan-2.2
Text-to-image and image editing – e.g. black-forest-labs/flux-pro, google/imagen-4, ideogram-ai/ideogram-v3, recraft-ai/recraft-v3, bria-ai/bria-background
Language and reasoning – e.g. openai/gpt-5, anthropic/claude-4.5-sonnet, xai/grok-4, meta/meta-llama-3.1-405b-instruct, deepseek-ai/deepseek-r1
Music and audio generation – e.g. stability-ai/stable-audio-2.5, google/lyria-2, minimax/music-1.5
Speech and transcription – e.g. openai/gpt-4o-transcribe, minimax/speech-02-turbo, resemble-ai/chatterbox
Image and video enhancement – e.g. nightmareai/real-esrgan, topazlabs/video-upscale, black-forest-labs/flux-fill-pro
Embeddings and search – e.g. openai/clip, ibm-granite/granite-embedding-278m-multilingual, lucataco/snowflake-arctic-embed-l

In short, this collection gives you a reliable foundation for nearly every generative AI task—all powered by models that are always online.

Which models are the most popular?

The most widely used include:

black-forest-labs/flux-pro and related Flux models
google/nano-banana and google/imagen-4
bytedance/seedream-4 and bytedance/seedance
openai/gpt-5 and anthropic/claude-4.5-sonnet
nightmareai/real-esrgan and topazlabs/video-upscale
ideogram-ai/ideogram-v3 and recraft-ai/recraft-v3

These consistently rank among the highest-run models on Replicate and are trusted for commercial, creative, and research use.

Why does Replicate keep these models "warm"?

By keeping them warm, Replicate ensures:

Instant availability (no waiting for cold starts)
Predictable pricing, so developers can estimate usage costs accurately
Stable performance across large workloads
Consistent APIs for long-term integrations

This is ideal for developers running production-level applications, where uptime and consistency matter as much as model quality.

What are the benefits of using Official models over community ones?

Official models:

Are monitored and load-balanced for uptime
Offer consistent latency and outputs
Are typically fine-tuned or hosted in collaboration with model creators (e.g. OpenAI, Google, Anthropic)
Include ongoing updates and performance improvements

Community models, while diverse and experimental, can go offline, change parameters, or vary in speed and output quality.

Which models are best for video generation?

For text-to-video and image-to-video:

luma/ray-2 / luma/ray-flash – fast and cinematic
google/veo-3 – high-fidelity with synced audio
bytedance/seedream-4, bytedance/seedance, bytedance/omni-human – realistic, production-grade human and scene rendering
pixverse-ai/pixverse-v5 – smooth motion and anime-style scenes
kling-ai/kling-v2.5, wan-ai/wan-2.2, minimax/hailuo-02 – top choices for physics realism and motion detail

Which models are best for text and reasoning?

For language understanding and complex reasoning:

openai/gpt-5 family – top-tier reasoning, coding, and writing performance
anthropic/claude-4.5-sonnet and anthropic/claude-4.5-haiku – strong instruction following and safety
xai/grok-4 – excels in logic and long-form reasoning
meta/meta-llama-3.1-405b-instruct – open-weight alternative for chat and analysis
deepseek-ai/deepseek-r1 and deepseek-ai/deepseek-v3.1 – great balance of speed and quality for reasoning tasks

Which models are best for image generation and editing?

black-forest-labs/flux-pro, black-forest-labs/flux-ultra, black-forest-labs/flux-kontext-pro – professional-grade, text-guided image creation and editing
google/imagen-4 – realism and aesthetic quality
ideogram-ai/ideogram-v3-quality and ideogram-ai/ideogram-v3-turbo – detailed, text-rich visuals
recraft-ai/recraft-v3 and bria-ai/bria-background – commercial-ready for brand design, upscaling, and background generation
google/nano-banana – fast, high-fidelity generation in Gemini 2.5

What about music and sound?

google/lyria-2 and minimax/music-1.5 for multi-instrument songs with vocals
stability-ai/stable-audio-2.5 for customizable text-to-music generation
mirelo/video-to-sfx-v1.5 for generating sound effects synced to video

These are ideal for soundtracks, content production, and creative prototyping.

Can I use these models commercially?

Yes. All Official AI Models are available for commercial use, unless explicitly stated otherwise on their individual pages.
They are licensed, production-ready, and have predictable pricing structures designed for apps, agencies, and enterprise deployments.

How do I know if a model is "official"?

Each official model includes a green "Official" tag on its page.
They are also grouped in this "Official AI Models" collection for easy discovery.

What should I know before running an Official model?

Models are optimized for consistent results rather than experimental flexibility.
Pricing and performance are kept stable through Replicate’s managed infrastructure.
Many official models (like openai/gpt-5, google/veo-3, or black-forest-labs/flux-pro) are updated regularly but retain backward-compatible interfaces.
You can use them programmatically via the Replicate API or directly on replicate.com.

Any collection-specific tips?

If you want reliability, use official models for production and community models for experimentation.
Combine official models across modalities — e.g. use bytedance/seedream-4 to generate video, then mirelo/video-to-sfx-v1.5 to add audio, and nightmareai/real-esrgan to upscale.
Official models are your best bet for long-term workflows, demos, or client-facing products where uptime, predictability, and support matter.