Gemini 3 Pro

Gemini 3 Pro is our most advanced reasoning Gemini model, capable of solving complex problems. Gemini 3 Pro can comprehend vast datasets and challenging problems from different information sources, including text, audio, images, video, PDFs, and even entire code repositories with its 1M token context window.

Gemini 3 Pro introduces several new features to improve performance, control, and multimodal fidelity:

  • Thinking level: Use the thinking_level parameter to control the amount of internal reasoning the model performs (low or high) to balance response quality, reasoning complexity, latency, and cost. The thinking_level parameter replaces thinking_budget for Gemini 3 models.
  • Media resolution: Use the media_resolution parameter (low, medium, or high) to control vision processing for multimodal inputs, impacting token usage and latency. See Get started with Gemini 3 for default resolution settings.
  • Thought signatures: Stricter validation of thought signatures improves reliability in multi-turn function calling.
  • Multimodal function responses: Function responses can now include multimodal objects like images and PDFs in addition to text.
  • Streaming Function calling: Stream partial function call arguments to improve user experience during tool use.

For more information on using these features, see Get started with Gemini 3 Pro.

Try in Vertex AI View in Model Garden (Preview) Deploy example app

Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Vertex AI API enabled.
Model ID gemini-3-pro-preview
Supported inputs & outputs
  • Inputs:
    Text, Code, Images, Audio, Video, PDF
  • Outputs:
    Text
Token limits
  • Maximum input tokens: 1,048,576
  • Maximum output tokens: 65,536
Capabilities
Usage types
Technical specifications
Images
  • Maximum images per prompt: 900
  • Maximum image size: 7 MB
  • Default resolution tokens: 1120
  • Supported MIME types:
    image/png, image/jpeg, image/webp
Documents
  • Maximum number of files per prompt: 900
  • Maximum number of pages per file: 900
  • Maximum file size per file for the API or Cloud Storage imports: 50 MB
  • Maximum file size per file for direct uploads through the console: 7 MB
  • Default resolution tokens: 560
  • OCR for scanned PDFs: Not used by default
  • Supported MIME types:
    application/pdf, text/plain
Video
  • Maximum video length (with audio): Approximately 45 minutes
  • Maximum video length (without audio): Approximately 1 hour
  • Maximum number of videos per prompt: 10
  • Default resolution tokens per frame: 70
  • Supported MIME types:
    video/x-flv, video/quicktime, video/mpeg, video/mpegs, video/mpg, video/mp4, video/webm, video/wmv, video/3gpp
Audio
  • Maximum audio length per prompt: Approximately 8.4 hours, or up to 1 million tokens
  • Maximum number of audio files per prompt: 1
  • Speech understanding for: Audio summarization, transcription, and translation
  • Supported MIME types:
    audio/x-aac, audio/flac, audio/mp3, audio/m4a, audio/mpeg, audio/mpga, audio/mp4, audio/ogg, audio/pcm, audio/wav, audio/webm
Parameter defaults
  • Temperature: 0.0-2.0 (default 1.0)
  • topP: 0.0-1.0 (default 0.95)
  • topK: 64 (fixed)
  • candidateCount: 1–8 (default 1)
Supported regions

Model availability

(Includes dynamic shared quota & Provisioned Throughput)

  • Global
    • global
See Data residency for more information.
Knowledge cutoff date January 2025
Versions
  • gemini-3-pro-preview
    • Launch stage: Public preview
    • Release date: November 18, 2025
Security controls
See Security controls for more information.
Supported languages See Supported languages.
Pricing See Pricing.

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025年11月18日 UTC.