Gemini 2.0 Flash¶
Gemini 2.0 Flash delivers next-gen features and improved capabilities, including superior speed, built-in tool use, multimodal generation, and a 1M token context window.
2.0 Flash¶
Try in Vertex AI View model card in Model Garden (Preview) Deploy example app
Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Vertex AI API enabled.
Model ID | gemini-2.0-flash |
|
Supported inputs & outputs | - Inputs: Text, Code, Images, Audio, Video - Outputs: Text | |
Token limits | - Maximum input tokens: 1,048,576 - Maximum output tokens: 8,192 | |
Capabilities | - Supported - Grounding with Google Search - Code execution - Tuning - System instructions - Controlled generation - Batch prediction - Function calling - Count Tokens - Context caching - Vertex AI RAG Engine - Chat completions - Not supported - Live API previewPreview feature - Thinking previewPreview feature | |
Usage types | - Supported - Provisioned Throughput - Dynamic shared quota - Not supported - Fixed quota | |
Technical specifications | ||
Images photo | - Maximum images per prompt: 3,000 - Maximum image size: 7 MB - Maximum tokens per minute (TPM) per project: - High/Medium/Default media resolution: - US/Asia: 40 M - EU: 10 M - Low media resolution: - US/Asia: 10 M - EU: 2.6 M - Supported MIME types: image/png , image/jpeg , image/webp |
|
Documents description | - Maximum number of files per prompt: 3,000 - Maximum number of pages per file: 1,000 - Maximum file size per file: 50 MB - Maximum tokens per minute (TPM) per project1: - US/Asia: 3.4 M - EU: 3.4 M - Supported MIME types: application/pdf , text/plain |
|
Video videocam | - Maximum video length (with audio): Approximately 45 minutes - Maximum video length (without audio): Approximately 1 hour - Maximum number of videos per prompt: 10 - Maximum tokens per minute (TPM): - High/Medium/Default media resolution: - US/Asia: 38 M - EU: 10 M - Low media resolution: - US/Asia: 10 M - EU: 2.5 M - Supported MIME types: video/x-flv , video/quicktime , video/mpeg , video/mpegs , video/mpg , video/mp4 , video/webm , video/wmv , video/3gpp |
|
Audio mic | - Maximum audio length per prompt: Appropximately 8.4 hours, or up to 1 million tokens - Maximum number of audio files per prompt: 1 - Speech understanding for: Audio summarization, transcription, and translation - Maximum tokens per minute (TPM): - US/Asia: 3.5 M - EU: 3.5 M - Supported MIME types: audio/x-aac , audio/flac , audio/mp3 , audio/m4a , audio/mpeg , audio/mpga , audio/mp4 , audio/opus , audio/pcm , audio/wav , audio/webm |
|
Parameter defaults tune | - Temperature: 0-2 - topP: 0.95 - topK: 64 (fixed) - candidateCount: 1-8 | |
Knowledge cutoff date | June 2024 | |
Versions | - gemini-2.0-flash-001 - Launch stage: Generally available - Release date: February 5, 2025 - Discontinuation date: February 5, 2026 |
|
Supported regions | ||
Model availability (Includes dynamic shared quota & Provisioned Throughput) | - Global - global - United States - us-central1 - us-east1 - us-east4 - us-east5 - us-south1 - us-west1 - us-west4 - Europe - europe-central2 - europe-north1 - europe-southwest1 - europe-west1 - europe-west4 - europe-west8 - europe-west9 | |
ML processing | - United States - Multi-region - Europe - Multi-region | |
See Data residency for more information. | ||
Security controls | ||
Online prediction | - Data residency (at rest) Supported - Customer-managed encryption keys (CMEK) Supported - VPC Service Controls Supported - Access Transparency (AXT) Supported | |
Batch prediction | - Data residency (at rest) Supported - Customer-managed encryption keys (CMEK) Not supported - VPC Service Controls Supported - Access Transparency (AXT) Not supported | |
Tuning | - Data residency (at rest) Supported - Customer-managed encryption keys (CMEK) Supported - VPC Service Controls Supported - Access Transparency (AXT) Not supported | |
See Security controls for more information. | ||
Pricing | See Pricing. |
Image generation¶
Preview
This feature is subject to the "Pre-GA Offerings Terms" in the General Service Terms section of the Service Specific Terms. Pre-GA features are available "as is" and might have limited support. For more information, see the launch stage descriptions.
Model ID | gemini-2.0-flash-preview-image-generation |
|
Supported inputs & outputs | - Inputs: Text, Code, Images, Audio, Video - Outputs: Text and image | |
Token limits | - Maximum input tokens: 32,768 - Maximum output tokens: 8,192 | |
Capabilities | - Supported - System instructions - Count Tokens - Not supported - Grounding with Google Search - Code execution - Tuning - Controlled generation - Batch prediction - Function calling - Live API previewPreview feature - Thinking previewPreview feature - Context caching - Vertex AI RAG Engine | |
Usage types | - Supported - Dynamic shared quota - Not supported - Fixed quota - Provisioned Throughput | |
Technical specifications | ||
Images photo | - Maximum images per prompt: 3,000 - Maximum image size: 7 MB - Maximum number of output images per prompt: 10 - Maximum tokens per minute (TPM) per project: - High/Medium/Default media resolution: - US/Asia: 40 M - EU: 10 M - Low media resolution: - US/Asia: 10 M - EU: 3 M - Supported MIME types: image/png , image/jpeg , image/webp |
|
Documents description | - Maximum number of files per prompt: 3,000 - Maximum number of pages per file: 1,000 - Maximum file size per file: 50 MB - Supported MIME types: application/pdf , text/plain |
|
Video videocam | - Maximum video length (with audio): Approximately 45 minutes - Maximum video length (without audio): Approximately 1 hour - Maximum number of videos per prompt: 10 - Maximum tokens per minute (TPM): - High/Medium/Default media resolution: - US/Asia: 37.9 M - EU: 9.5 M - Low media resolution: - US/Asia: 1 G - EU: 2.5 M - Supported MIME types: video/x-flv , video/quicktime , video/mpeg , video/mpegs , video/mpg , video/mp4 , video/webm , video/wmv , video/3gpp |
|
Audio mic | - Maximum audio length per prompt: Appropximately 8.4 hours, or up to 1 million tokens - Maximum number of audio files per prompt: 1 - Speech understanding for: Audio summarization, transcription, and translation - Maximum tokens per minute (TPM): - US/Asia: 1.7 M - EU: 0.4 M - Supported MIME types: audio/x-aac , audio/flac , audio/mp3 , audio/m4a , audio/mpeg , audio/mpga , audio/mp4 , audio/opus , audio/pcm , audio/wav , audio/webm |
|
Parameter defaults tune | - Temperature: 0-2 - topP: 0.95 - topK: 64 (fixed) - candidateCount: 1-8 | |
Knowledge cutoff date | August 2024 | |
Versions | - gemini-2.0-flash-preview-image-generation - Launch stage: Public preview - Release date: May 6, 2025 |
|
Supported regions | ||
Model availability | - global - global | |
See Data residency for more information. | ||
Security controls | ||
Online prediction | - Data residency (at rest) Not supported - Customer-managed encryption keys (CMEK) Not supported - VPC Service Controls Supported - Access Transparency (AXT) Supported | |
See Security controls for more information. | ||
Pricing | See Pricing. |
Live API¶
Preview
This feature is subject to the "Pre-GA Offerings Terms" in the General Service Terms section of the Service Specific Terms. Pre-GA features are available "as is" and might have limited support. For more information, see the launch stage descriptions.
Model ID | gemini-2.0-flash-live-preview-04-09 |
|
Supported inputs & outputs | - Inputs: Audio, Video - Outputs: Audio | |
Token limits | - Maximum input tokens: 32,768 - Maximum output tokens: 8,192 | |
Capabilities | - Supported - Grounding with Google Search - Code execution - System instructions - Function calling - Live API previewPreview feature - Context caching - Not supported - Tuning - Controlled generation - Batch prediction - Thinking previewPreview feature - Vertex AI RAG Engine | |
Usage types | - Supported - Dynamic shared quota - Not supported - Fixed quota - Provisioned Throughput | |
Technical specifications | ||
Video videocam | - Maximum video length (with audio): Approximately 45 minutes - Maximum video length (without audio): Approximately 1 hour - Maximum number of videos per prompt: 10 - Maximum tokens per minute (TPM): - High/Medium/Default media resolution: - US/Asia: 37.9 M - EU: 9.5 M - Low media resolution: - US/Asia: 1 G - EU: 2.5 M - Supported MIME types: video/x-flv , video/quicktime , video/mpeg , video/mpegs , video/mpg , video/mp4 , video/webm , video/wmv , video/3gpp |
|
Audio mic | - Maximum audio length per prompt: Appropximately 8.4 hours, or up to 1 million tokens - Maximum number of audio files per prompt: 1 - Speech understanding for: Audio summarization, transcription, and translation - Maximum tokens per minute (TPM): - US/Asia: 1.7 M - EU: 0.4 M - Supported MIME types: audio/x-aac , audio/flac , audio/mp3 , audio/m4a , audio/mpeg , audio/mpga , audio/mp4 , audio/opus , audio/pcm , audio/wav , audio/webm |
|
Parameter defaults tune | - Temperature: 0-2 - topP: 0.95 - topK: 64 (fixed) - candidateCount: 1-8 | |
Knowledge cutoff date | June 2024 | |
Versions | - gemini-2.0-flash-live-preview-04-09 - Launch stage: Public preview - Release date: April 9, 2025 |
|
Supported regions | ||
Model availability | - Global - global - United States - us-central1 | |
See Data residency for more information. | ||
Security controls | ||
Online prediction | - Data residency (at rest) Not supported - Customer-managed encryption keys (CMEK) Not supported - VPC Service Controls Supported - Access Transparency (AXT) Supported | |
See Security controls for more information. | ||
Pricing | See Pricing. |