The latest Gemini models, like Gemini 3.6 Flash, are available to use with Firebase AI Logic! Learn more.

All Imagen models will shut down as early as June 30, 2026. Learn about migrating your apps to use Nano Banana.

Gemini 2.5 models will shut down in October 2026. To avoid service disruptions, update to a newer model (like gemini-3.6-flash or gemini-3.1-flash-image). Any stable Gemini Live API 2.5 models are not impacted. Learn more.

Learn about supported models

For mobile and web apps, the Firebase AI Logic SDKs let you interact with the supported Gemini models directly from your app.

Gemini models are considered multimodal because they're capable of processing and even generating multiple modalities, including text, code, PDFs, images, video, and audio.

Also, review our FAQ about all the models that Firebase AI Logic supports and does not support.

Featured models

FAST AND INTELLIGENT

Gemini 3.6 Flash

gemini-3.6-flash

Frontier-class performance rivaling larger models at a fraction of the cost. (billing not required)

ULTRA FAST

Gemini 3.5 Flash-Lite

gemini-3.5-flash-lite

High-volume, cost-sensitive workhorse model with the performance and quality of the Gemini 3 series. (billing not required)

IMAGE GENERATING

Gemini 3.1 Flash Image (Nano Banana 2)

gemini-3.1-flash-image

Powerful, high-efficiency image generation and editing model, optimized for speed and high-volume use cases. (billing required)

General-use models

Go to tables with model details

Gemini 3.x general-use models

Gemini 3.1 Pro

gemini-3.1-pro-preview

Advanced intelligence, complex problem-solving skills, and powerful agentic and vibe coding capabilities. (billing required)

Gemini 3.6 Flash

gemini-3.6-flash

Frontier-class performance rivaling larger models at a fraction of the cost. (billing not required)

Gemini 3.5 Flash-Lite

gemini-3.5-flash-lite

High-volume, cost-sensitive workhorse model with the performance and quality of the Gemini 3 series. (billing not required)

Older stable general-use models

Gemini 3.5 Flash (gemini-3.5-flash): Previous Gemini 3.x Flash model for frontier-class performance rivaling larger models at a fraction of the cost. (billing not required)
Gemini 3.1 Flash‑Lite (gemini-3.1-flash-lite): Previous Gemini 3.x Flash‑Lite model for high-volume, cost-sensitive workhorse tasks. (billing not required)

Image-generating models

Go to tables with model details

Gemini 3.x image-generating models

Gemini 3 Pro Image (Nano Banana Pro)

gemini-3-pro-image

State-of-the-art image generation and editing model for highly contextual native image creation. (billing required)

Gemini 3.1 Flash Image (Nano Banana 2)

gemini-3.1-flash-image

Powerful, high-efficiency image generation and editing model, optimized for speed and high-volume use cases. (billing required)

Gemini 3.1 Flash-Lite Image (Nano Banana 2 Lite)

gemini-3.1-flash-lite-image

Ultra-low latency and cost-effective image generation and editing model, designed for high-volume interactive use cases. (billing required)

Audio-generating models

You can generate streamed audio with models that support the Gemini Live API.

Go to page with model details

Gemini 2.5 Flash with Gemini Live API native audio

Gemini Developer API: gemini-2.5-flash-native-audio-preview-12-2025

Agent Platform Gemini API: gemini-live-2.5-flash-native-audio

Enables low-latency, real-time voice and video interactions with a Gemini model that is bidirectional. (billing not required)

The remainder of this page provides detailed information about the models supported by Firebase AI Logic.

Compare models:
- Supported input and output
- High-level comparison of the supported capabilities
- Specifications and limitations, for example max input tokens or max length of input video
Description of how models are versioned, specifically their stable, preview, and experimental versions
Lists of available model names to include in your code during initialization
Lists of supported languages for the models

At the bottom of this page, you can view detailed information about previous generation models.

For details about the Gemini Live API models (like gemini-live-2.5-flash-native-audio), see Limits and specifications of the Live API.

Compare models

Each model has different capabilities to support various use cases. Note that each of tables in this section describe each model when used with Firebase AI Logic. Each model might have additional capabilities that aren't available when using our SDKs.

If you can't find the information you're looking for in the following sub-sections, you can find even more information in your chosen API provider documentation: Gemini Developer API or Agent Platform Gemini API (formerly Vertex AI).

For details about the Gemini Live API models (like gemini-live-2.5-flash-native-audio), see Limits and specifications of the Live API.

Supported input and output

The following table lists the supported input and output types when using each model with Firebase AI Logic.

To learn about supported file types, see Supported input files and requirements.

	Gemini 3.x Pro, Flash, Flash‑Lite	Gemini 3.x Pro Image	Gemini 3.x Flash Image	Gemini 3.x Flash‑Lite Image
Input types
Text
Code
Documents (PDFs or plain-text)
Images
Video
Audio
Output types
Text
Text (streaming)
Code
Structured output (like JSON)
Images
Video
Audio

(Deprecated) Supported input and output (Gemini 2.5 models)

	Gemini 2.5 Pro, Flash, Flash‑Lite		Gemini 2.5 Flash Image
Input types
Text
Code
Documents (PDFs or plain-text)
Images
Video
Audio
Output types
Text
Text (streaming)
Code
Structured output (like JSON)
Images
Video
Audio

Supported capabilities and features

The following table lists the supported capabilities and features when using each model with Firebase AI Logic.

	Gemini 3.x Pro Image	Gemini 3.x Flash Image	Gemini 3.x Flash‑Lite Image
Thinking
Generate text from text-only or multimodal inputs	interleaved or as part of image	interleaved or as part of image	interleaved or as part of image
Generate images
Edit images
Generate audio
Generate structured output (like JSON)
Analyze documents (PDFs or plain-text) (text-output \| image-output)
Analyze images (text-output \| image-output)
Analyze video (text-output \| image-output)
Analyze audio
Multi-turn chat
Bidirectional multimodal streaming
Supported tools
Function calling
Code execution
URL context
Grounding with Google Search
Grounding with Google Maps

(Deprecated) Supported capabilities and features (Gemini 2.5 models)

	Gemini 2.5 Pro, Flash, Flash‑Lite		Gemini 2.5 Flash Image
Thinking
Generate text from text-only or multimodal inputs			interleaved or as part of image
Generate images
Edit images
Generate audio
Generate structured output (like JSON)
Analyze documents (PDFs or plain-text) (text-output \| image-output)
Analyze images (text-output \| image-output)
Analyze video (text-output \| image-output)
Analyze audio
Multi-turn chat
Bidirectional multimodal streaming
Supported tools
Function calling
Code execution
URL context
Grounding with Google Search
Grounding with Google Maps

Specifications and limitations

The following table lists the specifications and limitations when using each model with Firebase AI Logic.

Property	Gemini 3.x Pro, Flash, Flash‑Lite	Gemini 3.x Pro Image	Gemini 3.x Flash Image	Gemini 3.x Flash‑Lite Image
Input token limit *	1,048,576 tokens	65,536 tokens	131,072 tokens	65,536 tokens
Output token limit *	65,536 tokens	32,768 tokens	32,768 tokens	4,096 tokens
PDFs (per request)
Max number of input PDF files **	900 files	14 files	14 files	14 files
Max number of pages per input PDF file **	900 pages	14 pages	14 pages	14 pages
Max size per input PDF file	50 MB	50 MB	50 MB	50 MB
Images (per request)
Max number of input images	1,000 images	14 images	14 images	14 images
Max size per input base64-encoded image	7 MB	7 MB	7 MB	7 MB
Max number of output images	---	Up to output token limit	Up to output token limit	Up to output token limit
Video (per request)
Max number of input video files	10 files	---	Up to input token limit	Up to input token limit
Max length of all input video (frames only)	~60 minutes	---	~25 minutes	~12 minutes
Max length of all input video (frames+audio)	~45 minutes	---	---	---
Audio (per request)
Max number of input audio files	1 file	---	---	---
Max length of all input audio	~8.4 hours	---	---	---

(Deprecated) Specifications and limitations (Gemini 2.5 models)

	Gemini 2.5 Pro, Flash, Flash‑Lite	Gemini 2.5 Flash Image
Input token limit *	1,048,576 tokens	32,768 tokens
Output token limit *	65,536 tokens	8,192 tokens
PDFs (per request)
Max number of input PDF files **	3,000 files	3 files
Max number of pages per input PDF file **	1,000 pages	3 pages
Max size per input PDF file	50 MB	50 MB
Images (per request)
Max number of input images	3,000 images	3 images
Max size per input base64-encoded image	7 MB	7 MB
Max number of output images	---	Up to output token limit
Video (per request)
Max number of input video files	10 files	---
Max length of all input video (frames only)	~60 minutes	---
Max length of all input video (frames+audio)	~45 minutes	---
Audio (per request)
Max number of input audio files	1 file	---
Max length of all input audio	~8.4 hours	---

^{*
For all Gemini models, a token is equivalent to about 4 characters,
so 100 tokens are about 60-80 English words. For Gemini models, you can
determine the total count of tokens in your requests using
countTokens.}

^{**
PDFs are treated as images, so a single page of a PDF is treated as
one image. The number of pages allowed in a request is limited to the number
of images the model can support.}

Find additional detailed information

Quotas and pricing are different for each model. Pricing also depends on input and output.
Learn about supported input file types, how to specify MIME type, and how to make sure that your input files and multimodal requests meet the requirements and follow best practices in Supported input files and requirements.

Important: The total request size limit is 20 MB. To send large files, review the options for providing files in multimodal requests.
For details about the Gemini Live API models, see Limits and specifications of the Live API.

Model versioning and naming patterns

Models are offered in stable, preview, and experimental versions. For convenience, aliases without explicit version values are supported.

To find specific model names to use in your code, see the "available model names" section later on this page.

Version type / Release stage	Description	Model name pattern
Stable	*Stable* versions are available and supported for production use starting on the release date. A stable model version is typically released with a retirement date, which indicates the last day that the model is available. After this date, the model is no longer accessible or supported by Google.	Gemini 2.5 and later models Model names of stable versions have no suffix Example: `gemini-3.6-flash`
Preview	*Preview* versions have new capabilities and are considered not stable. These models are not recommended for production use, come with more restrictive rate limits, and may have billing requirements. These models are shutdown (retired) within a few weeks or months after their associated stable version is released. For the Agent Platform Gemini API (formerly Vertex AI), preview models are only available in the `global` location.	Model names of preview versions are appended with `-preview` and often the model's release date (`-MM-DD` for older models or `-MM-YYYY` for newer models) Examples: `gemini-2.5-flash-preview-04-17` (released on April 17, 2025) or `gemini-2.5-flash-preview-09-2025` (released in September 2025) or `gemini-3-pro-preview` (released in November 2025)
Experimental	*Experimental* versions have new capabilities and are considered not stable. These models are not recommended for production use and come with more restrictive rate limits. Experimental models are intended for gathering feedback and to enable experimentation with our latest features. These models are shutdown (retired) within a few weeks or months after their associated stable version is released. For the Agent Platform Gemini API (formerly Vertex AI), experimental models are only available in the `global` location.	Model names of experimental versions are appended with `-exp` along with the model's release date (`-MM-DD`) Example: `gemini-2.5-pro-exp-03-25` (released on March 25, 2025)
Shutdown (retired)	*Shutdown (retired)* versions are past their shutdown (retirement) date and have been permanently deactivated. Shutdown (retired) models are no longer accessible or supported by Google, and a request using a retired model name returns a 404 error.	---

Available model names

Model names are the explicit values that you include in your code during initialization of the model.

General-use models (like gemini-3.6-flash)
Image-generating models (like gemini-3.1-flash-image, aka the "Nano Banana" models)
Audio-generating models (like gemini-live-2.5-flash-native-audio)

For initialization examples for your platform, see the getting started guide.

For details about the release stages (especially for use cases, billing, and shutdown), see model versioning and naming patterns.

Programmatically list all available models

You can list all available models names using the REST API:

Gemini Developer API: Call the models.list endpoint
Agent Platform Gemini API (formerly Vertex AI): Call the publishers.models.list endpoint

Note that this returned list will include all models supported by the API providers, but Firebase AI Logic only supports the Gemini models described on this page.

General-use models