The latest Gemini models, like Gemini 3.5 Flash, are available to use with Firebase AI Logic! Learn more.

All Imagen models will shut down as early as June 30, 2026. Learn about migrating your apps to use Nano Banana.

Build hybrid experiences in Web apps with on-device and cloud-hosted models

Build AI-powered web apps and features with hybrid inference using Firebase AI Logic. Hybrid inference enables running inference using on-device models when available and seamlessly falling back to cloud-hosted models otherwise (and vice versa).

This page describes how to get started using the client SDK. After completing this standard setup, check out the additional configuration options and capabilities (like structured output).

Note that on-device inference is supported for web apps running on Chrome on Desktop.

Jump to the code examples

Recommended use cases and supported capabilities

Recommended use cases:

Using an on-device model for inference offers:
- Enhanced privacy
- Local context
- Inference at no-cost
- Offline functionality
Using hybrid functionality offers:
- Reach 100% of your audience, regardless of on-device model availability or internet connectivity

Supported capabilities and features for on-device inference:

On-device inference only supports single-turn text generation (not chat), with streaming or non-streaming output. It supports the following text-generation capabilities:

Generating text from text-only input
Generating text from text-and-image input, specifically input image types of JPEG and PNG

You can also generate structured output, including JSON and enums.

Before you begin

Take note of the following:

Inference using an on-device model uses the Prompt API from Chrome; whereas inference using a cloud-hosted model uses your chosen Gemini API provider (either the Gemini Developer API or the Vertex AI Gemini API).
This page describes how to get started developing using localhost (learn more about using APIs on localhost in the Chrome documentation).

After completing this standard setup, check out the additional configuration options and capabilities (like structured output).
After you've implemented your feature, you can enable end-users to try your feature in your actual app.

Get started on localhost

These get started steps describe the required general setup for any supported prompt request that you want to send.

Step 1: Set up Chrome and the Prompt API for on-device inference

Make sure you're using a recent version of Chrome. Update in chrome://settings/help.
On-device inference is available from Chrome v139 and higher.
Enable the on-device multimodal model by setting the following flag to Enabled:
- chrome://flags/#prompt-api-for-gemini-nano-multimodal-input
Restart Chrome.
(Optional) Download the on-device model before the first request.

The Prompt API is built into Chrome; however, the on-device model isn't available by default. If you haven't yet downloaded the model before your first request for on-device inference, the request will automatically start the model download in the background.

Note: Downloading the model can take several minutes, so waiting to auto-download with the first request can significantly delay receiving a response to that request.
View instructions to download the on-device model
1. Open Developer Tools > Console.
2. Run the following:
```
await LanguageModel.availability();
```
3. Make sure that the output is available, downloading, or downloadable.
4. If the output is downloadable, start the model download by running:
```
await LanguageModel.create();
```
5. You can use the following monitor callback to listen for download progress and make sure that the model is available before making requests:
```
const session = await LanguageModel.create({
  monitor(m) {
    m.addEventListener("downloadprogress", (e) => {
      console.log(`Downloaded ${e.loaded * 100}%`);
    });
  },
});
```

Step 2: Set up a Firebase project and connect your app to Firebase

Sign into the Firebase console, and then select your Firebase project.
Don't already have a Firebase project?

If you don't already have a Firebase project, click the button to create a new Firebase project, and then use either of the following options:
- Option 1: Create a wholly new Firebase project (and its underlying Google Cloud project automatically) by entering a new project name in the first step of the workflow.
- Option 2: "Add Firebase" to an existing Google Cloud project by clicking Add Firebase to Google Cloud project (at bottom of page). In the first step of the workflow, start entering the project name of the existing project, and then select the project from the displayed list.
Complete the remaining steps of the on-screen workflow to create a Firebase project. Note that when prompted, you do not need to set up Google Analytics to use the Firebase AI Logic SDKs.
In the Firebase console, go to AI Services > AI Logic.
Click Get started to launch a guided workflow that helps you set up the required APIs and resources for your project.
If prompted, follow the on-screen instructions to register your app and add your Firebase configuration to your app.
When asked to choose a "Gemini API provider", we recommend selecting the Gemini Developer API, which lets you get started quickly at no cost.

At any point later, you can always set up the Vertex AI Gemini API (and its requirement for billing).
Continue in the workflow to set up the required APIs and associated services for Firebase AI Logic.

Starting early July 2026, this stage of the workflow automatically enforces Firebase App Check for AI Logic, which is a critical service to help protect the Gemini API when it's directly accessed from your app. As part of getting started (see steps later in this guide), you'll need to configure the App Check debug provider for local development when App Check is enforced.

Note: To release your app's AI feature to end users, you need to register your app with a production App Check attestation provider so that they can use the feature when App Check is enforced.
Continue to the next step in this guide to add the required SDKs to your app.

Step 3: Add the SDK

The Firebase library provides access to the APIs for interacting with generative models. The library is included as part of the Firebase JavaScript SDK for Web.

Install the Firebase JS SDK for Web using npm:
```
npm install firebase
```

Initialize Firebase in your app:

import { initializeApp } from "firebase/app";
import { initializeAppCheck, DebugProvider } from "firebase/app-check";

// TODO(developer) Replace the following with your app's Firebase configuration
// See: https://firebase.google.com/docs/web/learn-more#config-object
const firebaseConfig = {
  // ...
};

// Initialize FirebaseApp
const firebaseApp = initializeApp(firebaseConfig);

Step 3: Configure the App Check debug provider for local development

Starting early July 2026, as part of the guided setup workflow for AI Logic in the console, Firebase App Check is automatically enforced to protect the Gemini API. For local development, you need to configure the App Check debug provider to bypass attestation while still maintaining the enforcement of App Check.

Here's how to use the debug provider while running your app from localhost interactively (for example, during local development):

In your debug build, enable debug mode by setting self.FIREBASE_APPCHECK_DEBUG_TOKEN to true before you initialize App Check. For example:
```
self.FIREBASE_APPCHECK_DEBUG_TOKEN = true;
initializeAppCheck(app, { /* App Check options */ });
```

Visit your web app locally and open the browser's developer tools. In the debug console, you'll see a debug token:

AppCheck debug token: "123a4567-b89c-12d3-e456-789012345678".
You will need to safelist it in the Firebase console for it to work.

Register your debug token with App Check:
1. In the Firebase console, go to the Security > App Check > Apps tab.
2. Find your app, click the overflow menu (), and then select Manage debug tokens.
3. Follow the on-screen instructions to register your debug token.

For details about the debug provider (including how to get a new debug token), check out the official App Check docs.

Step 4: Initialize the service and create a model instance

Click your Gemini API provider to view provider-specific content and code on this page.

Set up the following before you send a prompt request to the model:

Initialize the service for your chosen API provider.
Create a GenerativeModel instance. Make sure to set the mode to one of:
- PREFER_ON_DEVICE: Use the on-device model if it's available; otherwise, fall back to the cloud-hosted model.
- ONLY_ON_DEVICE: Use the on-device model if it's available; otherwise, throw an exception.
- PREFER_IN_CLOUD: Use the cloud-hosted model if it's available; otherwise, fall back to the on-device model.
- ONLY_IN_CLOUD: Use the cloud-hosted model if it's available; otherwise, throw an exception.

import { initializeApp } from "firebase/app";
import { getAI, getGenerativeModel, GoogleAIBackend, InferenceMode } from "firebase/ai";

// TODO(developer) Replace the following with your app's Firebase configuration
// See: https://firebase.google.com/docs/web/learn-more#config-object
const firebaseConfig = {
  // ...
};

// Initialize FirebaseApp
const firebaseApp = initializeApp(firebaseConfig);

// Initialize the Gemini Developer API backend service
const ai = getAI(firebaseApp, { backend: new GoogleAIBackend() });

// Create a `GenerativeModel` instance
// Set the mode (for example, use the on-device model if it's available)
const model = getGenerativeModel(ai, { mode: InferenceMode.PREFER_ON_DEVICE });

Step 5: Initialize the on-device model

You must call initializeDeviceModel() after or on an end-user page interaction (like a button click) and before you send a prompt request to the model. Learn more about the user activation requirement in the Chrome documentation.

import { initializeApp } from "firebase/app";
import { getAI, getGenerativeModel, GoogleAIBackend, InferenceMode } from "firebase/ai";

// TODO(developer) Replace the following with your app's Firebase configuration
// See: https://firebase.google.com/docs/web/learn-more#config-object
const firebaseConfig = {
  // ...
};

// Initialize FirebaseApp
const firebaseApp = initializeApp(firebaseConfig);

// Initialize the Gemini Developer API backend service
const ai = getAI(firebaseApp, { backend: new GoogleAIBackend() });

// Create a `GenerativeModel` instance
// Set the mode (for example, use the on-device model if it's available)
const model = getGenerativeModel(ai, { mode: InferenceMode.PREFER_ON_DEVICE });

// `initializeDeviceModel` must be called:
// (1) after or on an end-user page interaction such as a button click
// and
// (2) before any queries to the model (such as `generateContent()`)
// You may want to `await` this promise if using `ONLY_ON_DEVICE` (see note below).
model.initializeDeviceModel((val) =>
  // Example: "Download progress: 72.62%""
  console.log(`Download progress: ${Math.round(val*10000) / 100}%`)
);

Downloading the on-device model can take several minutes.

If you haven't yet downloaded the model before your first request for on-device inference, calling initializeDeviceModel() will automatically start the model download in the background. Waiting for initializeDeviceModel() to complete ensures the model is ready for immediate queries but may take several minutes.

Calling methods like generateContent() before the download is complete will have different results based on the mode:

ONLY_ON_DEVICE: the request will wait until the download is complete, and then send the prompt to the on-device model and return the result. This may take several minutes.
PREFER_IN_CLOUD or PREFER_ON_DEVICE: the request will not wait for the download to complete and will automatically fall back to use the cloud-hosted model.
ONLY_IN_CLOUD: the request will automatically fall back to use the cloud-hosted model.

Step 6: Send a prompt request to a model

This section shows you how to send various types of input to generate different types of output, including:

Generate text from text-only input
Generate text from text-and-image (multimodal) input

If you want to generate structured output (like JSON or enums), then use one of the following "generate text" examples and additionally configure the model to respond according to a provided schema.

Generate text from text-only input

Before trying this sample, make sure that you've completed the Get started section of this guide.

You can use generateContent() to generate text from a prompt that contains text:

// Imports + initialization of FirebaseApp and backend service + creation of model instance

// Wrap in an async function so you can use await
async function run() {
  // Provide a prompt that contains text
  const prompt = "Write a story about a magic backpack."

  // To generate text output, call `generateContent` with the text input
  const result = await model.generateContent(prompt);

  const response = result.response;
  const text = response.text();
  console.log(text);
}

run();

Note that Firebase AI Logic also supports streaming of text responses using generateContentStream (instead of generateContent).

Generate text from text-and-image (multimodal) input

Before trying this sample, make sure that you've completed the Get started section of this guide.

You can use generateContent() to generate text from a prompt that contains text and image files—providing each input file's mimeType and the file itself.

The supported input image types for on-device inference are PNG and JPEG.

// Imports + initialization of FirebaseApp and backend service + creation of model instance

// Converts a File object to a Part object.
async function fileToGenerativePart(file) {
  const base64EncodedDataPromise = new Promise((resolve) => {
    const reader = new FileReader();
    reader.onloadend = () => resolve(reader.result.split(',')[1]);
    reader.readAsDataURL(file);
  });
  return {
    inlineData: { data: await base64EncodedDataPromise, mimeType: file.type },
  };
}

async function run() {
  // Provide a text prompt to include with the image
  const prompt = "Write a poem about this picture:";

  const fileInputEl = document.querySelector("input[type=file]");
  const imagePart = await fileToGenerativePart(fileInputEl.files[0]);

  // To generate text output, call `generateContent` with the text and image
  const result = await model.generateContent([prompt, imagePart]);

  const response = result.response;
  const text = response.text();
  console.log(text);
}

run();