The latest Gemini models, like Gemini 3.5 Flash, are available to use with Firebase AI Logic! Learn more.

Google uses AI technology to translate content into your preferred language. AI translations can contain errors.

通过 Apple 的基础模型框架访问 Gemini API 时可用的功能

本页中的示例假设您已完成使用入门：通过 Apple 的 Foundation Models 框架访问 Gemini API。

本指南介绍了如何使用 Firebase AI Logic SDK for Apple 平台，通过 Apple 的 Foundation Models 框架向 Gemini API 发送各种类型的请求。

本页展示了如何发送以下类型请求的示例：

根据纯文本输入生成文本
在多轮会话（聊天）期间生成文本
根据多模态输入（例如图片）生成文本
根据纯文本输入生成图片

生成结构化 JSON 输出

生成文本

Gemini 模型支持以下文本生成功能：

根据纯文本输入生成文本
在多轮会话（聊天）期间生成文本
根据多模态输入（例如图片）生成文本

支持此功能的模型

gemini-3.1-pro-preview
gemini-3.5-flash
gemini-3.1-flash-lite

根据纯文本输入生成文本

点击您的 Gemini API 提供商，以查看此页面上特定于提供商的内容和代码。

您可以仅通过文本输入来提示 Gemini 模型生成文本。

import FoundationModels
import FirebaseCore
import FirebaseAILogic

// Initialize the Gemini Developer API backend service.
let ai = FirebaseAI.firebaseAI(backend: .googleAI())
// Initialize a `geminiLanguageModel` with a Gemini model that supports your use case.
let model = ai.geminiLanguageModel(name: "gemini-3.5-flash")

// Provide a prompt that contains text.
let prompt = "Write a story about a magic backpack."

// Create a session by injecting the model into Apple's `LanguageModelSession`.
// For a single-turn interaction, create a new session each time you call the model.
let session = LanguageModelSession(model: model)

// Generate a text response to the prompt.
let response = try await session.respond(to: prompt)
print(response.content)

以流式传输回答

您可以不等待模型生成完整结果，而是使用流式传输来处理部分结果，从而实现更快的互动。如需以流式传输响应，请使用 streamResponse(to:) 而不是 respond(to:)。

// imports
// initialization of Gemini API backend service and a `geminiLanguageModel`

// Provide a prompt that contains text.
let prompt = "Write a story about a magic backpack."

// Create a session by injecting the model into Apple's `LanguageModelSession`.
// For a single-turn interaction, create a new session each time you call the model.
let session = LanguageModelSession(model: model)

// Generate a text response to the prompt.
// To stream the response, use `streamResponse(to:)` instead of `respond(to:)`
let stream = session.streamResponse(to: "Write a story about a magic backpack.")
var response = ""
for try await snapshot in stream {
  // The snapshot contains *all* content generated so far.
  response = snapshot.content
}

在多轮会话（聊天）期间生成文本

点击您的 Gemini API 提供商，以查看此页面上特定于提供商的内容和代码。

import FoundationModels
import FirebaseCore
import FirebaseAILogic

// Initialize the Gemini Developer API backend service.
let ai = FirebaseAI.firebaseAI(backend: .googleAI())
// Initialize a `geminiLanguageModel` with a Gemini model that supports your use case.
let model = ai.geminiLanguageModel(name: "gemini-3.5-flash")

// Create a session by injecting the model into Apple's `LanguageModelSession`.
// The session maintains state between each request.
let session = LanguageModelSession(model: model)

// Generate a text response to an initial prompt.
let response = try await session.respond(to: "Hello! I'd like to learn more about Albert Einstein.")
print(response.content)  // Example response from model: "What would you like to know?"

// Continue using the existing session. Each prompt and response is added to the transcript.
let response2 = try await session.respond(to: "When was he born?")
print(response2.content)  // Example response from model: "March 14, 1879"

根据多模态输入（例如图片）生成文本

点击您的 Gemini API 提供商，以查看此页面上特定于提供商的内容和代码。

您可以向 Gemini 模型提供文本和文件（例如图片或 PDF）提示，让其生成文本。

import FoundationModels
import FirebaseCore
import FirebaseAILogic

// Initialize the Gemini Developer API backend service.
let ai = FirebaseAI.firebaseAI(backend: .googleAI())
// Initialize a `geminiLanguageModel` with a Gemini model that supports your use case.
let model = ai.geminiLanguageModel(name: "gemini-3.5-flash")

// Create a session by injecting the model into Apple's `LanguageModelSession`.
// For a single-turn interaction, create a new session each time you call the model.
let session = LanguageModelSession(model: model)

let cgImage: CGImage = // ... fetch CGImage from your datasource.
let response = try await session.respond {
  "What are the dominant colors of this image, in order?"
  Attachment(cgImage)
}
print(response.content)

以流式传输回答

您可以不等待模型生成完整结果，而是使用流式传输来处理部分结果，从而实现更快的互动。如需以流式传输响应，请使用 streamResponse 而不是 respond。

// imports
// initialization of Gemini API backend service and a `geminiLanguageModel`

// Create a session by injecting the model into Apple's `LanguageModelSession`.
// For a single-turn interaction, create a new session each time you call the model.
let session = LanguageModelSession(model: model)

let cgImage: CGImage = // ... fetch CGImage from your datasource.
let stream = session.streamResponse {
  "What are the dominant colors of this image, in order?"
  Attachment(cgImage)
}

var response = ""
for try await snapshot in stream {
  // The snapshot contains *all* content generated so far.
  response = snapshot.content
}
print(response)

生成图片（使用“Nano Banana”模型）

点击您的 Gemini API 提供商，以查看此页面上特定于提供商的内容和代码。

支持此功能的模型

gemini-3-pro-image（又称“Nano Banana Pro”）
gemini-3.1-flash-image（又称“Nano Banana 2”）

您可以向 Gemini 图片生成模型（例如“Nano Banana”模型）提供纯文本输入，让其生成图片。

以下示例展示了如何仅生成图片，但 Gemini 图片生成模型可以同时生成图片和文本。

import FoundationModels
import FirebaseCore
import FirebaseAILogic

// Initialize the Gemini Developer API backend service.
let ai = FirebaseAI.firebaseAI(backend: .googleAI())
// Initialize a `geminiLanguageModel` with a Gemini image-generating model that supports your use case.
let model = ai.geminiLanguageModel(name: "gemini-3.1-flash-image"
    options:
      GeminiGenerationOptions(responseModalities: .image)
)

let session = LanguageModelSession(model: model)
let response = try await session.respond(
          to: "Generate an image of the Eiffel tower with fireworks in the background."
        )

var generatedImage: CIImage?
// Find the image in the transcriptEntries.
for entry in response.transcriptEntries {
  if case let .response(response) = entry {
    for segment in response.segments {
      if case let .attachment(attachment) = segment,
          case let .image(image) = attachment.content {
        generatedImage = image.ciImage
      }
    }
  }
}

生成结构化 JSON 输出

点击您的 Gemini API 提供商，以查看此页面上特定于提供商的内容和代码。

支持此功能的模型

gemini-3.1-pro-preview
gemini-3.5-flash
gemini-3.1-flash-lite
gemini-3-pro-image

Gemini 模型默认以非结构化文本的形式返回回答。不过，某些使用情形需要结构化文本，例如 JSON。例如，您可能正在将响应用于需要已建立数据架构的其他下游任务。

您可以配置模型，使其根据您提供的 JSON 架构设置回答格式。如需详细了解如何生成结构化 JSON 输出，以及相关最佳实践和用例，请参阅生成结构化输出指南。

import FoundationModels
import FirebaseCore
import FirebaseAILogic

@Generable(description: "Basic profile information about a cat")
struct CatProfile {
  var name: String
  @Guide(description: "The age of the cat", .range(0 ... 20))
  var age: Int
  @Guide(description: "A one sentence profile about the cat's personality")
  var profile: String
}

// Initialize the Gemini Developer API backend service.
let ai = FirebaseAI.firebaseAI(backend: .googleAI())
// Initialize a `geminiLanguageModel` with a Gemini model that supports your use case.
let model = ai.geminiLanguageModel(name: "gemini-3.5-flash")
let session = LanguageModelSession(model: model)

let response = try await session.respond(
  to: "Generate a cute rescue cat profile with an Elvish theme",
  generating: CatProfile.self
)
let cat = response.content

提供反馈 关于通过 Apple 的基础模型框架访问 Gemini API

通过 Apple 的基础模型框架访问 Gemini API 时可用的功能 使用集合让一切井井有条 根据您的偏好保存内容并对其进行分类。

生成文本

支持此功能的模型

根据纯文本输入生成文本

以流式传输回答

在多轮会话（聊天）期间生成文本

根据多模态输入（例如图片）生成文本

以流式传输回答

生成图片（使用“Nano Banana”模型）

支持此功能的模型

生成结构化 JSON 输出

支持此功能的模型

通过 Apple 的基础模型框架访问 Gemini API 时可用的功能