gemini

command

v0.2.21 Latest Latest Go to latest Published: Dec 11, 2025 License: MIT Imports: 12 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/Ingenimax/agent-sdk-go

Links

Open Source Insights

README ¶

Gemini LLM Examples

This directory contains examples demonstrating how to use the Gemini API integration in the Agent SDK.

Prerequisites

Before running these examples, you need one of the following:

Option 1: Gemini API (Recommended for most users)

Google Cloud API Key: Get your API key from the Google AI Studio
Environment Variable: Set your API key as an environment variable:
```
export GEMINI_API_KEY="your-api-key-here"
```

Option 2: Vertex AI (For Google Cloud Platform users)

GCP Project ID: Your Google Cloud Platform project ID
Environment Variables: Set your project configuration:
```
export GEMINI_VERTEX_PROJECT_ID="your-gcp-project-id"
export GEMINI_VERTEX_REGION="us-central1"  # Optional, defaults to us-central1 if not set
```
Supported Regions: Common Vertex AI regions include:
- us-central1 (Iowa) - Default
- us-east4 (Virginia)
- us-west1 (Oregon)
- europe-west1 (Belgium)
- europe-west4 (Netherlands)
- asia-northeast1 (Tokyo)
- asia-southeast1 (Singapore)
Authentication Options (choose one, in order of precedence):
- Service Account JSON (Raw or Base64) - Highest precedence:
```
# Option A: Base64-encoded service account JSON
export GOOGLE_APPLICATION_CREDENTIALS_JSON="$(cat service-account.json | base64)"

# Option B: Raw JSON string (will be auto-detected)
export GOOGLE_APPLICATION_CREDENTIALS_JSON='{"type":"service_account",...}'
```
- Service Account JSON File:
```
export GOOGLE_APPLICATION_CREDENTIALS="/path/to/service-account.json"
```
- Application Default Credentials (ADC): Run gcloud auth application-default login
Credential Priority:
1. GOOGLE_APPLICATION_CREDENTIALS_JSON (base64 or raw JSON) - takes highest precedence
2. GOOGLE_APPLICATION_CREDENTIALS (file path)
3. Application Default Credentials (ADC)
The example will automatically detect whether the JSON is base64-encoded or raw JSON and handle it accordingly.

Available Examples

Basic Usage (`main.go`)

Demonstrates basic text generation with the Gemini API, including:

Simple text generation
Structured output with JSON schemas
Function calling with tools
Streaming responses
Different reasoning modes

Agent Integration (`agent_integration/main.go`)

Shows how to integrate Gemini with the Agent SDK framework:

Creating agents with Gemini LLM
Using memory and tools
Multi-modal capabilities

Structured Output (`structured_output/main.go`)

Advanced example of structured output generation:

Complex JSON schemas
Data extraction and formatting
Response validation

Streaming Example (`streaming/main.go`)

Demonstrates streaming capabilities:

Real-time response streaming
Tool execution with streaming
Event handling

Multi-modal Example (`multimodal/main.go`)

Shows vision and audio capabilities:

Image analysis and description
Document processing
Video understanding

Supported Models

The Gemini integration supports multiple models:

gemini-2.5-pro-latest - Most capable model with vision, audio, and tool calling
gemini-2.5-flash-latest - Fast model with vision, audio, and tool calling
gemini-2.5-flash-lite-latest - Fastest model, text-only
gemini-1.5-pro - Previous generation with vision and tool calling
gemini-1.5-flash - Previous generation fast model with vision

Features

Text Generation

client, err := gemini.NewClient(apiKey, gemini.WithModel(gemini.ModelGemini25Flash))
response, err := client.Generate(ctx, "Write a haiku about AI")

Function Calling

tools := []interfaces.Tool{calculator.New(), websearch.New()}
response, err := client.GenerateWithTools(ctx, "What's 15 * 7?", tools)

Structured Output

schema := interfaces.JSONSchema{
    "type": "object",
    "properties": map[string]interface{}{
        "summary": {"type": "string"},
        "confidence": {"type": "number"},
    },
}
response, err := client.Generate(ctx, prompt, gemini.WithResponseFormat(interfaces.ResponseFormat{
    Type: interfaces.ResponseFormatJSON,
    Schema: schema,
}))

Streaming

stream, err := client.GenerateStream(ctx, "Tell me a story")
for event := range stream {
    switch event.Type {
    case interfaces.StreamEventContentDelta:
        fmt.Print(event.Content)
    case interfaces.StreamEventError:
        fmt.Printf("Error: %v\n", event.Error)
    }
}

Reasoning Modes

// Comprehensive reasoning - detailed step-by-step explanations
response, err := client.Generate(ctx, "Solve this math problem: 2x + 5 = 13",
    gemini.WithReasoning("comprehensive"))

// Minimal reasoning - brief explanations
response, err := client.Generate(ctx, prompt, gemini.WithReasoning("minimal"))

// No reasoning - direct answers only
response, err := client.Generate(ctx, prompt, gemini.WithReasoning("none"))

Running Examples

Set your API key:

export GEMINI_API_KEY="your-api-key-here"

Run the basic example:
```
cd examples/llm/gemini
go run main.go
```

Run specific examples:

go run agent_integration/main.go
go run structured_output/main.go
go run streaming/main.go

Configuration Options

The Gemini client supports various configuration options:

Gemini API Configuration

client, err := gemini.NewClient(ctx,
    gemini.WithAPIKey("your-api-key"),
    gemini.WithBackend(genai.BackendGeminiAPI),
    gemini.WithModel(gemini.ModelGemini25Flash),
    gemini.WithLogger(logger),
    gemini.WithRetry(retry.WithMaxRetries(3)),
)

Vertex AI Configuration

// Using Vertex AI with explicit region configuration
client, err := gemini.NewClient(ctx,
    gemini.WithBackend(genai.BackendVertexAI),
    gemini.WithProjectID("your-gcp-project"),
    gemini.WithLocation("us-central1"),  // Specify the GCP region
    gemini.WithCredentialsJSON(credentialsBytes), // Takes precedence over file
    gemini.WithCredentialsFile("/path/to/service-account.json"),
    gemini.WithModel(gemini.ModelGemini25Flash),
)

Region Configuration: The WithLocation() option sets the GCP region for Vertex AI. If not specified, it defaults to us-central1. You can set this via the GEMINI_VERTEX_REGION environment variable in the examples.

Credential Precedence (Vertex AI only)

When using Vertex AI, you can provide credentials in multiple ways. If both are provided:

JSON credentials take precedence over file credentials
A warning will be logged when both are provided
The file path will be ignored in favor of JSON

// This configuration will use JSON credentials and log a warning
client, err := gemini.NewClient(ctx,
    gemini.WithBackend(genai.BackendVertexAI),
    gemini.WithProjectID("project-id"),
    gemini.WithCredentialsFile("/ignored/file.json"),     // Ignored
    gemini.WithCredentialsJSON(jsonCredentials),          // Used
)

Error Handling

The client includes comprehensive error handling:

Network timeouts and retries
API rate limiting
Invalid request formatting
Content filtering and safety

Safety and Content Filtering

Gemini includes built-in safety filtering. You can configure safety settings:

// Default safety settings are applied automatically
// Blocks medium and high risk content for:
// - Harassment
// - Hate speech
// - Sexually explicit content
// - Dangerous content

Best Practices

API Key Security: Never hardcode API keys. Use environment variables or secure key management.
Model Selection: Choose the right model for your use case:
- Use Flash models for speed
- Use Pro models for complex reasoning
- Use Lite models for simple text tasks
Error Handling: Always handle errors appropriately and implement retry logic for production use.
Rate Limiting: Be aware of API rate limits and implement appropriate backoff strategies.
Content Filtering: Understand Gemini's content filtering and safety features.
Token Management: Monitor token usage, especially with large context models.

Troubleshooting

Common Issues

Authentication Error: Verify your API key is correct and has proper permissions.
Model Not Found: Ensure you're using a supported model name.
Rate Limiting: Implement exponential backoff for rate limit errors.
Content Filtered: Check if your content triggers safety filters.
Network Timeouts: Configure appropriate timeout values for your use case.

Debug Logging

Enable debug logging to troubleshoot issues:

logger := logging.New()
client, err := gemini.NewClient(apiKey, gemini.WithLogger(logger))

Support

For issues specific to the Agent SDK Gemini integration, please check:

For Gemini API-specific questions:

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

main.go

Directories ¶

Path	Synopsis
structured_output

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL