skills$openclaw/gemini-image-proxy
yspcoder3.2k

by yspcoder

gemini-image-proxy – OpenClaw Skill

gemini-image-proxy is an OpenClaw Skills integration for coding workflows. Generate and edit images with Gemini API using the OpenAI Python SDK.

3.2k stars669 forksSecurity L1
Updated Feb 7, 2026Created Feb 7, 2026coding

Skill Snapshot

namegemini-image-proxy
descriptionGenerate and edit images with Gemini API using the OpenAI Python SDK. OpenClaw Skills integration.
owneryspcoder
repositoryyspcoder/gemini-image-proxy
languageMarkdown
licenseMIT
topics
securityL1
installopenclaw add @yspcoder/gemini-image-proxy
last updatedFeb 7, 2026

Maintainer

yspcoder

yspcoder

Maintains gemini-image-proxy in the OpenClaw Skills directory.

View GitHub profile
File Explorer
4 files
.
scripts
generate.py
3.1 KB
_meta.json
290 B
SKILL.md
3.3 KB
SKILL.md

name: gemini-image-proxy version: 1.0.0 description: Generate and edit images with Gemini API using the OpenAI Python SDK. metadata: openclaw: emoji: "🎨" requires: env: ["GOOGLE_PROXY_API_KEY", "GOOGLE_PROXY_BASE_URL"]

Gemini Image Simple

Generate and edit images using Gemini 3 Pro Image via the OpenAI Python SDK and an OpenAI-compatible API endpoint.

Why This Skill

FeatureThis SkillOthers (nano-banana-pro, etc.)
Dependenciesopenai (SDK)google-genai, pillow, etc.
Requires pip/uv✅ Yes✅ Yes
Works on Fly.io free✅ Yes (with pip)❌ Fails
Works in containers✅ Yes (with pip)❌ Often fails
Image generation✅ Full✅ Full
Image editing✅ Yes✅ Yes
Setup complexityInstall SDK + set API keyInstall packages first

Bottom line: This skill uses the OpenAI SDK, so you must install openai once with pip.

Install

python3 -m pip install openai

Quick Start

# Set env
export GOOGLE_PROXY_API_KEY="your_api_key"
export GOOGLE_PROXY_BASE_URL="https://example.com/v1"

# Generate
python3 /data/clawd/skills/gemini-image-simple/scripts/generate.py "A cat wearing a tiny hat" cat.png

# Edit existing image
python3 /data/clawd/skills/gemini-image-simple/scripts/generate.py "Make it sunset lighting" edited.png --input original.png

Usage

Generate new image

python3 {baseDir}/scripts/generate.py "your prompt" output.png

Edit existing image

python3 {baseDir}/scripts/generate.py "edit instructions" output.png --input source.png

Supported input formats: PNG, JPG, JPEG, GIF, WEBP

Environment

Set these environment variables:

  • GOOGLE_PROXY_API_KEY (your API key)
  • GOOGLE_PROXY_BASE_URL (OpenAI-compatible base URL, e.g. https://example.com/v1)

How It Works

Uses Gemini 3 Pro Image (gemini-3-pro-image) via the OpenAI Python SDK:

  • client.images.generate(...) for new images
  • client.images.edits(...) for edits
  • Requires the openai package

That's it. Works on any Python 3.10+ installation with openai installed.

Model

Currently using: gemini-3-pro-image

Other available models (can be changed in generate.py if needed):

  • gemini-3-pro-image-preview - Preview variant
  • imagen-4.0-ultra-generate-001 - Imagen 4.0 Ultra
  • imagen-4.0-generate-001 - Imagen 4.0
  • gemini-2.5-flash-image - Gemini 2.5 Flash with image gen

Examples

# Landscape
python3 {baseDir}/scripts/generate.py "Misty mountains at sunrise, photorealistic" mountains.png

# Product shot
python3 {baseDir}/scripts/generate.py "Minimalist product photo of a coffee cup, white background" coffee.png

# Edit: change style
python3 {baseDir}/scripts/generate.py "Convert to watercolor painting style" watercolor.png --input photo.jpg

# Edit: add element
python3 {baseDir}/scripts/generate.py "Add a rainbow in the sky" rainbow.png --input landscape.png
README.md

No README available.

Permissions & Security

Security level L1: Low-risk skills with minimal permissions. Review inputs and outputs before running in production.

Requirements

  • OpenClaw CLI installed and configured.
  • Language: Markdown
  • License: MIT
  • Topics:

FAQ

How do I install gemini-image-proxy?

Run openclaw add @yspcoder/gemini-image-proxy in your terminal. This installs gemini-image-proxy into your OpenClaw Skills catalog.

Does this skill run locally or in the cloud?

OpenClaw Skills execute locally by default. Review the SKILL.md and permissions before running any skill.

Where can I verify the source code?

The source repository is available at https://github.com/openclaw/skills/tree/main/skills/yspcoder/gemini-image-proxy. Review commits and README documentation before installing.