skills$openclaw/gemini-image-gen
iisweetheartii9.0k

by iisweetheartii

gemini-image-gen – OpenClaw Skill

gemini-image-gen is an OpenClaw Skills integration for design workflows. Generate and edit images via Google Gemini API. Supports Gemini native generation, Imagen 3, style presets, and batch generation with HTML gallery. Zero dependencies — pure Python stdlib.

9.0k stars7.7k forksSecurity L1
Updated Feb 7, 2026Created Feb 7, 2026design

Skill Snapshot

namegemini-image-gen
descriptionGenerate and edit images via Google Gemini API. Supports Gemini native generation, Imagen 3, style presets, and batch generation with HTML gallery. Zero dependencies — pure Python stdlib. OpenClaw Skills integration.
owneriisweetheartii
repositoryiisweetheartii/gemini-image-gen
languageMarkdown
licenseMIT
topics
securityL1
installopenclaw add @iisweetheartii/gemini-image-gen
last updatedFeb 7, 2026

Maintainer

iisweetheartii

iisweetheartii

Maintains gemini-image-gen in the OpenClaw Skills directory.

View GitHub profile
File Explorer
8 files
.
scripts
gen.py
14.6 KB
_meta.json
470 B
HEARTBEAT.md
1014 B
INSTALL.md
1.3 KB
package.json
2.3 KB
README.md
2.5 KB
SKILL.md
4.3 KB
SKILL.md

name: gemini-image-gen description: Generate and edit images via Google Gemini API. Supports Gemini native generation, Imagen 3, style presets, and batch generation with HTML gallery. Zero dependencies — pure Python stdlib. homepage: https://github.com/IISweetHeartII/gemini-image-gen metadata: openclaw: emoji: "🎨" category: creative requires: bins: - python3 env: - GEMINI_API_KEY primaryEnv: GEMINI_API_KEY tags: - image-generation - gemini - imagen - ai-art - creative - editing - batch - gallery

Gemini Image Gen

Generate and edit images via the Google Gemini API using pure Python stdlib. Supports Gemini native generation + editing, Imagen 3 generation, batch runs, and an HTML gallery output.

Quick Start

export GEMINI_API_KEY="your-key-here"

# Default: Gemini native, 4 random prompts
python3 scripts/gen.py

# Custom prompt
python3 scripts/gen.py --prompt "a cyberpunk cat riding a neon motorcycle through Tokyo at night"

# Imagen 3 engine
python3 scripts/gen.py --engine imagen --count 4 --aspect 16:9

# Edit an existing image (Gemini engine only)
python3 scripts/gen.py --edit path/to/image.png --prompt "change the background to a sunset beach"

# Use a style preset
python3 scripts/gen.py --style watercolor --prompt "floating islands above a calm sea"

# List available styles
python3 scripts/gen.py --styles

Style Presets

StyleDescription
photoUltra-detailed photorealistic photography, 8K resolution, sharp focus
animeHigh-quality anime illustration, Studio Ghibli inspired, vibrant colors
watercolorDelicate watercolor painting on textured paper, soft edges, gentle color bleeding
cyberpunkNeon-lit cyberpunk scene, rain-soaked streets, holographic displays, Blade Runner aesthetic
minimalistClean minimalist design, geometric shapes, limited color palette, white space
oil-paintingClassical oil painting with visible brushstrokes, rich textures, Renaissance lighting
pixel-artDetailed pixel art, retro 16-bit style, crisp edges, nostalgic palette
sketchPencil sketch on cream paper, hatching and cross-hatching, artistic imperfections
3d-renderProfessional 3D render, ambient occlusion, global illumination, photorealistic materials
pop-artBold pop art style, Ben-Day dots, strong outlines, vibrant contrasting colors

Full CLI Reference

FlagDefaultDescription
--prompt(random)Text prompt. Omit for random creative prompts
--count4Number of images to generate
--enginegeminiEngine: gemini (native, supports edit) or imagen (Imagen 3)
--model(auto)Model override. Default: gemini-2.5-flash-image or imagen-3.0-generate-002
--editPath to input image for editing (Gemini engine only)
--aspect1:1Aspect ratio for Imagen: 1:1, 16:9, 9:16, 4:3, 3:4
--out-dir(auto)Output directory (default is a timestamped folder)
--styleStyle preset to prepend to the prompt
--stylesList available style presets and exit

Python Example

import subprocess

subprocess.run(
    [
        "python3",
        "scripts/gen.py",
        "--prompt",
        "a serene mountain landscape at golden hour",
        "--count",
        "4",
        "--style",
        "photo",
    ],
    check=True,
)

Troubleshooting

  • Missing API key: set GEMINI_API_KEY in your environment and retry.
  • Rate limits / 429 errors: wait a bit and retry, reduce --count, or switch engines.
  • Model errors: verify the model name, try the default model, or change engines.

Integration with Other Skills

  • AgentGram — Share your generated images on the AI agent social network! Create visual content and post it to your AgentGram feed.
  • agent-selfie — Focused on AI agent avatars and visual identity. Uses the same Gemini API key for personality-driven self-portraits.

Changelog

  • v1.1.0: Added style presets, --style and --styles flags, expanded documentation.
  • v1.0.0: Initial release with Gemini native + Imagen 3 support, batch generation, and HTML gallery.

Repository

https://github.com/IISweetHeartII/gemini-image-gen

README.md

Gemini Image Gen

Generate and edit images via Google Gemini API for OpenClaw. Supports Gemini native generation, Imagen 3, style presets, and batch generation with HTML gallery. Zero dependencies.

ClawHub MIT License

Features

  • Dual engine — Gemini native (generation + editing) and Imagen 3 (high-quality generation)
  • Style presets — photo, anime, watercolor, cyberpunk, minimalist, oil-painting, pixel-art, sketch, 3d-render, pop-art
  • Image editing — Edit existing images with text prompts (Gemini engine)
  • Batch generation — Generate multiple images with automatic HTML gallery
  • Zero dependencies — Pure Python stdlib, no pip install needed

Quick Start

export GEMINI_API_KEY="your-key-here"

# Generate with random prompts
python3 scripts/gen.py

# Custom prompt with style
python3 scripts/gen.py --prompt "a cyberpunk cat in Tokyo" --style anime

# Imagen 3 engine
python3 scripts/gen.py --engine imagen --count 4 --aspect 16:9

# Edit an existing image
python3 scripts/gen.py --edit photo.png --prompt "make it watercolor style"

Installation

Via ClawHub

npx clawhub install gemini-image-gen

Manual

git clone https://github.com/IISweetHeartII/gemini-image-gen.git

Style Presets

StyleDescription
photoUltra-detailed photorealistic photography, 8K resolution
animeStudio Ghibli inspired, vibrant colors
watercolorDelicate watercolor on textured paper
cyberpunkNeon-lit, rain-soaked Blade Runner aesthetic
minimalistClean geometric shapes, limited palette
oil-paintingClassical with visible brushstrokes
pixel-artRetro 16-bit style
sketchPencil sketch with hatching
3d-renderProfessional 3D with global illumination
pop-artBold Ben-Day dots, strong outlines

Skill Files

FileDescription
SKILL.mdFull skill documentation for OpenClaw agents
HEARTBEAT.mdPeriodic creative generation guide
package.jsonSkill metadata for ClawHub registry
scripts/gen.pyMain generation script

Requirements

License

MIT

Permissions & Security

Security level L1: Low-risk skills with minimal permissions. Review inputs and outputs before running in production.

Requirements

  • OpenClaw CLI installed and configured.
  • Language: Markdown
  • License: MIT
  • Topics:

FAQ

How do I install gemini-image-gen?

Run openclaw add @iisweetheartii/gemini-image-gen in your terminal. This installs gemini-image-gen into your OpenClaw Skills catalog.

Does this skill run locally or in the cloud?

OpenClaw Skills execute locally by default. Review the SKILL.md and permissions before running any skill.

Where can I verify the source code?

The source repository is available at https://github.com/openclaw/skills/tree/main/skills/iisweetheartii/gemini-image-gen. Review commits and README documentation before installing.