skills$openclaw/voice-agent
ricardotrevisan6.7k

by ricardotrevisan

voice-agent – OpenClaw Skill

voice-agent is an OpenClaw Skills integration for coding workflows. Local Voice Input/Output for Agents using the AI Voice Agent API.

6.7k stars6.1k forksSecurity L1
Updated Feb 7, 2026Created Feb 7, 2026coding

Skill Snapshot

namevoice-agent
descriptionLocal Voice Input/Output for Agents using the AI Voice Agent API. OpenClaw Skills integration.
ownerricardotrevisan
repositoryricardotrevisan/voice-agent
languageMarkdown
licenseMIT
topics
securityL1
installopenclaw add @ricardotrevisan/voice-agent
last updatedFeb 7, 2026

Maintainer

ricardotrevisan

ricardotrevisan

Maintains voice-agent in the OpenClaw Skills directory.

View GitHub profile
File Explorer
6 files
.
scripts
client.py
5.0 KB
start.sh
2.5 KB
_meta.json
461 B
CHANGELOG.md
502 B
SKILL.md
1.7 KB
SKILL.md

name: voice-agent display-name: AI Voice Agent Backend version: 1.0.0 description: Local Voice Input/Output for Agents using the AI Voice Agent API. author: trevisanricardo homepage: https://github.com/ricardotrevisan/ai-conversational-skill user-invocable: true disable-model-invocation: false

Voice Agent

This skill allows you to speak and listen to the user using the local Voice Agent API.

Behavior Guidelines

  • Audio First: When the user communicates via audio (files), your PRIMARY mode of response is Audio File.
  • Silent Delivery: When sending an audio response, DO NOT send a text explanation like "I sent an audio". Just send the audio file.
  • Workflow:
    1. User sends audio.
    2. You usage transcribe to read it.
    3. You think of a response.
    4. You usage synthesize to generate the audio file.
    5. You send the file.
    6. STOP. Do not add text commentary.

Tools

Transcribe File

To transcribe an audio file (Speech-to-Text), run the client script with the transcribe command.

python3 {baseDir}/scripts/client.py transcribe "/path/to/audio/file.ogg"

Synthesize to File

To generate audio from text and save it to a file (Text-to-Speech), run the client script with the synthesize command.

python3 {baseDir}/scripts/client.py synthesize "Text to speak" --output "/path/to/output.mp3"

Health Check

To check if the voice agent API is running and healthy:

python3 {baseDir}/scripts/client.py health

Service Management

If the Health Check fails or you receive a connection error, the service may be stopped. You can attempt to start it by running:

{baseDir}/scripts/start.sh
README.md

No README available.

Permissions & Security

Security level L1: Low-risk skills with minimal permissions. Review inputs and outputs before running in production.

Requirements

  • OpenClaw CLI installed and configured.
  • Language: Markdown
  • License: MIT
  • Topics:

FAQ

How do I install voice-agent?

Run openclaw add @ricardotrevisan/voice-agent in your terminal. This installs voice-agent into your OpenClaw Skills catalog.

Does this skill run locally or in the cloud?

OpenClaw Skills execute locally by default. Review the SKILL.md and permissions before running any skill.

Where can I verify the source code?

The source repository is available at https://github.com/openclaw/skills/tree/main/skills/ricardotrevisan/voice-agent. Review commits and README documentation before installing.