skills$openclaw/voice-agent

6.7k★

voice-agent – OpenClaw Skill

Name: voice-agent
Author: ricardotrevisan

voice-agent is an OpenClaw Skills integration for coding workflows. Local Voice Input/Output for Agents using the AI Voice Agent API.

6.7k stars6.1k forksSecurity L1

Updated Feb 7, 2026Created Feb 7, 2026coding

Skill Snapshot

name	voice-agent
description	Local Voice Input/Output for Agents using the AI Voice Agent API. OpenClaw Skills integration.
owner	ricardotrevisan
repository	ricardotrevisan/voice-agent
language	Markdown
license	MIT
topics
security	L1
install	openclaw add @ricardotrevisan/voice-agent
last updated	Feb 7, 2026

Maintainer

ricardotrevisan

Maintains voice-agent in the OpenClaw Skills directory.

View GitHub profile

File Explorer

6 files

scripts

client.py

5.0 KB

start.sh

2.5 KB

_meta.json

461 B

CHANGELOG.md

502 B

SKILL.md

1.7 KB

SKILL.md

name: voice-agent display-name: AI Voice Agent Backend version: 1.0.0 description: Local Voice Input/Output for Agents using the AI Voice Agent API. author: trevisanricardo homepage: https://github.com/ricardotrevisan/ai-conversational-skill user-invocable: true disable-model-invocation: false

Voice Agent

This skill allows you to speak and listen to the user using the local Voice Agent API.

Behavior Guidelines

Audio First: When the user communicates via audio (files), your PRIMARY mode of response is Audio File.
Silent Delivery: When sending an audio response, DO NOT send a text explanation like "I sent an audio". Just send the audio file.
Workflow:
1. User sends audio.
2. You usage transcribe to read it.
3. You think of a response.
4. You usage synthesize to generate the audio file.
5. You send the file.
6. STOP. Do not add text commentary.

Tools

Transcribe File

To transcribe an audio file (Speech-to-Text), run the client script with the transcribe command.

python3 {baseDir}/scripts/client.py transcribe "/path/to/audio/file.ogg"

Synthesize to File

To generate audio from text and save it to a file (Text-to-Speech), run the client script with the synthesize command.

python3 {baseDir}/scripts/client.py synthesize "Text to speak" --output "/path/to/output.mp3"

Health Check

To check if the voice agent API is running and healthy:

python3 {baseDir}/scripts/client.py health

Service Management

If the Health Check fails or you receive a connection error, the service may be stopped. You can attempt to start it by running:

{baseDir}/scripts/start.sh

README.md

No README available.

Permissions & Security

Security level L1: Low-risk skills with minimal permissions. Review inputs and outputs before running in production.

Requirements

OpenClaw CLI installed and configured.
Language: Markdown
License: MIT
Topics:

FAQ

How do I install voice-agent?

Run openclaw add @ricardotrevisan/voice-agent in your terminal. This installs voice-agent into your OpenClaw Skills catalog.

Does this skill run locally or in the cloud?

OpenClaw Skills execute locally by default. Review the SKILL.md and permissions before running any skill.

Where can I verify the source code?

The source repository is available at https://github.com/openclaw/skills/tree/main/skills/ricardotrevisan/voice-agent. Review commits and README documentation before installing.