skills$openclaw/transcribee
itsfabioroma9.7kā˜…

by itsfabioroma

transcribee – OpenClaw Skill

transcribee is an OpenClaw Skills integration for coding workflows. Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.

9.7k stars9.1k forksSecurity L1
Updated Feb 7, 2026Created Feb 7, 2026coding

Skill Snapshot

nametranscribee
descriptionTranscribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis. OpenClaw Skills integration.
owneritsfabioroma
repositoryitsfabioroma/transcribee
languageMarkdown
licenseMIT
topics
securityL1
installopenclaw add @itsfabioroma/transcribee
last updatedFeb 7, 2026

Maintainer

itsfabioroma

itsfabioroma

Maintains transcribee in the OpenClaw Skills directory.

View GitHub profile
File Explorer
8 files
.
_meta.json
463 B
CLAUDE.md
4.2 KB
index.ts
25.7 KB
package.json
544 B
pnpm-lock.yaml
25.4 KB
README.md
3.8 KB
SKILL.md
1.3 KB
transcribe.sh
877 B
SKILL.md

name: transcribee description: Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.

Transcribee

Transcribe YouTube videos and local media files with speaker diarization via ElevenLabs.

Usage

# YouTube video
transcribee "https://www.youtube.com/watch?v=..."

# Local video
transcribee ~/path/to/video.mp4

# Local audio
transcribee ~/path/to/podcast.mp3

Always quote URLs containing & or special characters.

Output

Transcripts save to: ~/Documents/transcripts/{category}/{title}-{date}/

FileUse
transcription.txtSpeaker-labeled transcript
transcription-raw.txtPlain text, no speakers
transcription-raw.jsonWord-level timings
metadata.jsonVideo info, language, category

Supported Formats

  • Audio: mp3, m4a, wav, ogg, flac
  • Video: mp4, mkv, webm, mov, avi
  • URLs: youtube.com, youtu.be

Dependencies

brew install yt-dlp ffmpeg

Troubleshooting

ErrorFix
yt-dlp not foundbrew install yt-dlp
ffmpeg not foundbrew install ffmpeg
API errorsCheck .env file in transcribee directory
README.md

transcribee šŸ

Open source macOS transcriber for YouTube, Instagram Reels, TikTok, and local media — evolves a self-organizing knowledge base.

transcribee "https://youtube.com/watch?v=..."
transcribee "https://instagram.com/reel/..."
transcribee "https://vt.tiktok.com/..."
transcribee ~/Downloads/podcast.mp3

Over time, your ~/Documents/transcripts/ folder naturally evolves into a personal library:

transcripts/
ā”œā”€ā”€ AI-Research/
│   ā”œā”€ā”€ ilya-sutskever-agi-2024/
│   └── anthropic-constitutional-ai/
ā”œā”€ā”€ Startups/
│   ā”œā”€ā”€ ycombinator-how-to-get-users/
│   └── pmarca-founder-mode/
└── Health/
    └── huberman-sleep-optimization/

Each transcript is speaker-labeled and ready to paste into ChatGPT, Claude, or any LLM.

Why šŸÆ

I consume a lot of video content — YouTube, Instagram, TikTok, podcasts, interviews. I wanted to:

  • Ask questions about videos in LLMs
  • Have all that knowledge searchable and organized
  • Not do any manual work to maintain it

transcribee does exactly that. Transcribe once, knowledge stays forever.

Features 🪻

  • Transcribes YouTube, Instagram Reels, TikTok, and local audio/video files
  • Speaker diarization — identifies different speakers
  • Auto-categorizes transcripts using Claude based on content
  • Builds a knowledge library that organizes itself over time

Use with Clawdbot šŸ¤–

transcribee is available as a Clawdbot skill. Just ask your agent to transcribe any YouTube video:

"Transcribe this video: https://youtube.com/watch?v=..."

Install the skill

# Install from ClawdHub (recommended)
clawdhub install transcribee

# Or clone manually
git clone https://github.com/itsfabioroma/transcribee.git ~/.clawdbot/skills/transcribee

Make sure you have the dependencies installed (brew install yt-dlp ffmpeg) and API keys configured.

Quick Start 🪺

# Install dependencies (macOS)
brew install yt-dlp ffmpeg
pnpm install

# Configure API keys
cp .env.example .env
# Add your ElevenLabs + Anthropic API keys to .env

# Transcribe anything
transcribee "https://youtube.com/watch?v=..."
transcribee "https://instagram.com/reel/..."
transcribee "https://vt.tiktok.com/..."
transcribee ~/Downloads/podcast.mp3
transcribee ~/Videos/interview.mp4

Add to ~/.zshrc:

alias transcribee="noglob /path/to/transcribee/transcribe.sh"

Output šŸÆ

Each transcript saves to ~/Documents/transcripts/{category}/{title}/:

FileWhat it's for
transcript.txtSpeaker-labeled transcript — paste this into your LLM
metadata.jsonVideo info, language, auto-detected theme

Raw JSON (optional)

For power users who need word-level timestamps and confidence scores:

transcribee --raw "https://youtube.com/watch?v=..."

This adds transcript-raw.json with the full ElevenLabs response.

How it works šŸ

  1. Downloads audio from YouTube (yt-dlp) or extracts from local video (ffmpeg)
  2. Transcribes with ElevenLabs (scribe_v1_experimental with speaker diarization)
  3. Claude analyzes content and existing library structure
  4. Auto-categorizes into the right folder
  5. Saves transcript files with metadata

Requirements

Supported formats

TypeFormats
Audiomp3, m4a, wav, ogg, flac
Videomp4, mkv, webm, mov, avi
URLsyoutube.com, youtu.be, instagram.com/reel, tiktok.com

bzz bzz šŸ

Permissions & Security

Security level L1: Low-risk skills with minimal permissions. Review inputs and outputs before running in production.

Requirements

```bash brew install yt-dlp ffmpeg ```

FAQ

How do I install transcribee?

Run openclaw add @itsfabioroma/transcribee in your terminal. This installs transcribee into your OpenClaw Skills catalog.

Does this skill run locally or in the cloud?

OpenClaw Skills execute locally by default. Review the SKILL.md and permissions before running any skill.

Where can I verify the source code?

The source repository is available at https://github.com/openclaw/skills/tree/main/skills/itsfabioroma/transcribee. Review commits and README documentation before installing.