5.2kβ
by kylelol
clawdbites β OpenClaw Skill
clawdbites is an OpenClaw Skills integration for coding workflows. Extract recipes from Instagram reels. Use when a user sends an Instagram reel link and wants to get the recipe from the caption. Parses ingredients, instructions, and macros into a clean format.
Skill Snapshot
| name | clawdbites |
| description | Extract recipes from Instagram reels. Use when a user sends an Instagram reel link and wants to get the recipe from the caption. Parses ingredients, instructions, and macros into a clean format. OpenClaw Skills integration. |
| owner | kylelol |
| repository | kylelol/clawdbites |
| language | Markdown |
| license | MIT |
| topics | |
| security | L1 |
| install | openclaw add @kylelol/clawdbites |
| last updated | Feb 7, 2026 |
Maintainer

name: clawdbites description: Extract recipes from Instagram reels. Use when a user sends an Instagram reel link and wants to get the recipe from the caption. Parses ingredients, instructions, and macros into a clean format. homepage: https://github.com/kylelol/ClawdBites metadata: {"clawdbot":{"emoji":"π¦","os":["darwin","linux"],"requires":{"bins":["yt-dlp","ffmpeg","whisper"]},"install":[{"id":"yt-dlp","kind":"brew","formula":"yt-dlp","bins":["yt-dlp"],"label":"Install yt-dlp via Homebrew"},{"id":"ffmpeg","kind":"brew","formula":"ffmpeg","bins":["ffmpeg"],"label":"Install ffmpeg via Homebrew"},{"id":"whisper","kind":"shell","command":"pip3 install --user openai-whisper","label":"Install Whisper (local, no API key)"}]}}
Instagram Recipe Extractor
Extract recipes from Instagram reels using a multi-layered approach:
- Caption parsing β Instant, check description first
- Audio transcription β Whisper (local, no API key)
- Frame analysis β Vision model for on-screen text
No Instagram login required. Works on public reels.
When to Use
- User sends an Instagram reel link
- User mentions "recipe from Instagram" or "save this reel"
- User wants to extract recipe details from a video post
How It Works (MANDATORY FLOW)
ALWAYS follow this complete flow β do not stop after caption if instructions are missing:
- User sends Instagram reel URL
- Extract metadata using yt-dlp (
--dump-json) - Parse the caption for recipe details
- Check completeness: Does caption have BOTH ingredients AND instructions?
- β YES: Present the recipe
- β NO (missing instructions or incomplete): Automatically proceed to audio transcription β do NOT stop or ask the user
- If audio transcription needed:
- Download video:
yt-dlp -o "/tmp/reel.mp4" "URL" - Extract audio:
ffmpeg -y -i /tmp/reel.mp4 -vn -acodec pcm_s16le -ar 16000 -ac 1 /tmp/reel.wav - Transcribe:
whisper /tmp/reel.wav --model base --output_format txt --output_dir /tmp - Merge caption ingredients with audio instructions
- Download video:
- Present clean, formatted recipe (combining caption + audio as needed)
- User decides what to do (save to notes, add to wishlist, etc.)
Completeness check heuristics:
- Has ingredients = contains 3+ quantity+item patterns (e.g., "1 cup flour", "2 lbs chicken")
- Has instructions = contains action verbs (blend, cook, bake, mix, pour, add) + sequence OR numbered steps
Extraction Command
yt-dlp --dump-json "https://www.instagram.com/reel/SHORTCODE/" 2>/dev/null
Key fields from JSON output:
descriptionβ The caption containing the recipeuploaderβ Creator's namechannelβ Creator's handlewebpage_urlβ Original URLlike_countβ Popularity indicator
Recipe Parsing
Look for these patterns in the caption:
Macros:
- "X Calories | Xg P | Xg C | Xg F"
- "Macros per serving"
- "Cal/Protein/Carbs/Fat"
Ingredients:
- Lines starting with quantities (1 cup, 2 tbsp, 24oz)
- Lines with measurement units
- Emoji bullet points (π₯© π½ π§ etc.)
Sections:
- "For the [component]:"
- "Ingredients:"
- "Instructions:"
- "Directions:"
Output Format
Present extracted recipe cleanly:
## [Recipe Name]
*From @[handle]*
**Macros (per serving):** X cal | Xg P | Xg C | Xg F
### Ingredients
- [ingredient 1]
- [ingredient 2]
...
### Instructions
1. [step 1]
2. [step 2]
...
---
Source: [original URL]
User Actions After Extraction
Let the user decide what to do:
- "Save to my recipes" β Save to Apple Notes (if meal-planner skill available)
- "Add to wishlist" β Save to
memory/recipe-wishlist.json - "Just show me" β Display only, no save
- "Plan this for next week" β Hand off to meal-planner skill
Wishlist Storage
Optional storage for recipes user wants to try later:
memory/recipe-wishlist.json:
{
"recipes": [
{
"name": "Recipe Name",
"source": "instagram",
"sourceUrl": "https://instagram.com/reel/...",
"handle": "@creator",
"addedDate": "2026-01-26",
"tried": false,
"macros": {
"calories": 585,
"protein": 56,
"carbs": 25,
"fat": 28,
"servings": 3
},
"ingredients": [...],
"instructions": [...]
}
]
}
Error Handling
If yt-dlp fails:
- Check if URL is valid Instagram reel format
- May be a private account β inform user
- Suggest user paste caption text manually as fallback
If no recipe found in caption (IMPORTANT):
After extracting, scan the caption for recipe indicators:
- Ingredient quantities (numbers + units like oz, cups, tbsp, lbs)
- Recipe sections ("For the...", "Ingredients:", "Instructions:")
- Cooking verbs (bake, cook, sautΓ©, mix, combine)
- Macro information (calories, protein, carbs, fat)
If none found, tell the user clearly:
"I pulled the caption but it doesn't look like the recipe is there β it might just be a teaser or the recipe is only shown in the video itself. Here's what the caption says:
[show caption]
A few options:
- Check the comments β sometimes creators post recipes there
- Check their bio link β might lead to the full recipe
- Describe what you saw in the video and I can help find a similar recipe"
Recipe detection heuristics:
HAS_RECIPE if caption contains:
- 3+ ingredient-like patterns (quantity + food item)
- OR "recipe" + ingredient list
- OR macro breakdown + ingredients
- OR numbered/bulleted instructions
NO_RECIPE if caption is:
- Mostly hashtags
- Just a description/teaser
- Under 100 characters
- No quantities or measurements
Integration with meal-planner
The meal-planner skill can reference this skill:
- When planning meals, check wishlist for untried recipes
- Suggest wishlist recipes that match pantry items
- Mark recipes as "tried" after they're used in a meal plan
Audio Transcription (V2) β MANDATORY FALLBACK
When caption is missing instructions, ALWAYS transcribe the audio automatically. Do not stop and ask the user β just do it. This is the most common case since creators often put ingredients in captions but speak the instructions.
Step 1: Download video
yt-dlp -o "/tmp/reel.mp4" "https://instagram.com/reel/XXX"
Step 2: Extract audio
ffmpeg -i /tmp/reel.mp4 -vn -acodec pcm_s16le -ar 16000 -ac 1 /tmp/reel.wav
Step 3: Transcribe with Whisper
/Users/kylekirkland/Library/Python/3.14/bin/whisper /tmp/reel.wav --model base --output_format txt --output_dir /tmp
Step 4: Parse transcript for recipe Look for cooking instructions, ingredients mentioned verbally.
Inference for Missing Measurements
ALWAYS infer quantities when not provided. Never present a recipe without amounts β estimate based on context and standard package sizes.
Vague Language β Specific Amounts
| What they say | Infer |
|---|---|
| "some chicken" | ~1 lb |
| "a bit of garlic" | 2-3 cloves |
| "handful of spinach" | ~2 cups |
| "drizzle of oil" | 1-2 tbsp |
| "season to taste" | Β½ tsp salt, ΒΌ tsp pepper |
| "splash of soy sauce" | 1-2 tbsp |
| "a few tablespoons" | 2-3 tbsp |
| "some rice" | 1 cup dry |
| "cheese on top" | Β½ - 1 cup shredded |
| "diced onion" | 1 medium onion |
| "bell peppers" | 2 peppers |
Standard Package Sizes (when item mentioned without amount)
| Ingredient | Standard Package | Infer |
|---|---|---|
| Puff pastry | 17oz sheet | 1 sheet |
| Ground beef/turkey | 1 lb pack | 1 lb |
| Chicken breast | ~1.5 lb pack | 1.5 lbs |
| Sausage links | 14oz / 4-5 links | 1 package |
| Bacon | 12oz / 12 slices | Β½ package (6 slices) |
| Shredded cheese | 8oz bag | 1-2 cups |
| Tortillas | 8-10 count | 1 package |
| Canned beans | 15oz can | 1 can |
| Broth/stock | 32oz carton | 1-2 cups |
| Pasta | 16oz box | 8oz (half box) |
| Rice | 2 lb bag | 1-2 cups dry |
Context-Aware Scaling
By recipe type:
- Stir fry for 2 β 1 lb protein, 4 cups veggies
- Soup/stew β 1.5-2 lbs protein, 4 cups broth
- Sheet pan meal β 1.5 lbs protein, 3-4 cups veggies
- Appetizers β smaller portions, estimate ~12-15 pieces per batch
By servings mentioned:
- "Serves 4" β Scale standard amounts for 4
- "Meal prep for the week" β Assume 5-8 servings
- No servings mentioned β Default to 4 servings
By protein target (if user has macro goals):
- 40-50g protein per serving β ~6-8oz cooked meat per portion
- Scale recipe protein accordingly
Output Format
Always present inferred amounts clearly:
### Ingredients
- 1 lb ground turkey *(estimated)*
- 1 medium onion, diced *(estimated)*
- 2 cups broth *(estimated based on typical soup)*
Mark inferred quantities with (estimated) so user knows what came from the source vs inference.
Combined Extraction Flow
1. TRY CAPTION (instant)
βββ yt-dlp --dump-json β parse description
βββ Recipe found? β DONE β
βββ Check for "pinned" / "in comments" / "check comments" β FLAG
2. IF FLAGGED: CHECK FOR CREATOR COMMENT
βββ Look through comments for creator's username
βββ If creator comment found with recipe β DONE β
βββ If not found β continue + notify user
3. TRY AUDIO (30-60 sec)
βββ Download video
βββ Extract audio with ffmpeg
βββ Transcribe with Whisper (base model)
βββ Parse transcript for recipe
βββ Infer missing measurements
βββ Recipe found? β DONE β
4. PRESENT RESULTS + PROMPT IF NEEDED
βββ Show what was extracted from audio
βββ If "pinned" was flagged, tell user:
"The creator mentioned the full recipe is pinned in the comments.
I extracted what I could from the audio, but if you want the
exact measurements, paste the pinned comment here and I'll
merge it with what I found."
5. TRY FRAME ANALYSIS (if audio incomplete)
βββ Extract 5-8 key frames with ffmpeg
βββ Send to Claude vision
βββ Ask: "Extract any recipe text, ingredients, or measurements shown"
βββ Merge findings with audio transcript
6. FALLBACK (nothing found)
βββ Inform user: "Recipe wasn't in caption or audio/video"
βββ Offer: search for similar recipe based on video title/description
Frame Analysis
Extract key frames and analyze with vision model.
Extract frames:
# Extract 1 frame every 5 seconds
ffmpeg -i /tmp/reel.mp4 -vf "fps=1/5" /tmp/frame_%02d.jpg
# Or extract specific number of frames evenly distributed
ffmpeg -i /tmp/reel.mp4 -vf "select='not(mod(n,30))'" -vsync vfr /tmp/frame_%02d.jpg
Send to vision model: Use Claude's image analysis to read each frame:
- Recipe cards / title screens
- Ingredient lists shown on screen
- Measurements in text overlays
- Step-by-step instructions displayed
Vision prompt:
Analyze this frame from a cooking video. Extract any:
- Recipe name or title
- Ingredients with quantities
- Cooking instructions
- Nutritional information / macros
- Any other recipe-related text shown
If no recipe text is visible, respond with "No recipe text found."
Merge strategy:
- Audio transcript = primary source (spoken instructions)
- Frame analysis = supplement (exact measurements, recipe cards)
- Combine both, prefer specific measurements from visual over inferred from audio
Pinned Comment Detection
Scan caption for these phrases (case-insensitive):
- "recipe pinned"
- "pinned in comments"
- "check comments"
- "in the comments"
- "comment below"
- "recipe below"
- "full recipe in comments"
If detected, flag and notify user after extraction:
"Heads up β the creator said the recipe is pinned in the comments. I got what I could from the audio, but yt-dlp can't access pinned comments without login. If you want the exact recipe, copy the pinned comment and send it to me β I'll format it properly."
Requirements
yt-dlpβbrew install yt-dlpffmpegβbrew install ffmpegwhisperβpip3 install openai-whisper(runs locally, no API key)- No Instagram login required for public reels
ClawdBites π¦π΄
Extract recipes from Instagram reels using a multi-layered approach β no login required.
A Clawdbot skill.
Features
- Caption parsing β Instant extraction from post description
- Audio transcription β Whisper (local, no API key) for spoken instructions
- Frame analysis β Vision model for on-screen recipe cards
- Smart inference β Estimates missing measurements based on context
- Wishlist β Save recipes to try later, integrates with meal planning
Requirements
| Tool | Install |
|---|---|
| yt-dlp | brew install yt-dlp |
| ffmpeg | brew install ffmpeg |
| whisper | pip3 install openai-whisper |
No Instagram login required. Works on public reels.
Usage
Send an Instagram reel link to your Clawdbot:
https://www.instagram.com/reel/ABC123/
The skill automatically:
- Checks the caption for recipe details
- If incomplete, transcribes the audio
- If still missing info, analyzes video frames
- Presents a clean, formatted recipe with macros
Example Output
## Sheet Pan Philly Cheesesteak
*From @salaarfit*
**Macros (per serving):** 331 cal | 46g P | 12g C | 10.5g F
### Ingredients
- 24oz shaved ribeye steak
- 1 green bell pepper
- 1 red bell pepper
- 1 yellow onion
- 2 cups baby mushrooms
- 2-3 tbsp Worcestershire sauce
- 6 slices thin provolone
### Instructions
1. Dice veggies into thin strips
2. Top with shaved ribeye
3. Season with Worcestershire, steak seasoning, salt
4. Cook at 425Β°F for 15 min
5. Layer provolone on top
6. Broil on high 2-3 min until cheese melts
---
Source: instagram.com/reel/...
Integration
Works great with the meal-planner skill:
- Save recipes to a wishlist
- Meal planner can suggest wishlist recipes that match your pantry
- Track which recipes you've tried
License
MIT
Permissions & Security
Security level L1: Low-risk skills with minimal permissions. Review inputs and outputs before running in production.
Requirements
- `yt-dlp` β `brew install yt-dlp` - `ffmpeg` β `brew install ffmpeg` - `whisper` β `pip3 install openai-whisper` (runs locally, no API key) - No Instagram login required for public reels
FAQ
How do I install clawdbites?
Run openclaw add @kylelol/clawdbites in your terminal. This installs clawdbites into your OpenClaw Skills catalog.
Does this skill run locally or in the cloud?
OpenClaw Skills execute locally by default. Review the SKILL.md and permissions before running any skill.
Where can I verify the source code?
The source repository is available at https://github.com/openclaw/skills/tree/main/skills/kylelol/clawdbites. Review commits and README documentation before installing.
