1.9k★by kesslerio
mineru-pdf – OpenClaw Skill
mineru-pdf is an OpenClaw Skills integration for coding workflows. Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.
Skill Snapshot
| name | mineru-pdf |
| description | Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction. OpenClaw Skills integration. |
| owner | kesslerio |
| repository | kesslerio/mineru-pdf-parser-clawdbot-skill |
| language | Markdown |
| license | MIT |
| topics | |
| security | L1 |
| install | openclaw add @kesslerio/mineru-pdf-parser-clawdbot-skill |
| last updated | Feb 7, 2026 |
Maintainer

name: mineru-pdf description: Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.
MinerU PDF
Overview
Parse a PDF locally with MinerU (CPU). Default output is Markdown + JSON. Use tables/images only when requested.
Quick start (single PDF)
# Run from the skill directory
./scripts/mineru_parse.sh /path/to/file.pdf
Optional examples:
./scripts/mineru_parse.sh /path/to/file.pdf --format json
./scripts/mineru_parse.sh /path/to/file.pdf --tables --images
When to read references
If flags differ from your wrapper or you need advanced defaults (backend/method/device/threads/format mapping), read:
references/mineru-cli.md
Output conventions
- Output root defaults to
./mineru-output/. - MinerU creates the per-document subfolder under the output root (e.g.,
./mineru-output/<basename>/...).
Batching
Default is single-PDF parsing. Only implement batch folder parsing if explicitly requested.
MinerU PDF Parser - Clawdbot Skill
A Clawdbot skill for parsing PDFs locally using MinerU (CPU). Produces rich structured output including Markdown, JSON with layout data, and extracted images.
Features
- Local CPU processing — No GPU required; runs entirely on your machine
- Rich structured output — Markdown + detailed JSON with layout information
- Image extraction — Automatically extracts embedded images
- Table support — Optional table extraction (if supported by your MinerU version)
- Configurable — Flexible env overrides for different MinerU wrappers
Installation
Prerequisites
- MinerU CLI installed and accessible (see MinerU installation)
- Clawdbot installed
Install the skill
# Clone the repo
git clone https://github.com/kesslerio/MinerU-PDF-Parser-Clawdbot-Skill.git
# Or copy the mineru-pdf/ folder to your Clawdbot skills directory
cp -r MinerU-PDF-Parser-Clawdbot-Skill/mineru-pdf ~/.clawdbot/skills/
Usage
Quick start
# Run from the skill directory
./scripts/mineru_parse.sh /path/to/document.pdf
Options
./scripts/mineru_parse.sh /path/to/document.pdf --format json
./scripts/mineru_parse.sh /path/to/document.pdf --tables --images
./scripts/mineru_parse.sh /path/to/document.pdf --outroot ./my-output
| Option | Default | Description |
|---|---|---|
--format | both | Output format: md, json, or both |
--outroot | ./mineru-output | Output root directory |
--tables | off | Extract tables (if supported) |
--images | off | Extract images (if supported) |
--threads | 4 | Thread count (OMP_NUM_THREADS) |
--lang | en | Language |
--backend | pipeline | MinerU backend |
--method | auto | Processing method |
--device | cpu | Device (cpu/gpu) |
Configuration
If your MinerU wrapper uses different flags, set env overrides. See mineru-pdf/references/mineru-cli.md for full documentation.
export MINERU_CMD=~/.local/bin/mineru
export MINERU_INPUT_FLAG=-p
export MINERU_OUTPUT_FLAG=-o
Output
MinerU creates a per-document subfolder under the output root:
./mineru-output/
└── document-name/
└── auto/
├── document-name.md # Markdown output
├── document-name_middle.json # Rich structured JSON (~50KB+)
├── document-name_layout.pdf # Layout visualization
└── images/ # Extracted images
Output quality
MinerU produces rich structured output including:
- Layout-aware text extraction
- Detailed JSON with position/structure metadata
- Extracted images and layout PDFs
Best for: Documents requiring accurate layout preservation, image extraction, or structured data output.
Comparison with PyMuPDF
| Aspect | MinerU | PyMuPDF |
|---|---|---|
| Speed | Slower (~15-30s/page) | Fast (~1s/page) |
| JSON output | Rich (~50KB+, layout data) | Minimal (~1KB, text only) |
| Image extraction | Yes (automatic) | Yes (optional) |
| Layout preservation | Excellent | Basic |
| Dependencies | Heavy (~20GB models) | Light (pip install) |
Use MinerU when: Quality and structure matter more than speed.
Use PyMuPDF when: Speed matters or for simple text extraction.
License
Apache 2.0
Contributing
Issues and PRs welcome. Please test with a variety of PDFs before submitting changes.
Related
- PyMuPDF PDF Parser Skill — Fast, lightweight alternative
- MinerU — The underlying PDF parser
- Clawdbot — The AI agent framework
Permissions & Security
Security level L1: Low-risk skills with minimal permissions. Review inputs and outputs before running in production.
Requirements
- OpenClaw CLI installed and configured.
- Language: Markdown
- License: MIT
- Topics:
FAQ
How do I install mineru-pdf?
Run openclaw add @kesslerio/mineru-pdf-parser-clawdbot-skill in your terminal. This installs mineru-pdf into your OpenClaw Skills catalog.
Does this skill run locally or in the cloud?
OpenClaw Skills execute locally by default. Review the SKILL.md and permissions before running any skill.
Where can I verify the source code?
The source repository is available at https://github.com/openclaw/skills/tree/main/skills/kesslerio/mineru-pdf-parser-clawdbot-skill. Review commits and README documentation before installing.
