skills$openclaw/mineru-pdf
kesslerio1.9k

by kesslerio

mineru-pdf – OpenClaw Skill

mineru-pdf is an OpenClaw Skills integration for coding workflows. Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.

1.9k stars6.2k forksSecurity L1
Updated Feb 7, 2026Created Feb 7, 2026coding

Skill Snapshot

namemineru-pdf
descriptionParse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction. OpenClaw Skills integration.
ownerkesslerio
repositorykesslerio/mineru-pdf-parser-clawdbot-skill
languageMarkdown
licenseMIT
topics
securityL1
installopenclaw add @kesslerio/mineru-pdf-parser-clawdbot-skill
last updatedFeb 7, 2026

Maintainer

kesslerio

kesslerio

Maintains mineru-pdf in the OpenClaw Skills directory.

View GitHub profile
File Explorer
7 files
.
references
mineru-cli.md
1.5 KB
scripts
mineru_parse.sh
4.1 KB
_meta.json
319 B
README.md
3.8 KB
SKILL.md
1.0 KB
SKILL.md

name: mineru-pdf description: Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.

MinerU PDF

Overview

Parse a PDF locally with MinerU (CPU). Default output is Markdown + JSON. Use tables/images only when requested.

Quick start (single PDF)

# Run from the skill directory
./scripts/mineru_parse.sh /path/to/file.pdf

Optional examples:

./scripts/mineru_parse.sh /path/to/file.pdf --format json
./scripts/mineru_parse.sh /path/to/file.pdf --tables --images

When to read references

If flags differ from your wrapper or you need advanced defaults (backend/method/device/threads/format mapping), read:

  • references/mineru-cli.md

Output conventions

  • Output root defaults to ./mineru-output/.
  • MinerU creates the per-document subfolder under the output root (e.g., ./mineru-output/<basename>/...).

Batching

Default is single-PDF parsing. Only implement batch folder parsing if explicitly requested.

README.md

MinerU PDF Parser - Clawdbot Skill

A Clawdbot skill for parsing PDFs locally using MinerU (CPU). Produces rich structured output including Markdown, JSON with layout data, and extracted images.

Features

  • Local CPU processing — No GPU required; runs entirely on your machine
  • Rich structured output — Markdown + detailed JSON with layout information
  • Image extraction — Automatically extracts embedded images
  • Table support — Optional table extraction (if supported by your MinerU version)
  • Configurable — Flexible env overrides for different MinerU wrappers

Installation

Prerequisites

  1. MinerU CLI installed and accessible (see MinerU installation)
  2. Clawdbot installed

Install the skill

# Clone the repo
git clone https://github.com/kesslerio/MinerU-PDF-Parser-Clawdbot-Skill.git

# Or copy the mineru-pdf/ folder to your Clawdbot skills directory
cp -r MinerU-PDF-Parser-Clawdbot-Skill/mineru-pdf ~/.clawdbot/skills/

Usage

Quick start

# Run from the skill directory
./scripts/mineru_parse.sh /path/to/document.pdf

Options

./scripts/mineru_parse.sh /path/to/document.pdf --format json
./scripts/mineru_parse.sh /path/to/document.pdf --tables --images
./scripts/mineru_parse.sh /path/to/document.pdf --outroot ./my-output
OptionDefaultDescription
--formatbothOutput format: md, json, or both
--outroot./mineru-outputOutput root directory
--tablesoffExtract tables (if supported)
--imagesoffExtract images (if supported)
--threads4Thread count (OMP_NUM_THREADS)
--langenLanguage
--backendpipelineMinerU backend
--methodautoProcessing method
--devicecpuDevice (cpu/gpu)

Configuration

If your MinerU wrapper uses different flags, set env overrides. See mineru-pdf/references/mineru-cli.md for full documentation.

export MINERU_CMD=~/.local/bin/mineru
export MINERU_INPUT_FLAG=-p
export MINERU_OUTPUT_FLAG=-o

Output

MinerU creates a per-document subfolder under the output root:

./mineru-output/
└── document-name/
    └── auto/
        ├── document-name.md          # Markdown output
        ├── document-name_middle.json # Rich structured JSON (~50KB+)
        ├── document-name_layout.pdf  # Layout visualization
        └── images/                   # Extracted images

Output quality

MinerU produces rich structured output including:

  • Layout-aware text extraction
  • Detailed JSON with position/structure metadata
  • Extracted images and layout PDFs

Best for: Documents requiring accurate layout preservation, image extraction, or structured data output.

Comparison with PyMuPDF

AspectMinerUPyMuPDF
SpeedSlower (~15-30s/page)Fast (~1s/page)
JSON outputRich (~50KB+, layout data)Minimal (~1KB, text only)
Image extractionYes (automatic)Yes (optional)
Layout preservationExcellentBasic
DependenciesHeavy (~20GB models)Light (pip install)

Use MinerU when: Quality and structure matter more than speed.
Use PyMuPDF when: Speed matters or for simple text extraction.

License

Apache 2.0

Contributing

Issues and PRs welcome. Please test with a variety of PDFs before submitting changes.

Permissions & Security

Security level L1: Low-risk skills with minimal permissions. Review inputs and outputs before running in production.

Requirements

  • OpenClaw CLI installed and configured.
  • Language: Markdown
  • License: MIT
  • Topics:

FAQ

How do I install mineru-pdf?

Run openclaw add @kesslerio/mineru-pdf-parser-clawdbot-skill in your terminal. This installs mineru-pdf into your OpenClaw Skills catalog.

Does this skill run locally or in the cloud?

OpenClaw Skills execute locally by default. Review the SKILL.md and permissions before running any skill.

Where can I verify the source code?

The source repository is available at https://github.com/openclaw/skills/tree/main/skills/kesslerio/mineru-pdf-parser-clawdbot-skill. Review commits and README documentation before installing.