skills$openclaw/screen-monitor
emasoudy4.8k

by emasoudy

screen-monitor – OpenClaw Skill

screen-monitor is an OpenClaw Skills integration for ai ml workflows. Dual-mode screen sharing and analysis. Model-agnostic (Gemini/Claude/Qwen3-VL).

4.8k stars9.2k forksSecurity L1
Updated Feb 7, 2026Created Feb 7, 2026ai ml

Skill Snapshot

namescreen-monitor
descriptionDual-mode screen sharing and analysis. Model-agnostic (Gemini/Claude/Qwen3-VL). OpenClaw Skills integration.
owneremasoudy
repositoryemasoudy/screen-monitor
languageMarkdown
licenseMIT
topics
securityL1
installopenclaw add @emasoudy/screen-monitor
last updatedFeb 7, 2026

Maintainer

emasoudy

emasoudy

Maintains screen-monitor in the OpenClaw Skills directory.

View GitHub profile
File Explorer
11 files
.
references
backend-endpoint.js
2.4 KB
env-check.sh
460 B
get-share-url.sh
228 B
screen-analyze.sh
737 B
web
screen-share.html
6.2 KB
_meta.json
282 B
manifest.json
1.1 KB
README.md
664 B
SKILL.md
1.4 KB
SKILL.md

name: screen-monitor description: Dual-mode screen sharing and analysis. Model-agnostic (Gemini/Claude/Qwen3-VL). metadata: {"clawdbot":{"emoji":"🖥️","requires":{"model_features":["vision"]}}}

Screen Monitor

This skill provides two ways for the agent to see and interact with your screen.

🟢 Path A: Fast Share (WebRTC)

Best for: Quick visual checks, restricted browsers, or non-technical environments.

Tools

  • screen_share_link: Generates a local WebRTC portal URL.
  • screen_analyze: Captures the current frame from the portal and analyzes it with vision.

Usage:

# Get the link
bash command:"{baseDir}/references/get-share-url.sh"

# Analyze
bash command:"{baseDir}/references/screen-analyze.sh"

🔵 Path B: Full Control (Browser Relay)

Best for: Deep debugging, UI automation, and clicking/typing in tabs.

Setup

  1. Run clawdbot browser extension install.
  2. Load the unpacked extension from clawdbot browser extension path.
  3. Click the Clawdbot icon in your Chrome toolbar to Attach.

Tools

  • browser action:snapshot: Take a precise screenshot of the attached tab.
  • browser action:click: Interact with elements (requires profile="chrome").

Technical Details

  • Port: 18795 (WebRTC Backend)
  • Files:
    • web/screen-share.html: The sharing portal.
    • references/backend-endpoint.js: Frame storage server.
README.md

Screen-Monitor Skill

Browser-based screen sharing with vision analysis. Works with any multimodal LLM.

Installation

clawdhub install screen-monitor

Or manual:

git clone https://github.com/emasoudy/clawdbot-skills.git
cp -r clawdbot-skills/screen-monitor ~/.clawdbot/skills/

Usage

  1. Get share link: screen_share_link
  2. Open link in browser (e.g., http://192.168.1.100:18795/screen-share)
  3. Click "Start Sharing"
  4. Ask questions about your screen

Requirements

  • Multimodal LLM (Gemini, Claude, Qwen3-VL, etc.)
  • Modern browser with WebRTC
  • Optional: ImageMagick for OS fallback

License

MIT - See LICENSE file

Permissions & Security

Security level L1: Low-risk skills with minimal permissions. Review inputs and outputs before running in production.

Requirements

  • OpenClaw CLI installed and configured.
  • Language: Markdown
  • License: MIT
  • Topics:

FAQ

How do I install screen-monitor?

Run openclaw add @emasoudy/screen-monitor in your terminal. This installs screen-monitor into your OpenClaw Skills catalog.

Does this skill run locally or in the cloud?

OpenClaw Skills execute locally by default. Review the SKILL.md and permissions before running any skill.

Where can I verify the source code?

The source repository is available at https://github.com/openclaw/skills/tree/main/skills/emasoudy/screen-monitor. Review commits and README documentation before installing.