4.8k★by emasoudy
screen-monitor – OpenClaw Skill
screen-monitor is an OpenClaw Skills integration for ai ml workflows. Dual-mode screen sharing and analysis. Model-agnostic (Gemini/Claude/Qwen3-VL).
Skill Snapshot
| name | screen-monitor |
| description | Dual-mode screen sharing and analysis. Model-agnostic (Gemini/Claude/Qwen3-VL). OpenClaw Skills integration. |
| owner | emasoudy |
| repository | emasoudy/screen-monitor |
| language | Markdown |
| license | MIT |
| topics | |
| security | L1 |
| install | openclaw add @emasoudy/screen-monitor |
| last updated | Feb 7, 2026 |
Maintainer

name: screen-monitor description: Dual-mode screen sharing and analysis. Model-agnostic (Gemini/Claude/Qwen3-VL). metadata: {"clawdbot":{"emoji":"🖥️","requires":{"model_features":["vision"]}}}
Screen Monitor
This skill provides two ways for the agent to see and interact with your screen.
🟢 Path A: Fast Share (WebRTC)
Best for: Quick visual checks, restricted browsers, or non-technical environments.
Tools
screen_share_link: Generates a local WebRTC portal URL.screen_analyze: Captures the current frame from the portal and analyzes it with vision.
Usage:
# Get the link
bash command:"{baseDir}/references/get-share-url.sh"
# Analyze
bash command:"{baseDir}/references/screen-analyze.sh"
🔵 Path B: Full Control (Browser Relay)
Best for: Deep debugging, UI automation, and clicking/typing in tabs.
Setup
- Run
clawdbot browser extension install. - Load the unpacked extension from
clawdbot browser extension path. - Click the Clawdbot icon in your Chrome toolbar to Attach.
Tools
browser action:snapshot: Take a precise screenshot of the attached tab.browser action:click: Interact with elements (requiresprofile="chrome").
Technical Details
- Port: 18795 (WebRTC Backend)
- Files:
web/screen-share.html: The sharing portal.references/backend-endpoint.js: Frame storage server.
Screen-Monitor Skill
Browser-based screen sharing with vision analysis. Works with any multimodal LLM.
Installation
clawdhub install screen-monitor
Or manual:
git clone https://github.com/emasoudy/clawdbot-skills.git
cp -r clawdbot-skills/screen-monitor ~/.clawdbot/skills/
Usage
- Get share link:
screen_share_link - Open link in browser (e.g.,
http://192.168.1.100:18795/screen-share) - Click "Start Sharing"
- Ask questions about your screen
Requirements
- Multimodal LLM (Gemini, Claude, Qwen3-VL, etc.)
- Modern browser with WebRTC
- Optional: ImageMagick for OS fallback
License
MIT - See LICENSE file
Permissions & Security
Security level L1: Low-risk skills with minimal permissions. Review inputs and outputs before running in production.
Requirements
- OpenClaw CLI installed and configured.
- Language: Markdown
- License: MIT
- Topics:
FAQ
How do I install screen-monitor?
Run openclaw add @emasoudy/screen-monitor in your terminal. This installs screen-monitor into your OpenClaw Skills catalog.
Does this skill run locally or in the cloud?
OpenClaw Skills execute locally by default. Review the SKILL.md and permissions before running any skill.
Where can I verify the source code?
The source repository is available at https://github.com/openclaw/skills/tree/main/skills/emasoudy/screen-monitor. Review commits and README documentation before installing.
