Best Vision AI Skills & MCP Servers
34 curated Vision skills and MCP servers — install any of them into Claude, Cursor, ChatGPT, n8n, or any AI stack with one command.
Pi Code Reasoning
Code Reasoning tools for pi and MCP — reflective sequential thinking with branching and revision support
Server
MCP server for EastRouter — vision tools that route through your EastRouter API key.
Screenshot Website Fast
Fast screenshot capture tool for web pages - optimized for Claude Vision API
Sdk
Official TypeScript/JavaScript SDK for Licentric — license keys, machine activation, offline validation, AI agent monetization, and Stripe-driven license provisioning. Alternative to Keygen, Cryptlex, LemonSqueezy License Keys.
Apple App Store Connect
Complete Model Context Protocol (MCP) server for Apple's App Store Connect API — 1221 tools, 100% coverage. TestFlight, Xcode Cloud, Game Center, App Clips, in-app purchases, subscriptions, analytics, review submissions, provisioning. Works with Claude, C
Server Agentpay
MCP server for AgentPay — the payment gateway for autonomous AI agents. Discover, provision, and pay for MCP tool APIs. Includes reliability monitoring with circuit breakers and health metrics.
Awaithumans
HITL infrastructure for AI agents. Your agent calls awaitHuman(), a human reviews via Slack/email/dashboard, agent resumes with a typed response.
Gdelt
Search and analyze global news coverage and US television transcripts via the GDELT Project's real-time APIs via MCP. STDIO or Streamable HTTP.
Kastell
CLI toolkit for provisioning, securing, and managing self-hosted servers
Claude Video Vision
MCP server that gives Claude Code the ability to watch and understand videos — extracts frames via ffmpeg and processes audio via multiple backends
Mistral
MCP server exposing Mistral AI capabilities over MCP: chat, embeddings, FIM, vision, OCR, audio, agents, moderation, classification, files, batch, workflows, sampling, prompts, resources, and Streamable HTTP.
Scrcpy
MCP server for Android device control via ADB and scrcpy — gives AI agents vision and control over Android devices
Vision Link
Universal MCP server that gives AI assistants the ability to watch and understand videos — extracts frames via ffmpeg and processes audio via multiple backends
Mcp
Persistent memory for visual intelligence. MCP server for the Phototology registry: analyze photos with composable lenses, enrich files with EXIF/IPTC/XMP metadata, and look up cached results for free. Works with local files. Analyze once. Remember foreve
Pdf Toolkit
MCP server for PDF manipulation — 22 tools: render pages to images for vision, Markdown/template→PDF, AES-256 encrypt, form-preserving merge/split, fill forms, search, compare, QR codes, watermark. Zero-config, TypeScript-native.
Zerodb Sequential Thinking
Persistent sequential thinking MCP — chain-of-thought reasoning that survives sessions, resumes across agents, and saves conclusions as plan artifacts. Powered by ZeroDB. Zero-config: agents self-provision a ZeroDB account on first run, no signup required
Server
Perceptron MCP server for high-accuracy visual perception powered by fast, efficient vision-language models
Pixelfix
MCP tool for reading and analyzing images - giving AI the power of vision
Cody Cli
Cody CLI - AINative Studio's AI-powered coding assistant. Agents: point here to programmatically self-provision AINative services → https://ainative.studio/agent.md
Mcp
MCP server for Zindex - agent-native diagram state infrastructure. A thin HTTP client exposing tools (create, patch, validate, normalize, diff, render, list-revisions, get-scene, delete-scene, undelete-scene, list-recently-deleted, submit-to-support, publ
Testforge
TestForge MCP Server — AI-powered testing in your IDE. Analyzes code for security, unit tests, load, accessibility, vision alignment, scope coverage, and stack quality.
Simple Dynamsoft
MCP server for Dynamsoft SDKs - Capture Vision, Barcode Reader (Mobile/Python/Web), Dynamic Web TWAIN, and Document Viewer. Provides documentation, code snippets, and API guidance.
Emit
MCP server for atrib. The producer-side cognitive primitive: lets agents sign explicit observations, annotations, and revisions beyond what middleware auto-signs.
Vessel Browser
AI-native web browser runtime for autonomous agents with human supervision
About Vision skills on iClaude
iClaude is the universal install layer for AI skills. Every Vision skill on this page can be installed into Claude Code, Claude Desktop, Cursor, ChatGPT, n8n, Codex, and more — using a single copy-paste command. No config drift, no per-stack adapters, no manual MCP wiring.