Best Audio AI Skills & MCP Servers
17 curated Audio skills and MCP servers — install any of them into Claude, Cursor, ChatGPT, n8n, or any AI stack with one command.
Claude Video Vision
MCP server that gives Claude Code the ability to watch and understand videos — extracts frames via ffmpeg and processes audio via multiple backends
Mistral
MCP server exposing Mistral AI capabilities over MCP: chat, embeddings, FIM, vision, OCR, audio, agents, moderation, classification, files, batch, workflows, sampling, prompts, resources, and Streamable HTTP.
Mcpollinations
Model Context Protocol (MCP) server for the Pollinations APIs with image saving functionality.
Vision Link
Universal MCP server that gives AI assistants the ability to watch and understand videos — extracts frames via ffmpeg and processes audio via multiple backends
Server
Echosaw MCP Server - Media intelligence for AI assistants. Connect your LLM to Echosaw and analyze media directly within your workflow.
Stemsplit
MCP server for AI stem separation — remove vocals, isolate instrumentals, build karaoke tracks, or split audio into vocals, drums, bass, piano, guitar, and other stems from local files or YouTube URLs. Works in Claude Desktop, Cursor, Cline, Windsurf, Zed
Levea
MCP server for the Levea autonomous AI video editor — natural-language video editing (viral clips, captions, vertical reframe, chroma key, audio cleanup, motion tracking, B-roll, voiceover, music, MP4 export) for Claude Desktop, Claude Code, Cursor, Cline
Audio File App
An MCP app for inspecting audio files in audio workflows. Playback, metadata, statistics
Screenpipe
MCP server for screenpipe - search your screen recordings and audio transcriptions
Pagebolt
MCP server for PageBolt — take screenshots, generate PDFs, create OG images, inspect pages, record demo videos with Audio Guide narration, from AI coding assistants like Claude, Cursor, and Windsurf.
Windy Word
MCP server exposing Windy Word's agent-control surface as 95 typed tools (paste / hotkeys / transcription / recording verbs / audio devices / voice clones / install / Doctor / archive / translation / documents / soul-file).
Server
MCP server for the H-ear World audio classification API — connect Claude, ChatGPT, and other AI agents to 521+ sound classes
Whisper Windows
Windows-native MCP server for local audio transcription using whisper.cpp with Vulkan GPU acceleration
Notebooklm
MCP server for Google NotebookLM — chat, source ingestion, audio overviews, citations, stdio + Streamable-HTTP transports.
Sdk
Tencent Cloud MCP Server for SDK
Fish Audio
MCP server for Fish Audio Text-to-Speech integration
audioknihy-catalog
First Czech audiobook catalog MCP server with cross-partner price comparison. Aggregates Knihy Dobrovský, Martinus, Kosmas, Dobré-knihy, and Radiotéka. Read-only discovery for AI agents — search, browse genres, compare prices, find cheapest offers, lookup author/narrator p
About Audio skills on iClaude
iClaude is the universal install layer for AI skills. Every Audio skill on this page can be installed into Claude Code, Claude Desktop, Cursor, ChatGPT, n8n, Codex, and more — using a single copy-paste command. No config drift, no per-stack adapters, no manual MCP wiring.