About
An MCP server that wraps yt‑dlp to provide video and audio download, metadata extraction, subtitle handling, and YouTube search for large language models. It supports privacy‑focused direct downloads and integration with MCP‑compatible tools like Dive.
Capabilities
Overview
The yt‑dlp‑mcp server bridges the gap between large language models and real‑world multimedia content by exposing a suite of tools that let an AI assistant interact with video platforms such as YouTube, Facebook, and TikTok. It solves a common pain point for developers: retrieving rich media data—metadata, subtitles, audio, and video itself—without having to write custom scrapers or deal with the complexities of each platform’s API. By wrapping yt‑dlp behind the Model Context Protocol, developers can embed media retrieval directly into conversational flows, enabling assistants to answer questions about video content, provide downloadable resources, or analyze subtitles for sentiment and topic modeling.
At its core, the server offers a collection of high‑level operations. search_videos lets an assistant surface relevant clips based on user queries, while get_video_metadata and get_video_metadata_summary return structured or human‑readable descriptions of any video URL. Subtitles are handled through list_subtitle_languages, download_video_subtitles, and download_transcript, giving LLMs access to clean, timestamp‑free text that can be fed into downstream NLP tasks. For users who need the actual media, download_video and download_audio provide resolution‑controlled downloads directly to a local Downloads folder, supporting optional trimming via start and end times. The server also respects privacy by performing direct downloads without external tracking.
Developers find these capabilities invaluable when building AI‑powered media assistants. For instance, a learning platform can let students search for tutorial videos, retrieve transcripts for note‑taking, and download audio clips for offline study. Content creators can automate the collection of competitor videos’ metadata or subtitles to inform strategy. A customer support bot could fetch and summarize product demo videos on demand, improving response times.
Integration is straightforward: any MCP‑compatible LLM (such as Dive or Claude) can invoke the server’s tools by name, passing JSON‑formatted arguments. The server handles all communication with yt‑dlp and returns results in a consistent structure, freeing developers from handling platform quirks or parsing raw HTML. The privacy‑focused design further appeals to applications that must avoid third‑party tracking.
Unique advantages of yt‑dlp‑mcp include its comprehensive subtitle support—auto‑generated captions and multiple language options—and the ability to trim videos on download, saving bandwidth. The server’s modular toolset means developers can cherry‑pick only the operations they need, keeping models lightweight while still offering rich media functionality. Overall, this MCP server empowers AI assistants to seamlessly fetch, analyze, and deliver video content in real‑time, unlocking a wide range of interactive media applications.
Related Servers
n8n
Self‑hosted, code‑first workflow automation platform
FastMCP
TypeScript framework for rapid MCP server development
Activepieces
Open-source AI automation platform for building and deploying extensible workflows
MaxKB
Enterprise‑grade AI agent platform with RAG and workflow orchestration.
Filestash
Web‑based file manager for any storage backend
MCP for Beginners
Learn Model Context Protocol with hands‑on examples
Weekly Views
Server Health
Information
Explore More Servers
WorkOS MCP Server
Connect agents to WorkOS API via Cloudflare Workers
YouTube MCP Server
AI‑powered YouTube discovery without the official API
BrowserBee MCP Demo Server
Demo MCP server for BrowserBee integration
Seta MCP
Local Docs, Offline AI Context
ScrAPI MCP Server
Web scraping made simple with SCP-powered MCP
GitHub Chat MCP
Analyze GitHub repos with Claude using the GitHub Chat API