About
A Docker‑based MCP server that lets LLMs fetch, analyze, and download website assets—HTML, CSS, images, fonts—and generate sitemaps for full site cloning.
Capabilities
Site Cloner MCP Server
The Site Cloner MCP server empowers large‑language models to act as full‑stack web scrapers and site duplicators. In many AI‑driven development workflows, a user may ask an assistant to “clone this website” or “create a local copy of example.com.” Without direct access to the web, an LLM would need an external tool that can fetch pages, resolve relative paths, and persist assets. Site Cloner fills this gap by exposing a set of high‑level tools that handle every step of the cloning process, from initial HTML retrieval to final asset download and site‑map generation.
At its core, the server offers six tightly coupled tools. fetch_page pulls raw HTML from any reachable URL. extract_assets parses that HTML to pull out links to CSS, JavaScript, images, fonts, and other resources. download_asset then downloads each referenced file into a structured local directory, preserving the original relative paths. parse_css_for_assets goes one level deeper by inspecting CSS files for references, ensuring that font and background image assets are not missed. create_site_map crawls a site to an adjustable depth, yielding a navigable map of pages that can be used for further analysis or incremental cloning. Finally, analyze_page_structure provides a quick structural overview of any fetched page, useful for UI‑testing or content extraction.
For developers building AI‑powered tools, this server delivers several practical advantages. First, it removes the need to write custom web‑scraping code for each new project; the LLM can simply invoke the pre‑defined tools, keeping the developer’s focus on higher‑level logic. Second, because the server runs in Docker and exposes a simple command interface, it can be launched on any machine that supports containers, ensuring consistent behavior across environments. Third, the asset‑resolution logic handles relative URLs and CSS‑embedded resources automatically, which is often a source of bugs in manual scrapers.
Typical use cases include automated documentation generation for static sites, migration of legacy web pages to new hosting platforms, or creating offline copies for compliance audits. In a Cursor workflow, a user can configure the MCP once and then ask Claude to clone a site; the assistant will orchestrate the sequence of tool calls, returning a fully‑structured local copy ready for inspection or deployment. The server’s modular design also allows developers to extend it with custom tools—such as image optimization or HTML minification—without touching the core logic.
In summary, Site Cloner is a turnkey MCP solution that transforms an LLM into a web‑cloning agent. By handling the intricacies of page fetching, asset resolution, and site mapping, it lets developers leverage AI assistants for end‑to‑end website duplication tasks with minimal overhead.
Related Servers
MarkItDown MCP Server
Convert documents to Markdown for LLMs quickly and accurately
Context7 MCP
Real‑time, version‑specific code docs for LLMs
Playwright MCP
Browser automation via structured accessibility trees
BlenderMCP
Claude AI meets Blender for instant 3D creation
Pydantic AI
Build GenAI agents with Pydantic validation and observability
Chrome DevTools MCP
AI-powered Chrome automation and debugging
Weekly Views
Server Health
Information
Explore More Servers
MySQL DB MCP Server
Connect and query MySQL databases via MCP
Google Cloud Run MCP Server
Deploy AI-generated code to Cloud Run effortlessly
MCP Server: Scalable OpenAPI Endpoint Discovery and API Request Tool
Instant semantic search for private OpenAPI endpoints
Tmux MCP Server
AI‑powered terminal session management with tmux
K8S Deep Insight
Deep insights into Kubernetes clusters
Firebase CLI MCP Server
Deploy, test, and manage Firebase projects from the command line