loading…
Search for a command to run...
loading…
Web extraction MCP server for AI agents. Extract structured data from any URL with built-in Cloudflare bypass, JavaScript rendering, and intelligent parsing. Re
Web extraction MCP server for AI agents. Extract structured data from any URL with built-in Cloudflare bypass, JavaScript rendering, and intelligent parsing. Returns clean markdown or JSON.
This is the canonical public MCP package working tree for @hauntapi/mcp-server and GitHub Darko893/mcp-server. The live Haunt API app/site lives separately in /root/haunt. Do not confuse this with /root/haunt/mcp-server, which is a legacy/local copy inside the app repo.
Give Claude, Cursor, Windsurf, and other MCP-compatible agents a clean web extraction tool.
Haunt turns permitted public web pages into structured JSON using natural-language prompts. It can also return clean Markdown for agents, RAG pipelines, notes, and .md files when you need page text instead of typed fields. It is built for agent workflows that need product data, competitor pricing, article content, metadata, lead lists, research snippets, Markdown page text, or other visible web data without maintaining brittle selectors.
For the clean CLI path, run:
npx -y --package @hauntapi/cli@latest haunt-cli init
If you already have a Haunt API key:
HAUNT_API_KEY=PASTE_YOUR_KEY_HERE npx -y --package @hauntapi/cli@latest haunt-cli init
The CLI prints the MCP config for Claude, Cursor, Windsurf, and other MCP-compatible clients. It does not edit config files for you.
Install the MCP server and call try_demo_extract first. It returns a fixed sample extraction JSON/trace plus docs, signup, pricing, and free-tier links without using credits.
Use Haunt's try_demo_extract tool and show me the signup and docs links.
Get a free API key: https://hauntapi.com/#signup
Add this to your MCP client config:
{
"mcpServers": {
"haunt": {
"command": "npx",
"args": ["-y", "@hauntapi/mcp-server"],
"env": {
"HAUNT_API_KEY": "your-api-key"
}
}
}
}
Then ask your agent:
Use Haunt to extract the product name, price, availability, and review count from https://example.com/product
For Markdown output:
Use Haunt extract_markdown to turn https://example.com/docs into clean Markdown.
Haunt does not promise universal extraction, CAPTCHA solving, login-wall access, paywall access, or anti-bot circumvention. It works best on permitted public pages and supported rendered pages. When a page is blocked, login-required, CAPTCHA-gated, or too thin to verify, Haunt returns a clear failure signal instead of fabricated data.
try_demo_extractNo-key activation check. Returns a fixed sample extraction JSON/trace, Haunt's demo endpoint, docs, signup, pricing, MCP info route, and free-tier details. Use this first when a user has installed the MCP server but has not added HAUNT_API_KEY yet.
extract / extract_urlGeneral-purpose extraction from permitted public web pages. extract and extract_url are aliases so hosted MCP docs and local stdio package users can follow the same wording.
Use it for:
It accepts optional response_format values: json (default), markdown / md, and raw_html / html.
extract_markdownReturn clean Markdown from a permitted public page. Use this when the agent or workflow wants readable page text for RAG, notes, docs ingestion, or saving as a .md file instead of structured fields.
extract_articleExtract article fields from news, blog, and editorial pages.
Returns title, body text, author, and publish date when available.
extract_metadataExtract page metadata including title, description, Open Graph tags, Twitter Card tags, canonical URL, and related metadata.
get_usageCheck current plan, monthly credit limit, used credits, reserved credits, and remaining credits. Use this after a live extraction to see what was charged.
error_code, captcha_provider, requires_human_verification).md files| Plan | Credits | Price |
|---|---|---|
| Free | 1,000/mo | £0 |
| Starter | 10,000/mo | £19/mo |
| Pro | 30,000/mo | £49/mo |
| Scale | 80,000/mo | £99/mo |
Credits are not one-to-one requests. Simple public/non-LLM output usually uses 1 credit, normal structured extraction 2, browser-rendered or authenticated extraction 4, and heavy/screenshot extraction 8. Failed, blocked, login/CAPTCHA, provider, and server failures do not burn credits.
Upgrade: https://hauntapi.com/#pricing
Run in your terminal:
claude mcp add hauntapi-mcp-server -- npx Yes, @Hauntapi/ Server MCP is free — one-click install via Unyly at no cost.
No, @Hauntapi/ Server runs without API keys or environment variables.
A hosted option is available: Unyly runs the server in the cloud, no local setup required.
Open @Hauntapi/ Server on unyly.org, pick your client tab (Claude Desktop, Claude Code, Cursor) and press Install — the config is generated automatically, no JSON editing.
CSA PROJECT - FZCO © 2026 IFZA Business Park, DDP, Premises Number 31174 - 001
Security
Low riskAutomated heuristic from public metadata — not a security guarantee.