from Hacker News

Show HN: Puremd-mcp – MCP server for reliably accessing web content

by andrethegiant on 4/1/25, 9:30 PM with 2 comments

  • by andrethegiant on 4/1/25, 9:35 PM

    This is the MCP version of the pure.md[1] REST API. You can integrate it with MCP clients like Cursor, Windsurf, and Claude Desktop to help them reliably get context from web content.

    If you'd like to use without configuring MCP, simply prefix any URL with `pure.md/` and you get the unblocked markdown content of that webpage. It avoids bot detection and renders JavaScript-heavy websites, and can convert HTML, PDFs, images, and more into pure markdown.

    pure.md acts as a global caching layer between LLMs and web content. I like to think of it like a CDN for LLMs, similar to how Cloudinary is a CDN for images.

    [1] https://pure.md/

  • by andrewfromx on 4/1/25, 9:35 PM

    nice, i've been using https://github.com/go-rod/rod or python playwright for this and doing the logic to clean the html into text. But this is nice 1 step solution.