
Security News
Meet Socket at Black Hat Europe and BSides London 2025
Socket is heading to London! Stop by our booth or schedule a meeting to see what we've been working on.
@mdream/crawl
Advanced tools
Mdream Crawl generates comprehensive llms.txt artifacts from a single URL, using mdream to convert HTML to Markdown.
Multi-page website crawler that generates comprehensive llms.txt files by following internal links and processing entire websites using mdream HTML-to-Markdown conversion.
Note: For single-page HTML-to-Markdown conversion, use the
mdreambinary instead.@mdream/crawlis specifically designed for crawling entire websites with multiple pages.
npm install @mdream/crawl
Simply run the command to start the interactive multi-page website crawler:
npx @mdream/crawl
The crawler will automatically discover and follow internal links to crawl entire websites. The interactive interface provides:
You can also use @mdream/crawl programmatically in your Node.js applications:
import { crawlAndGenerate, generateLlmsTxt } from '@mdream/crawl'
// Crawl entire websites programmatically
const results = await crawlAndGenerate({
urls: ['https://docs.example.com'], // Starting URLs for website crawling
outputDir: './output',
maxRequestsPerCrawl: 100, // Maximum pages per website
generateLlmsTxt: true,
followLinks: true, // Follow internal links to crawl entire site
maxDepth: 3, // How deep to follow links
driver: 'http', // or 'playwright' for JS-heavy sites
verbose: true
})
// Generate llms.txt manually from existing results
await generateLlmsTxt({
siteName: 'Example Site',
description: 'Documentation for Example Site',
results: crawlResults,
outputPath: './output/llms.txt'
})
The crawler generates comprehensive output from entire websites:
.md file per crawled page with clean markdown content# example.com
## Pages
- [Example Domain](https---example-com-.md): https://example.com/
- [About Us](https---example-com-about.md): https://example.com/about
Perfect for:
Not suitable for: Single-page conversions (use mdream binary instead)
MIT
FAQs
Mdream Crawl generates comprehensive llms.txt artifacts from a single URL, using mdream to convert HTML to Markdown.
The npm package @mdream/crawl receives a total of 32 weekly downloads. As such, @mdream/crawl popularity was classified as not popular.
We found that @mdream/crawl demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
Socket is heading to London! Stop by our booth or schedule a meeting to see what we've been working on.

Security News
OWASP’s 2025 Top 10 introduces Software Supply Chain Failures as a new category, reflecting rising concern over dependency and build system risks.

Research
/Security News
Socket researchers discovered nine malicious NuGet packages that use time-delayed payloads to crash applications and corrupt industrial control systems.