Create xml sitemaps from the command line.
A crawler module designed for Blocklets. It supports batch crawling of HTML, webpage screenshots, title, description, and more, based on URL or Sitemap.
Crawls web urls from a list
Crawls information from public netatmo stations
An implementation of a simple web crawler capable of producing streams of page objects
A list of common crawler agents used on Internet..
http request for web scraping
[](https://www.npmjs.com/package/recrawl) [](https://github.com/aleclarson/recrawl/actions/workflows/release.yml) [![codeco
Tool to crawl events, leagues and statistics from WBSC based websites.
A web scraper for NodeJs
## Preconditions
A Twitter crawler helper with auth
This express middleware provides pre-rendered HTML generated by SnapKit for Blocklets, enabling them to return complete HTML content to web spider. This is essential for SEO and ensuring that search engines can properly index dynamically generated content
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.
A tiny node module to detect spiders/crawlers quickly and comes with optional middleware for ExpressJS
A robust GitHub API crawler that walks a queue of GitHub entities retrieving and storing their contents.
## 下载htlm文档中间件
Phâm tích html
爬虫调度程序
Crawler4nodejs is an open source web crawler for Node.js which provides a simple interface for crawling the Web.
Easy to write build preload script in nightmare
Vercel integration for SnapCrawl. Serve pre-rendered HTML to crawlers in Next.js middleware or Edge Functions for static SPAs and Express apps.
Provides support for calling our backend REST APIs in the polar cloud.
Common resources for web crawler events
JavaScript SDK for Firecrawl API
Ravencoin node crawler
Script to monitor & download Twitter Spaces 24/7
Web crawler and API for aggregating and serving digital rights organizations' publications.
Clado Model Context Protocol Server
RINGOC爬虫工具包