Book a Demo Install Sign in

Book a Demo Install Sign in

npm

Categories
Server
Web
Crawler

Crawler

crawler4nodejs

Crawler4nodejs is an open source web crawler for Node.js which provides a simple interface for crawling the Web.

published 1.1.5 • 6 years ago

web-auto-extractor

Automatically extracts structured information from webpages

vasanthgopal

published 1.0.17 • 8 years ago

jopi-crawler

A crawler, to download web-site

johan.piquet

published 1.0.16 • 5 days ago

thredds-catalog-crawler

A module for crawling thredds catalogs

published 0.0.7 • last year

sitemap-generator

Easily create XML sitemaps for your website.

published 8.5.1 • 5 years ago

crawler-ninja-logger

Common log method for all crawler.ninja plugins

christophebe

published 1.0.7 • 9 years ago

crawlerlibrary

Web crawler library

amrutakapure

published 1.0.17 • 5 years ago

pdfdataextract

Extract data from a pdf with pure javascript

published 4.0.0 • 5 months ago

crawler-result-store

``` queueItem Object 当前爬取地址对象 queueItem.analysisUrlResult Array<Object> 分析html页面得出的url地址数组 queueItem.analysisResult Object 分析html页面得出的数据结果 queueIte

published 1.0.25 • 9 years ago

@vendasta/web-crawler

SDK to interact with the web-crawler service

published 3.16.0 • 6 months ago

pdf-parse-new

Pure javascript cross-platform module to extract text from PDFs.

simone.gosetto

published 1.4.1 • 3 months ago

robots-txt-parser

A lightweight robots.txt parser for Node.js with support for wildcards, caching and promises.

published 2.0.3 • 3 years ago

@qualweb/crawler

Webpage crawler for qualweb

carlosapaduarte

published 0.4.2 • 8 months ago

crawler-js

Opensource Framework Crawler in Node.js

rodrigorizando

published 2.0.10 • 7 years ago

crawler-ts-fetch

Lightweight crawler written in TypeScript using ES6 generators.

crawling-framework

supergillis

published 1.1.1 • 5 years ago

puppeteer-afp

Stop website fingerprinting techniques

detection-evasion

published 1.1.6 • last year

pdf-parse-fork

Pure javascript cross-platform module to extract text from PDFs.

published 1.2.0 • 2 years ago

get-all-files

A blazing fast recursive directory crawler with lazy sync and async iterator support.

tomeraberbach

published 6.0.0 • last week

seenreq

A library to test if a url(request) is crawled, usually used in a web crawler. Compatible with `request` and `node-crawler`

reomve duplicate url

request normalize

published 3.0.0 • 6 years ago

test-crawler

**[★ Online documentation ★](https://apiel.github.io/test-crawler/)**

published 3.5.7 • 5 years ago

headless-chrome-crawler

Distributed web crawler powered by Headless Chrome

published 1.8.0 • 7 years ago

@types/npm-license-crawler

TypeScript definitions for npm-license-crawler

published 0.2.3 • 2 years ago

n8n-nodes-zca-crawler

n8n node integration zca

n8n-community-node-package

hoanganh-nguyen

published 0.1.8 • 4 days ago

node-scrapy

Simple, lightweight and expressive web scraping with Node.js

stefanmaric

published 0.5.0 • 5 years ago

crawler-hangzhou

A web spider of hangzhou

published 1.2.2 • 8 years ago

crawler-detect

```javascript

published 1.0.1 • 7 years ago

crawler_teach

大厂面经爬虫

published 1.0.2 • 4 years ago

html-metadata-parser

Html Metadata scraper and parser for Node.js

html metadata parser

html metadata crawler

html metadata scraper

metadata parser

published 2.0.4 • 4 years ago

@mertsolak/sitemap-crawler

Developed to create sitemap easily.

1.0.2 • 9 months ago

crawler.plugins.common

爬虫公共代码

published 0.1.23 • 8 years ago

crawler-lian

A configuration - based crawler framework

published 1.0.7 • 5 years ago

crawler-ts

Lightweight crawler written in TypeScript using ES6 generators.

crawling-framework

supergillis

published 1.1.1 • 5 years ago

crawler-ts-htmlparser2

Lightweight crawler written in TypeScript using ES6 generators.

crawling-framework

supergillis

published 1.1.1 • 5 years ago

@apify/n8n-nodes-apify-content-crawler

n8n nodes for Apify

n8n-community-node-package

apify-service-account

published 0.0.6 • 5 days ago

better-sitemap-crawler

To install:

published 0.0.1 • 5 months ago

pdf-parse-debugging-disabled

Pure javascript cross-platform module to extract text from PDFs.

ymansurozer

published 1.1.1 • 3 years ago

license-crawler

crawls a npm package and it's dependencies for their licenses

published 0.0.5 • 7 years ago

js-crawler

Web crawler for Node.js

website-crawler

published 0.3.21 • 7 years ago

pdf-page-counter

Pure javascript cross-platform module to extract page count from PDFs, based on pdf-parser.

pdf-page-counter

published 1.0.3 • 5 years ago

firecrawl

JavaScript SDK for Firecrawl API

hello_sideguide

published 4.3.4 • 3 days ago

node-html-crawler

Crawler (spider) of site web pages by domain name

published 1.2.3 • 4 years ago

@attestate/crawler

@attestate/crawler is a tool chain to retrieve on-chain data from Ethereum.

published 0.6.4 • 4 weeks ago

@the-convocation/twitter-scraper

A port of n0madic/twitter-scraper to Node.js.

published 0.18.2 • 2 weeks ago

crypto-crawler

Crawl orderbook and trade messages from crypto exchanges.

soulmachine

published 3.1.9 • 5 years ago

pdf-extraction

Pure javascript cross-platform module to extract text from PDFs.

fellow-workers

published 1.0.2 • 5 years ago

jarviscrawlercore

jarvis crawler core

published 0.6.48 • 4 years ago

crawler-links

Node.js web crawler to get all internal links from a website.

1.0.1 • last month

@attestate/crawler-call-block-logs

An attestate crawler strategy to download and transform Ethereum block event logs

published 0.5.2 • 4 weeks ago

@ptrumpis/snap-lens-web-crawler

Crawl and download Snap Lenses from *lens.snapchat.com* with ease.

published 1.2.4 • 2 months ago

mcp-web-crawler

crawler

published 0.0.2 • 2 days ago

Product

Package Alerts
Integrations
Docs
Pricing
FAQ
Roadmap
Changelog

About

About
Love
Blog
Glossary
CareersHiring
Send Feedback
Contact Us
System Status

Packages

Explore Rubygems

Stay in touch

Get open source security insights delivered straight into your inbox.

Enter your email

Terms
Privacy
Security

Made with ⚡️ by Socket Inc

U.S. Patent No. 12,346,443 & 12,314,394. Other pending.