Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More
Socket
Sign inDemoInstall
Socket

@xapp/arachne-cli

Package Overview
Dependencies
Maintainers
0
Versions
66
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

@xapp/arachne-cli

  • 1.8.13
  • latest
  • npm
  • Socket score

Version published
Maintainers
0
Created
Source

@xapp/arachne-cli

A command line crawler based on puppeteer

Example Usage

To crawl a site and save the pages to a local ./temp directory

$ arachne crawl http://www.thecoffeefaq.com/ -d ./temp

To also save markdown and schema.org FAQs

$ arachne  crawl http://www.thecoffeefaq.com/ -a -t markdown -d ./temp

With a whitelisted patterns file

$ arachne  crawl http://www.thecoffeefaq.com/ -a -t markdown -d ./temp -w ./temp/whitelist.md

With a settling period

$ arachne crawl http://www.thecoffeefaq.com/ -d ./temp -b 5000 -o 9000

Windows & WSL2 Notes

Follow the instructions here to setup: https://github.com/puppeteer/puppeteer/issues/1837#issuecomment-689006806

You will need to start XLaunch before running the CLI, select multiple windows, no client, turn off access control.

Another option is to add -h flag to run headless (no browser application launched).

If the normal commands don't work, you might need to pass in the executablePath (-e) and run headless (-h).

$ arachne crawl http://www.thecoffeefaq.com/ -e  /usr/bin/google-chrome -h

Licenses

FAQs

Package last updated on 25 Sep 2024

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc