Introducing Socket Firewall: Free, Proactive Protection for Your Software Supply Chain.Learn More
Socket
Book a DemoInstallSign in
Socket

site-pages-sampler

Package Overview
Dependencies
Maintainers
1
Versions
6
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

site-pages-sampler

A simple crawler to get page URL samples in a website.

latest
npmnpm
Version
0.3.3
Version published
Maintainers
1
Created
Source

To get 10 URLs of webpage in the website from a starting URL.

site-pages-sampler -l 10 -s 'https://www.ideamans.com/'

Usage

site-pages-sampler <url>

Starts to crawl samples pages from the URL.

Options:
  --help                 Show help                                     [boolean]
  --version              Show version number                           [boolean]
  --user-agent-type, -u  User agent type. (mobile|desktop)   [default: "mobile"]
  --limit, -l            Limit of total sample pages.             [default: 100]
  --limit-each           Sample page links from each page.        [default: 100]
  --concurrency, -c      Concurrency of requests.                   [default: 8]
  --timeout-each         Timeout for each page request. (seconds)
  --timeout              Total timeout. (seconds)                  [default: 30]
  --url-hash             Recognizes url with hash as unique.
                                                      [boolean] [default: false]
  --verify, -v           Verifies each url can be got.[boolean] [default: false]
  --shuffle              Shuffles links order.        [boolean] [default: false]
  --debug, -d            Outputs debug logs to stderr.[boolean] [default: false]
  --page-extnames        Comma separated extension names of pages.
                              [default: ",.html,.htm,.php,.asp,.aspx,.jsp,.cgi"]
  --directory-index      Comma separated directory index minimatch patterns.
                                                  [default: "index.*,Default.*"]
  --ignore-param         Comma separated search param minimatch patterns to
                         ignore.                  [default: "index.*,Default.*"]
  --format, -f           Output format. (text|json)            [default: "text"]

FAQs

Package last updated on 23 Apr 2020

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts