
Product
Introducing Rust Support in Socket
Socket now supports Rust and Cargo, offering package search for all users and experimental SBOM generation for enterprise projects.
Sitemappy (or sitemap-py 😉) is a crawler that produces a sitemap for a given website.
Sitemappy is a command-line application, and also provides Python interfaces for use as a library.
asyncio
, multithreading
, multiprocessing
)robots.txt
multiprocessing
Generate a sitemap (./results.json
):
sitemappy-cli https://monzo.com/
$ sitemappy-cli --help
usage: sitemappy-cli [-h] BASE_URL
Sitemappy is a CLI tool to crawl a website and create a sitemap.
For more information about the tool go to https://github.com/dan-wilton/sitemappy/
Arguments:
BASE_URL a valid website URL to sitemap [required]
Options:
--workers INTEGER Number of workers to asynchronously
make web requests [default: 10]
--crawl-depth INTEGER Depth of links from base URL to follow
[default: 0 - unlimited]
--politeness-delay INTEGER Delay between each request to the website
[default: 0 - none]
--enable-cmd-out Print output to cmd
--help show this help message and exit
Python 3.12+
To use the sitemappy CLI:
pip install --user -U sitemappy-cli
pdm install
Run the tests with:
pytest -v
Use sitemappy in your project with one of the following:
with pip:
pip install -U sitemappy-cli
with PDM:
pdm add sitemappy-cli
with Poetry >= 1.2.0:
poetry add sitemappy-cli
NOTE: This is not yet enabled 😢
via homebrew:
brew install sitemappy-cli
FAQs
A Python library for crawling a website and returning a sitemap.
We found that sitemappy-cli demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Product
Socket now supports Rust and Cargo, offering package search for all users and experimental SBOM generation for enterprise projects.
Product
Socket’s precomputed reachability slashes false positives by flagging up to 80% of vulnerabilities as irrelevant, with no setup and instant results.
Product
Socket is launching experimental protection for Chrome extensions, scanning for malware and risky permissions to prevent silent supply chain attacks.