
Security News
npm Adopts OIDC for Trusted Publishing in CI/CD Workflows
npm now supports Trusted Publishing with OIDC, enabling secure package publishing directly from CI/CD workflows without relying on long-lived tokens.
sitemapcrawler is a simple, blocking Python Crawler that is the backbone of a few other projects.
You're welcome to use it, but it's only as modular as we've needed it to be, which is to say, probably not fit for projects that aren't built with this in mind.
It works pretty simply.
pip install sitemapcrawler
from sitemapcrawler import Crawler
crawler = Crawler(domain="https://yourdomain.com", sitemap="https://yourdomain.com/sitemap.xml", fetch=True)
crawler.run()
If you just want to fetch a given page, create an instance of the crawler and call it like this:
crawler.fetch_page(url="https://yourdomain.com/blog/title")
The init
will create a nanoid crawl_id
so that when results are persisted, they'll be associated to a given crawl, to make it easy for reports to be built against crawls and such.
python3 -m build
python3 -m twine upload dist/* --skip-existing
FAQs
A simple sitemap crawler that acts as the backbone for other operations
We found that sitemapcrawler demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
npm now supports Trusted Publishing with OIDC, enabling secure package publishing directly from CI/CD workflows without relying on long-lived tokens.
Research
/Security News
A RubyGems malware campaign used 60 malicious packages posing as automation tools to steal credentials from social media and marketing tool users.
Security News
The CNA Scorecard ranks CVE issuers by data completeness, revealing major gaps in patch info and software identifiers across thousands of vulnerabilities.