
Security News
vlt Launches "reproduce": A New Tool Challenging the Limits of Package Provenance
vlt's new "reproduce" tool verifies npm packages against their source code, outperforming traditional provenance adoption in the JavaScript ecosystem.
Soup Stars is a framework for building web parsers with Python. It is designed to make building, deploying, and scheduling web parsers easier by simplifying what you need to get started.
pip install soupstars
New parsers are created by typing soupstars create
into a terminal, and supplying the name of a python module.
soupstars create myparser.py
Soup Stars will use a template parser to help you get started. This example creates a parser that extracts headlines from articles on the New York Times website.
from soupstars import data, follow
url = "https://www.nytimes.com"
@follow
def follow(url):
return (url.domain == "www.nytimes.com") and (url.match("\d{4}\/\d{2}\/\d{2}"))
@parse
def h1(soup):
return soup.h1.text
You can test that the parser functions correctly.
soupstars run myparser
Use soupstars --help
to see a full list of available commands.
More documentation is available here.
Start the docker services.
docker-compose up -d
Set up the containers.
docker-compose exec web flask s3 mb soupstars-archive
docker-compose exec web flask db upgrade
docker-compose exec web flask seed schedules
docker-compose exec web flask seed plans
docker-compose exec web flask seed user
docker-compose exec web flask seed parsers
Run the tests.
docker-compose run --rm client pytest -vs
New tags that pass on CI will automatically be pushed to docker hub.
To deploy to PyPI requires manually running the following commands.
pip3 install twine
python3 setup.py sdist bdist_wheel
twine upload dist/*
FAQs
Declarative web parsers
We found that soupstars demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
vlt's new "reproduce" tool verifies npm packages against their source code, outperforming traditional provenance adoption in the JavaScript ecosystem.
Research
Security News
Socket researchers uncovered a malicious PyPI package exploiting Deezer’s API to enable coordinated music piracy through API abuse and C2 server control.
Research
The Socket Research Team discovered a malicious npm package, '@ton-wallet/create', stealing cryptocurrency wallet keys from developers and users in the TON ecosystem.