HTML parser with an lxml backend. Implements a subset of BeautifulSoup API and is an order of magnitude faster
Python 3 HTML meta tag parser, with emphasis on complex meta tag structures with support for OpenGraph and Twitter Card tags, including array handling
A readability parser which can extract title, content, images from html pages
Template-based HTML-to-JSON parser.
Translate documents and webpages to various markup languages and document formats (html, epub, mobi ..)
Markdown articles downloader and converter
A Django command line tool for importing HTML, XML and JSON data to models via XSLT mapping
HTTP request tool with a little functionality
A tool for extracting Indicators of Compromise from security reports
Parse html content of Mail.ru
easyscrapper is a fast, lightweight Python package and CLI tool that lets developers, data scientists, and AI engineers extract text, HTML, emails, links, canonical, meta and images from any public webpage - perfect for AI, RAG pipelines, SEO, content aggregation, and scalable data workflows with just one command or a few lines of code.
Generate an HTML report for line_parser
Package to parse rutracker.org forum
Frequently used functions for html parsing with beautifulsoup4 https://pypi.org/project/beautifulsoup4/
Package for parsing moodle quiz HTML documents
CriticMarkdup parser with optional pandoc backend
Python bindings for Gumbo HTML parser
parses youtube content
Lightning-fast web scraping Python SDK - 11x faster than traditional scrapers
A basic HTML parser in Python
A web parser for tabular and/or paginated data
this is parser of HTML meta tag
Web Crawler, HTML Parser, and Data Visualization
HTML parser ant text retriever using user defined rule set
HTML Parser of Economic Research Institute Cost of Living HTML.
Simple library for parsing SEC forms
Custom wikitext parser to produce html, plain text fields and relevant links from wikipedia page source code.
An Office Open XML parser that outputs HTML.
HTML form parser for humans
AI HTML Parser
Static documentation extraction tool for python language
Parse your browser's exported HTML bookmark file to Markdown.
Wikipedia parser
Parser UTF8/HTML <-> pure HTML -> UTF8/Markdown
Your package description.
Xcrap Parser is a declarative, model-driven parser for extracting data from HTML and JSON files, with the ability to interleave both to extract even more information.
Easy html parser with Jquery selector
A Python library for comprehensive on-page SEO analysis of HTML content.
Python 3 template parser to generate HTML from a pug/jade like syntax