HTML parser based on the WHATWG HTML specification
Fast and robust extraction of original and updated publication dates from URLs and web pages.
A fast HTML5 parser with CSS selectors, written in Cython, using Modest and Lexbor engines.
fast html to text parser (article readability tool) with python 3 support
High-performance HTML to Markdown converter powered by Rust with a clean Python API
HTML parser based on the WHATWG HTML specification
Port of Readability HTML parser in Python
Python parser for Apache/nginx-style HTML directory listing.
HTML parser used by django-components written in Rust.
A small and simple HTML table parser not requiring any external dependency.
Parse SEC EDGAR HTML documents into a tree of elements that correspond to the visual structure of the document.
HTML to Markdown converter
Fast C based HTML 5 parsing for python
Python ctypes bindings for reliq
The fast, most optimal, and correct HTML & XML parsing library.
A module to parse metadata out of urls and html documents
Extended Python bindings for the Comrak Rust library, a fast CommonMark/GFM parser
The Style of Markdown with the Power of LaTeX.
UNKNOWN
HTML parser based on the WHATWG HTML specification
Professional web content fetching and extraction toolkit with configurable extraction methods and domain caching
A library for converting DOCX documents to HTML and plain text
A Python library for extracting and parsing Next.js hydration data from HTML content
A wrapper for requests for integration with html tree parsers
A simple HTML Parser
Convert html to snippets
Parse html content of Yandex
Python bindings for Gumbo HTML parser
Fast C based HTML 5 parsing for python, fork of Kovid Goyal html5-parser
HTML parser based on the WHATWG HTML specification
Lightning-fast HTML parser and data extractor with WebPage API - BeautifulSoup alternative built in Rust
EditorJS.py
Scrapery: A fast, lightweight library to scrape HTML, XML, and JSON using XPath, CSS selectors, and intuitive DOM navigation.
A Python NextJS data parser from HTML
Pure-Python HTML parser with ElementTree support.
A node parser which can create a hierarchy of all code scopes in a directory.
Web Crawler, HTML Parser, and Data Visualization