HTML parser based on the WHATWG HTML specification
Fast and robust extraction of original and updated publication dates from URLs and web pages.
Fast HTML5 parser with CSS selectors.
fast html to text parser (article readability tool) with python 3 support
Python parser for Apache/nginx-style HTML directory listing.
Port of Readability HTML parser in Python
HTML parser based on the WHATWG HTML specification
A small and simple HTML table parser not requiring any external dependency.
Parse SEC EDGAR HTML documents into a tree of elements that correspond to the visual structure of the document.
Python ctypes bindings for reliq
HTML parser used by django-components written in Rust.
HTML to Markdown converter
Fast C based HTML 5 parsing for python
UNKNOWN
The fast, most optimal, and correct HTML & XML parsing library.
A module to parse metadata out of urls and html documents
The Style of Markdown with the Power of LaTeX.
HTML parser based on the WHATWG HTML specification
HTML parser based on the WHATWG HTML specification
A library for converting DOCX documents to HTML and plain text
easyscrapper is a fast, lightweight Python package and CLI tool that lets developers, data scientists, and AI engineers extract text, HTML, emails, links, canonical, meta and images from any public webpage - perfect for AI, RAG pipelines, SEO, content aggregation, and scalable data workflows with just one command or a few lines of code.
Convert html to snippets
A parser that parses articles from any url or html
A node parser which can create a hierarchy of all code scopes in a directory.
A simple HTML Parser
A Python NextJS data parser from HTML
A wrapper for requests for integration with html tree parsers
Pure-Python HTML parser with ElementTree support.
A parser for HTML templates.
A node parser which can create a hierarchy of all code scopes in a directory.
EditorJS.py
HTML parser with an lxml backend. Implements a subset of BeautifulSoup API and is an order of magnitude faster
Template-based HTML-to-JSON parser.
Translate documents and webpages to various markup languages and document formats (html, epub, mobi ..)
A toolkit for quickly performing crawler functions
HTML table parser that supports rowspan, colspan, links and nested tables. Fast, lightweight with no external dependencies.