HTML parser based on the WHATWG HTML specification
Fast and robust extraction of original and updated publication dates from URLs and web pages.
fast html to text parser (article readability tool) with python 3 support
Fast HTML5 parser with CSS selectors.
Python parser for Apache/nginx-style HTML directory listing.
Port of Readability HTML parser in Python
HTML parser based on the WHATWG HTML specification
HTML parser used by django-components written in Rust.
A small and simple HTML table parser not requiring any external dependency.
Parse SEC EDGAR HTML documents into a tree of elements that correspond to the visual structure of the document.
HTML to Markdown converter
Python ctypes bindings for reliq
The Style of Markdown with the Power of LaTeX.
The fast, most optimal, and correct HTML & XML parsing library.
A module to parse metadata out of urls and html documents
UNKNOWN
Fast C based HTML 5 parsing for python
HTML parser based on the WHATWG HTML specification
HTML parser based on the WHATWG HTML specification
A library for converting DOCX documents to HTML and plain text
A simple HTML Parser
A parser that parses articles from any url or html
Convert html to snippets
EditorJS.py
A node parser which can create a hierarchy of all code scopes in a directory.
Parse html content of Yandex
A tool for extracting Indicators of Compromise from security reports
A Python NextJS data parser from HTML
A wrapper for requests for integration with html tree parsers
A toolkit for quickly performing crawler functions
A node parser which can create a hierarchy of all code scopes in a directory.
HTTP request tool with a little functionality
Pure-Python HTML parser with ElementTree support.
easyscrapper is a fast, lightweight Python package and CLI tool that lets developers, data scientists, and AI engineers extract text, HTML, emails, links, canonical, meta and images from any public webpage - perfect for AI, RAG pipelines, SEO, content aggregation, and scalable data workflows with just one command or a few lines of code.
Translate documents and webpages to various markup languages and document formats (html, epub, mobi ..)
A Django command line tool for importing HTML, XML and JSON data to models via XSLT mapping
HTML table parser that supports rowspan, colspan, links and nested tables. Fast, lightweight with no external dependencies.