HTML scraper with templates
html text parser,get the content form html page
DOM parser for html, xml and other.
HTML parser tools, crawle data framework.
A command line tool to render HTML and text emails of markdown content.
Simple, extendable HTML and XML data extraction engine using YAML configurations and some times pythonic functions.
CLI tool that turns a CSV file into an HTML table with basic formatting
Miss Match is a HTML Parser designed to identify mismatched HTML tags.
Easy html parser with Jquery selector
A package that provides an API to create a DOM of HTML documents and access to its elements
A HTML Parser
HTMLie is a command line HTML Parser.
White list HTML filter
Adds tweet entities to a tweets text in HTML
A generic HTML page parser
ElementTree wrapper for BeautifulSoup HTML parser
A Powerful HTML Parser/Scraper/Validator/Formatter that constructs a modifiable, searchable DOM tree, and includes many standard JS DOM functions (getElementsBy*, appendChild, etc) and additional methods
TechGame Framework CSS Parser and Engine
CPython html.parser module ported to Pycopy
CPython html.parser module ported to MicroPython
Read HTML data and convert it into python classes.
Programming language decoder. Now You can Decode HyperTextMarkupLanguage (html) and css(Style sheet) using this package.
Parsing HTML chemistry papers from certain publishers into plain text
A Nifty HTML Parser written in Python
A simple html parser that constructs DOM tree.
A python library that parses html into a tree structure
A component parser for html
Transform html tables from soup objects to usable data structures (eg 2D arrays)
Static website templating engine, Generates HTML pages using .nuo template files and data in JSON files.
Optimized parser for creole-like markup language
Markdown parser that transforms the markdown into HTML
a html parser based lxml
A simple, fast and pure-python HTML parser
Full HTML WebPage Downloader, Parser and Data Extractor. Web Crawler, Server and Client Included.
Common Python library which contains reusable components, developed at Infrae.
Python implementation of John Gruber's Markdown.