You're Invited:Meet the Socket Team at BlackHat and DEF CON in Las Vegas, Aug 4-6.RSVP →

Book a Demo Install Sign in

Book a Demo Install Sign in

pypi

Categories
Server
File Formats
HTML Parser

HTML Parser

html-wrapper

HTML parser with an lxml backend. Implements a subset of BeautifulSoup API and is an order of magnitude faster

metatron

Python 3 HTML meta tag parser, with emphasis on complex meta tag structures with support for OpenGraph and Twitter Card tags, including array handling

html meta parser opengraph twittercard

jparser

A readability parser which can extract title, content, images from html pages

jsonify-html

Template-based HTML-to-JSON parser.

pyeditorjs

pyEditorJS

pypolyglot

Translate documents and webpages to various markup languages and document formats (html, epub, mobi ..)

biblio-py

Package to manage bibliography files

markdown-tool

Markdown articles downloader and converter

markdown-parser

preparser

a slight preparser to help parse webpage content or get request from urls,which supports win, mac and unix.

django-import-data

A Django command line tool for importing HTML, XML and JSON data to models via XSLT mapping

django import mapping parser xml html json xslt

prop-request

HTTP request tool with a little functionality

crawler parser html

iocparser-tool

A tool for extracting Indicators of Compromise from security reports

threat-intelligence

mailru-parser

Parse html content of Mail.ru

commie

Extracts comments from source code in different programming languages

easyscrapper

easyscrapper is a fast, lightweight Python package and CLI tool that lets developers, data scientists, and AI engineers extract text, HTML, emails, links, canonical, meta and images from any public webpage - perfect for AI, RAG pipelines, SEO, content aggregation, and scalable data workflows with just one command or a few lines of code.

data extraction

html-report-line-profiler

Generate an HTML report for line_parser

'line_profiler'

pyrutracker

Package to parse rutracker.org forum

beautifulsoup4-helpers

Frequently used functions for html parsing with beautifulsoup4 https://pypi.org/project/beautifulsoup4/

python3 beautifulsoup4_helpers parser html

py-moodle-quiz-parser

Package for parsing moodle quiz HTML documents

pancritic

CriticMarkdup parser with optional pandoc backend

pandoc panflute markdown latex html criticmarkup

gumbo

Python bindings for Gumbo HTML parser

gumbo html html5 parser google html5lib beautifulsoup

chainsoup

A fluent, pipeline-based interface for querying HTML/XML with BeautifulSoup.

markup-parser

Parse JS variables from HTML markup

youtube-html-parser

parses youtube content

browsernative

Lightning-fast web scraping Python SDK - 11x faster than traditional scrapers

browser automation

data extraction

parser-html

A basic HTML parser in Python

apifier

A web parser for tabular and/or paginated data

api parser table data html

digs

Making easier the text crawling tasks over websites with depth levels.

metaparser

this is parser of HTML meta tag

pithy

Pithy is a collection of utility libraries for Python 3.

webby

Web Crawler, HTML Parser, and Data Visualization

web data crawler parse html xml

lieparse

HTML parser ant text retriever using user defined rule set

eriparse

HTML Parser of Economic Research Institute Cost of Living HTML.

sec-html-parser

Simple library for parsing SEC forms

wikitext-asymptote

Custom wikitext parser to produce html, plain text fields and relevant links from wikipedia page source code.

ooxmilker

An Office Open XML parser that outputs HTML.

zetanize

HTML form parser for humans

ai-html-parse

AI HTML Parser

moduledocs

Static documentation extraction tool for python language

generate documentation markdown html static parser

bookmarkdown

Parse your browser's exported HTML bookmark file to Markdown.

wikiparser

Wikipedia parser

wiki parser html to json wikipedia

pyhtmd

A Python HTML to Markdown parser

dktoparserhtml

Parser UTF8/HTML <-> pure HTML -> UTF8/Markdown

crimson-html-parser

Your package description.

xcrap-parser

Xcrap Parser is a declarative, model-driven parser for extracting data from HTML and JSON files, with the ability to interleave both to extract even more information.

py-style-flattener

Manipulate HTML by moving <style> to style=".

htmlpyever

Python bindings to html5ever

html-jparser

Easy html parser with Jquery selector

html parser jquery select easy

seokar

A Python library for comprehensive on-page SEO analysis of HTML content.

spenx

Python 3 template parser to generate HTML from a pug/jade like syntax

Product

Package Alerts
Integrations
Docs
Pricing
FAQ
Roadmap
Changelog

About

About
Love
Blog
Glossary
CareersHiring
Send Feedback
Contact Us
System Status

Packages

Explore Rubygems

Stay in touch

Get open source security insights delivered straight into your inbox.

Enter your email

Terms
Privacy
Security

Made with ⚡️ by Socket Inc

U.S. Patent No. 12,346,443 & 12,314,394. Other pending.