New Case Study:See how Anthropic automated 95% of dependency reviews with Socket.Learn More →

Sign in Demo Install

What is Socket?

Socket for GitHub

Detect suspicious packages in PRs

Socket CLI

Use Socket from the command line

Socket Web Extension

Use Socket from your browser

Socket Dependency Search

Find any package for your project

Socket Optimize

Optimize your dependencies

Integrations

All Integrations

Ticketing & Messaging

Package Managers

Docs

Want to read all the docs? Start here

Customers

Check out our customer stories

Blog

Keep up to date with all the news

Changelog

Latest updates and enhancements

FAQ

Answers to common questions

Package Alerts

Learn about all Socket alerts

Glossary

Open source and security terms

Blog

Application Security

Customer Stories

About

Why we built Socket

Love

See why developers love Socket

Careers

Join our team

Investors

Learn about our investors

Security

Our security practices

Why Socket?

Socket vs Dependabot

Socket vs Semgrep

Socket vs EndorLabs

Socket for Open Source Security

Socket for Supply Chain Attack Prevention

Achievements

Fortune Cyber 60

Pricing Love Docs

Sign in Demo Install

pypi
Categories
Server
File Formats
HTML Parser

HTML Parser

html5lib

HTML parser based on the WHATWG HTML specification

htmldate

Fast and robust extraction of original and updated publication dates from URLs and web pages.

datetime
date-parser
entity-extraction
html-extraction
html-parsing
metadata-extraction

tinyhtml5

HTML parser based on the WHATWG HTML specification

html
parser

readability-lxml

fast html to text parser (article readability tool) with python 3 support

djc-core-html-parser

HTML parser used by django-components written in Rust.

django
components
html

breadability

Port of Readability HTML parser in Python

bookie
breadability
content
HTML
parsing
readability

pyromark

Blazingly fast Markdown parser

converter
html

html5rdf

HTML parser based on the WHATWG HTML specification

html5-parser

Fast C based HTML 5 parsing for python

htmllistparse

Python parser for Apache/nginx-style HTML directory listing.

apache nginx listing fuse

html-table-parser-python3

A small and simple HTML table parser not requiring any external dependency.

sec-parser

Parse SEC EDGAR HTML documents into a tree of elements that correspond to the visual structure of the document.

reliq

Python ctypes bindings for reliq

ctypes
html
parser
text-processing

lukeparser

The Style of Markdown with the Power of LaTeX.

markdown
html
latex
parser

html-parser

UNKNOWN

html5lib-modern

HTML parser based on the WHATWG HTML specification

metadata-parser

A module to parse metadata out of urls and html documents

opengraph protocol facebook

pykami

A python module that parses KAMI into HTML

markup
kami
parser
html

html5

HTML parser based on the WHATWG HTML specification

blowdrycss

The atomic CSS compiler

blowdry blowdrycss css compiler pre-compiler pre-processor generator dry cascading style sheets html encoded class selector parser optimizer internet

yandex-parser

Parse html content of Yandex

docx-parser-converter

A library for converting DOCX documents to HTML and plain text

google-parser

Convert html to snippets

quick-crawler

A toolkit for quickly performing crawler functions

crawler
quick crawler
web crawler
html parser
data mining

article-parser

A parser that parses articles from any url or html

article news html parser Extract extractor body

bs2json

Convert bs4 Tags into Json

parser
html
bs4
BeautifulSoup
soup
bs4

njsparser

A Python NextJS data parser from HTML

htmldom

HTML parser which can be used for web-scraping applications

htmldom
html parser
html
xhtml
jquery

haruka-parser

A simple HTML Parser

commie

Extracts comments from source code in different programming languages

css
python
c
search
java
go

llama-index-packs-code-hierarchy

A node parser which can create a hierarchy of all code scopes in a directory.

c
code
cpp
hierarchy
html
javascript

whatsapp-chat-exporter

A Whatsapp database parser that will give you the history of your Whatsapp conversations in HTML and JSON. Android, iOS, iPadOS, Crypt12, Crypt14, Crypt15 supported.

android
ios
parsing
history
iphone
message

jsoup

Convert JSON to BeautifulSoup object

parser
html
bs4
BeautifulSoup
soup
jsoup

htmlement

Pure-Python HTML parser with ElementTree support.

html html5 parsehtml htmlparser elementtree dom

prop-request

HTTP request tool with a little functionality

crawler parser html

html-template-parser

A parser for HTML templates.

biblio-py

Package to manage bibliography files

bibliography
bibtex
converter
html
xml
latex

edwh-editorjs

EditorJS.py

bleach
clean
editor
editor.js
html
javascript

pypolyglot

Translate documents and webpages to various markup languages and document formats (html, epub, mobi ..)

pdf
html
parser

preparser

a slight preparser to help parse webpage content or get request from urls,which supports win, mac and unix.

preparser
parser
parse
crawl
webpage
html

html2object

Tools to handle the CRUD of .html files as objects.

html
utils
html_utils
html_element
html_parser
html_writer

jparser

A readability parser which can extract title, content, images from html pages

llama-index-packs-code-hierarchy-blar

A node parser which can create a hierarchy of all code scopes in a directory.

c
code
cpp
hierarchy
html
javascript

django-import-data

A Django command line tool for importing HTML, XML and JSON data to models via XSLT mapping

django import mapping parser xml html json xslt

jsonify-html

Template-based HTML-to-JSON parser.

htmst

HTML to AST with positions

html
parser
ast
position

pyrutracker

Package to parse rutracker.org forum

html
parser
rutracker

metatron

Python 3 HTML meta tag parser, with emphasis on complex meta tag structures with support for OpenGraph and Twitter Card tags, including array handling

html meta parser opengraph twittercard

py-moodle-quiz-parser

Package for parsing moodle quiz HTML documents

pancritic

CriticMarkdup parser with optional pandoc backend

pandoc panflute markdown latex html criticmarkup

Product

Package Alerts
Integrations
Docs
Pricing
FAQ
Roadmap
Changelog

About

About
Love
Blog
Glossary
Discord Community
CareersHiring
Send Feedback
Contact Us
System Status

Packages

npm

Directory
Explore
Random Package
Most Popular
Top Maintainers
Removed Packages

Go

Directory
Explore
Random Package

Maven

Directory
Explore
Random Package

PyPI

Directory
Explore
Random Package

Rubygems

Directory
Explore
Random Package

Stay in touch

Get open source security insights delivered straight into your inbox.

Enter your email

Terms
Privacy
Security

Made with ⚡️ by Socket Inc