Nokogiri (鋸) makes it easy and painless to work with XML and HTML from Ruby. It provides a sensible, easy-to-understand API for reading, writing, modifying, and querying documents. It is fast and standards-compliant by relying on native parsers like libxml2, libgumbo, or xerces.
A fast, safe and extensible Markdown to (X)HTML parser
a swift, liberal HTML parser with a fantastic library
Correios CEP gem finds updated Brazilian addresses by zipcode, directly from Correios database. No HTML parsers.
feedparser - web feed parser and normalizer (RSS, Atom, JSON Feed, HTML h-entry, etc.)
Parsing table could be difficult when its structure contains colspan or rowspan. TableParser parser HTML tables, group them by columns, with colspan and rowspan respected.
A markup parser that outputs html and text. Syntax is similar to MediaWiki.
A gem packaging of the CyberNeko HTML Parser
Watir-Nokogiri is an HTML parser using Watir's API.
`article_json` is a format definition for news articles and a ruby gem that offers conversions from and to different formats: - Parser for Google Doc HTML exports - Converter to: - simple HTML format - AMP format - Apple News Format (ANF) - Facebook Instant Article HTML - plain text - JSON
a swift, liberal HTML parser with a fantastic library
Ruby Cheerio is a jQuery style HTML parser, which take selectors as input. This is a Ruby version NodeJS package named 'Cheerio', which is extensively used by crawlers. Please visit the home page for usage details.
Library to easily extract information for tracking your Cukes, their overall performance, and generates HTML reports.
A lightweight (X)HTML(5) & XML parser
A (x)html friendly template engine for rails with the power of layouts, and partials. Parser gem
http://wiki.github.com/iori/wparser/ This package is Wiki Format Parser. Wiki => html
Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors. XML is like violence - if it doesn’t solve your problems, you are not using enough of it.
HTML/XML parser for RubyMotion, based on GDataXML-HTML.
A lightweight (X)HTML & XML parser
Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors. XML is like violence - if it doesn’t solve your problems, you are not using enough of it.
a swift, liberal HTML parser with a fantastic library
ParseHTML is an HTML parser which works with Ruby 1.8 and above. ParseHTML will even try to handle invalid HTML to some degree.
XOXO is a Ruby XOXO parser and generator. It provides a Ruby API similar to Marshal and YAML (though more specific) to load and dump XOXO[http://microformats.org/wiki/xoxo], a simple, open outline format written in standard XHTML and suitable for embedding in (X)HTML, Atom, RSS, and arbitrary XML.
Ruby HTML sanitizer based on a lightweight Oga parser.
A fast, safe and extensible Markdown to (X)HTML parser
Pure Ruby html parser. This library parse html and build rexml document. Nokogiri is very convenient, but the installation is complex because it do I need to build a native library, it is not suitable for chef.
Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors. XML is like violence - if it doesn’t solve your problems, you are not using enough of it.
A fast, safe and extensible Markdown to (X)HTML parser, with YouTube embedding, based on Redcarpet
Geração de templates HTML para a Spot
A LaTeX to HTML parser.
Standardized markup parsers to use a single format: an object. Instantiate a specific markup class with text to output formated Html. Allows for easy code block highlighting using a Proc; defaults to Uv (ruby Ultraviolet)
Parses APC (Advanced PHP Cache) HTML status output from apc.php, included in a recent versions of PHP, into a data structure that is usable by ruby.
Selma selects and matches HTML nodes using CSS rules. Backed by Rust's lol_html parser.
Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors. XML is like violence - if it doesn’t solve your problems, you are not using enough of it.
a swift, liberal HTML parser with a fantastic library
hpricot helper that scrapes html easily by parser class defined css selector
The core parser for the Bayeux document markup language, optimised for long-form documents. Generators are also provided for HTML, LaTeX and PanDoc
Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors. XML is like violence - if it doesn’t solve your problems, you are not using enough of it.
Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors. XML is like violence - if it doesn’t solve your problems, you are not using enough of it.
HTML Cruncher is a streaming parser that'll gladly crunch through anything mildly resembling HTML.
Koara to HTML parser written in Ruby
Thin wrapper around a native Java HTML parser
A gem that parses your markdown to html
ParseHTML is an HTML parser which works with Ruby 1.8 and above. ParseHTML will even try to handle invalid HTML to some degree.
Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser. Among Nokogiri's many features is the ability to search documents via XPath or CSS3 selectors. XML is like violence - if it doesn’t solve your problems, you are not using enough of it.
Koara to HTML parser written in Ruby
A fast, safe and extensible Markdown to (X)HTML parser patched to have smaller headings and no quote-escaping
This allows fetching and parsing Yelp businesses data from html content where otherwise not avaiblable.
Parse HTML using different parsers, then show the differences between the generated trees.
PertinentParser helps you compose HTML tags across existing tag boundaries.