Launch Week Day 1: Socket for Jira Is Now Available.Learn More →

Book a Demo Sign in

osmosis

Package Overview

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

osmosis

Web scraper for NodeJS

latest

Source

npm

Version: 1.1.10

Version published: 7 years ago

Weekly downloads: 135

Maintainers: 1

Weekly downloads

Created: 5 years ago

Source

Osmosis

HTML/XML parser and web scraper for NodeJS.

Downloads

Features

Uses native libxml C bindings
Clean promise-like interface
Supports CSS 3.0 and XPath 1.0 selector hybrids
Sizzle selectors, Slick selectors, and more
No large dependencies like jQuery, cheerio, or jsdom
Compose deep and complex data structures
HTML parser features
- Fast parsing
- Very fast searching
- Small memory footprint
HTML DOM features
- Load and search ajax content
- DOM interaction and events
- Execute embedded and remote scripts
- Execute code in the DOM
HTTP request features
- Logs urls, redirects, and errors
- Cookie jar and custom cookies/headers/user agent
- Login/form submission, session cookies, and basic auth
- Single proxy or multiple proxies and handles proxy failure
- Retries and redirect limits

Example

var osmosis = require('osmosis');

osmosis
.get('www.craigslist.org/about/sites')
.find('h1 + div a')
.set('location')
.follow('@href')
.find('header + div + div li > a')
.set('category')
.follow('@href')
.paginate('.totallink + a.button.next:first')
.find('p > a')
.follow('@href')
.set({
    'title':        'section > h2',
    'description':  '#postingbody',
    'subcategory':  'div.breadbox > span[4]',
    'date':         'time@datetime',
    'latitude':     '#map@data-latitude',
    'longitude':    '#map@data-longitude',
    'images':       ['img@src']
})
.data(function(listing) {
    // do something with listing data
})
.log(console.log)
.error(console.log)
.debug(console.log)

Documentation

For documentation and examples check out https://rchipka.github.io/node-osmosis/global.html

Dependencies

libxmljs-dom - DOM wrapper for libxmljs C bindings
needle - Lightweight HTTP wrapper

Donate

Please consider a donation if you depend on web scraping and Osmosis makes your job a bit easier. Your contribution allows me to spend more time making this the best web scraper for Node.

Keywords

FAQs

What is osmosis?

Is osmosis popular?

Is osmosis well maintained?

Package last updated on 01 Mar 2019

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

osmosis

Osmosis

Features

Example

Documentation

Dependencies

Donate

Keywords

Related posts

Socket Named Top Sales Organization by RepVue

NIST Officially Stops Enriching Most CVEs as Vulnerability Volume Skyrockets