Socket
Book a DemoInstallSign in
Socket

amazon-reviews-crawler

Package Overview
Dependencies
Maintainers
1
Versions
12
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

amazon-reviews-crawler

Crawls product reviews from Amazon.

1.0.4
latest
Source
npmnpm
Version published
Weekly downloads
2
100%
Maintainers
1
Weekly downloads
 
Created
Source

Amazon Reviews Crawler

Crawls product reviews from Amazon.

Installation

Via npm:

npm install amazon-reviews-crawler

Or Yarn:

yarn add amazon-reviews-crawler

Usage

Load the module

var reviewsCrawler = require('amazon-reviews-crawler')

Get reviews by a product ASIN

reviewsCrawler('0062472100')
	.then(function(results){
		console.log(results)
	})
	.catch(function(err){
		console.error(err)
	})

This will return an object containing the title of the product and an array of review data.

Example of a return:

{
	title: "Product Name",
	reviews: [
		{
			id: "R16DIYH5RRPEWK",
			title: "Review Title",
			rating: 5,
			text: "The product review body text.",
			author: "Reviewer Name"
			date: "October 26, 2016",
			link: "https://www.amazon.com/gp/customer-reviews/R16DIYH5RRPEWK/ref=cm_cr_arp_d_rvw_ttl?ie=UTF8&ASIN=0062472100"
		}
	]
}

Options

Options can also be provided to change the user agent string, review page, or elements being crawled.

Example:

reviewsCrawler('0062472100', {
		page: 'https://www.amazon.com/product-reviews/{{asin}}',
		userAgent: 'Mozilla/5.0 (Windows NT 6.1; WOW64; rv:27.0) Gecko/20100101 Firefox/27.0',
		elements: {
			// Searches whole page
			productTitle: '.product-title',
			reviewBlock: '.review',

			// Searches within elements.reviewBlock
			link: 'a',
			title: '.review-title',
			rating: '.review-rating',
			ratingPattern: 'a-star-',
			text: '.review-text',
			author: '.review-byline a',
			date: '.review-date'
		},

		// Stops crawling when it hits a particular review ID
		// Useful for only crawling new reviews
		stopAtReviewId: false
	})
	.then(console.log)
	.catch(console.error)

Keywords

amazon

FAQs

Package last updated on 25 Aug 2017

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

About

Packages

Stay in touch

Get open source security insights delivered straight into your inbox.

  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc

U.S. Patent No. 12,346,443 & 12,314,394. Other pending.