Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More
Socket
Sign inDemoInstall
Socket

scrapingai

Package Overview
Dependencies
Maintainers
1
Versions
2
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

scrapingai

Build web scraping agents using AI to auto-extract the data from websites

  • 1.0.1
  • latest
  • Source
  • npm
  • Socket score

Version published
Maintainers
1
Created
Source

scarapingai

version license

Extract data from websites automatically with AI or build web scraping agents for bulk URL scraping.

Installation

Install it via npm:

npm i scarapingai

Highlights

  • Built-in residential proxies and captcha handling
  • Smart ad blocker, popup blocker for better performance
  • Accept cookie consent automatically to close cookie banners
  • Compatible with Puppeteer, Playwright for browser automation and testing.
  • Background jobs for bulk URL scraping with automatic retry & error handling.

Usage

Get your api key from here

const agenty = new Agenty(API_KEY);
const data = await agenty.browser.extract("https://example.com");
console.log(data);

Extract

To auto-extract product, jobs listing, SEO meta data, schema JSON etc from given URL

const data = await agenty.browser.extract("https://example.com");
console.log(data);

Scrape

To extract data from given CSS selector or custom jQuery function

const data = await agenty.browser.scrape("https://example.com");
console.log(data);

Screenshot

To capture a screenshot for given URL

const data = await agenty.browser.screenshot("https://example.com");
console.log(data);

PDF

To convert webpage into PDF.

const data = await agenty.browser.pdf("https://example.com");
console.log(data);

Content

To get HTML content from a URL.

const data = await agenty.browser.content("https://example.com");
console.log(data);

License

scrapingai is a project by Agenty, released under the MIT License.

Keywords

FAQs

Package last updated on 06 Nov 2023

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc