You're Invited:Meet the Socket Team at BlackHat and DEF CON in Las Vegas, Aug 4-6.RSVP →

Book a Demo Install Sign in

Book a Demo Install Sign in

nuget

Categories
Server
Web
Crawler

Crawler

crawler

.Net parallelized crawler library.

1.0.0 • 126 years ago

dotnetspider.extraction

A .NET Standard web crawling library similar to WebMagic and Scrapy. It is a lightweight ,efficient and fast high-level web crawling & scraping framework for .NET

3.0.9 • 126 years ago

dotnetspider.htmlagilitypack.css

A .NET Standard web crawling library similar to WebMagic and Scrapy. It is a lightweight ,efficient and fast high-level web crawling & scraping framework for .NET

3.0.6 • 126 years ago

htmlagilitypack.netcore

Deprecated as there's new maintainer for original HAP project. Please check the new repo at https://github.com/zzzprojects/html-agility-pack. This is a port of HtmlAgilityPack library created by Simon Mourrier and Jeff Klawiter for .NET Core platform. This NuGet package supports can be used with Universal Windows Platform, ASP.NET 5 (using .NET Core) and full .NET Framework 4.6. Original description: This is an agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT (you actually don't HAVE to understand XPATH nor XSLT to use it, don't worry...). It is a .NET code library that allows you to parse "out of the web" HTML files. The parser is very tolerant with "real world" malformed HTML. The object model is very similar to what proposes System.Xml, but for HTML documents (or streams).

1.4.9 • 126 years ago

crawlerlib.engine

The Crawler-Lib Engine is a general purpose workflow enabled task processor. It has evolved from a web crawler over data mining and information retrieval. It is throughput optimized and can perform thousands of tasks per second on standard hardware. Due to its workflow capabilities it allows to structure and parallelize even complex kind of work. Please visit the project page for the complete view of the Crawler-Lib Engine. A license for the Anonymous Edition is included in the package. A license for the more powerful free Community Edition can be generated on the project page. A unrestricted license is available too.

2.0.2 • 12 years ago

dotnetspider.downloader

A .NET Standard web crawling library similar to WebMagic and Scrapy. It is a lightweight ,efficient and fast high-level web crawling & scraping framework for .NET

3.0.8 • 126 years ago

komodo.crawler

HTTP, HTTPS, FTP, S3, Azure, Kvpbase, and filesystem crawlers for Komodo. Please either install Komodo.Daemon to integrate search within your application, or Komodo.Server to run a standalone server. Komodo is an information search, metadata, storage, and retrieval platform.

1.7.0 • 5 years ago

dotnetspider.common

A .NET Standard web crawling library similar to WebMagic and Scrapy. It is a lightweight ,efficient and fast high-level web crawling & scraping framework for .NET

3.0.0 • 126 years ago

dotnetspider2.redial

Offical package. DotnetSpider is a high performance, light weight cralwer developed by C#.

1.0.9 • 126 years ago

ncrawler

Crawler and scrapping framework which is written in C#

4.0.0 • 7 years ago

dcsoup

dcsoup is a .NET library for working with real-world HTML. It provides a very convenient API for extracting and manipulating data, using the best of DOM, CSS, and jquery-like methods. This library is basically a port of jsoup, a Java HTML parser library. see also: http://jsoup.org/ API reference is available at: https://raw.githubusercontent.com/matarillo/dcsoup/master/sandcastle/Help/dcsoup.chm

1.0.0 • 8 years ago

crawlerlib.concurrencytesting

Crawler-Lib Concurrency Testing allows to write unit tests with multiple threads to test the concurrency behavior of components. It has synchronization mechanisms to control the workflow of the threads and to record the execution steps. It is also possible to use it for client/server tests. It can be used in conjunction with any unit test framework or with handwritten tests.

0.3.5612.19510 • 10 years ago

wangkanai.detection.crawler

ASP.NET Core Detection Crawler resolver components

2.0.1 • 5 years ago

httpcode.core

简单、易用、高效一个有态度的开源.Net Http请求框架!可以用制作爬虫，api请求等等。让你感受一个简易到极致的HTTP编程. 让编程更简易,代码更简洁。用法请查看：https://github.com/stulzq/HttpCode.Core

4.2.0 • 8 years ago

brandyolcrawler

My package description.

1.0.0 • 126 years ago

dotnetspider2.core

A .NET Standard web crawling library similar to WebMagic and Scrapy. It is a lightweight ,efficient and fast high-level web crawling & scraping framework for .NET

2.4.5 • 126 years ago

dotnetspider.proxy

A .NET Standard web crawling library similar to WebMagic and Scrapy. It is a lightweight ,efficient and fast high-level web crawling & scraping framework for .NET

3.0.0 • 126 years ago

dotnetspider.extension

A .NET Standard web crawling library similar to WebMagic and Scrapy. It is a lightweight ,efficient and fast high-level web crawling & scraping framework for .NET

3.0.8 • 126 years ago

nfdotnetspider

DotnetSpider, a .NET Standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework

5.0.1-beta6 • 5 years ago

prerender.io

Use this filter that prerenders a javascript-rendered page using an external service and returns the HTML to the search engine crawler for SEO.

1.0.0 • 11 years ago

crawler-webclient.crawlerwebclient.standard

A WebClient which is optimized for crawling.

1.0.6 • 126 years ago

crawler-webclient.crawlerwebclient

A WebClient which is optimized for crawling.

1.0.4 • 126 years ago

dotnetspider.selector

A .NET Standard web crawling library similar to WebMagic and Scrapy. It is a lightweight ,efficient and fast high-level web crawling & scraping framework for .NET

2.6.1-beta • 126 years ago

dotnetspider.extraction.excelexpression

A .NET Standard web crawling library similar to WebMagic and Scrapy. It is a lightweight ,efficient and fast high-level web crawling & scraping framework for .NET

3.0.6 • 126 years ago

skyscraper

Web scraper / crawler / spider. Supports robots protocol and user agent.

1.0.46 • 9 years ago

nhunspell.patched

Crawler-Lib NHunspell is a spell check, hyphenation, word stemming and thesaurus library based on the Open Office spell check library Hunspell. NHunspell can use the vast amount of OpenOffice dictionaries. It is an alternative to NetSpell, GNU Aspell, ISpell, PSpell and Enchant. It wraps the native libraries for Hunspell and Hyphen and contains a fully managed version of MyThes. This version of the nuget package automatically copies the native binaries to the output directory. NHunspell is licensed under: GPL/LGPL/MPL. Free use in commercial applications is permitted according to the LGPL and MPL licenses. Your commercial application can link against the NHunspell DLLs.

1.2.5554 • 5 years ago

webcrawler.model

Crawler

0.0.3 • 126 years ago

up4all.webcrawler.domain

Package Description

1.0.37 • 126 years ago

sloader.crawler.config

Sloader Crawler Config

1.2.0 • 126 years ago

jshttpclient

HttpClient extension class, and the .Net Core version of the HttpHelper class. Simple and flexible crawler base class library. JsHttpClient类是 .Net Core 下的一个简单灵活的爬虫基础类库

1.0.6 • 5 years ago

crawlerlib.engine.testhelper

The Crawler-Lib Engine Test Helper simplifies the test of tasks. It can be used to develop unit tests and integration tests for tasks.

2.3.5544.21265 • 10 years ago

sqreen.agent

The Sqreen in app agent for .NET. Defense in depth for OWASP Top-10 attacks that’s easy to install, manage and scale.

0.1.1.1443 • 126 years ago

htmlagilitypack.net45

This is an agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT (you actually don't HAVE to understand XPATH nor XSLT to use it, don't worry...). It is a .NET code library that allows you to parse "out of the web" HTML files. The parser is very tolerant with "real world" malformed HTML. The object model is very similar to what proposes System.Xml, but for HTML documents (or streams).

2.0.20 • 9 years ago

dotnetspiderlite.abstractions

A .NET Standard web crawling library similar to WebMagic. It is a lightweight, modular, efficient and fast high-level web crawling. scraping framework for .NET

0.2.0 • 6 years ago

crawlerlib.service.base

The Crawler-Lib Service Base is a foundation for the development of Windows Services, Cloud Services and Linux Daemons.

1.1.5401.27185 • 11 years ago

devzh.htmlagilitypack

HtmlAgilityPack for .NET Core

1.4.9 • 126 years ago

netcrawlerdetect

A .net standard port of JayBizzle's CrawlerDetect project (https://github.com/JayBizzle/Crawler-Detect).

1.2.941 • 126 years ago

filecrawler

Package Description

1.0.0 • 126 years ago

knapcode.catalogcrawler

A tool to operate on the NuGet catalog.

0.0.5-beta • 6 months ago

pretech.minimal.crawler.common

Package Description

1.0.1 • 126 years ago

corecompat.nscrape

A web scraping framework for .Net

0.4.0-r6 • 8 years ago

abotx

A powerful C# web crawler that makes advanced crawling features easy to use. AbotX builds upon the open source Abot C# Web Crawler by providing a powerful set of wrappers and extensions.

2.1.12 • 4 years ago

ruiji.net.core

crawler framework , distributed crawler extractor. try ruiji scraper --- chrome web crawler https://chrome.google.com/webstore/detail/ruiji-scraper/klhahkhllngppofpkjdlbmnglnmnbbol?hl=zh-CN&authuser=0

1.2.2 • 5 years ago

dotnetspider2.htmlagilitypack

Offical package. DotnetSpider is a high performance, light weight cralwer developed by C#.

1.0.9 • 126 years ago

spidersharp

Web Crawling and Scraping Framework

1.0.25 • 6 years ago

prmtoolkit.selenium

Facilita a intereção com Selenium através de comandos mais simples. Além de facilitar a realização de testes unitários, também é possível realizar Web Crawler ou Web Scraping

1.0.3 • 6 years ago

blackbeltcoder.htmlmonkey

HtmlMonkey is a lightweight HTML/XML parser written in C#. It allows you to parse an HTML or XML string into a hierarchy of node objects, which can then be traversed or queried using jQuery-like selectors. In addition, the node objects can be modified or even built from scratch using code. Finally, the classes can generate the HTML or XML from the data.

1.0.0 • 126 years ago

htmlmonkey

HtmlMonkey is a lightweight HTML/XML parser written in C#. It allows you to parse an HTML or XML string into a hierarchy of node objects, which can then be traversed or queried using jQuery-like syntax. In addition, the node objects can be modified or even built from scratch using code. Finally, the classes can generate the HTML or XML from the data.

1.0.1 • 126 years ago

shapecrawler

ShapeCrawler (formerly SlideDotNet) is a .NET library for manipulating PowerPoint presentations. It provides fluent APIs to process slides without having Microsoft Office installed. This library provides a simplified object model on top of the Open XML SDK for manipulating PowerPoint documents without any COM+ or COM interop layers.

0.71.1 • 4 days ago

webcrawler.core

这是一个轻量级、快速、多线程、多管道、灵活配置的网络爬虫。

1.0.0 • 8 years ago

Product

Package Alerts
Integrations
Docs
Pricing
FAQ
Roadmap
Changelog

About

About
Love
Blog
Glossary
CareersHiring
Send Feedback
Contact Us
System Status

Packages

Explore Rubygems

Stay in touch

Get open source security insights delivered straight into your inbox.

Enter your email

Terms
Privacy
Security

Made with ⚡️ by Socket Inc

U.S. Patent No. 12,346,443 & 12,314,394. Other pending.