Security News
tea.xyz Spam Plagues npm and RubyGems Package Registries
Tea.xyz, a crypto project aimed at rewarding open source contributions, is once again facing backlash due to an influx of spam packages flooding public package registries.
crawler-find-word
Advanced tools
Readme
Deep crawl to find word in the body of web pages by base url.
Simple but powerful, popular and production crawling/scraping package for Node.
We strict about our code , so, we choose to use 'travis-ci' and 'npm audit'.
Review bugs you found or feature you want in our Slack click this Slack invitation
Configurable level of maximum pages to visit.
Configurable root URL and Word to search.
Use event driven API, Raise 'Done' event when process ends.
Return usefull statistical data.
Enable Docker hosting
Use Cheerio to find word or phrase in the DOM.
Tested with Mocha and Chai.
Add 'Error' event handling.
Priority queue of requests.
Control rate limit.
Charset detection and conversion.
'use strict';
var srv = require('crawler-find-word');
var print = function(){
var count = srv.pages.length;
for(var i=0; i < count; ){
var u = srv.pages.pop();
console.log(JSON.stringify(u));
i++;
};
}
srv.eventHandler.on('done', print);
srv.crawl('https://cnn.com/', 'trump', 2);
Running nodemon ./crawler-find-word.js localhost 3000
will run the unit tests with debug mode.
Run the command npm test
for Mocha & Chai testing unit.
Run command to build Docker image from the project directory
docker build -t node-crawler-dev-env .
Run command to create Docker instance
docker container run -p 9999:80 --name node-crawler-app --rm -v local-drive:/usr/src/app node-crawler-dev-env:latest
FAQs
crawler service
The npm package crawler-find-word receives a total of 4 weekly downloads. As such, crawler-find-word popularity was classified as not popular.
We found that crawler-find-word demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Tea.xyz, a crypto project aimed at rewarding open source contributions, is once again facing backlash due to an influx of spam packages flooding public package registries.
Security News
As cyber threats become more autonomous, AI-powered defenses are crucial for businesses to stay ahead of attackers who can exploit software vulnerabilities at scale.
Security News
UnitedHealth Group disclosed that the ransomware attack on Change Healthcare compromised protected health information for millions in the U.S., with estimated costs to the company expected to reach $1 billion.