Security News
tea.xyz Spam Plagues npm and RubyGems Package Registries
Tea.xyz, a crypto project aimed at rewarding open source contributions, is once again facing backlash due to an influx of spam packages flooding public package registries.
jarviscrawlercore
Advanced tools
Readme
JarvisCrawlerCore 是一套分布式 爬虫服务框架 / 页面编程机器人 。
它可以用于构建一套数据爬取集群,也可以用于Web项目的自动化测试,或用于其它机器人操作web项目的子服务。
它用于多个机器人项目,包括群内自动翻译机器人、新闻推送频道、动漫影视资源推送频道、后台数据监控系统、页面分析、行业数据抓取等。
我们仅在前期测试使用命令行,主要维护grpc服务模式。
建议用docker部署,多节点并行获取数据,目前仅有golang客户端(jccclient)可以提供基本的任务分派。
如果需要多节点的统一运维,可以使用Jarvis。
机器配置要求,建议使用linux,能装docker。
内存2g及以上(1g内存也可以用,不要一次请求太多任务,chrome内存占用较严重,每隔一段时间重启服务会有好处,我们也有个翻译服务数月不重启的)。
这里 是一个可以直接用于部署的脚本项目。
下面的命令可以直接使用DockerHub源部署。
docker push zerrozhao/jarviscrawlercore:latest
需要修改配置文件,service.yaml
,建议放在 cfg
目录下。
servAddr: 0.0.0.0:7051
headless: true
slowMo: 10
clientToken:
- wzDkh9h2fhfUVuS9jZ8uVbhV3vC5AWX3
其中,clientToken,是用来校验权限的,可以配置多个,每次响应请求都会校验token,一个token可以提供给多个客户端使用。
nodejs
调用的例子,见src/service/client2.js
。
可以通过 npm 安装依赖,即可使用。
npm i jarviscrawlercore --save
这里还有一个直接用 jarviscrawlercore
项目来打包漫画的项目,这里 。
使用 jccclient
即可。
protos
结构Charles
线上部署jccclient
实现更高效的抓取工作FAQs
jarvis crawler core
The npm package jarviscrawlercore receives a total of 3 weekly downloads. As such, jarviscrawlercore popularity was classified as not popular.
We found that jarviscrawlercore demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Tea.xyz, a crypto project aimed at rewarding open source contributions, is once again facing backlash due to an influx of spam packages flooding public package registries.
Security News
As cyber threats become more autonomous, AI-powered defenses are crucial for businesses to stay ahead of attackers who can exploit software vulnerabilities at scale.
Security News
UnitedHealth Group disclosed that the ransomware attack on Change Healthcare compromised protected health information for millions in the U.S., with estimated costs to the company expected to reach $1 billion.