🚀 Big News: Socket Acquires Coana to Bring Reachability Analysis to Every Appsec Team.Learn more
Socket
DemoInstallSign in
Socket

inspire-crawler

Package Overview
Dependencies
Maintainers
2
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

inspire-crawler

Crawler integration with INSPIRE-HEP.

3.0.4
PyPI
Maintainers
2

.. This file is part of Invenio. Copyright (C) 2016 CERN.

Invenio is free software; you can redistribute it
and/or modify it under the terms of the GNU General Public License as
published by the Free Software Foundation; either version 2 of the
License, or (at your option) any later version.

Invenio is distributed in the hope that it will be
useful, but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
General Public License for more details.

You should have received a copy of the GNU General Public License
along with Invenio; if not, write to the
Free Software Foundation, Inc., 59 Temple Place, Suite 330, Boston,
MA 02111-1307, USA.

In applying this license, CERN does not
waive the privileges and immunities granted to it by virtue of its status
as an Intergovernmental Organization or submit itself to any jurisdiction.

================= inspire-crawler

.. image:: https://img.shields.io/travis/inspirehep/inspire-crawler.svg :target: https://travis-ci.org/inspirehep/inspire-crawler

.. image:: https://img.shields.io/coveralls/inspirehep/inspire-crawler.svg :target: https://coveralls.io/r/inspirehep/inspire-crawler

.. image:: https://img.shields.io/github/tag/inspirehep/inspire-crawler.svg :target: https://github.com/inspirehep/inspire-crawler/releases

.. image:: https://img.shields.io/pypi/dm/inspire-crawler.svg :target: https://pypi.python.org/pypi/inspire-crawler

.. image:: https://img.shields.io/github/license/inspirehep/inspire-crawler.svg :target: https://github.com/inspirehep/inspire-crawler/blob/master/LICENSE

Crawler integration with INSPIRE-HEP using scrapy project HEPCrawl_.

This module allows scheduling of crawler jobs to a Scrapyd_ instance serving a Scrapy_ project. E.g. in this case the default scrapy project is HEPCrawl_.

It integrates directly with invenio-workflows_ module to create workflows for every record harvested by the crawler.

This module is meant to use only with INSPIRE-HEP_ overlay. Use at own risk.

Full documentation is hosted here: http://pythonhosted.org/inspire-crawler/

See also documentation of HEPCrawl: http://pythonhosted.org/hepcrawl/

.. _HEPCrawl: http://pythonhosted.org/hepcrawl/ .. _Scrapyd: http://scrapyd.readthedocs.io/ .. _Scrapy: http://doc.scrapy.org/ .. _invenio-workflows: http://pythonhosted.org/invenio-workflows/ .. _INSPIRE-HEP: http://inspirehep.readthedocs.io

Keywords

invenio inspire scrapy crawler

FAQs

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts