Socket
Socket
Sign inDemoInstall

nix-sitemap-generator

Package Overview
Dependencies
6
Maintainers
1
Alerts
File Explorer

Install Socket

Detect and block malicious and high-risk dependencies

Install

    nix-sitemap-generator

Web crawler and sitemap generator.


Maintainers
1

Readme

nix-sitemap-generator

Sitemap generator library for python. Fork from https://github.com/Haikson/sitemap-generator.

Installing

::

pip install nix-sitemap-generator

Usage

  1. Import crawler from pysitemap
::

    from pysitemap import crawler

2. Call `crawler()`
~~~~~~~~~~~~~~~~~~~
::

    crawler(
        'https//site.com', out_file='debug/sitemap.xml', exclude_urls=[".pdf", ".jpg", ".zip"],
        http_request_options={"ssl": False}, parser=Parser
    )

Example
-------
::

    import sys
    import logging
    from pysitemap import crawler
    from pysitemap.parsers.lxml_parser import Parser

    if __name__ == '__main__':
        if '--iocp' in sys.argv:
            from asyncio import events, windows_events
            sys.argv.remove('--iocp')
            logging.info('using iocp')
            el = windows_events.ProactorEventLoop()
            events.set_event_loop(el)

        # root_url = sys.argv[1]
        root_url = 'https://www.haikson.com'
        crawler(
            root_url, out_file='debug/sitemap.xml', exclude_urls=[".pdf", ".jpg", ".zip"],
            http_request_options={"ssl": False}, parser=Parser
        )


Changes
-------
v. 0.10.1
~~~~~~~~~

- Refactored the code to make it more readable.
- Removed prints() calls from code.
- Added verbose mode to `crawler()`.
- Added type hints to `crawler()` arguments.
- Add ValueError handling when try to add_signal_handler()

FAQs


Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc