You're Invited:Meet the Socket Team at BlackHat and DEF CON in Las Vegas, Aug 4-6.RSVP →

Book a Demo Install Sign in

scrapeer

Package Overview

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

scrapeer

Essential Python library that scrapes HTTP(S) and UDP trackers for torrent information.

1.0.1

PyPI

Maintainers: 1

Scrapeer-py

A tiny Python library that lets you scrape HTTP(S) and UDP trackers for torrent information.

Scrapeer-py is a Python port of the original PHP Scrapeer library by TorrentPier.

Overview

Scrapeer-py allows you to retrieve peer information from BitTorrent trackers using both HTTP(S) and UDP protocols. It can fetch seeders, leechers, and completed download counts for multiple torrents from multiple trackers simultaneously.

Features

Support for both HTTP(S) and UDP tracker protocols
Batch scraping of multiple infohashes at once (up to 64)
Support for trackers with passkeys
Optional announce mode for trackers that don't support scrape
Configurable timeout settings
Detailed error reporting
Well-organized modular codebase

Installation

pip install scrapeer

Usage

from scrapeer import Scraper

# Initialize the scraper
scraper = Scraper()

# Define your infohashes and trackers
infohashes = [
    "0123456789abcdef0123456789abcdef01234567",
    "fedcba9876543210fedcba9876543210fedcba98"
]

trackers = [
    "udp://tracker.example.com:80",
    "http://tracker.example.org:6969/announce",
    "https://private-tracker.example.net:443/YOUR_PASSKEY/announce"
]

# Get the results (timeout of 3 seconds per tracker)
results = scraper.scrape(
    hashes=infohashes,
    trackers=trackers,
    timeout=3
)

# Print the results
for infohash, data in results.items():
    print(f"Results for {infohash}:")
    print(f"  Seeders: {data['seeders']}")
    print(f"  Leechers: {data['leechers']}")
    print(f"  Completed: {data['completed']}")

# Check if there were any errors
if scraper.has_errors():
    print("\nErrors:")
    for error in scraper.get_errors():
        print(f"  {error}")

Package Structure

Scrapeer-py is organized into the following modules:

scrapeer/ - Main package directory
- __init__.py - Package initialization that exports the Scraper class
- scraper.py - Main Scraper class implementation
- http.py - HTTP(S) protocol scraping functionality
- udp.py - UDP protocol scraping functionality
- utils.py - Utility functions used across the package

API Reference

`Scraper` class

`scrape(hashes, trackers, max_trackers=None, timeout=2, announce=False)`

Scrape trackers for torrent information.

Parameters:
- hashes: List (>1) or string of infohash(es)
- trackers: List (>1) or string of tracker(s)
- max_trackers: (Optional) Maximum number of trackers to be scraped, Default all
- timeout: (Optional) Maximum time for each tracker scrape in seconds, Default 2
- announce: (Optional) Use announce instead of scrape, Default False
Returns:
- Dictionary of results with infohashes as keys and stats as values

`has_errors()`

Checks if there are any errors.

Returns:
- bool: True if errors are present, False otherwise

`get_errors()`

Returns all the errors that were logged.

Returns:
- list: All the logged errors

Limitations

Maximum of 64 infohashes per request
Minimum of 1 infohash per request
Only supports BitTorrent trackers (HTTP(S) and UDP)

License

This project is licensed under the MIT License - see the LICENSE.txt file for details.

Keywords

FAQs

What is scrapeer?

Is scrapeer well maintained?

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

scrapeer

Scrapeer-py

Overview

Features

Installation

Usage

Package Structure

API Reference

Scraper class

scrape(hashes, trackers, max_trackers=None, timeout=2, announce=False)

has_errors()

get_errors()

Limitations

License

Keywords

Related posts

Toptal’s GitHub Organization Hijacked: 10 Malicious Packages Published

Surveillance Malware Hidden in npm and PyPI Packages Targets Developers with Keyloggers, Webcam Capture, and Credential Theft

`Scraper` class

`scrape(hashes, trackers, max_trackers=None, timeout=2, announce=False)`

`has_errors()`

`get_errors()`