You're Invited:Meet the Socket Team at BlackHat and DEF CON in Las Vegas, Aug 4-6.RSVP →

Book a Demo Install Sign in

LightYtSearch

Package Overview

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

LightYtSearch

Lightweight YouTube search scraper without using the official API

0.3.0

PyPI

Maintainers: 1

🔍 LightYtSearch

✨ A lightweight Python package to search YouTube without using the official API ✨

📋 Table of Contents

✨ Features

🔍 Search for videos, playlists, and movies on YouTube
📊 Extract detailed information including titles, channels, view counts, and more
🔑 No API key required
🌈 Colorful command-line interface
📄 JSON output support
🌐 Configurable search parameters (language, region, etc.)
🛡️ Fault-tolerant with retry capabilities
🗂️ Save raw YouTube data for further analysis
🖼️ Proxy support
🔄 User-agent rotation

🚀 Installation

Install from PyPI using pip:

pip install LightYtSearch

Or install the development version from GitHub:

pip install git+https://github.com/Arrowar/LightYtSearch.git

🏁 Quick Start

from LightYtSearch import search_youtube

# Basic search
results = search_youtube("python tutorial", max_results=5)
print(f"Found {len(results)} results")

# Print titles and URLs
for item in results:
    print(f"{item['type']}: {item['title']}")
    print(f"URL: {item['url']}")
    print("---")

📖 Usage

🐍 As a Python Module

from LightYtSearch import search_youtube

# Search for "python tutorial" and get up to 5 results
results = search_youtube("python tutorial", max_results=5)

# Process the results
for item in results:
    print(f"{item['type']}: {item['title']}")
    print(f"URL: {item['url']}")
    print("---")

⚙️ Function Parameters

The search_youtube() function accepts the following parameters:

Parameter	Type	Description	Default
`query`	str	The search term to look for on YouTube	(Required)
`max_results`	int	Maximum number of results to return	5
`filter_type`	str	Filter results by type ('video', 'playlist', 'movie')	None
`timeout`	int	Request timeout in seconds	10
`language`	str	Language code for search results	'en'
`region`	str	Region code for search results	'US'
`save_json`	bool	Whether to save results to a JSON file	False
`output_file`	str	Path to save results JSON	'results.json'
`verbose`	bool	Whether to print progress and results to console	True
`showResults`	bool	Whether to display search results in terminal	True
`retry_count`	int	Number of retries if request fails	3
`retry_delay`	int	Delay between retries in seconds	2
`showTimeExecution`	bool	Display execution time for each major process	False
`save_raw_data`	bool	Save raw YouTube data to JSON file	False
`raw_data_dir`	str	Directory to save raw YouTube data	'./raw_data'

Note: The maximum possible value for max_results is 20 due to YouTube's page limitations.

💻 Command-line Interface

# Basic search
LightYtSearch "python tutorial"

# Get 10 results (maximum is 20)
LightYtSearch "python tutorial" -n 10

# Export to JSON
LightYtSearch "python tutorial" -j > results.json

# Save to a file
LightYtSearch "python tutorial" -s -o my_results.json

# Save raw YouTube data
LightYtSearch "python tutorial" --save-raw-data --raw-data-dir ./my_raw_data

# Filter by type
LightYtSearch "python tutorial" --filter video

# Set custom timeout, language, and region
LightYtSearch "python tutorial" --timeout 15 --language fr --region FR

# Configure retry behavior
LightYtSearch "python tutorial" --retry-count 5 --retry-delay 3

# Show execution time
LightYtSearch "python tutorial" --time

🧰 CLI Arguments

Argument	Short	Description	Default
`query`		Search term (required)
`--max-results`	`-n`	Maximum number of results	5
`--json`	`-j`	Output results as JSON
`--save`	`-s`	Save results to a file
`--output`	`-o`	Output filename	results.json
`--quiet`	`-q`	Quiet mode (minimal output)
`--version`	`-v`	Show version information
`--filter`		Filter results by type
`--timeout`		Request timeout in seconds	10
`--language`		Language code	en
`--region`		Region code	US
`--retry-count`		Number of retries	3
`--retry-delay`		Delay between retries in seconds	2
`--time`		Show execution time
`--save-raw-data`		Save raw YouTube data to JSON file	False
`--no-save-raw-data`		Do not save raw YouTube data
`--raw-data-dir`		Directory to save raw YouTube data	./raw_data

🔄 Data Structure

The function returns a list of dictionaries with different structures depending on the item type:

🎬 Video Results

{
    'type': 'video',
    'id': 'video_id',
    'title': 'Video title',
    'channel': {
        'name': 'Channel name',
        'id': 'channel_id',
        'url': 'https://www.youtube.com/channel/channel_id',
        'thumbnail': 'Channel thumbnail URL',
        'badges': ['Verified', 'Official Artist', etc.]
    },
    'views': {
        'full': '1,234,567 views',
        'short': '1.2M views'
    },
    'published': '2 years ago',
    'duration': '12:34',
    'description': 'Video description snippet',
    'thumbnails': [thumbnail objects],
    'rich_thumbnail': 'Moving thumbnail URL',
    'flags': {
        'is_live': False,
        'is_upcoming': False,
        'has_closed_captions': True
    },
    'upcoming_date': None,
    'overlay_icons': ['4K', 'HD', etc.],
    'url': 'https://www.youtube.com/watch?v=video_id'
}

📑 Playlist Results

{
    'type': 'playlist',
    'id': 'playlist_id',
    'title': 'Playlist title',
    'channel': {
        'name': 'Channel name',
        'id': 'channel_id',
        'url': 'https://www.youtube.com/channel/channel_id'
    },
    'video_count': '42 videos',
    'thumbnails': [thumbnail objects],
    'video_previews': [
        {
            'title': 'Video title',
            'id': 'video_id',
            'url': 'https://www.youtube.com/watch?v=video_id'
        }
    ],
    'url': 'https://www.youtube.com/playlist?list=playlist_id'
}

🎥 Movie Results

{
    'type': 'movie',
    'id': 'movie_id',
    'title': 'Movie title',
    'description': 'Movie description',
    'duration': '1:23:45',
    'publisher': 'Studio name',
    'metadata': ['2023', 'PG-13', etc.],
    'bottom_metadata': ['Adventure', 'Action', etc.],
    'thumbnails': [thumbnail objects],
    'is_vertical_poster': False,
    'badges': ['4K', 'HD', etc.],
    'offers': ['Rent $3.99', 'Buy $14.99', etc.],
    'url': 'https://www.youtube.com/watch?v=movie_id'
}

📝 Examples

🔍 Searching for Videos Only

from LightYtSearch import search_youtube

# Search for videos only
videos = search_youtube("python programming", filter_type="video", max_results=10)
for video in videos:
    print(f"Title: {video['title']}")
    print(f"Channel: {video['channel']['name']}")
    print(f"Views: {video['views']['full']}")
    print(f"Duration: {video['duration']}")
    print(f"URL: {video['url']}")
    print("---")

💾 Saving Results to a JSON File

from LightYtSearch import search_youtube

# Search and save to custom file
search_youtube("machine learning", max_results=15, save_json=True, output_file="ml_videos.json")

🌐 Using Different Region and Language

from LightYtSearch import search_youtube

# Search in Italian from Italy
results_it = search_youtube("ricette pasta", language="it", region="IT", max_results=5)

# Search in Spanish from Mexico
results_es = search_youtube("recetas mexicanas", language="es", region="MX", max_results=5)

🗂️ Saving Raw YouTube Data

from LightYtSearch import search_youtube

# Save raw YouTube data to a custom directory
search_youtube("data science", save_raw_data=True, raw_data_dir="./data_science_raw")

⚠️ Limitations

Maximum Results: This library can extract a maximum of 20 results per search query due to YouTube's initial page load limitations.
No Pagination: Currently doesn't support fetching more than the initial results page.
Subject to Changes: YouTube's HTML structure might change, which could affect the scraping functionality.
Rate Limiting: Excessive use might trigger YouTube's rate limiting mechanisms.
No Official Support: This is not using the official YouTube API and is therefore not officially supported by YouTube.

👏 Acknowledgments

Inspired by the need for a lightweight YouTube search solution without API key requirements
Thanks to all contributors who have helped shape this project
This library is for educational purposes and personal use

FAQs

What is LightYtSearch?

Is LightYtSearch well maintained?

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

LightYtSearch

🔍 LightYtSearch

✨ A lightweight Python package to search YouTube without using the official API ✨

📋 Table of Contents

✨ Features

🚀 Installation

🏁 Quick Start

📖 Usage

🐍 As a Python Module

⚙️ Function Parameters

💻 Command-line Interface

🧰 CLI Arguments

🔄 Data Structure

🎬 Video Results

📑 Playlist Results

🎥 Movie Results

📝 Examples

🔍 Searching for Videos Only

💾 Saving Results to a JSON File

🌐 Using Different Region and Language

🗂️ Saving Raw YouTube Data

⚠️ Limitations

👏 Acknowledgments

Related posts

Introducing License Overlays: Smarter License Management for Real-World Code

Introducing Rust Support in Socket