Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More →

lighthousedataextract

Package Overview

Dependencies

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

lighthousedataextract

Google LightHouse Data Extractor

1.0.9
Source
PyPI

Maintainers: 1

LightHouse Data Extract

This tool parses the google lighthouse json data, accepts a csv file for categories of the URLs and returns 4 pandas DataFrames for metrics, opportunities, diagnostics and resources.

Install

pip install lighthousedataextract

Import

from lighthousedataextract import LightHouseDataExtract

Create a report variable

If json files are in directory ./repprt/lighthouse/ and you don't want to give an input file for categories of URLs

report = LightHouseDataExtract()

If your json files are in another directory

report = LightHouseDataExtract(
    path_to_json="./data/lighthouse/report/lighthouse/"
)

If you want to seperate URLs in categories

Your CSV of URLs should have two columns, without headers. Below you can see an example:


https://www.example.com/	Home Page
https://www.example.com/categories/category-1	Middle Tail
https://www.example.com/products/product-1234	Long Tail

report = LightHouseDataExtract(url_category_file="./data/lighthouse/category.csv")

Create a lighthouse metrics DataFrame

from lighthousedataextract import LightHouseDataExtract

report = LightHouseDataExtract(
    path_to_json="./data/lighthouse/report/lighthouse/",
    url_category_file="./data/lighthouse/category.csv",
)
df_report = report.df_report()
df_report.set_index("url").T

Create other DataFrames

df_opportunities = report.df_opportunities()
display(df_opportunities)
df_diagnostics = report.df_diagnostics()
display(df_diagnostics)
df_resources = report.df_resources()
display(df_resources)

If json files are obtained by gooogle pagespeed insights api then

api_report = LightHouseDataExtract(from_api=True)

Keywords

FAQs

What is lighthousedataextract?

Is lighthousedataextract well maintained?

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

lighthousedataextract

LightHouse Data Extract

Install

Import

Create a report variable

Create a lighthouse metrics DataFrame

Create other DataFrames

If json files are obtained by gooogle pagespeed insights api then

Keywords

Related posts

Input Validation Vulnerabilities Dominate MITRE's 2024 CWE Top 25 List

Risky Business Podcast: Why Open Source Software Needs Better Malware Tracking