New Case Study:See how Anthropic automated 95% of dependency reviews with Socket.Learn More
Socket
Sign inDemoInstall
Socket

luisy

Package Overview
Dependencies
Maintainers
1
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

luisy

Framework to build data pipelines

1.4.6
PyPI
Maintainers
1

luisy

Test
Package Test
docs PyPI

This tool is an extension for the Python Framework luigi which helps to build reproducable and complex data pipelines for batch jobs. Visit our docs to learn more!

How to use?

This is how an end-to-end luisy pipeline may look like:

    import luisy
    import pandas as pd
    
    @luisy.raw
    @luisy.csv_output(delimiter=',')
    class InputFile(luisy.ExternalTask):
        label = luisy.Parameter()
    
        def get_file_name(self): 
            return f"file_{self.label}"
    
    @luisy.interim
    @luisy.requires(InputFile)
    class ProcessedFile(luisy.Task):
        def run(self):
            df = self.input().read()
            # Some more preprocessings
            # ...
            # Write to disk
            self.write(df)
    
    @luisy.final
    class MergedFile(luisy.ConcatenationTask):
        def requires(self):
            for label in ['a', 'b', 'c', 'd']:
                yield ProcessedFile(label=label)

How to install?

Stable Branch: main

Minimum python version: 3.8

Install luisy with

pip install luisy

How to test?

To run all unittests that are inside the tests directory use the following command:

pytest

How to contribute?

Please have a look at our contribution guide.

Third-Party Licenses

Runtime dependencies

NameLicenseType
numpyBSD-3-Clause LicenseDependency
pandasBSD 3-Clause LicenseDependency
networkxBSD-3-Clause LicenseDependency
luigiApache License 2.0Dependency
distlibPython licenseDependency
matplotlibOtherDependency
azure-storage-blobMIT LicenseDependency
tablesBSD licenseDependency
pipdeptreeMIT LicenseDependency
requirements-parserApache License 2.0Dependency
pyarrowApache License 2.0Dependency
sparkApache License 2.0Dependency

Development dependency

NameLicenseType
sphinxBSD-2-ClauseDependency
sphinx_rtd_themeMIT LicenseDependency
flake8MIT LicenseDependency
pytestMIT LicenseDependency
pytest-flake8BSD LicenseDependency
pytest-covMIT LicenseDependency
pip-toolsBSD 3-Clause LicenseDependency

FAQs

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts