🚀 Big News: Socket Acquires Coana to Bring Reachability Analysis to Every Appsec Team.Learn more
Socket
Book a DemoInstallSign in
Socket

luisy

Package Overview
Dependencies
Maintainers
1
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

luisy

Framework to build data pipelines

1.4.7
PyPI
Maintainers
1

Project Status: Archived

This repository is no longer actively maintained. Due to shifting priorities and limited resources, we have decided to archive the repository and discontinue further development and maintenance.

What this means:

  • No new features or updates will be added.
  • Issues and pull requests will no longer be reviewed or responded to.
  • You are welcome to fork the project and continue development under your own maintenance.

luisy

Test
Package Test
docs PyPI

This tool is an extension for the Python Framework luigi which helps to build reproducable and complex data pipelines for batch jobs. Visit our docs to learn more!

How to use?

This is how an end-to-end luisy pipeline may look like:

    import luisy
    import pandas as pd
    
    @luisy.raw
    @luisy.csv_output(delimiter=',')
    class InputFile(luisy.ExternalTask):
        label = luisy.Parameter()
    
        def get_file_name(self): 
            return f"file_{self.label}"
    
    @luisy.interim
    @luisy.requires(InputFile)
    class ProcessedFile(luisy.Task):
        def run(self):
            df = self.input().read()
            # Some more preprocessings
            # ...
            # Write to disk
            self.write(df)
    
    @luisy.final
    class MergedFile(luisy.ConcatenationTask):
        def requires(self):
            for label in ['a', 'b', 'c', 'd']:
                yield ProcessedFile(label=label)

How to install?

Stable Branch: main

Minimum python version: 3.8

Install luisy with

pip install luisy

How to test?

To run all unittests that are inside the tests directory use the following command:

pytest

How to contribute?

Please have a look at our contribution guide.

Third-Party Licenses

Runtime dependencies

NameLicenseType
numpyBSD-3-Clause LicenseDependency
pandasBSD 3-Clause LicenseDependency
networkxBSD-3-Clause LicenseDependency
luigiApache License 2.0Dependency
distlibPython licenseDependency
matplotlibOtherDependency
azure-storage-blobMIT LicenseDependency
tablesBSD licenseDependency
pipdeptreeMIT LicenseDependency
requirements-parserApache License 2.0Dependency
pyarrowApache License 2.0Dependency
sparkApache License 2.0Dependency

Development dependency

NameLicenseType
sphinxBSD-2-ClauseDependency
sphinx_rtd_themeMIT LicenseDependency
flake8MIT LicenseDependency
pytestMIT LicenseDependency
pytest-flake8BSD LicenseDependency
pytest-covMIT LicenseDependency
pip-toolsBSD 3-Clause LicenseDependency

FAQs

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts