Stratosphere
A lightweight experimentation toolkit for data scientists.
Designed to be accessibile, fast and robust. stratosphere
lets you:
- Define as code your experiments:
s.define(name="experiment", funcs=[step1, step2, step3])
- Execute them with the best-suited backend:
s.execute()
- Track what you need (predictions, metrics, ...)
- Store them as plain tables and/or pickled objects
- Query them with
SQL
, Pandas
or Python
How is it possible?
Installation
On Python >= 3.8.0
:
- With PyPI:
pip install "stratosphere[complete]" --upgrade
# Install everything - With Poetry:
poetry add stratosphere@latest --extras complete
# Install everything
On Python 3.7.13-15
(Google Colab, Binder):
!pip install joblib pandas tqdm cloudpickle colorama sqlalchemy sqlalchemy-utils ulid-py --upgrade --quiet
!pip install tabulate scikit-learn dask[complete] --upgrade --quiet # to install extras
!pip install stratosphere --ignore-requires-python --no-dependencies --quiet
You can also install only the stratosphere
library (dropping the [complete]
extras).
Modules like stratosphere.utils.dask
, stratosphere.utils.metrics
, and stratosphere.utils.widgets
won't work until you also install dask[complete]
, scikit-learn
, and ipywidgets
, respectively.
These extras are required to run some of the tutorial notebooks.
Documentation
- In the intro notebook, you'll be exposed to the key concepts. It runs entirely in the browser.
- In the tutorial notebooks, you'll cover the advanced topics. Working locally, on Colab and on Binder.
Project pages
License
This project is licensed under the terms of the BSD 3-Clause License.
Development
See the development page.
Contributing
Work in progress!