DATATOOLBOX
This is a python package for handling global datasets. It contains the following features:
- Augumented pandas DataFrames adding meta data
- Automatic unit conversion and dataframe based computations
- ID based data structure
This package is under development and serves as a collection of tools around various data analysis packages.
The package is developed in-house and supported by Climate Analytics gGmbH and therefore mainly used in the context of climate change mitigation.
The included csv-based git database structure allows multi-user access to unified and version-controlled data sets.
Data access is locally controlled by define data IDs and globally via dataset-IDs.
Authors:
- Andreas Geiges
- Jonas Hörsch
- Gaurav Ganti
Dependencies
see environment.yml or setup.py
Installation via pip
Using pip:
pip install datatoolbox
Using conda/mamba
conda/mamba install datatoolbox
Read the docs
https://datatoolbox.readthedocs.io/en/latest/
Testing
From the root directory, run:
pytest