Security News
Fluent Assertions Faces Backlash After Abandoning Open Source Licensing
Fluent Assertions is facing backlash after dropping the Apache license for a commercial model, leaving users blindsided and questioning contributor rights.
Dataget is an easy to use, framework-agnostic, dataset library that gives you quick access to a collection of Machine Learning datasets through a simple API.
Main features:
numpy
arrays or pandas
dataframes which can be easily used with the majority of Machine Learning frameworks.Checkout the documentation for the list of available datasets.
In dataget you just have to do two things:
Dataset
from our collection.get
method to download the data to disk and load it into memory.Both are usually done in one line:
import dataget
X_train, y_train, X_test, y_test = dataget.image.mnist().get()
This example downloads the MNIST dataset to ./data/image_mnist
and loads it as numpy
arrays.
Kaggle promotes the use of csv
files and dataget
loves it! With dataget you can quickly download any dataset from the platform and have immediate access to the data:
import dataget
df_train, df_test = dataget.kaggle(dataset="cristiangarcia/pointcloudmnist2d").get(
files=["train.csv", "test.csv"]
)
To start using Kaggle datasets just make sure you have properly installed and configured the Kaggle API. In the future we want to expand Kaggle support in the following ways:
numpy
or pandas
can read.dataget.data.kaggle(..., type="image").get(...)
pip install dataget
Adding a new dataset is easy! Read our guide on Creating a Dataset if you are interested in contributing a dataset.
MIT License
FAQs
A framework-agnostic datasets library for Machine Learning research and education.
We found that dataget demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Fluent Assertions is facing backlash after dropping the Apache license for a commercial model, leaving users blindsided and questioning contributor rights.
Research
Security News
Socket researchers uncover the risks of a malicious Python package targeting Discord developers.
Security News
The UK is proposing a bold ban on ransomware payments by public entities to disrupt cybercrime, protect critical services, and lead global cybersecurity efforts.