Socket
Socket
Sign inDemoInstall

data-science-toolbox

Package Overview
Dependencies
Maintainers
1
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

data-science-toolbox

Various code to aid in data science projects for tasks involving data cleaning, ETL, EDA, NLP, viz, feature engineering, feature selection, model validation, etc.


Maintainers
1

data_science_toolbox

=====================

Various code to aid in data science projects for tasks involving data cleaning, ETL, EDA, NLP, viz, feature engineering, feature selection, model training and validation etc.

Installation

Using pip

You can install using the pip package manager by running

pip install data-science-toolbox

Project Organization


β”œβ”€β”€ README.md              
β”œβ”€β”€ data_science_toolbox   <- Project source code
β”‚   β”‚
β”‚   β”œβ”€β”€ gists              <- Code gists with commonly used code (change to root
β”‚   β”‚                         directory, connect to database, profile data, etc)
β”‚   β”œβ”€β”€ data_checks        <- Code for data checks and assertions
β”‚   β”œβ”€β”€ io                 <- Code for input/output utilities
β”‚   β”œβ”€β”€ etl                <- For building reproducible ETL pipelines, including data
β”‚   β”‚                         checks and transformers
β”‚   β”œβ”€β”€ ml                 <- Machine Learning utility code (feature engineering, etc) 
β”‚   β”œβ”€β”€ pandas             <- Pandas related utility code
β”‚   β”‚   β”œβ”€β”€ analysis                  
β”‚   β”‚   β”œβ”€β”€ cleaning
β”‚   β”‚   β”œβ”€β”€ engineering
β”‚   β”‚   β”œβ”€β”€ text    
β”‚   β”‚   β”œβ”€β”€ datetime     
β”‚   β”‚   β”œβ”€β”€ optimization       
β”‚   β”‚   └── profiling   
β”‚   β”œβ”€β”€ project_utils.py   <- For project specific utilities
β”‚   β”‚
β”‚   β”œβ”€β”€ text               <- Code for dealing with text. Includes distributed loading of text corpus, 
β”‚   β”‚                         entity statement extraction, sentiment analysis, pii removal etc.
β”‚   └── __init__.py        <- Makes data_science_toolbox a Python module               
β”œβ”€β”€ tests                  <- Pytest unit tests 
β”œβ”€β”€ dist                   <- tars and whls of version builds
β”œβ”€β”€ LICENSE
β”œβ”€β”€ poetry.lock
└── pyproject.toml 

FAQs


Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚑️ by Socket Inc