You're Invited:Meet the Socket Team at BlackHat and DEF CON in Las Vegas, Aug 4-6.RSVP →

Book a Demo Install Sign in

mlbench-core

Package Overview

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

mlbench-core

A public and reproducible collection of reference implementations and benchmark suite for distributed machine learning systems.

3.0.0

PyPI

Maintainers: 4

mlbench-core: Distributed Machine Learning Benchmark Core Library

MLBench is a Benchmarking Framework for Distributed Machine Learning algorithms.

This repository contains the core Python library for MLBench which is used to share code between Benchmark implementations as well as for communication with the dashboard.

For more information refer to the MLBench Core Documentation or the Main Documentation

Changelog

v3.0.0 (2020-12-07)

Full Changelog

Implemented enhancements:

Support multiple clusters in CLI #91
Add notebook/code to visualize results #72
Support AWS in CLI #33
Fix rnn language model #303 (ehoelzl)
Transformer language translation #99 (ehoelzl)

Fixed bugs:

Training code keeps running for PyTorch after training is done #26

Closed issues:

Remove loss argument for metric computation #295
Update PyTorch to 1.7 #286
Refactor optimizer and chose more appropriate names #284
fails to create kind cluster #277
Refactor CLI #253
Dependabot couldn't authenticate with https://pypi.python.org/simple/ #252
Unify requirements/setup.py versions #244
isort failing on all PRs #227
torch.div is not supported in PyTorch 1.6 #223
Refactor common functionality for tiller and helm #108
Add GPU support for AWS in CLI #104
Change CPU limit to #CPUs - 1 #101
Add --version flag #97
Cluster creation/deletion errors with non-default zone #94
Add command to list runs #86
RefreshError from gcloud #83
Run new benchmarks and document costs #82
Make nvidia k80 default GPU #80
Fix random seeds #79
benchmark against torch.nn.parallel.DistributedDataParallel MPSG #75
upgrade to pytorch 1.5 #74
Provide comparison to competitors #66
Add some integration tests #64
Remove stale branches #62
Add PowerSGD optimizer #59
Add RNN Language Model #54
Use torch.nn.DataParallel for intra-node computation #46
Add CLI support for DIND #42
Port over functionality from Language Model benchmark to the core library #34
make results reproducible from command-line #24
Contribution and docs section on README.md #17
test new torch.distributed #15

Merged pull requests:

Bugfix KIND cli #307 (ehoelzl)
Update README.md to show new badge #306 (ehoelzl)
Create manual.yml #305 (ehoelzl)
Switch to github actions #304 (ehoelzl)
Bump sphinx from 3.3.0 to 3.3.1 #301 (dependabot[bot])
Remove loss from metric argument #297 (ehoelzl)
Fix translators #294 (ehoelzl)
Update pytorch #292 (ehoelzl)
Bump sphinx from 3.2.1 to 3.3.0 in /docs #288 (dependabot[bot])
Refactor optimizers #285 (ehoelzl)
Bump isort from 5.5.4 to 5.6.4 #283 (dependabot[bot])
Bump sphinx-autoapi from 1.5.0 to 1.5.1 #280 (dependabot[bot])
Add gpu functionality on AWS #278 (mmilenkoski)
Catch exceptions when creating/deleting clusters #276 (ehoelzl)
Fix doc #275 (ehoelzl)
Fix AWS deployment #274 (mmilenkoski)
Create dependabot.yml #260 (ehoelzl)
Merge requirements & Update doc #259 (ehoelzl)
Bump google-api-python-client from 1.9.3 to 1.12.1 #246 (dependabot-preview[bot])
Bump numpy from 1.19.0 to 1.19.2 #245 (dependabot-preview[bot])
Bump boto3 from 1.14.6 to 1.14.50 #234 (dependabot-preview[bot])
Fix isort errors #233 (mmilenkoski)
Bump pytest-mock from 3.1.1 to 3.3.1 #231 (dependabot-preview[bot])
Bump isort from 4.3.21 to 5.4.2 #221 (dependabot-preview[bot])
Bump sphinx from 3.0.4 to 3.2.1 #220 (dependabot-preview[bot])
Bump grpcio from 1.29.0 to 1.31.0 #207 (dependabot-preview[bot])
Bump spacy from 2.3.0 to 2.3.2 #182 (dependabot-preview[bot])
Downgrade Sphinx #162 (ehoelzl)
Add developer docs #161 (Panaetius)
Fp optimizer changes #160 (ehoelzl)
Bump wcwidth from 0.1.9 to 0.2.5 #156 (dependabot-preview[bot])
Bump all versions and add doc test #152 (Panaetius)
Bump torchvision from 0.6.0 to 0.6.1 #151 (dependabot-preview[bot])
Bump numpy from 1.18.5 to 1.19.0 #150 (dependabot-preview[bot])
Bump torch from 1.5.0 to 1.5.1 #148 (dependabot-preview[bot])
Bump google-auth from 1.17.2 to 1.18.0 #147 (dependabot-preview[bot])
Bump sphinx-rtd-theme from 0.4.3 to 0.5.0 #144 (dependabot-preview[bot])
Bump spacy from 2.2.4 to 2.3.0 #142 (dependabot-preview[bot])
Bump sphinx from 3.1.0 to 3.1.1 #140 (dependabot-preview[bot])
Bump dill from 0.3.1.1 to 0.3.2 #138 (dependabot-preview[bot])
Update dependencies #137 (Panaetius)
Bump spacy from 2.2.3 to 2.2.4 #135 (dependabot-preview[bot])
Bump numpy from 1.16.6 to 1.18.5 #133 (dependabot-preview[bot])
Bump freezegun from 0.3.12 to 0.3.15 #129 (dependabot-preview[bot])
Bump tabulate from 0.8.6 to 0.8.7 #128 (dependabot-preview[bot])
Bump deprecation from 2.0.6 to 2.1.0 #125 (dependabot-preview[bot])
Bump pytest-black from 0.3.8 to 0.3.9 #124 (dependabot-preview[bot])
Bump sphinx-rtd-theme from 0.4.2 to 0.4.3 #123 (dependabot-preview[bot])
Bump sphinx from 1.8.1 to 3.1.0 #121 (dependabot-preview[bot])
Bump pytest-mock from 1.10.0 to 3.1.1 #120 (dependabot-preview[bot])
Bump torchtext from 0.5.0 to 0.6.0 #118 (dependabot-preview[bot])
Bump torchvision from 0.5.0 to 0.6.0 #117 (dependabot-preview[bot])
Adds support for multiple clusters #115 (Panaetius)
Bump click from 7.0 to 7.1.2 #114 (dependabot-preview[bot])
Bump google-cloud-container from 0.3.0 to 0.5.0 #113 (dependabot-preview[bot])
Bump appdirs from 1.4.3 to 1.4.4 #112 (dependabot-preview[bot])
Bump sphinxcontrib-bibtex from 0.4.0 to 1.0.0 #111 (dependabot-preview[bot])
Bump sphinx-autoapi from 1.3.0 to 1.4.0 #110 (dependabot-preview[bot])
Remove unused arguments in create_aws #109 (mmilenkoski)
Fix Random seeds, Add new tracker stats #107 (ehoelzl)
Add return_code check in test_cli #106 (mmilenkoski)
Add AWS support in CLI #103 (mmilenkoski)
Update test_cli.py #100 (giorgiosav)
Adds a chart command to cli #95 (Panaetius)
Add support for kind cluster creation in the CLI #93 (mmilenkoski)

Changelog

v2.4.0 (2020-04-20)

Full Changelog

Implemented enhancements:

Switch to black for code formatting #35

Closed issues:

Travis tests run only for Python 3.6 #65
Downloading results fails if --output option is not provided #57
Remember user input in mlbench run #56
Aggregate the gradients by model, instead of by layers. #45
Update docker images to CUDA10, mlbench-core module to newest #43
Upgrade PyTorch to 1.4 #40

Merged pull requests:

Pytorch v1.4.0 #68 (ehoelzl)
Fix ci #67 (ehoelzl)
Add aggregation by model #61 (ehoelzl)
Remember user input in mlbench run #60 (mmilenkoski)
Add default name of output file in CLI #58 (mmilenkoski)
Cli adaptation #55 (ehoelzl)
Update tags and patch version to 2.3.2 #52 (ehoelzl)
Add get_optimizer to create optimizer object #48 (mmilenkoski)

Changelog

v2.3.2 (2020-04-07)

Full Changelog

Implemented enhancements:

Add NCCL & GLOO Backend support #49
Add NCCL & GLOO Backend support #47 (giorgiosav)

Fixed bugs:

math ValueError with 1-node cluster #38

Merged pull requests:

num_workers fix #51 (giorgiosav)
Adds centralized Adam implementation #41 (mmilenkoski)

Change Log

2.3.1 (2020-03-09)

Full Changelog

Implemented enhancements:

Customize Communication Scheme For Sparsified/Quantizatized/Decentralized scenarios #12

v2.3.0 (2019-12-23)

Full Changelog

v2.2.1 (2019-12-16)

Full Changelog

Change Log

v2.2.0 (2019-11-11)

Full Changelog

Implemented enhancements:

initialize_backends can now be called as context manager
Improved CLI to run multiple runs in parallel

v2.1.1 (2019-11-11)

Full Changelog

v2.1.0 (2019-11-4)

Full Changelog

Implemented enhancements:

Added CLI for MLBench runs

v1.4.4 (2019-05-28)

Full Changelog

v1.4.3 (2019-05-23)

Full Changelog

v1.4.2 (2019-05-21)

Full Changelog

v1.4.1 (2019-05-16)

Full Changelog

v1.4.0 (2019-05-02)

Full Changelog

Implemented enhancements:

Split Train and Validation in Tensorflow #22

v1.3.4 (2019-03-20)

Full Changelog

Implemented enhancements:

in controlflow, don't mix train and validation #20

Fixed bugs:

Add metrics logging for Tensorflow #19

v1.3.3 (2019-02-26)

Full Changelog

v1.3.2 (2019-02-13)

Full Changelog

v1.3.1 (2019-02-13)

Full Changelog

v1.3.0 (2019-02-12)

Full Changelog

v1.2.1 (2019-01-31)

Full Changelog

v1.2.0 (2019-01-30)

Full Changelog

v1.1.1 (2019-01-09)

Full Changelog

v1.1.0 (2018-12-06)

Full Changelog

Fixed bugs:

Bug when saving checkpoints #13

v1.0.0 (2018-11-20)

Full Changelog

Implemented enhancements:

Add API Client to mlbench-core #6
Move to google-style docs #4
Add Imagenet Dataset for pytorch #3
Move worker code to mlbench-core repo #1

Change Log

1.4.2 (2019-05-21)

Full Changelog

Implemented enhancements:

Split Train and Validation in Tensorflow #22
in controlflow, don't mix train and validation #20

Fixed bugs:

Add metrics logging for Tensorflow #19
Bug when saving checkpoints #13

Change Log

v1.4.1 (2019-05-16)

Full Changelog

1.4.0 (2019-05-02)

Full Changelog

Implemented enhancements:

Split Train and Validation in Tensorflow #22
in controlflow, don't mix train and validation #20

Fixed bugs:

Add metrics logging for Tensorflow #19
Bug when saving checkpoints #13

Change Log

v1.3.4 (2019-03-20)

Full Changelog

Implemented enhancements:

in controlflow, don't mix train and validation #20

Fixed bugs:

Add metrics logging for Tensorflow #19

v1.3.3 (2019-02-26)

Full Changelog

v1.3.2 (2019-02-13)

Full Changelog

v1.3.1 (2019-02-13)

Full Changelog

v1.3.0 (2019-02-12)

Full Changelog

v1.2.1 (2019-01-31)

Full Changelog

v1.2.0 (2019-01-30)

Full Changelog

v1.1.1 (2019-01-09)

Full Changelog

Change Log

v1.1.0 (2018-12-06)

Full Changelog

Fixed bugs:

Bug when saving checkpoints #13
Adds Tensorflow Controlflow, Dataset and Model code
Adds Pytorch linear models
Adds sparsified and decentralized optimizers

v1.0.0 (2018-11-15)

Implemented enhancements:

Add API Client to mlbench-core #6
Move to google-style docs #4
Add Imagenet Dataset for pytorch #3
Move worker code to mlbench-core repo #1

0.1.0 (2018-09-14)

Implemented enhancements:

Add documentation in reference implementation to docs #46
Replace cAdvisor with Kubernetes stats for Resource usage #38
Rename folders #31
Change docker image names #30
Add continuous output for mpirun #27
Replace SQlite with Postgres #25
Fix unittest #23
Add/Fix CI/Automated build #22
Cleanup unneeded project files #21
Remove hardcoded values #20
Improves Notes.txt #19
Rename components #15

Fixed bugs:

504 Error when downloading metrics for long runs #61

Closed issues:

small doc improvements for first release #54
Check mlbench works on Google Cloud #51
learning rate scheduler #50
Add Nvidia k8s-device-plugin to charts #48
Add Weave to Helm Chart #41
Allow limiting of resources for experiments #39
Allow downloading of Run measurements #35
Worker Details page #33
Run Visualizations #32
Show experiment history in Dashboard #18
Show model progress in Dashboard #13
Report cluster status in Dashboard #12
Send metrics from SGD example to metrics api #11
Add metrics endpoint for experiments #10
Let Coordinator Dashboard start a distributed Experiment #9
Add mini-batch SGD model experiment #8
add benchmark code for MPI #7
add benchmark code for tensorflow #6
add benchmark code for apache reef #5
add benchmark code for apache flink #4
get initial benchmark numbers (spark reference implementation and mllib/ml) #3
evaluate script (framework-independent) and algorithm output format #2
bench-spark: remove prepare-data for now, comment on solver prequisites #1

* This Change Log was automatically generated by github_changelog_generator

* This Changelog was automatically generated by github_changelog_generator

Keywords

mlbench

FAQs

What is mlbench-core?

Is mlbench-core well maintained?

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

mlbench-core

mlbench-core: Distributed Machine Learning Benchmark Core Library

Changelog

v3.0.0 (2020-12-07)

Changelog

v2.4.0 (2020-04-20)

Changelog

v2.3.2 (2020-04-07)

Change Log

2.3.1 (2020-03-09)

v2.3.0 (2019-12-23)

v2.2.1 (2019-12-16)

Change Log

v2.2.0 (2019-11-11)

v2.1.1 (2019-11-11)

v2.1.0 (2019-11-4)

v1.4.4 (2019-05-28)

v1.4.3 (2019-05-23)

v1.4.2 (2019-05-21)

v1.4.1 (2019-05-16)

v1.4.0 (2019-05-02)

v1.3.4 (2019-03-20)

v1.3.3 (2019-02-26)

v1.3.2 (2019-02-13)

v1.3.1 (2019-02-13)

v1.3.0 (2019-02-12)

v1.2.1 (2019-01-31)

v1.2.0 (2019-01-30)

v1.1.1 (2019-01-09)

v1.1.0 (2018-12-06)

v1.0.0 (2018-11-20)

Change Log

1.4.2 (2019-05-21)

Change Log

v1.4.1 (2019-05-16)

1.4.0 (2019-05-02)

Change Log

v1.3.4 (2019-03-20)

v1.3.3 (2019-02-26)

v1.3.2 (2019-02-13)

v1.3.1 (2019-02-13)

v1.3.0 (2019-02-12)

v1.2.1 (2019-01-31)

v1.2.0 (2019-01-30)

v1.1.1 (2019-01-09)

Change Log

v1.1.0 (2018-12-06)

v1.0.0 (2018-11-15)

0.1.0 (2018-09-14)

Keywords

Related posts

Introducing License Overlays: Smarter License Management for Real-World Code

Introducing Rust Support in Socket