TorchX
TorchX is a universal job launcher for PyTorch applications.
TorchX is designed to have fast iteration time for training/research and support
for E2E production ML pipelines when you're ready.
TorchX currently supports:
- Kubernetes (EKS, GKE, AKS, etc)
- Slurm
- AWS Batch
- Docker
- Local
- Ray (prototype)
- GCP Batch (prototype)
Need a scheduler not listed? Let us know!
Quickstart
See the quickstart guide.
Documentation
Requirements
torchx:
- python3 (3.8+)
- PyTorch
- optional: Docker (needed for docker based schedulers)
Certain schedulers may require scheduler specific requirements. See installation
for info.
Installation
Stable
pip install torchx
pip install "torchx[dev]"
pip install "torchx[kfp]"
pip install "torchx[kubernetes]"
pip install "torchx[ray]"
pip install "torchx[gcp_batch]"
Nightly
pip install torchx-nightly[dev]
Source
$ pip install -e git+https://github.com/pytorch/torchx.git
$ pip install -e git+https://github.com/pytorch/torchx.git
Docker
TorchX provides a docker container for using as as part of a TorchX role.
See: https://github.com/pytorch/torchx/pkgs/container/torchx
Contributing
We welcome PRs! See the CONTRIBUTING file.
License
TorchX is BSD licensed, as found in the LICENSE file.