Fast, flexible, and advanced augmentation library for deep learning, computer vision, and medical imaging. Albumentations offers a wide range of transformations for both 2D (images, masks, bboxes, keypoints) and 3D (volumes, volumetric masks, keypoints) data, with optimized performance and seamless integration into ML workflows.
Ultralytics YOLO 🚀 for SOTA object detection, multi-object tracking, instance segmentation, pose estimation and image classification.
Open Source Differentiable Computer Vision Library for PyTorch
Low level implementations for computer vision in Rust
A set of easy-to-use utils that will come in handy in any Computer Vision project
High-performance image processing functions for deep learning and computer vision.
Collection of common code shared among different research projects in FAIR computer vision team
QUick and DIrty Domain Adaptation
A series of convenience functions to make basic image processing functions such as translation, rotation, resizing, skeletonization, displaying Matplotlib images, sorting contours, detecting edges, and much more easier with OpenCV and both Python 2.7 and Python 3.
Image augmentation library for deep neural networks
Video scene cut/shot detection program and Python library.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration
Open Source Image and Video Super-Resolution Toolbox
The Rerun Logging SDK
FiftyOne: the open-source tool for building high-quality datasets and computer vision models
Microsoft Azure Cognitive Services Computer Vision Client Library for Python
OpenMMLab Detection Toolbox and Benchmark
A toolkit for making real world machine learning and data analysis applications
Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration
Light Weight Toolkit for Bounding Boxes
With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments using Roboflow Inference.
With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments using Roboflow Inference.
Document Text Recognition (docTR): deep Learning for high-performance OCR on documents.
OpenMMLab Computer Vision Foundation
Open MMLab Semantic Segmentation Toolbox and Benchmark
With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments using Roboflow Inference.
With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments using Roboflow Inference CLI.
OpenMMLab Computer Vision Foundation
SuperGradients
Industry-strength computer Vision extensions for Keras.
Ultralytics HUB Client SDK.
OpenMMLab Model Pretraining Toolbox and Benchmark
Provides spatial maths capability for Python.
Savant Rust core functions library
OpenMMLab Image Classification Toolbox and Benchmark
Pymba is a Python wrapper for Allied Vision's Vimba C API.
A toolkit for making real world machine learning and data analysis applications
Catalyst. Accelerated deep learning R&D with PyTorch.
Mahotas: Computer Vision Library
Different ways of visualizing objects given bounding box data
Transform, analyze, and visualize computer vision annotations.
The realtime communication library for Python
RF100-VL Dataset Interface
Automation with Computer Vision for Python
OpenMMLab Pose Estimation Toolbox and Benchmark.
OpenMMLab's next-generation platformfor general 3D object detection.
TorchSeg: Semantic Segmentation models for PyTorch
AI2-THOR: A Near Photo-Realistic Interactable Framework for Embodied AI Agents