Fast, flexible, and advanced augmentation library for deep learning, computer vision, and medical imaging. Albumentations offers a wide range of transformations for both 2D (images, masks, bboxes, keypoints) and 3D (volumes, volumetric masks, keypoints) data, with optimized performance and seamless integration into ML workflows.
Ultralytics YOLO 🚀 for SOTA object detection, multi-object tracking, instance segmentation, pose estimation and image classification.
Open Source Differentiable Computer Vision Library for PyTorch
High-performance image processing functions for deep learning and computer vision.
A set of easy-to-use utils that will come in handy in any Computer Vision project
Low level implementations for computer vision in Rust
Video scene cut/shot detection program and Python library.
Collection of common code shared among different research projects in FAIR computer vision team
Image augmentation library for deep neural networks
QUick and DIrty Domain Adaptation
A series of convenience functions to make basic image processing functions such as translation, rotation, resizing, skeletonization, displaying Matplotlib images, sorting contours, detecting edges, and much more easier with OpenCV and both Python 2.7 and Python 3.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration
Open Source Image and Video Super-Resolution Toolbox
The Rerun Logging SDK
Microsoft Azure Cognitive Services Computer Vision Client Library for Python
OpenMMLab Detection Toolbox and Benchmark
A toolkit for making real world machine learning and data analysis applications
Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration
Light Weight Toolkit for Bounding Boxes
With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments using Roboflow Inference.
OpenMMLab Computer Vision Foundation
With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments using Roboflow Inference.
Document Text Recognition (docTR): deep Learning for high-performance OCR on documents.
FiftyOne: the open-source tool for building high-quality datasets and computer vision models
SuperGradients
Industry-strength computer Vision extensions for Keras.
With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments using Roboflow Inference CLI.
With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments using Roboflow Inference.
OpenMMLab Computer Vision Foundation
Open MMLab Semantic Segmentation Toolbox and Benchmark
Pymba is a Python wrapper for Allied Vision's Vimba C API.
Savant Rust core functions library
Mahotas: Computer Vision Library
Provides spatial maths capability for Python.
OpenMMLab Image Classification Toolbox and Benchmark
Ultralytics HUB Client SDK.
A toolkit for making real world machine learning and data analysis applications
OpenMMLab Model Pretraining Toolbox and Benchmark
The realtime communication library for Python
OpenMMLab Pose Estimation Toolbox and Benchmark.
Catalyst. Accelerated deep learning R&D with PyTorch.
TorchSeg: Semantic Segmentation models for PyTorch
Savant Rust core functions library
OpenMMLab's next-generation platformfor general 3D object detection.
Automation with Computer Vision for Python
This is the module for detecting and classifying text on rama pictures
Industry-strength computer Vision extensions for Keras.