A Python package for generating diverse and enriched image datasets using traditional, neural style transfer, and patch mixing augmentations.
A synthetic data generator for training OCR models
Extremely Fast and Robust Screen Capture on Windows with the Desktop Duplication API
PyTorch datasets with pre-computed model logits for efficient research
Remo, a webapp to manage datasets and annotations for Computer Vision
Extracts the Machine Readable Zone (MRZ) data from document images
Next-Generation Computer Vision for Robots.
A Python package for utilities and classes related to the file I/O, dataset record keeping and visualization for image processing and computer vision.
Toolkit for Video Understanding tasks
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox
GuruLearn is a comprehensive Python library that seamlessly integrates machine learning, computer vision, audio processing, and conversational AI capabilities in one package. Through six specialized modules (MLModelAnalysis, image classification, CTScanProcessor, AudioRecognition, FlowBot, and QAAgent), it empowers developers to build sophisticated AI solutions with minimal setup, accelerating the journey from prototype to production for data-driven applications across multiple domains.
McQuic, a.k.a. Multi-codebook Quantizers for neural image compression
Active learning for computer vision.
A package designed to predict static pose and detect falls with 2D RGB Camera in well lit indoor environments.
Ikomia Python API for Computer Vision workflow and plugin integration in Ikomia Studio
A comprehensive toolkit for computer vision and segmentation tasks
CLIP-powered multimodal image search engine with web interface and CLI tools
A Python package for converting 3D files to images.
Computer Vision and OCR library for detecting and analyzing UI elements
A deep learning approach for mapping and dating burned areas using temporal sequences of satellite images
Savant Rust core functions library
Python3 implementation of Computer Vision and Pattern Recognition library Balu
very nb computer vision
Fast, memory-efficient image tiling and reconstruction for deep learning and scientific computing
Image augmentation library for deep neural networks
A phenotyping pipeline for python
DataBouncer Python bindings
Tooling for Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans
Hệ thống trích xuất bảng, hàng, cột hoàn chỉnh với AI và GPU support
Native aimage library wrapper for internal use.
A deep learning experimentation toolbox
Optimize RandAugment with differentiable operations
Driver for Intel Realsense cameras.
Python tools for machine vision - education and research
ML/DL tools function library
Visual testing framework. Combining selenium and computer vision.
A computer vision library for 360-degree cameras
[DEPRECATED] Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
Image augmentation library for deep neural networks
Helpful functions for Computer Vision tasks
Industry-strength computer Vision extensions for Keras.
physics-inspired computer vision algorithms
High-level deep learning package for Object Detection API
MCP服务器:火山引擎图像编辑工具,提供显著性分割、背景移除等图像处理功能
SDK for the Reality Defender deepfake detection API
Automation of the creation of the architecture of the neural network based on the input
A Python Library for Computer Vision tasks like Object Detection, Segmentation, Pose Estimation etc