Open Source Architecture Code Analyzer
Benchmarking imputation methods for microdata
Benchmarking framework for machine learning with fNIRS
A comprehensive benchmark and code base for Image manipulation and localization.
Scandinavian Embedding Benchmark
pygmtools provides graph matching solvers in Python API and supports numpy and pytorch backends. pygmtools also provides dataset API for standard graph matching benchmarks.
CLI extension for AEA framework benchmarking.
LLM Benchmark
Guacamol: benchmarks for de novo molecular design
Count floating-point operations in Python code & benchmark relative flop costs.
Benchmark functions for Bayesian optimization
RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark
A library for training and benchmarking RL agents on graphs.
PyAnaDroid: A replicable, fully-customizable execution pipeline foranalyzing and benchmarking Android Applications
Python package of ADBench
Video generation benchmark
Benchmark tools for robotframework
A simple tool for benchamrking and tracking machine learning models and experiments.
Fix Inventory Compliance Benchmarks and Checks
Reproducible and Reusable Data Analysis Workflow Server (Core Infrastructure)
Benchmark toolkit for optimization
Advanced benchmarking for machine learning models.
I/O profiler for deep learning python apps. Specifically for dlio_benchmark.
Models and utilities for event-based depth / segmentation (Surreal benchmark).
rliable: Reliable evaluation on reinforcement learning and machine learning benchmarks.
A framework for developing and benchmarking AI agents using Model Context Protocol (MCP)
AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
CatBench: Benchmark Framework of Machine Learning Interatomic Potentials for Adsorption Energy Predictions in Heterogeneous Catalysis
Simple probabilistic time series benchmark models
Unfied Semi-Supervised Learning Benchmark
Adapters for Running and Tracking Benchmarks
OmniGenBench: A comprehensive toolkit for genome analysis benchmarking.
A tool for automated scientific benchmarking
A test bench to benchmark learn algorithms for graphical models
Benchmark suite tool
Data compression methods and benchmarks for Cli/Met datasets
ycecream - sweeter debugging and benchmarking
ManiSkill2: A Unified Benchmark for Generalizable Manipulation Skills
This tool safely and securely analyzes applications for benchmarking.
Causal AI Benchmarking Framework
A package for submitting benchmarking scripts on OSCAR.
Open source remote sensing dataset with benchmarks
Benchmark tabular synthetic data generators using a variety of datasets
Python Cache Hierarchy Simulator
A multi-modal Python library for benchmarking Azure lakehouse engines and ELT scenarios, supporting both industry-standard and novel benchmarks.
Easily benchmark Machine Learning models on selected tasks and datasets
Fibber is a benchmarking suite for adversarial attacks on text classification.
A package for benchmarking time series machine learning tools.