You're Invited:Meet the Socket Team at BlackHat and DEF CON in Las Vegas, Aug 4-6.RSVP →

Book a Demo Install Sign in

Book a Demo Install Sign in

pypi

Categories
Utilities
Client & Server Utilities
Benchmarking

Benchmarking

benchmarking

Simple benchmark framework (in active development)

pytest-benchmark

A ``pytest`` fixture for benchmarking code. It will group the tests into rounds that are calibrated to the chosen timer.

pytest-codspeed

Pytest plugin to create CodSpeed benchmarks

ir-datasets

provides a common interface to many IR ad-hoc ranking benchmarks, training datasets, etc.

mmdet

OpenMMLab Detection Toolbox and Benchmark

computer vision

object detection

asv

Airspeed Velocity: A simple Python history benchmarking tool

benchmarking-asv

pytest-harvest

Store data created during your pytest tests execution, and retrieve it at the end of the session, e.g. for applicative benchmarking purposes.

pytest test result store fixture collect benchmark artifact session data dataframe

rdt

Reversible Data Transforms

machine learning

synthetic data generation

generative models

pyperf

Python module to run and analyze benchmarks

motmetrics

Metrics for multiple object tracker benchmarking.

tracker MOT evaluation metrics compare

benchmarking-qrc

Benchmarking QRC measures the ability to store information of

mteb

Massive Text Embedding Benchmark

text embeddings

swebench

The official SWE-bench package - a benchmark for evaluating LMs on software engineering

benchpots

A Python Toolbox for Benchmarking Machine Learning on Partially-Observed Time Series

neural networks

machine learning

artificial intelligence

collie-bench

Official Implementation of "COLLIE: Systematic Construction of Constrained Text Generation Tasks"

large language model

constrained generation

pyaf

Python Automatic Forecasting

arx automatic-forecasting autoregressive benchmark cycle decomposition exogenous forecasting heroku hierarchical-forecasting horizon jupyter pandas python scikit-learn seasonal time-series transformation trend web-service

apebench

Benchmark suite for Autoregressive Neural Emulators of PDEs in JAX.

neural operator

mmsegmentation

Open MMLab Semantic Segmentation Toolbox and Benchmark

computer vision

semantic segmentation

benchmark-runner

Benchmark Runner Tool

mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

computer vision

pose estimation

resp-benchmark

resp-benchmark is a benchmark tool for testing databases that support the RESP protocol, such as Redis, Valkey, and Tair.

pmlb

A Python wrapper for the Penn Machine Learning Benchmark data repository.

machine learning

salesforce-merlion

Merlion: A Machine Learning Framework for Time Series Intelligence

anomaly detection

machine learning

ensemble learning

beir

A Heterogeneous Benchmark for Information Retrieval

Evaluation Framework

Information Retrieval

Transformer Networks

Large Language Models

ogb

Open Graph Benchmark

graph machine learning

graph representation learning

graph neural networks

browsergym-webarena

WebArena benchmark for BrowserGym

mmcls

OpenMMLab Image Classification Toolbox and Benchmark

computer vision

image classification

m24842-ml

Collection of ML models and benchmarking tools

browsergym-miniwob

MiniWoB++ benchmark for BrowserGym

prediction-market-agent-tooling

Tools to benchmark, deploy and monitor prediction market agents.

bench-it

Quick and easy python benchmarking.

fuzzydata

Fuzzy Data Benchmark

silero

Silero Models: pre-trained enterprise-grade STT / TTS models and benchmarks.

pytabkit

ML models + benchmark for tabular data classification and regression

gradient boosting

mmpretrain

OpenMMLab Model Pretraining Toolbox and Benchmark

computer vision

image classification

unsupervised learning

self-supervised learning

benchmarkfcns

A high-performant C++ implementation of benchmark functions for mathematical optimization algorithms.

browsergym-workarena

WorkArena benchmark for BrowserGym

browsergym-assistantbench

AssistantBench benchmark for BrowserGym

libwebarena

This is an unofficial, use-at-your-own risks port of the webarena benchmark, for use as a standalone library package.

browsergym-visualwebarena

VisualWebArena benchmark for BrowserGym

pytest-speed

Modern benchmarking library for python with pytest integration.

lab

Benchmark your code

benchmarks cluster grid

weblinx-browsergym

BrowserGym integration for the WebLINX benchmark

google-benchmark

A library to benchmark code snippets.

libvisualwebarena

This is an unofficial, use-at-your-own risks port of the visualwebarena benchmark, for use as a standalone library package.

mlbench-core

A public and reproducible collection of reference implementations and benchmark suite for distributed machine learning systems.

clip-benchmark

CLIP-like models benchmarks on various datasets

robosuite

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

scikit-datasets

Scikit-learn-compatible datasets

Product

Package Alerts
Integrations
Docs
Pricing
FAQ
Roadmap
Changelog

About

About
Love
Blog
Glossary
CareersHiring
Send Feedback
Contact Us
System Status

Packages

Explore Rubygems

Stay in touch

Get open source security insights delivered straight into your inbox.

Enter your email

Terms
Privacy
Security

Made with ⚡️ by Socket Inc

U.S. Patent No. 12,346,443 & 12,314,394. Other pending.