New Case Study:See how Anthropic automated 95% of dependency reviews with Socket.Learn More →

Sign in Demo Install

Sign in Demo Install

pypi

Categories
Utilities
Client & Server Utilities
Benchmarking

Benchmarking

torchbench

Easily benchmark Machine Learning models on selected tasks and datasets - with PyTorch

benchpress

Benchmark suite tool

pgbenchmark

A Python package to benchmark query performance and comparison on PostgreSQL Database

ogbench

OGBench: Benchmarking Offline Goal-Conditioned RL

enfobench

Energy forecast benchmarking toolkit.

evalsync

evalsync is a library used to synchronize applications under benchmark with an external manager

ycecream

ycecream - sweeter debugging and benchmarking

annb

A simple ANN benchmark tools

py-tgb

Temporal Graph Benchmark project repo

dlio-profiler-py

I/O profiler for deep learning python apps. Specifically for dlio_benchmark.

pytorch benchmark

imt-benchmark

High-level Interface to Inertial Motion Tracking

mmrotate

Rotation Detection Toolbox and Benchmark

computer vision

object detection

rotation detection

bmds

A Python interface for the USEPA Benchmark dose modeling software (BMDS)

perfbench

perfbench measures execution time of code snippets with Timeit and uses Plotly to visualize the results.

opensr-test

A comprehensive benchmark for real-world Sentinel-2 imagery super-resolution

nxbench

A centralized benchmarking suite to facilitate comparative profiling of tools across graph analytic libraries and datasets

bayeso-benchmarks

Benchmark functions for Bayesian optimization

sotabencheval

Easily benchmark Machine Learning models on selected tasks and datasets

cli-pybench

A pytest-like framework for benchmarking

gxabm

Opinionated Benchmarking Automatation in Galaxy

powerful-benchmarker

A PyTorch library for benchmarking deep metric learning. It's powerful.

pytorch-benchmark

Easily benchmark PyTorch model FLOPs, latency, throughput, max allocated memory and energy consumption in one go.

vidore-benchmark

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

evogym

Evolution Gym: A benchmark for developing and evaluating soft robot co-design algorithms.

cocopp

Benchmarking framework for all types of black-box optimization algorithms, postprocessing.

smallbench

Small Benchmarks for LM Agents

qtip

Platform Performance Benchmarking

relbench

RelBench: Relational Deep Learning Benchmark

pycachesim

Python Cache Hierarchy Simulator

hpc performance benchmark analysis

jsp-benchmarks

ジョブショップスケジューリング問題のベンチマーク問題をpythonのクラスで表現し，load出来るようにする

sdgym

Benchmark tabular synthetic data generators using a variety of datasets

machine learning

synthetic data generation

generative models

pyllms

Minimal Python library to connect to LLMs (OpenAI, Anthropic, Google, Mistral, OpenRouter, Reka, Groq, Together, Ollama, AI21, Cohere, Aleph-Alpha, HuggingfaceHub), with a built-in model performance benchmark.

large language model

natural language processing

clarifai-evals

Clarifai Evals is an SDK for evaluating AI models, providing a structured framework to benchmark model performance using predefined and custom evaluation templates.

flambe

Pytorch based library for robust prototyping, standardized benchmarking, and effortless experiment management

dynabench

Benchmark dataset for learning dynamical systems from data

welqrate

The official implementation of the WelQrate dataset and benchmark

evobench

Optimization benchmarks, both synthetic and practical.

vectordb-bench

VectorDBBench is not just an offering of benchmark results for mainstream vector databases and cloud services, it's your go-to tool for the ultimate performance and cost-effectiveness comparison. Designed with ease-of-use in mind, VectorDBBench is devised to help users, even non-professionals, reproduce results or test new systems, making the hunt for the optimal choice amongst a plethora of cloud services and open-source vector databases a breeze.

zntrack

Create, Run and Benchmark DVC Pipelines in Python

data-version-control

machine-learning

reproducibility

agent-harness

AI Maintainer Agent Harness for our benchmarking and Marketplace API and platform

pquisby

Quisby is a data processing and visualization tool for benchmark testing.

dragon-prep

Preprocessing scripts for the DRAGON benchmark

llm-bench

LLM Benchmarking tool for OLLAMA

teneva-bm

Benchmarks library, based on the package teneva, for testing multivariate approximation and optimization methods

benchmarks approximation optimization multidimensional array multivariate function low-rank representation tensor train format TT-decomposition

rl4co

RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark

combinatorial optimization

reinforcement learning

semilearn

Unfied Semi-Supervised Learning Benchmark

pytorch semi-supervised-learning

automlbench

A Python package for automated ML model benchmarking and comparison

clinicalomicsdb

Clinical Trial Omics Database for machine learning benchmarking

cmdbench

Quick and easy benchmarking for any command's CPU, memory, disk usage and runtime.

vbench

Video generation benchmark

Product

Package Alerts
Integrations
Docs
Pricing
FAQ
Roadmap
Changelog

About

About
Love
Blog
Glossary
Discord Community
CareersHiring
Send Feedback
Contact Us
System Status

Packages

npm

Directory
Explore
Random Package
Most Popular
Top Maintainers
Removed Packages

Go

Directory
Explore
Random Package

Maven

Directory
Explore
Random Package

PyPI

Directory
Explore
Random Package

Rubygems

Directory
Explore
Random Package

Stay in touch

Get open source security insights delivered straight into your inbox.

Enter your email

Terms
Privacy
Security

Made with ⚡️ by Socket Inc