New Case Study:See how Anthropic automated 95% of dependency reviews with Socket.Learn More →

Sign in Demo Install

What is Socket?

Socket for GitHub

Detect suspicious packages in PRs

Socket CLI

Use Socket from the command line

Socket Web Extension

Use Socket from your browser

Socket Dependency Search

Find any package for your project

Socket Optimize

Optimize your dependencies

Integrations

All Integrations

Ticketing & Messaging

Package Managers

Languages

Socket for Java

Socket for JavaScript

Socket for Python

Socket for Ruby

Docs

Want to read all the docs? Start here

Customers

Check out our customer stories

Blog

Keep up to date with all the news

Changelog

Latest updates and enhancements

FAQ

Answers to common questions

Package Alerts

Learn about all Socket alerts

Glossary

Open source and security terms

Blog

Application Security

Customer Stories

About

Why we built Socket

Love

See why developers love Socket

Careers

Join our team

Investors

Learn about our investors

Security

Our security practices

Why Socket?

Socket vs Dependabot

Socket vs Semgrep

Socket vs EndorLabs

Socket for Open Source Security

Socket for Supply Chain Attack Prevention

Achievements

Fortune Cyber 60

Pricing Love Docs

Sign in Demo Install

pypi
Categories
Utilities
Client & Server Utilities
Benchmarking

Benchmarking

opensearch-benchmark

Macrobenchmarking framework for OpenSearch

pytest-speed

Modern benchmarking library for python with pytest integration.

bark-simulator

A tool for Behavior benchmARKing

simulator autonomous driving machine learning

resp-benchmark

resp-benchmark is a benchmark tool for testing databases that support the RESP protocol, such as Redis, Valkey, and Tair.

silero

Silero Models: pre-trained enterprise-grade STT / TTS models and benchmarks.

mlbench-core

A public and reproducible collection of reference implementations and benchmark suite for distributed machine learning systems.

mlbench

gitlabcis

An automated tool that assesses the GitLab CIS benchmarks against a project.

GitLab Benchmarks
CIS Benchmarks
GitLab Hardening
GitLab Recommendations
GitLabcis
GitLab CIS Benchmarks

collie-bench

Official Implementation of "COLLIE: Systematic Construction of Constrained Text Generation Tasks"

large language model
llm
constrained generation
benchmark

jhub-client

Library and Client for managing, benchmarking, and interacting with jupyterhub

jupyterhub
jupyter
benchmark

folktables

New machine learning benchmarks from tabular datasets.

mani-skill

ManiSkill3: A Unified Benchmark for Generalizable Manipulation Skills

example-robot-data

Set of robot URDFs for benchmarking and developed examples.

moabb

Mother of All BCI Benchmarks

eeg
datasets
reproducibility
bci
benchmark

holobench

A package for benchmarking the performance of arbitrary functions

pycompliance

Library for working with compliance benchmarks and data.

evalsync

evalsync is a library used to synchronize applications under benchmark with an external manager

wilds

WILDS distribution shift benchmark

alma-torch

A package for benchmarking the speed of different PyTorch conversion options

fmbench

Benchmark performance of **any Foundation Model (FM)** deployed on **any AWS Generative AI service**, be it **Amazon SageMaker**, **Amazon Bedrock**, **Amazon EKS**, or **Amazon EC2**. The FMs could be deployed on these platforms either directly through `FMbench`, or, if they are already deployed then also they could be benchmarked through the **Bring your own endpoint** mode supported by `FMBench`.

benchmarking
sagemaker
bedrock
bring your own endpoint
generative-ai
foundation-models

anadroid

PyAnaDroid: A replicable, fully-customizable execution pipeline foranalyzing and benchmarking Android Applications

actbench

A framework for evaluating web automation agents and LAM systems.

AI
LAM systems
agent evaluation
benchmarking
web automation

tinytimer

Tiny Python benchmarking library

benchadapt

Adapters for Running and Tracking Benchmarks

kerncraft

Loop Kernel Analysis and Performance Modeling Toolkit

hpc performance benchmark analysis

weblinx-browsergym

BrowserGym integration for the WebLINX benchmark

conbench

Continuous Benchmarking (CB) Framework

appworld

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents

ai-agents
ai-assistants
ai-planning
autonomous-agents
ai-environment
tool-usage

mmaction2

OpenMMLab Video Understanding Toolbox and Benchmark

computer vision
video understanding

nlp4bia

Download NLP4BIA benchmarks and load datasets in their format

feed
reader
tutorial

sm-serverless-benchmarking

Benchmark sagemaker serverless endpoints for cost and performance

sagemaker
inference
hosting

bench-it

Quick and easy python benchmarking.

iqm-benchmarks

A package for implementation of Quantum Characterization, Verification and Validation (QCVV) techniques on IQM's hardware at gate level abstraction

feelpp-benchmarking

The Feel++ Benchmarking Project

ms-opencompass

A lightweight toolkit for evaluating LLMs based on OpenCompass.

AI
NLP
in-context learning
large language model
evaluation
benchmark

agbenchmark

Benchmarking the performance of agents far and wide, regardless of how they are set up and how they work

adbench

Python package of ADBench

anomaly detection
outlier detection
tabular data
benchmark

redis-benchmarks-specification

The Redis benchmarks specification describes the cross-language/tools requirements and expectations to foster performance and observability standards around redis related technologies. Members from both industry and academia, including organizations and individuals are encouraged to contribute.

catbench

CatBench: Benchmark of Machine Learning Potentials for Adsorption Energy Predictions in Heterogeneous Catalysis

MLP benchmarking for catalysis

coderdata

A package to download, load, and process multiple benchmark multi-omic drug response datasets

global-benchmark-database-tool

Superseded by: gbd-tools

trulens-benchmark

Library to systematically track and evaluate LLM based applications.

tape-proteins

Repostory of Protein Benchmarking and Modeling

benchmark-adv-ml

Advanced benchmarking for machine learning models.

peek-python

peek - debugging and benchmarking made easy

llm-benchmark

LLM Benchmark

banana-hep

Benchmark QCD physics

benchmark-4dn

Benchmark functions that returns total space, mem, cpu given input size and parameters for the CWL workflows

benchmark
cwl
common workflow language
docker
tibanna
bioinformatics

cai-benchmarking

Causal AI Benchmarking Framework

efaar-benchmarking

efaar_benchmarking

benchnirs

Benchmarking framework for machine learning with fNIRS

Product

Package Alerts
Integrations
Docs
Pricing
FAQ
Roadmap
Changelog

About

About
Love
Blog
Glossary
Discord Community
CareersHiring
Send Feedback
Contact Us
System Status

Packages

npm

Directory
Explore
Random Package
Most Popular
Top Maintainers
Removed Packages

Go

Directory
Explore
Random Package

Maven

Directory
Explore
Random Package

PyPI

Directory
Explore
Random Package

Rubygems

Directory
Explore
Random Package

Stay in touch

Get open source security insights delivered straight into your inbox.

Enter your email

Terms
Privacy
Security

Made with ⚡️ by Socket Inc