Guidance platform for deploying and managing large language models.
🦜💪 Flex those feathers!
Pyinfer is a model agnostic Python utility tool for ML developers and researchers to benchmark model inference statistics.
Create charts from FIO storage benchmark tool output
Unfied Semi-Supervised Learning Benchmark
I/O profiler for deep learning python apps. Specifically for dlio_benchmark.
RAG Evaluation Benchmark and Evaluation Library
A library for training and benchmarking RL agents on graphs.
Benchmarking Likelihood Ratio systems
A benchmarking suite for robust audio watermarking.
SIBERIA (SIgned BEnchmarks foR tIme series Analysis) provides maximum-entropy null models and validation methods for signed networks derived from time series.
Benchmarks library, based on the package teneva, for testing multivariate approximation and optimization methods
A GEMSEO-based package to benchmark optimization algorithms.
OpenMMLab Model Pretraining Toolbox and Benchmark
Benchmark orchestrator for the xemu original Microsoft Xbox emulator
Benchmarking library for generative algorithms
Open source remote sensing dataset with benchmarks
llama-index packs rag_evaluator integration
Optimum-Benchmark is a unified multi-backend utility for benchmarking Transformers, Timm, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
Easily benchmark PyTorch model FLOPs, latency, throughput, max allocated memory and energy consumption in one go.
Python Interface to lfk-mp-benchmark
A Python interface for the USEPA Benchmark dose modeling software (BMDS)
Temporal Graph Benchmark project repo
A simple ANN benchmark tools
PyAnaDroid: A replicable, fully-customizable execution pipeline foranalyzing and benchmarking Android Applications
Benchmark functions for Bayesian optimization
Benchmark Python Scripts
A comprehensive, open-source LLM evaluation framework for testing and benchmarking AI models
A simple benchmarking library
Evolution Gym: A benchmark for developing and evaluating soft robot co-design algorithms.
Fibber is a benchmarking suite for adversarial attacks on text classification.
Packaged data modules for multiview learning benchmarks
Pytorch based library for robust prototyping, standardized benchmarking, and effortless experiment management
Advanced benchmarking for machine learning models.
Easily benchmark Machine Learning models on selected tasks and datasets - with PyTorch
Python Optimization Benchmarking Functions
a python package to benchmarks algorithms against various datasets
Use LLMs to get classification risk scores on tabular tasks.
Benchmark Tool for CaddoBenchmark Project
Python Cache Hierarchy Simulator
coNstructiOn pRoject Manager