🚀 Big News: Socket Acquires Coana to Bring Reachability Analysis to Every Appsec Team.Learn more →

Sign in Demo Install

Sign in Demo Install

pypi

Categories
Utilities
Programming
Eval

Eval

dataretrieval

Discover and retrieve water data from U.S. federal hydrologic web services.

torcheval-nightly

A library for providing a simple interface to create new metrics and an easy-to-use toolkit for metric computations and checkpointing.

mindsdb-evaluator

Model evaluation for Machine Learning pipelines.

jpmml-evaluator

PMML evaluator library for Python

lighteval

A lightweight and configurable evaluation package

lmms-eval

A framework for evaluating large multi-modality language models

perceval

Send Sir Perceval on a quest to fetch and gather data from software repositories.

eval-mm

eval-mm is a tool for evaluating Multi-Modal Large Language Models.

agentevals

Open-source evaluators for LLM agents

human-eval-windows

Windows-compatible fork of OpenAI's human-eval

clip-retrieval

Easily computing clip embeddings and building a clip retrieval system with them

machine learning

computer vision

eval-rag

A comprehensive evaluation toolkit for assessing Retrieval-Augmented Generation (RAG) outputs using linguistic, semantic, and fairness metrics

dataeval

DataEval provides a simple interface to characterize image data and its impact on model performance across classification and object-detection tasks

promptflow-evals

Prompt flow evals

pytest-evals

A pytest plugin for running and analyzing LLM evaluation tests

potluck-eval

Python code evaluation system and submissions server capable of unit tests, tracing, and AST inspection. Server can run on Python 2.7 but evaluation requires 3.7+.

aind-segmentation-evaluation

Python package for evaluating neuron segmentations in terms of the number of splits and merges

perceval-mozilla

Bundle of Perceval backends for Mozilla ecosystem.

flexeval

perceval-opnfv

Bundle of Perceval backends for OPNFV ecosystem.

perceval-puppet

Bundle of Perceval backends for Puppet, Inc. ecosystem.

treevalue

A flexible, generalized tree-based data structure.

Tree-structured Value Management

perceval-weblate

Bundle of Perceval backends for Weblate.

ms-vlmeval

OpenCompass VLM Evaluation Kit for Eval-Scope

in-context learning

vbi-evaluate-blogs

A short description of the package

hydroeval

HydroEval: An Evaluator for Streamflow Time Series In Python

py-evalexpr

Python bindings for evalexpr Rust crate for safe expression evaluation

judgeval

Judgeval Package

eval-suite

User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

continuous-eval

Open-Source Evaluation for GenAI Applications.

sklearn-evaluation

scikit-learn model evaluation made easy: plots, tables andmarkdown reports.

machinelearning

raga-llm-eval

Package for LLM Evaluation

ml-evaluation-framework

ragbits-evaluate

Evaluation module for Ragbits components

Large Language Models

pkginfo2

Query metadatdata from sdists / bdists / installed packages. Safer fork of pkginfo to avoid doing arbitrary imports and eval()

distribution sdist installed metadata

clusteval

clusteval is a python package for unsupervised cluster validation.

machine-learning

silhouette score

langevals-core

Core package for LLM evaluation platform, providing base classes and utilities.

llama-stack-provider-lmeval

Llama Stack Remote Eval Provider for TrustyAI LM-Eval

skretrieval

The University of Saskatchewan Retrieval Framework

drevalpy

Drug response evaluation of cancer cell line drug response models in a fair setting

unicorn-eval

Evaluation and adaption method for the UNICORN Challenge

simuleval

SimulEval: A Flexible Toolkit for Automated Machine Translation Evaluation

Machine Translation

fimeval

A Framework for Automatic Evaluation of Flood Inundation Mapping Predictions Evaluation

perceval-public-inbox

Perceval backend for public-inbox.

pyndeval

Interface to ndeval.c

ir-datasets-longeval

Extension for accessing the LongEval test collections via ir_datasets.

sumeval

Well tested evaluation framework for Text summarization

text summarization machine learning

litereval

Wrapper around ast.literal_eval with new {foo='bar', key=None} dict syntax.

jqqb-evaluator

Python evaluator for jQuery-QueryBuilder rules

table-evaluator

A package to evaluate how close a synthetic data set is to real data.

Table-evaluation

data-generation

data-evaluation

Product

Package Alerts
Integrations
Docs
Pricing
FAQ
Roadmap
Changelog

About

About
Love
Blog
Glossary
Discord Community
CareersHiring
Send Feedback
Contact Us
System Status

Packages

npm

Directory
Explore
Random Package
Most Popular
Top Maintainers
Removed Packages

Go

Directory
Explore
Random Package

Maven

Directory
Explore
Random Package

NuGet

Directory
Explore
Random Package

PyPI

Directory
Explore
Random Package

Rubygems

Directory
Explore
Random Package

Stay in touch

Get open source security insights delivered straight into your inbox.

Enter your email

Terms
Privacy
Security

Made with ⚡️ by Socket Inc