Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More →

Sign in Demo Install

What is Socket?

Socket for GitHub

Detect suspicious packages in PRs

Socket CLI

Use Socket from the command line

Socket Web Extension

Use Socket from your browser

Socket Dependency Search

Find any package for your project

Integrations

All Integrations

Ticketing & Messaging

Package Managers

Docs

Want to read all the docs? Start here

Customers

Check out our customer stories

Blog

Keep up to date with all the news

Changelog

Latest updates and enhancements

FAQ

Answers to common questions

Package Alerts

Learn about all Socket alerts

Glossary

Open source and security terms

Blog

Application Security

Customer Stories

About

Why we built Socket

Love

See why developers love Socket

Careers

Join our team

Investors

Learn about our investors

Security

Our security practices

Why Socket?

Socket vs Dependabot

Socket vs Semgrep

Socket vs EndorLabs

Socket for Open Source Security

Achievements

Fortune Cyber 60

Pricing Love Docs

Sign in Demo Install

pypi
Categories
Utilities
Programming
Eval

Eval

digital-eval

Evaluate Digitalization Data

torch-mir-eval

Mir_eval package ported to pytorch

rank-eval

rank_eval: A Blazing Fast Python Library for Ranking Evaluation and Comparison

trec_eval
information retrieval
evaluation
ranking
numba

audioldm-eval

This package is written for the evaluation of audio generation model.

sep-eval

An easy to use collection of Speech enhancement measures

athina-evals

Python SDK to configure and run evaluations for your LLM-based application

rormula

Formula parser and evaluator for Wilkinson notation

design of experiments
Wilkinson
parser
eval
doe

python-adc-eval

ADC Evaluation Library

adc
analog-to-digital
evaluation
eval
spectrum

ts-eval

Time Series analysis and evaluation tools

tts-eval

TTS Evaluation

machine-learning

gans-eval

This package provides measurement tools for Generative Adversarial Networks (GANs), including Inception Score (IS), Fréchet Inception Distance (FID), Kernel Inception Distance (KID), and Precision and Recall (PR). These metrics are used to evaluate the quality and diversity of generated images in GANs. The package streamlines the use of these metrics, making it easier to apply them to your work.

feature-eval

A user-friendly feature evaluation and selection package.

dragon-eval

Evaluation method for the DRAGON benchmark

ssl-eval

A plug & play evaluator for self-supervised image classification.

safegraph-eval

Library for evaluating SafeGraph data

pyframe-eval

A PEP 523 compatible frame evaluator

rag-eval

A RAG evaluation framework

rag
evaluation

virtualhome-eval

Embodied agent interface evaluation for VirtualHome

napari-cardio-bio-eval

The evaluation of the epic cardio biosensor integrated into napari

interpret-eval

Interpretable Evaluation for Natural Language Processing

vital-agent-eval-env

Vital Agent Eval Env

fast-coco-eval

pprint-problems

Used to print entries from jsonl files, developed with LLM evals in mind.

math-eval

Tool for safe (or less safe) evaluation of strings as math expressions

math
safe eval tool

nodejs-eval

Evaluate arbitrary JavaScript from Python using a NodeJS sidecar

lm-debug-eval-ui

LLM Application Debug/Eval UI on top of AIConfig

score-eval

A visualization package for model score evaluation.

agiflow-eval

Agiflow (EVAL) for Python

testscript-eval

TestScript FHIR resource evaluator

fhir

coder-evals

AI Maintainer Agent Harness for our benchmarking and Marketplace API and platform

coder_evals

sci-annot-eval

The evaluation component of the sci-annot framework

sci-annot
object
detection
evaluation

tm-eval

Topic Modeling Evaluation

topic modeling
metrics

nci-eval

Supplementary code and materials for paper "On Model Evaluation under Non-constant Class Imbalance" by Brabec et al.

image-eval

A library for evaluating image generation models

perfect-eval

Safe execute expr code.

jinja2-eval

Jinja2 Extension for getting eval() result.

orbis-eval

An Extendable Evaluation Pipeline for Named Entity Drill-Down Analysis

model-evals

A placeholder package to reserve the name llms.

bothlent-asr-eval

An asr evaluation tool targetting for testing Portuguese(Brazil) sentence.

sample
setuptools
development

multilabel-eval-metrics

Quickly evaluate multi-label classifiers in various metrics

multi-label-classifier
metrics

quantiphy-eval

calculations with physical quantities

quantities
physical
quantity
units
SI
scale factors

nuclia-eval

Library for evaluating RAG using Nuclia's models

ml-eval-pro

A Comprehensive Platform for Automated Testing and Analysis For Supervised Machine Learning Tasks.

aigc-evals

aigc_evals

ner-eval-dashboard

Dashboard for Quality-driven NER.

conformal-eval

A package with utility functions for evaluating conformal predictors

snreval

This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/projects/snreval

id-marl-eval

A Python library for Multi-Agent Reinforcement Learning evaluation.

multi-agent reinforcement-learning python machine learning

deepevals

Eval

hcl2-eval

Product

Package Alerts
Integrations
Docs
Pricing
FAQ
Roadmap
Changelog

About

About
Love
Blog
Glossary
Discord Community
CareersHiring
Send Feedback
Contact Us
System Status

Packages

npm

Directory
Explore
Random Package
Most Popular
Top Maintainers
Removed Packages

Go

Directory
Explore
Random Package

Maven

Directory
Explore
Random Package

PyPI

Directory
Explore
Random Package

Rubygems

Directory
Explore
Random Package

Stay in touch

Get open source security insights delivered straight into your inbox.

Enter your email

Terms
Privacy
Security

Made with ⚡️ by Socket Inc