Huge News!Announcing our $40M Series B led by Abstract Ventures.Learn More
Socket
Sign inDemoInstall
Socket

github.com/semi-technologies/weaviate-benchmarking

Package Overview
Dependencies
Alerts
File Explorer
Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

github.com/semi-technologies/weaviate-benchmarking

  • v2.0.3+incompatible
  • Source
  • Go
  • Socket score

Version published
Created
Source

Weaviate Benchmarking

This repo contains a tool for benchmarking Weaviate performance.

Documentation for benchmarker

  • 📊 results and context can be found in the Weaviate documentation
  • 💬 discuss the results on our Slack channel or Twitter

ANN benchmark

There are two components you will need to run for the benchmarks:

  1. weaviate the standard Weaviate image
  2. benchmarker a go based benchmarking tool

You can run both as containers on the same machine via Docker compose.

For replicating our benchmarks we recommend setting up two separate machines.

Machine descriptionCPU typeCPUsMemoryDisk sizeDisk typeMisc.
Machine to run Weaviatec230120GB500GBSSDUbuntu 22.04 with Docker-compose
Machine to run benchmark scriptN2864GB500GBSSDUbuntu 22.04 with Docker-compose
Prepare the Weaviate machine

Clone this repo and cd into it $ git clone https://github.com/semi-technologies/weaviate-benchmarking && cd weaviate-benchmarking

Run the following command to spin up Weaviate: $ docker-compose up weaviate -d

Copy the IP address and amount of CPU cores this machine has.

Prepare the benchmark machine

Check if the Weaviate machine is available: curl http://{IP OF WEAVIATE INSTANCE}/v1/meta. Note that the instance runs on port 8080, e.g., http://10.128.15.12:8080/v1/meta. You will also need to allow port 50051 for gRPC and 21121 for metrics. You can verify this via nc -zv localhost 50051.

Clone this repo and cd into it $ git clone https://github.com/semi-technologies/weaviate-benchmarking && cd weaviate-benchmarking

Download the files into a datasets folder as outlined below.

mkdir datasets && \
    curl -o ./datasets/dbpedia-100k-openai-ada002-angular.hdf5 https://storage.googleapis.com/ann-datasets/custom/dbpedia-100k-openai-ada002-angular.hdf5 \
    curl -o ./datasets/deep-image-96-angular.hdf5 https://ann-benchmarks.com/deep-image-96-angular.hdf5 && \
    curl -o ./datasets/mnist-784-euclidean.hdf5 https://ann-benchmarks.com/mnist-784-euclidean.hdf5 && \
    curl -o ./datasets/gist-960-euclidean.hdf5 https://ann-benchmarks.com/gist-960-euclidean.hdf5

Run a single performance test on an ann-benchmarks hdf5 dataset.

DATASET=./datasets/dbpedia-100k-openai-ada002-angular.hdf5 DISTANCE=cosine docker compose run benchmarker

For more details on additional configuration options see the help options.

docker compose run benchmarker /app/benchmarker ann-benchmark -h

FAQs

Package last updated on 28 Aug 2024

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts

SocketSocket SOC 2 Logo

Product

  • Package Alerts
  • Integrations
  • Docs
  • Pricing
  • FAQ
  • Roadmap
  • Changelog

Packages

npm

Stay in touch

Get open source security insights delivered straight into your inbox.


  • Terms
  • Privacy
  • Security

Made with ⚡️ by Socket Inc