Recursive subset comparison for complex nested Python data structures
A logical, reasonably standardized but flexible project structure for doing and sharing data science work.
The PySubnetTree package provides a Python data structure SubnetTree
Package for reading the Solvency 2 Risk-Free Interest Rate Term Structures from the zip-files on the EIOPA website and deriving the term structures for alternative extrapolations
A picky dataclass runtime utility collection, enforcing strict types and validations.
Simple data structures for representing source code locations
Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPoint, Excel, images, URLs to clean markdown, JSON, HTML locally. Alternative to Unstructured, Docling, Marker, MarkItDown, MinerU, PaddleOCR, Tesseract
Tool for creating nanowire tools with the flask structure.
Process data structures with the jq expression language.
A collection of robust, efficient and small classic algorithms and data structures.
functional data structures and type classes
A set of python libraries used to generate ASCII art from high level data structures.
SNPio is a Python API for population genetic file processing, filtering, and analysis. It is designed to be a user-friendly tool for the manipulation of population genetic data in a variety of formats. SNPio can be used to filter data based on missingness, MAF and MAC, singletons, biallelic, and monomorphic sites. It can also generate summary statistics for population genetic analyses.
A Manim-based animation library for visualizing data structures and algorithms.
mmap.ninja: Memory mapped data structures
A framework that enables efficient extraction of structured data from unstructured text using large language models (LLMs).
Query language for blending SQL and LLMs across structured + unstructured data, with type constraints.
yolo-pyutils provides utility functions for IO, data-processing, scheduling, common structures etc.
PyBloom: A Probabilistic data structure
A tool for categorizing text data and images using LLMs and vision models
Double-ended priority queue
python bindings for the daa library
Pythonic Data Structures and Algorithms
Read inventory cards, extract images, and create structured data.
Store raw and structured FollowTheMoney data from different datasets in a data lake
Location based social network (LBSN) data structure format & transfer tool
Python library to parse Circuit Maintenance notifications and return a structured data back
Python immutable data structures library
Data-structure definition/validation/traversal, mapping and serialisation toolkit for Python
High abstract python library for functional programming. Contains algebraic data structures known from Haskell or Scala.
Utility for ingesting large survey data into HATS structure
Tech Support Buddy is a versatile Python module built to empower developers and IT professionals in resolving technical issues. It provides a suite of Python functions designed to efficiently diagnose and resolve technical issues by parsing raw text into structured data, enabling automation and data-driven decision-making.
A modern, extensible language for manipulation with structured data
python library providing utilities, data structures, constants, parsers, and tools for working with USB data
Data structures for crystallography
Traverse nested data structures.
A data caching implemention based on Redis and redis_structures.
Construct file definitions for the Retro Studios game engine files
Reorganising NIfTI files from dcm2niix into the Brain Imaging Data Structure
More Collections! Some useful data structures for dealing with Data
Human-readable versatile data format
Beautiful and user friendly data structures for quantum chemistry.
A lightweight Python software package for accessing the data in the various AAIndex databases, which represent the physiochemical, biochemical and structural properties of amino acids as numerical indices.
A Python module for preserving structural isomorphisms across data transformations, ensuring reversible and type-stable conversions between formats like DataFrame, JSON, and dict.
A lightweight and extensible Python package for managing data, tailored for researchers working with structured data.
A library for computing topological data structures
A small package of basic data structures and algorithms
A pure-python module to read and write binary data