🚀 Big News: Socket Acquires Coana to Bring Reachability Analysis to Every Appsec Team.Learn more →

Book a Demo Install Sign in

docling-ibm-models

Package Overview

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

docling-ibm-models

This package contains the AI models used by the Docling PDF conversion package

3.6.0

PyPI

Maintainers: 1

Docling IBM models

AI modules to support the Docling PDF document conversion project.

TableFormer is an AI module that recognizes the structure of a table and the bounding boxes of the table content.
Layout model is an AI model that provides among other things ability to detect tables on the page. This package contains inference code for Layout model.

Pipeline Overview

Architecture

Datasets

Below we list datasets used with their description, source, and "TableFormer Format". The TableFormer Format is our processed version of the version of the original format to work with the dataloader out of the box, and to augment the dataset when necassary to add missing groundtruth (bounding boxes for empty cells).

Name	Description	URL
PubTabNet	PubTabNet contains heterogeneous tables in both image and HTML format, 516k+ tables in the PubMed Central Open Access Subset	PubTabNet
FinTabNet	A dataset for Financial Report Tables with corresponding ground truth location and structure. 112k+ tables included.	FinTabNet
TableBank	TableBank is a new image-based table detection and recognition dataset built with novel weak supervision from Word and Latex documents on the internet, contains 417K high-quality labeled tables.	TableBank

Models

TableModel04:

TableModel04rs (OTSL) is our SOTA method that using transformers in order to predict table structure and bounding box.

Configuration file

Example configuration can be found inside test tests/test_tf_predictor.py These are the main sections of the configuration file:

dataset: The directory for prepared data and the parameters used during the data loading.
model: The type, name and hyperparameters of the model. Also the directory to save/load the trained checkpoint files.
train: Parameters for the training of the model.
predict: Parameters for the evaluation of the model.
dataset_wordmap: Very important part that contains token maps.

Model weights

You can download the model weights and config files from the links:

Inference Tests

You can run the inference tests for the models with:

python -m pytest tests/

This will also generate prediction and matching visualizations that can be found here: tests\test_data\viz\

Visualization outlines:

Light Pink: border of recognized table
Grey: OCR cells
Green: prediction bboxes
Red: OCR cells matched with prediction
Blue: Post processed, match
Bold Blue: column header
Bold Magenta: row header
Bold Brown: section row (if table have one)

Demo

A demo application allows to apply the LayoutPredictor on a directory <input_dir> that contains png images and visualize the predictions inside another directory <viz_dir>.

First download the model weights (see above), then run:

python -m demo.demo_layout_predictor -i <input_dir> -v <viz_dir>

e.g.

python -m demo.demo_layout_predictor -i tests/test_data/samples -v viz/

Keywords

FAQs

What is docling-ibm-models?

Is docling-ibm-models well maintained?

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

docling-ibm-models

Docling IBM models

Pipeline Overview

Datasets

Models

TableModel04:

Configuration file

Model weights

Inference Tests

Demo

Keywords

Related posts

ECMAScript 2025 Finalized with Iterator Helpers, Set Methods, RegExp.escape, and More

Node.js Homepage Adds Paid Support Link, Prompting Contributor Pushback