
Security News
Meet Socket at Black Hat and DEF CON 2025 in Las Vegas
Meet Socket at Black Hat & DEF CON 2025 for 1:1s, insider security talks at Allegiant Stadium, and a private dinner with top minds in software supply chain security.
Original repository: https://github.com/microsoft/table-transformer
This is the Table Transformer Model developed by Brandon Smock et al. of Microsoft AI. This repository consists of Table Structure Recognition (TATR) for detecting and extracting table infomation into popular formats such as CSV or HTML table, plus text recognition using EasyOCR.
pip install table-transformer
The full model usage can be found here:
from table_transformer import TableExtractionPipeline
pipe = TableExtractionPipeline(det_device="cpu", str_device="cpu",
det_model_path=".\path\to\pubtables1m_detection_detr_r18.pth",
str_model_path=".\path\to\TATR-v1.1-Pub-msft.pth")
img = "\path\to\image.jpg"
table_objects, table_cells_coordinates, table_cells_text = pipe(img)
print(table_cells_text[0]) # Should be DataFrame
With structure recognition, the original author has evaluated the v1.0 model on PubTables-1M with great results. With other datasets such as PubTabNet, the score is quite good.
You can check out the score and run the evaluation with your own dataset in this link.
FAQs
Table Transformer
We found that table-transformer demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Meet Socket at Black Hat & DEF CON 2025 for 1:1s, insider security talks at Allegiant Stadium, and a private dinner with top minds in software supply chain security.
Security News
CAI is a new open source AI framework that automates penetration testing tasks like scanning and exploitation up to 3,600× faster than humans.
Security News
Deno 2.4 brings back bundling, improves dependency updates and telemetry, and makes the runtime more practical for real-world JavaScript projects.