
Product
Socket Now Supports pylock.toml Files
Socket now supports pylock.toml, enabling secure, reproducible Python builds with advanced scanning and full alignment with PEP 751's new standard.
Table Diff is a Python package that provides a text-based interface for comparing two tables. It is designed to be used by data analysts and data scientists to compare two tables and identify differences between them, especially as data is modified in an ETL pipeline.
The diff between two tables is printed to stdout as Markdown, and can be saved to a Markdown file and/or PDF file.
Install Python 3.10 or later.
Install pipx, a tool to create isolated Python environments for individual packages:
pip install pipx
pipx install table-diff[pdf]
# Optionally, install without PDF export support:
pipx install table-diff
table_diff <old_csv_path> <new_csv_path> -u PrimaryKeyCol1 PrimaryKeyColN
For development environment setup, please refer to the CONTRIBUTING.md
guide.
Running this tool with Docker is not recommended.
docker build -t table-diff .
docker run -it -v <local_folder_path>:/files table-diff
table_diff /files/<your_file_name_left.csv/pq> /files/<your_file_name_right.csv/pq> -u PrimaryKeyCol
To run the demo with the sample dataset bundled in this repository, run:
docker build -t table-diff .
docker run -it table-diff
# Inside the container:
table_diff tests/demo_datasets/populations/city-populations_2010.csv tests/demo_datasets/populations/city-populations_2015.csv -u location_id
Please submit Bug Reports and Merge Requests to the GitLab project.
Please refer to the CONTRIBUTING.md
file for more details about the contribution policy.
This project is licensed using the MIT License. For more information, see the LICENSE file.
Note that this project has been created and modified with the help of Large Language Model (LLM)-based tools like GitHub Copilot and ChatGPT.
FAQs
Tool to compare tables
We found that table-diff demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Product
Socket now supports pylock.toml, enabling secure, reproducible Python builds with advanced scanning and full alignment with PEP 751's new standard.
Security News
Research
Socket uncovered two npm packages that register hidden HTTP endpoints to delete all files on command.
Research
Security News
Malicious Ruby gems typosquat Fastlane plugins to steal Telegram bot tokens, messages, and files, exploiting demand after Vietnam’s Telegram ban.