
Research
2025 Report: Destructive Malware in Open Source Packages
Destructive malware is rising across open source registries, using delays and kill switches to wipe code, break builds, and disrupt CI/CD.
transtractor
Advanced tools
The Transaction Extractor, or 'Transtractor', aspires to be a universal library for extracting transaction data from PDF bank statements. Key features:
Install Rust: Download and install Rust from rustup.rs
Install Maturin: Install the Python build tool for Rust extensions
pip install maturin
Build and install Transtractor: Clone the repository and build
git clone https://github.com/gravytoast/transtractor.git
cd transtractor
maturin develop --release
Import and initialise the parser
from transtractor import Parser
parser = Parser()
Convert PDF to CSV: All CSV files are written in a standard format
parser.parse('statement.pdf').to_csv('statement.csv')
Convert PDF to DataFrame: Load into a DataFrame for analysis
import pandas as pd
data = parser.parse('statement.pdf').to_pandas_dict()
df = pd.DataFrame(data)
See the documentation for a current list of supported statements. You may also create your own parsing configuration files by following these instructions and loading it by:
from transtractor import Parser
parser = Parser()
parser.load('my_config.json')
parser.parse('statement.pdf').to_csv('statement.csv')
New and well-tested configuration files are especially welcome. Please submit a pull request with them add to the python/transtractor/configs directory, or email to develop@transtractor.net.
FAQs
Universal PDF bank statement parsing library
We found that transtractor demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Research
Destructive malware is rising across open source registries, using delays and kill switches to wipe code, break builds, and disrupt CI/CD.

Security News
Socket CTO Ahmad Nassri shares practical AI coding techniques, tools, and team workflows, plus what still feels noisy and why shipping remains human-led.

Research
/Security News
A five-month operation turned 27 npm packages into durable hosting for browser-run lures that mimic document-sharing portals and Microsoft sign-in, targeting 25 organizations across manufacturing, industrial automation, plastics, and healthcare for credential theft.