Security News
Research
Data Theft Repackaged: A Case Study in Malicious Wrapper Packages on npm
The Socket Research Team breaks down a malicious wrapper package that uses obfuscation to harvest credentials and exfiltrate sensitive data.
ingestr is a command-line application that ingests data from various sources and stores them in any database.
Copy data from any source to any destination without any code
ingestr is a command-line app that allows you to ingest data from any source into any destination using simple command-line flags, no code necessary.
append
, merge
or delete+insert
ingestr takes away the complexity of managing any backend or writing any code for ingesting data, simply run the command and watch the data land on its destination.
We recommend using uv to run ingestr
.
pip install uv
uvx ingestr
Alternatively, if you'd like to install it globally:
uv pip install --system ingestr
While installation with vanilla pip
is possible, it's an order of magnitude slower.
ingestr ingest \
--source-uri 'postgresql://admin:admin@localhost:8837/web?sslmode=disable' \
--source-table 'public.some_data' \
--dest-uri 'bigquery://<your-project-name>?credentials_path=/path/to/service/account.json' \
--dest-table 'ingestr.some_data'
That's it.
This command:
public.some_data
from the Postgres instance.ingestr
and table some_data
.You can see the full documentation here.
Join our Slack community here.
Pull requests are welcome. However, please open an issue first to discuss what you would like to change. We maybe able to offer you help and feedback regarding any changes you would like to make.
[!NOTE] After cloning
ingestr
make sure to runmake setup
to install githooks.
Source | Destination | |
---|---|---|
Databases | ||
Postgres | ✅ | ✅ |
BigQuery | ✅ | ✅ |
Snowflake | ✅ | ✅ |
Redshift | ✅ | ✅ |
Databricks | ✅ | ✅ |
DuckDB | ✅ | ✅ |
Microsoft SQL Server | ✅ | ✅ |
Local CSV file | ✅ | ✅ |
MongoDB | ✅ | ❌ |
Oracle | ✅ | ❌ |
SAP Hana | ✅ | ❌ |
SQLite | ✅ | ❌ |
MySQL | ✅ | ❌ |
Platforms | ||
Adjust | ✅ | - |
Airtable | ✅ | - |
AppsFlyer | ✅ | - |
Chess.com | ✅ | - |
Facebook Ads | ✅ | - |
Gorgias | ✅ | - |
Google Sheets | ✅ | - |
HubSpot | ✅ | - |
Klaviyo | ✅ | - |
Notion | ✅ | - |
S3 | ✅ | - |
Shopify | ✅ | - |
Slack | ✅ | - |
Stripe | ✅ | - |
Zendesk | ✅ | - |
Feel free to create an issue if you'd like to see support for another source or destination.
This project would not have been possible without the amazing work done by the SQLAlchemy and dlt teams. We relied on their work to connect to various sources and destinations, and built ingestr
as a simple, opinionated wrapper around their work.
FAQs
ingestr is a command-line application that ingests data from various sources and stores them in any database.
We found that ingestr demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Research
The Socket Research Team breaks down a malicious wrapper package that uses obfuscation to harvest credentials and exfiltrate sensitive data.
Research
Security News
Attackers used a malicious npm package typosquatting a popular ESLint plugin to steal sensitive data, execute commands, and exploit developer systems.
Security News
The Ultralytics' PyPI Package was compromised four times in one weekend through GitHub Actions cache poisoning and failure to rotate previously compromised API tokens.