Ethereum Redshift ETL
Fork change
3.0.0
- remove sha3_uncles, logs_bloom, transactions_root, state_root, receipts_root, extra_data from block.
- remove trace_id from trace.
- remove 0x from the value of all address field.
3.0.1
- add prefix option to extract_csv_column command.
Ethereum ETL lets you convert blockchain data into convenient formats like CSVs and relational databases.
Do you just want to query Ethereum data right away? Use
the public dataset in BigQuery
.
Full documentation available here.
Quickstart
Install Ethereum ETL:
pip3 install ethereum-etl
Export blocks and transactions (Schema
, Reference):
> ethereumetl export_blocks_and_transactions --start-block 0 --end-block 500000 \
--blocks-output blocks.csv --transactions-output transactions.csv \
--provider-uri https://mainnet.infura.io/v3/7aef3f0cd1f64408b163814b22cc643c
Export ERC20 and ERC721 transfers (Schema
, Reference):
> ethereumetl export_token_transfers --start-block 0 --end-block 500000 \
--provider-uri file://$HOME/Library/Ethereum/geth.ipc --output token_transfers.csv
Export traces (Schema, Reference):
> ethereumetl export_traces --start-block 0 --end-block 500000 \
--provider-uri file://$HOME/Library/Ethereum/parity.ipc --output traces.csv
Stream blocks, transactions, logs, token_transfers continually to console (Reference):
> pip3 install ethereum-etl[streaming]
> ethereumetl stream --start-block 500000 -e block,transaction,log,token_transfer --log-file log.txt \
--provider-uri https://mainnet.infura.io/v3/7aef3f0cd1f64408b163814b22cc643c
Find other commands here.
For the latest version, check out the repo and call
> pip3 install -e .
> python3 ethereumetl.py
Useful Links
Running Tests
> pip3 install -e .[dev,streaming]
> export ETHEREUM_ETL_RUN_SLOW_TESTS=True
> export PROVIDER_URL=<your_porvider_uri>
> pytest -vv
Running Tox Tests
> pip3 install tox
> tox
Running in Docker
-
Install Docker: https://docs.docker.com/install/
-
Build a docker image
> docker build -t ethereum-etl:latest .
> docker image ls
-
Run a container out of the image
> docker run -v $HOME/output:/ethereum-etl/output ethereum-etl:latest export_all -s 0 -e 5499999 -b 100000 -p https://mainnet.infura.io
> docker run -v $HOME/output:/ethereum-etl/output ethereum-etl:latest export_all -s 2018-01-01 -e 2018-01-01 -p https://mainnet.infura.io
-
Run streaming to console or Pub/Sub
> docker build -t ethereum-etl:latest -f Dockerfile .
> echo "Stream to console"
> docker run ethereum-etl:latest stream --start-block 500000 --log-file log.txt
> echo "Stream to Pub/Sub"
> docker run -v /path_to_credentials_file/:/ethereum-etl/ --env GOOGLE_APPLICATION_CREDENTIALS=/ethereum-etl/credentials_file.json ethereum-etl:latest stream --start-block 500000 --output projects/<your-project>/topics/crypto_ethereum
Projects using Ethereum ETL
- Google - Public BigQuery Ethereum datasets
- Nansen - Analytics platform for Ethereum