Search code examples
pythongoogle-bigqueryethereumblockchaincryptocurrency

Labelling exchange addresses in ethereum-etl datasets and BigQuery


I want to query all of known CEX addresses (ideally from the beginning of ETH til now) with google bigquery. So far, I'm getting data from the bigquery-public-data & crypto-ethereum using SQL. If possible, I also want the list for DEX addresses. Is there any dataset that I can extract data from?

Thank you in advance!


Solution

  • As commented by @Mikko Ohtamaa, You cannot get the CEX address from BigQuery UI as they are not public information. Instead, you can buy them privately. DEX addresses are available in Trading Strategy Exchanges dataset.

    You can find the data description here. This covers only Uniswap-like DEXes, as there are dozens of different DEX types available.

    Also, as mentioned by @Mikko Ohtamaa, there is no single "deposit" address or even a single "swap address." Uniswap has multiple "router" smart contracts, but one does not need to use them. In fact, aggregators like 1inch, which is most of the volume, do not use them. Thus, for DEX, the only way to analyze trades, you need to analyze each liquidity pool / pair contract individually.

    Posting the answer as community wiki for the benefit of the community that might encounter this use case in the future. Feel free to edit this answer for additional information.