Google BigQuery has made the Ethereum dataset publicly available. BigQuery is a Google Cloud service that requires a subscription. It allows companies to generate interactive analysis of big data in conjunction with Google Storage, making it a high-performance and scalable query engine on the cloud.
BigQuery has been used for multiple different types of use cases, including the collection, exploration and processing of historical weather data in order to improve weather predictions to querying metadata collected from 1.1 billion taxi trips.
Applied to Ethereum, BigQuery can facilitate the operations of decentralized autonomous organizations and help them scale by quickly retrieving and analyzing terabytes of data from smart contracts that create agreements and trigger automatic execution.
Etherscan.io, a popular Ethereum explorer, displays wallet addresses and their transactions. Users of BigQuery can delve even more deeply into the blockchain and also connect directly with the Parity client.
According to the announcement, developers of BigQuery have built a software system on Google Cloud that:
- Synchronizes the Ethereum blockchain to computers running Parity in Google Cloud.
- Performs a daily extraction of data from the Ethereum blockchain ledger, including the results of smart contract transactions, such as token transfers.
- De-normalizes and stores date-partitioned data to BigQuery for easy and cost-effective exploration.
BigQuery can, for example, find the most popular smart contracts, as measured by transaction count.
Google BigQuery made the Bitcoin dataset publicly available for analysis earlier this year.