Skip to main content
We used our decades-long experience in TradFi and Capital Markets Big Data to build state-of-the-art data lakes/warehouses for Blockchain data across over 120 chains and all protocols. SonarX data infrastructure eliminates the cost and burden of collecting, storing, processing, maintaining, and accessing these XXX-L datasets at scale and speed for customers. Moreover, SonarX empowers customers with audit- and forensic-grade quality they can rely on for the most demanding and critical workloads, including trading & investing, regulatory compliance, tax accounting, and fraud detection & prevention. SonarX handles and maintains bulk data delivery and the full lifecycle of data movement, including the often-overlooked complexities:
  • Bulk delivery — Move 100s of TBs to your destinations (Snowflake, Databricks, S3, BigQuery, GCS, ABS) in days—no custom scripts. Parallelized loads, checksum verification, and restartable jobs ensure integrity at scale.
  • Seamless schema updates & migrations — Leverage Iceberg/Delta table formats with clearly defined SOPs for backward compatibility. Versioned schemas, automatic evolutions, and reorg-aware backfills keep pipelines stable without breaking downstream consumers.

​Benefits of SonarX Datasets

Wide and deep like no others

We currently support over 120+ blockchains across all protocols, all fully curated and optimized for analytics.You’ll find the core datasets you’d expect in a standardized, homogenous format across all chains: blocks, transactions, logs, receipts, traces, regular and internal transfers, and more. Each dataset offers tools to make complex analytics seamless, including token metadata, converted values, and USD pricing. For example, SonarX datasets track native and token balances using an approach that is especially unique. Without netting transfers or calling RPCs, our proprietary technique gives customers a complete and accurate historical balance for any address at any point in time. This capability is truly one-of-a-kind in the market, and we support it for both native and non-native token balances.

The power of simplicity in action: decoded data

Decoding at scale is complex, compute-heavy, and expensive. Still, it enables customers to query events and smart contract interactions in plain language without having to manage or maintain their own API definitions. For this reason, SonarX provides decoded logs and traces, which we’ve processed across history using over 100 million contract APIs across all supported chains. For example, we decode both the input value passed to the smart contract function and the output value returned by it. This makes all smart contract interactions ready for immediate analysis. We support the same functionality for logs, decoding them into a human-readable format ready for use.

Optimized for searching and querying at speed and scale

SonarX offers optimized search solutions, providing custom search optimization strategies for common query patterns. Our optimizations can drive search & query times from several minutes to a few seconds. For example, we benchmarked a query for historical wallet activity against a standard table and an optimized table, both containing the same set of 8 billion records. The query against the standard table returned approximately 1.1 million records in 8 minutes. The same query against the table optimized for address-based lookups returned the same set of 1.1 million records in about 4 seconds.

Advanced Analytics and Focused Datasets

SonarX also curates Staking, DEX (including HyperLiquid full L2, L3, and L4 data), and other specialized datasets, enabling analysis of decentralized activity and liquidity across multiple dimensions and verticals. All of this curation allows for our customers to analyze blockchain data activity with speed, precision, and confidence, without having to deal with complex data engineering challenges. We continually expand these specialized datasets and, based on customers’ specific requirements, also offer focused datasets for Tokens & Payments, RWAs & Tokenized Assets, and Wallets. Soon (2016), we will add Proof-of-Reserve & Attestation, DeFi Reference Data, Risk, Credit & Underwriting, and more.

​Supported Integrations

SonarX supports a wide range of data connectors and data lake destinations, enabling seamless integration of blockchain data with your infrastructure.

​Data Export Methods

IntegrationMechanismSupported Destinations
Instant Data SharesQuery blockchain data directly via your data lake❄️ Snowflake Shares
🧱 Databricks Delta Sharing
🔍 BigQuery Analytics Hub
Data DumpsGet data delivered to your cloud storage. Multiple formats supported (CSV, Parquet, Iceberg)Amazon S3, Google Cloud Storage, and Azure Blob Storage

​Regional Availability

SonarX’s Instant Data Shares are natively hosted in AWS’s US-West-2 region and can be made available in every other region worldwide upon customer request.
PlatformRegions & Coverage
Snowflake & DatabricksUS-West-2 region
• Worldwide delivery available with 30-minute or less freshnessincludes Solana and Hyperliquid
Amazon Public Blockchain DatasetUs-East-2 region
BigQueryComing in 2026
Need a different region, platform, or format? Contact us to discuss your requirements.