webscraping.app
Features
Pricing
About Us
Sign In
Get early access
Features
Pricing
About Us
/
Categories
/
Databases
/
Analytics Databases
Analytics Databases
A curated collection of the best columnar and analytical databases optimized for querying and aggregating large scraped datasets.
Order by
Delta Lake
Open-Source Lakehouse Storage Framework
Analytics Databases
Data Lakehouse
Delta Lake is an open-source storage framework that brings ACID transactions, schema evolution, and time travel to data lakes at petabyte scale.
DuckDB
In-process SQL analytics, no server needed
Analytics Databases
DuckDB is an in-process analytical database that lets you query scraped CSV, Parquet, and JSON files with fast SQL, no server required.
ClickHouse
Real-time analytics database for petabyte-scale data
Analytics Databases
ClickHouse is a fast, open-source columnar database that runs analytical queries on billions of scraped records in milliseconds.
Apache Hudi
Lakehouse Platform for Incremental Processing
Analytics Databases
Data Lakehouse
Apache Hudi is an open data lakehouse platform that enables efficient record-level upserts, incremental processing, and ACID transactions on data lakes.
Apache Iceberg
Open Table Format for Huge Analytic Datasets
Analytics Databases
Data Lakehouse
Apache Iceberg is a high-performance open table format enabling multi-engine analytics with schema evolution, time travel, and hidden partitioning.
StarRocks
Sub-second analytics for enterprise-scale data
Analytics Databases
StarRocks is an open-source analytical database that delivers sub-second query latency for complex joins and aggregations at enterprise scale.
Apache Druid
Real-time OLAP database for event-driven data
Analytics Databases
Apache Druid is a real-time OLAP database that delivers sub-second queries on streaming and batch data for analytics at scale.
Apache Doris
Real-time analytical database with MySQL syntax
Analytics Databases
Apache Doris is a real-time analytical database with MySQL-compatible syntax for high-concurrency queries over large scraped datasets.
QuestDB
Time-series database with ultra-low latency
Analytics Databases
QuestDB is an open-source time-series database delivering ultra-low latency ingestion and queries for scraping metrics and monitoring data.