webscraping.app
Browse
Pricing
About Us
Submit
Sign In
Latest tools
Categories
Tags
Pricing
Submit
About Us
/
Categories
/
Databases
/
Analytics Databases
Analytics Databases
A curated collection of the best columnar and analytical databases optimized for querying and aggregating large scraped datasets.
Popular Categories:
Browser Automation
14
Scraping Frameworks
11
Analytics Databases
9
SERP APIs
9
ETL Tools
9
Workflow Orchestration
9
AI Web Scraping
8
Scraping APIs
8
Distributed Crawling
6
Cloud Compute
6
Proxy Services
6
Search Engines
6
Order by
Apache Hudi
Lakehouse Platform for Incremental Processing
Analytics Databases
Data Lakehouse
Apache Hudi is an open data lakehouse platform that enables efficient record-level upserts, incremental processing, and ACID transactions on data lakes.
Apache Iceberg
Open Table Format for Huge Analytic Datasets
Analytics Databases
Data Lakehouse
Apache Iceberg is a high-performance open table format enabling multi-engine analytics with schema evolution, time travel, and hidden partitioning.
Delta Lake
Open-Source Lakehouse Storage Framework
Analytics Databases
Data Lakehouse
Delta Lake is an open-source storage framework that brings ACID transactions, schema evolution, and time travel to data lakes at petabyte scale.
StarRocks
Sub-second analytics for enterprise-scale data
Analytics Databases
StarRocks is an open-source analytical database that delivers sub-second query latency for complex joins and aggregations at enterprise scale.
Apache Druid
Real-time OLAP database for event-driven data
Analytics Databases
Apache Druid is a real-time OLAP database that delivers sub-second queries on streaming and batch data for analytics at scale.
Apache Doris
Real-time analytical database with MySQL syntax
Analytics Databases
Apache Doris is a real-time analytical database with MySQL-compatible syntax for high-concurrency queries over large scraped datasets.
QuestDB
Time-series database with ultra-low latency
Analytics Databases
QuestDB is an open-source time-series database delivering ultra-low latency ingestion and queries for scraping metrics and monitoring data.
DuckDB
In-process SQL analytics, no server needed
Analytics Databases
DuckDB is an in-process analytical database that lets you query scraped CSV, Parquet, and JSON files with fast SQL, no server required.
ClickHouse
Real-time analytics database for petabyte-scale data
Analytics Databases
ClickHouse is a fast, open-source columnar database that runs analytical queries on billions of scraped records in milliseconds.