webscraping.app
Browse
About Us
Submit
Sign In
Latest tools
Categories
Tags
Submit
About Us
/
Categories
/
Databases
/
Data Lakehouse
Data Lakehouse
A curated collection of the best data lakehouse platforms combining data lake storage with warehouse query capabilities for scraped data.
Popular Categories:
Browser Automation
14
Scraping Frameworks
11
ETL Tools
9
Analytics Databases
9
SERP APIs
9
Workflow Orchestration
9
AI Web Scraping
8
Scraping APIs
8
Proxy Services
6
Distributed Crawling
6
Search Engines
6
Cloud Compute
6
Data Lakehouse – webscraping.app
Order by
Apache Hudi
Lakehouse platform for upserts and incremental processing. Efficient record-level updates.
Analytics Databases
Data Lakehouse
Apache Iceberg
Open table format for huge analytics tables. Multi-engine (Spark, Trino, Flink). Used by Netflix, Airbnb.
Analytics Databases
Data Lakehouse
Delta Lake
Lakehouse storage framework. ACID transactions, schema evolution, time travel. Databricks standard.
Analytics Databases
Data Lakehouse