webscraping.app
Browse
Pricing
About Us
Submit
Sign In
Latest tools
Categories
Tags
Pricing
Submit
About Us
/
Categories
/
Databases
Databases
Browse 5 subcategories of Databases tools and find the perfect solution for your needs.
Caching Databases
3 tools
NoSQL Databases
4 tools
Analytics Databases
9 tools
Vector Databases
5 tools
Data Lakehouse
3 tools
Popular Categories:
Browser Automation
14
Scraping Frameworks
11
Analytics Databases
9
SERP APIs
9
ETL Tools
9
Workflow Orchestration
9
AI Web Scraping
8
Scraping APIs
8
Distributed Crawling
6
Cloud Compute
6
Proxy Services
6
Search Engines
6
Order by
Apache Hudi
Lakehouse Platform for Incremental Processing
Analytics Databases
Data Lakehouse
Apache Hudi is an open data lakehouse platform that enables efficient record-level upserts, incremental processing, and ACID transactions on data lakes.
DynamoDB
AWS managed serverless NoSQL database
NoSQL Databases
Amazon DynamoDB is a fully managed, serverless NoSQL database with single-digit millisecond performance for high-throughput scraping workloads.
Apache Iceberg
Open Table Format for Huge Analytic Datasets
Analytics Databases
Data Lakehouse
Apache Iceberg is a high-performance open table format enabling multi-engine analytics with schema evolution, time travel, and hidden partitioning.
Delta Lake
Open-Source Lakehouse Storage Framework
Analytics Databases
Data Lakehouse
Delta Lake is an open-source storage framework that brings ACID transactions, schema evolution, and time travel to data lakes at petabyte scale.
LanceDB
Serverless vector database, no server management
Vector Databases
LanceDB is a serverless vector database built in Rust that runs embedded or in the cloud with native DataFrame integration for AI workloads.
Cassandra
Distributed NoSQL with high availability at scale
NoSQL Databases
Apache Cassandra is a distributed NoSQL database that handles massive amounts of scraped data with high availability and linear scalability.
StarRocks
Sub-second analytics for enterprise-scale data
Analytics Databases
StarRocks is an open-source analytical database that delivers sub-second query latency for complex joins and aggregations at enterprise scale.
KeyDB
Multithreaded Redis fork with higher throughput
Caching Databases
KeyDB is a multithreaded Redis-compatible database delivering over 1M ops/sec per node for high-throughput caching and queuing.
Apache Druid
Real-time OLAP database for event-driven data
Analytics Databases
Apache Druid is a real-time OLAP database that delivers sub-second queries on streaming and batch data for analytics at scale.
Memcached
Simple, fast distributed memory caching system
Caching Databases
Memcached is a free, high-performance distributed memory caching system that speeds up dynamic applications by caching scraped data in RAM.
Apache Doris
Real-time analytical database with MySQL syntax
Analytics Databases
Apache Doris is a real-time analytical database with MySQL-compatible syntax for high-concurrency queries over large scraped datasets.
Weaviate
AI-native vector database with built-in ML models
Vector Databases
Weaviate is an AI-native vector database with built-in ML models for hybrid search, RAG, and semantic retrieval over scraped content.
QuestDB
Time-series database with ultra-low latency
Analytics Databases
QuestDB is an open-source time-series database delivering ultra-low latency ingestion and queries for scraping metrics and monitoring data.
PostgreSQL + JSONB
SQL database with native JSON document support
NoSQL Databases
PostgreSQL with JSONB combines relational SQL power with flexible JSON document storage for scraped data that needs both structure and flexibility.
Chroma
AI-native embedding database with simple API
Vector Databases
Chroma is an open-source embedding database with a simple API for storing and querying vectors, ideal for AI search over scraped content.
MongoDB
Flexible document database for any data shape
NoSQL Databases
MongoDB is a flexible document database that stores scraped data as JSON-like documents, making it easy to ingest varied web content.
Qdrant
High-performance vector search engine in Rust
Vector Databases
Qdrant is a high-performance vector search engine built in Rust for production-grade semantic search and RAG applications.
DuckDB
In-process SQL analytics, no server needed
Analytics Databases
DuckDB is an in-process analytical database that lets you query scraped CSV, Parquet, and JSON files with fast SQL, no server required.
Milvus
Open-source vector database for billion-scale AI
Vector Databases
Milvus is an open-source vector database that handles billion-scale embeddings with GPU acceleration for AI-powered search and retrieval.
ClickHouse
Real-time analytics database for petabyte-scale data
Analytics Databases
ClickHouse is a fast, open-source columnar database that runs analytical queries on billions of scraped records in milliseconds.
Redis
In-memory data store for caching and queues
Caching Databases
Redis is a blazing-fast in-memory data store used to cache scraped results, manage job queues, and power real-time data pipelines.