webscraping.app
Features
Pricing
About Us
Sign In
Get early access
Features
Pricing
About Us
/
Tags
/
Java Script
Best Java Script Web Scraping Tools
Discover the best web scraping tools and libraries for Java Script. Compare features, community support, and documentation.
Order by
Scrapoxy
Open-source super proxy aggregator
Proxy Services
Scrapoxy is an open-source proxy aggregator that unifies multiple proxy providers with built-in fingerprinting and IP rotation for web scraping.
BeeAI Framework
Production-Ready AI Agent Framework
AI Web Scraping
Scraping Frameworks
BeeAI Framework is an open-source platform for building production-ready multi-agent systems in Python and TypeScript with MCP and workflow support.
parse5
HTML5 spec-compliant parser for Node.js
HTML Parsers
parse5 is the fastest spec-compliant HTML5 parser for Node.js, parsing HTML exactly as modern browsers do.
Algolia
Managed search and AI retrieval platform
Search Engines
Algolia is a managed search platform that delivers fast, relevant search results over scraped and structured content for web applications.
Inngest
Durable Functions for Workflows and Queues
Task Queues
Inngest replaces traditional queues with durable step functions, enabling reliable multi-step workflows on serverless, servers, or the edge.
htmlparser2
The fast and forgiving HTML/XML parser
HTML Parsers
htmlparser2 is the fastest Node.js HTML and XML parser, offering a streaming, event-driven API with a forgiving approach to malformed markup.
LLM-Scraper
Turn Any Webpage into Structured Data with LLMs
AI Web Scraping
Scraping Frameworks
LLM Scraper is a TypeScript library that extracts structured data from any webpage using LLMs with Zod schemas, Playwright, and streaming support.
LanceDB
Serverless vector database, no server management
Vector Databases
LanceDB is a serverless vector database built in Rust that runs embedded or in the cloud with native DataFrame integration for AI workloads.
Cassandra
Distributed NoSQL with high availability at scale
NoSQL Databases
Apache Cassandra is a distributed NoSQL database that handles massive amounts of scraped data with high availability and linear scalability.
WebdriverIO
Next-Gen Browser and Mobile Automation for Node.js
Browser Automation
Headless Browsers
WebdriverIO is an all-in-one Node.js automation framework for web and mobile testing with smart selectors, component testing, and extensive plugins.
TestCafe
WebDriver-Free End-to-End Browser Testing
Browser Automation
TestCafe is a free, open-source end-to-end testing framework that works across browsers without WebDriver, with intuitive syntax and built-in waiting.
Manticore Search
Fast open-source search database, 2.8x faster
Search Engines
Manticore Search is a fast, open-source search database that delivers 2.8x faster performance than Elasticsearch for full-text and analytical queries.
OpenSearch
Open-source search and analytics suite
Search Engines
OpenSearch is an Apache 2.0-licensed search and analytics suite that lets you ingest, search, visualize, and analyze scraped data at scale.
KeyDB
Multithreaded Redis fork with higher throughput
Caching Databases
KeyDB is a multithreaded Redis-compatible database delivering over 1M ops/sec per node for high-throughput caching and queuing.
Memcached
Simple, fast distributed memory caching system
Caching Databases
Memcached is a free, high-performance distributed memory caching system that speeds up dynamic applications by caching scraped data in RAM.
Weaviate
AI-native vector database with built-in ML models
Vector Databases
Weaviate is an AI-native vector database with built-in ML models for hybrid search, RAG, and semantic retrieval over scraped content.
Windmill
Build and Deploy Internal Tools at Scale
Workflow Orchestration
Windmill is an open-source platform for building workflows, internal tools, and data pipelines with full code flexibility and a visual flow builder.
QuestDB
Time-series database with ultra-low latency
Analytics Databases
QuestDB is an open-source time-series database delivering ultra-low latency ingestion and queries for scraping metrics and monitoring data.
Temporal
Durable Execution for Invincible Applications
Workflow Orchestration
Temporal is a durable execution platform that makes distributed applications fault-tolerant by automatically capturing state and recovering from failures.
Mastra
TypeScript AI Agent Framework with Workflows
AI Web Scraping
Mastra is an all-in-one TypeScript framework for building AI agents and workflows with RAG, memory, MCP support, evals, and an interactive playground.
Typesense
Typo-tolerant search with sub-50ms responses
Search Engines
Typesense is a fast, typo-tolerant search engine that delivers instant search-as-you-type results over your scraped and structured data.
Chroma
AI-native embedding database with simple API
Vector Databases
Chroma is an open-source embedding database with a simple API for storing and querying vectors, ideal for AI search over scraped content.
Kestra
Declarative Data Orchestration Platform
Workflow Orchestration
Kestra is an open-source orchestration platform that uses declarative YAML to build, schedule, and monitor event-driven workflows at scale.
Polars
Lightning-Fast DataFrame Library in Rust
Data Transformation
Polars is an open-source, Rust-powered DataFrame library delivering 10-100x faster data processing than pandas with lazy evaluation and parallelism.
Prev
Page 2 of 2
Page:
1
2
Next