webscraping.app
Browse
About Us
Submit
Sign In
Latest tools
Categories
Tags
Submit
About Us
/
Categories
/
Data Processing
/
Data Transformation
Data Transformation
A curated collection of the best data transformation and cleaning tools for normalizing, deduplicating, and enriching scraped datasets.
Popular Categories:
Browser Automation
14
Scraping Frameworks
11
ETL Tools
9
Analytics Databases
9
SERP APIs
9
Workflow Orchestration
9
AI Web Scraping
8
Scraping APIs
8
Proxy Services
6
Distributed Crawling
6
Search Engines
6
Cloud Compute
6
Order by
dlt (data load tool)
Lightweight Python library for data loading. Auto schema inference, 5000+ sources supported.
Data Transformation
ETL Tools
dbt
Transform data in your warehouse using SQL. Version control, testing, documentation for data models.
Data Transformation
ETL Tools
Great Expectations
Data quality testing with 'Expectations'. Validate scraped data, auto-generate docs, CI/CD integration.
Data Transformation
Pydantic
Data validation using Python type hints. Rust-powered core for speed. Define schemas for scraped data.
Data Transformation
Polars
Lightning-fast DataFrame library in Rust. 10-100x faster than pandas. Lazy evaluation, out-of-core processing.
Data Transformation