webscraping.app
Browse
Pricing
About Us
Submit
Sign In
Latest tools
Categories
Tags
Pricing
Submit
About Us
/
Categories
/
Data Processing
/
Data Transformation
Data Transformation
A curated collection of the best data transformation and cleaning tools for normalizing, deduplicating, and enriching scraped datasets.
Popular Categories:
Browser Automation
14
Scraping Frameworks
11
Analytics Databases
9
SERP APIs
9
ETL Tools
9
Workflow Orchestration
9
AI Web Scraping
8
Scraping APIs
8
Distributed Crawling
6
Cloud Compute
6
Proxy Services
6
Search Engines
6
Order by
dlt (data load tool)
Python Library for Data Loading
Data Transformation
ETL Tools
dlt is a lightweight, open-source Python library for loading data from any source into well-structured datasets with automatic schema inference.
dbt
Transform Data in Your Warehouse with SQL
Data Transformation
ETL Tools
dbt empowers data teams to transform, test, and document data directly in the warehouse using SQL, with version control and governance built in.
Great Expectations
Data Quality Testing and Validation Platform
Data Transformation
Great Expectations helps data teams validate, document, and monitor data quality across pipelines with automated testing and observability.
Pydantic
Data Validation Using Python Type Hints
Data Transformation
Pydantic is the most widely used Python data validation library, offering fast schema enforcement with type hints and a Rust-powered core engine.
Polars
Lightning-Fast DataFrame Library in Rust
Data Transformation
Polars is an open-source, Rust-powered DataFrame library delivering 10-100x faster data processing than pandas with lazy evaluation and parallelism.