webscraping.app
Browse
Pricing
About Us
Submit
Sign In
Latest tools
Categories
Tags
Pricing
Submit
About Us
/
Categories
/
Web Scraping
/
HTML Parsers
HTML Parsers
A curated collection of the best libraries and tools for parsing HTML and extracting structured data from web page markup.
Popular Categories:
Browser Automation
14
Scraping Frameworks
11
Analytics Databases
9
SERP APIs
9
ETL Tools
9
Workflow Orchestration
9
AI Web Scraping
8
Scraping APIs
8
Distributed Crawling
6
Cloud Compute
6
Proxy Services
6
Search Engines
6
Order by
BeautifulSoup
Python library for parsing HTML and XML
HTML Parsers
Beautiful Soup is a Python library for pulling data out of HTML and XML files, providing Pythonic idioms for navigating and searching parse trees.
parse5
HTML5 spec-compliant parser for Node.js
HTML Parsers
parse5 is the fastest spec-compliant HTML5 parser for Node.js, parsing HTML exactly as modern browsers do.
htmlparser2
The fast and forgiving HTML/XML parser
HTML Parsers
htmlparser2 is the fastest Node.js HTML and XML parser, offering a streaming, event-driven API with a forgiving approach to malformed markup.
Goquery
jQuery-like HTML parsing for Go
HTML Parsers
GoQuery brings jQuery-like syntax to Go for parsing and manipulating HTML documents, making web scraping in Go intuitive and efficient.
Cheerio
Fast, flexible HTML parsing and manipulation for JS
HTML Parsers
Cheerio is the industry-standard JavaScript library for parsing and manipulating HTML and XML using a familiar jQuery-like syntax.