
Frontera is a scalable, modular crawl frontier framework that manages crawl scheduling, prioritization, and goal tracking for web crawlers of any size.
Key Features:
Whether you're orchestrating large-scale crawls, managing URL frontiers for distributed scrapers, or building custom crawling infrastructure, Frontera provides the scheduling framework to crawl smarter at any scale.
Stars
1.3KForks
217Last commit
10 months agoLicense
BSD-3-ClauseLanguage
Python