Web Crawlers
2.Crawljax Core8 usages
com.crawljax » crawljax-core Apache
Crawljax Core
Last Release on Jun 1, 2023
Norconex HTTP Collector is a web spider, or crawler that aims to be very flexible, easy to extend, and portable
Last Release on May 25, 2025
4.WebMagic Parent
us.codecraft » webmagic-parent Apache
A crawler framework. It covers the whole lifecycle of crawler: downloading, url management, content
extraction and persistent.
Last Release on Apr 23, 2024
5.Crawler4j17 usages
edu.uci.ics » crawler4j Apache
Open Source Web Crawler for Java
Last Release on Mar 26, 2018
crawler-commons is a set of reusable Java components that implement
functionality common to any web crawler.
Last Release on Oct 10, 2014
8.Gecco4 usages
com.geccocrawler » gecco MIT
Easy to use lightweight web crawler
Last Release on Jul 4, 2020
10.Crawler1 usages
com.soulgalore » crawler Apache
Simple java (1.6) crawler to crawl web pages on one and same domain.
Last Release on Feb 8, 2014
