1
0
mirror of https://github.com/crawler-commons/crawler-commons synced 2024-09-29 18:51:14 +02:00
A set of reusable Java components that implement functionality common to any web crawler
Go to file
kkrugler_lists@transpac.com bf8ba66115 Rolled in Ian's patches to pom.xml and build.xml
Rolled in Ian's EffectiveTldFinder code & test cases.

Fixed "dist" target for build.
2009-12-12 00:22:44 +00:00
doc Change name of format from "Bixo" to "Crawler-commons" 2009-12-04 04:19:21 +00:00
lib Initial commit of build system, plus some paid-level domain extraction code from Bixo. 2009-12-04 04:13:38 +00:00
src Rolled in Ian's patches to pom.xml and build.xml 2009-12-12 00:22:44 +00:00
build.properties Rolled in Ian's patches to pom.xml and build.xml 2009-12-12 00:22:44 +00:00
build.xml Rolled in Ian's patches to pom.xml and build.xml 2009-12-12 00:22:44 +00:00
pom.xml Rolled in Ian's patches to pom.xml and build.xml 2009-12-12 00:22:44 +00:00