mirror of
https://github.com/crawler-commons/crawler-commons
synced 2024-09-23 17:33:23 +02:00
b5704684ff
Generally better to call parseSiteMap w/o passing an explicit contentType, as web servers lie all the time - so let Tika figure it out. |
||
---|---|---|
.. | ||
main | ||
test |