1
0
mirror of https://github.com/crawler-commons/crawler-commons synced 2024-09-25 01:50:39 +02:00
crawler-commons/src
Sebastian Nagel 01d675fc37 Incorrect domains returned by EffectiveTldFinder (#172)
- correctly recognize exceptions to wildcard rules as domains
- do not disallow TLDs with last element not being a TLD (e.g., .ac.za)
- partially fix IDNs: punycoded IDNs are now recognized
- add unit test for uppercase / mixed case host names
2017-10-17 13:23:15 +02:00
..
main Incorrect domains returned by EffectiveTldFinder (#172) 2017-10-17 13:23:15 +02:00
test Incorrect domains returned by EffectiveTldFinder (#172) 2017-10-17 13:23:15 +02:00