mirror of
https://github.com/crawler-commons/crawler-commons
synced 2024-09-25 01:50:39 +02:00
01d675fc37
- correctly recognize exceptions to wildcard rules as domains - do not disallow TLDs with last element not being a TLD (e.g., .ac.za) - partially fix IDNs: punycoded IDNs are now recognized - add unit test for uppercase / mixed case host names |
||
---|---|---|
.. | ||
main | ||
test |