Package | Description |
---|---|
crawlercommons.sitemaps |
Sitemaps package provides all classes relevant to focused sitemap parsing,
url definition and processing.
|
crawlercommons.sitemaps.sax |
Modifier and Type | Method and Description |
---|---|
static void |
SiteMapTester.main(String[] args) |
AbstractSiteMap |
SiteMapParser.parseSiteMap(byte[] content,
URL url)
Parse a sitemap, given the content bytes and the URL.
|
AbstractSiteMap |
SiteMapParserSAX.parseSiteMap(byte[] content,
URL url)
Parse a sitemap, given the content bytes and the URL.
|
AbstractSiteMap |
SiteMapParser.parseSiteMap(String contentType,
byte[] content,
AbstractSiteMap sitemap)
Returns a processed copy of an unprocessed sitemap object, i.e.
|
AbstractSiteMap |
SiteMapParserSAX.parseSiteMap(String contentType,
byte[] content,
AbstractSiteMap sitemap)
Returns a processed copy of an unprocessed sitemap object, i.e.
|
AbstractSiteMap |
SiteMapParser.parseSiteMap(String contentType,
byte[] content,
URL url)
Parse a sitemap, given the MIME type, the content bytes, and the URL.
|
AbstractSiteMap |
SiteMapParserSAX.parseSiteMap(String contentType,
byte[] content,
URL url)
Parse a sitemap, given the MIME type, the content bytes, and the URL.
|
AbstractSiteMap |
SiteMapParser.parseSiteMap(URL onlineSitemapUrl)
Returns a SiteMap or SiteMapIndex given an online sitemap URL
Please note that this method is a static method which goes online and
fetches the sitemap then parses it
This method is a convenience method for a user who has a sitemap URL and
wants a "Keep it simple" way to parse it.
|
AbstractSiteMap |
SiteMapParserSAX.parseSiteMap(URL onlineSitemapUrl)
Returns a SiteMap or SiteMapIndex given an online sitemap URL
Please note that this method is a static method which goes online and
fetches the sitemap then parses it
This method is a convenience method for a user who has a sitemap URL and
wants a "Keep it simple" way to parse it.
|
protected SiteMap |
SiteMapParser.parseSyndicationFormat(URL sitemapUrl,
Document doc)
Parse the XML document, looking for a feed element to determine if
it's an Atom doc rss to determine if it's an RSS
doc.
|
protected AbstractSiteMap |
SiteMapParser.processGzippedXML(URL url,
byte[] response)
Decompress the gzipped content and process the resulting XML Sitemap.
|
protected AbstractSiteMap |
SiteMapParserSAX.processGzippedXML(URL url,
byte[] response)
Decompress the gzipped content and process the resulting XML Sitemap.
|
protected AbstractSiteMap |
SiteMapParser.processXml(URL sitemapUrl,
byte[] xmlContent)
Parse the given XML content.
|
protected AbstractSiteMap |
SiteMapParserSAX.processXml(URL sitemapUrl,
byte[] xmlContent)
Parse the given XML content.
|
protected AbstractSiteMap |
SiteMapParser.processXml(URL sitemapUrl,
InputSource is)
Parse the given XML content.
|
protected AbstractSiteMap |
SiteMapParserSAX.processXml(URL sitemapUrl,
InputSource is)
Parse the given XML content.
|
Modifier and Type | Method and Description |
---|---|
protected UnknownFormatException |
DelegatorHandler.getException() |
Modifier and Type | Method and Description |
---|---|
protected void |
DelegatorHandler.setException(UnknownFormatException exception) |
Copyright © 2009–2017 Crawler-Commons. All rights reserved.