Modifier and Type | Method and Description |
---|---|
void |
AbstractExtractor.handle(CrawlerAccess crawler,
java.io.InputStream in,
int depth,
java.net.URL url) |
void |
AbstractExtractor.handleInterrupt(CrawlerAccess crawler,
int interrupt,
java.net.URL url) |
Modifier and Type | Method and Description |
---|---|
void |
FindSubjectLocator.handle(CrawlerAccess crawler,
java.io.InputStream in,
int depth,
java.net.URL url) |
void |
FindSubjectLocator.handleInterrupt(CrawlerAccess crawler,
int interrupt,
java.net.URL url) |
Modifier and Type | Class and Description |
---|---|
class |
AbstractCrawler |
class |
LuceneCrawler |
class |
WebCrawler
A generic class for crawling web pages (or possibly other objects/files too).
|
Modifier and Type | Field and Description |
---|---|
private CrawlerAccess |
AbstractCrawler.callback |
(package private) CrawlerAccess |
LuceneCrawler.crawler |
Modifier and Type | Method and Description |
---|---|
CrawlerAccess |
AbstractCrawler.getCallBack() |
Modifier and Type | Method and Description |
---|---|
void |
AbstractCrawler.setCallBack(CrawlerAccess cb)
Sets the callback object.
|
Constructor and Description |
---|
LuceneCrawler(CrawlerAccess crawler,
org.apache.lucene.index.IndexWriter writer)
Creates new LuceneCrawler
|
Modifier and Type | Method and Description |
---|---|
void |
DownloadDummyHandler.handle(CrawlerAccess crawler,
java.io.InputStream in,
int depth,
java.net.URL page) |
void |
HTMLHandler.handle(CrawlerAccess crawler,
java.io.InputStream in,
int depth,
java.net.URL page) |
void |
HTMLSaveHandler.handle(CrawlerAccess crawler,
java.io.InputStream in,
int depth,
java.net.URL page) |
void |
HTMLSurfer.handle(CrawlerAccess crawler,
java.io.InputStream in,
int depth,
java.net.URL page) |
void |
Handler.handle(CrawlerAccess crawler,
java.io.InputStream in,
int depth,
java.net.URL page)
Processes the given page.
|
void |
PDFHandler.handle(CrawlerAccess crawler,
java.io.InputStream in,
int depth,
java.net.URL page) |
void |
RTFHandler.handle(CrawlerAccess crawler,
java.io.InputStream in,
int depth,
java.net.URL page) |
void |
SaveHandler.handle(CrawlerAccess crawler,
java.io.InputStream in,
int depth,
java.net.URL page) |
void |
XMLHandler.handle(CrawlerAccess crawler,
java.io.InputStream in,
int depth,
java.net.URL page) |
Modifier and Type | Method and Description |
---|---|
void |
InterruptHandler.handleInterrupt(CrawlerAccess crawler,
int interrupt,
java.net.URL page) |
Copyright 2004-2015 Wandora Team