| Modifier and Type | Method and Description | 
|---|---|
void | 
AbstractExtractor.handle(CrawlerAccess crawler,
      java.io.InputStream in,
      int depth,
      java.net.URL url)  | 
void | 
AbstractExtractor.handleInterrupt(CrawlerAccess crawler,
               int interrupt,
               java.net.URL url)  | 
| Modifier and Type | Method and Description | 
|---|---|
void | 
FindSubjectLocator.handle(CrawlerAccess crawler,
      java.io.InputStream in,
      int depth,
      java.net.URL url)  | 
void | 
FindSubjectLocator.handleInterrupt(CrawlerAccess crawler,
               int interrupt,
               java.net.URL url)  | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
AbstractCrawler  | 
class  | 
LuceneCrawler  | 
class  | 
WebCrawler
A generic class for crawling web pages (or possibly other objects/files too). 
 | 
| Modifier and Type | Field and Description | 
|---|---|
private CrawlerAccess | 
AbstractCrawler.callback  | 
(package private) CrawlerAccess | 
LuceneCrawler.crawler  | 
| Modifier and Type | Method and Description | 
|---|---|
CrawlerAccess | 
AbstractCrawler.getCallBack()  | 
| Modifier and Type | Method and Description | 
|---|---|
void | 
AbstractCrawler.setCallBack(CrawlerAccess cb)
Sets the callback object. 
 | 
| Constructor and Description | 
|---|
LuceneCrawler(CrawlerAccess crawler,
             org.apache.lucene.index.IndexWriter writer)
Creates new LuceneCrawler 
 | 
| Modifier and Type | Method and Description | 
|---|---|
void | 
DownloadDummyHandler.handle(CrawlerAccess crawler,
      java.io.InputStream in,
      int depth,
      java.net.URL page)  | 
void | 
HTMLHandler.handle(CrawlerAccess crawler,
      java.io.InputStream in,
      int depth,
      java.net.URL page)  | 
void | 
HTMLSaveHandler.handle(CrawlerAccess crawler,
      java.io.InputStream in,
      int depth,
      java.net.URL page)  | 
void | 
HTMLSurfer.handle(CrawlerAccess crawler,
      java.io.InputStream in,
      int depth,
      java.net.URL page)  | 
void | 
Handler.handle(CrawlerAccess crawler,
      java.io.InputStream in,
      int depth,
      java.net.URL page)
Processes the given page. 
 | 
void | 
PDFHandler.handle(CrawlerAccess crawler,
      java.io.InputStream in,
      int depth,
      java.net.URL page)  | 
void | 
RTFHandler.handle(CrawlerAccess crawler,
      java.io.InputStream in,
      int depth,
      java.net.URL page)  | 
void | 
SaveHandler.handle(CrawlerAccess crawler,
      java.io.InputStream in,
      int depth,
      java.net.URL page)  | 
void | 
XMLHandler.handle(CrawlerAccess crawler,
      java.io.InputStream in,
      int depth,
      java.net.URL page)  | 
| Modifier and Type | Method and Description | 
|---|---|
void | 
InterruptHandler.handleInterrupt(CrawlerAccess crawler,
               int interrupt,
               java.net.URL page)  | 
Copyright 2004-2015 Wandora Team