public class Page extends Object
getUrl() get url of current page getHtml() get content of current page putField(String, Object) save extracted result getResultItems() get extract results to be used in PipelineaddTargetRequests(java.util.List) addTargetRequest(String) add urls to fetch Downloader,
PageProcessor| 构造器和说明 |
|---|
Page() |
| 限定符和类型 | 方法和说明 |
|---|---|
void |
addTargetRequest(Request request)
add requests to fetch
|
void |
addTargetRequest(String requestString)
add url to fetch
|
void |
addTargetRequests(List<String> requests)
add urls to fetch
|
void |
addTargetRequests(List<String> requests,
long priority)
add urls to fetch
|
Html |
getHtml()
get html content of page
|
Json |
getJson()
get json content of page
|
String |
getRawText() |
Request |
getRequest()
get request of current page
|
ResultItems |
getResultItems() |
int |
getStatusCode() |
List<Request> |
getTargetRequests() |
Selectable |
getUrl()
get url of current page
|
boolean |
isNeedCycleRetry() |
void |
putField(String key,
Object field)
store extract results
|
void |
setHtml(Html html)
已过时。
since 0.4.0
The html is parse just when first time of calling
getHtml(), so use setRawText(String) instead. |
void |
setNeedCycleRetry(boolean needCycleRetry) |
Page |
setRawText(String rawText) |
void |
setRequest(Request request) |
Page |
setSkip(boolean skip) |
void |
setStatusCode(int statusCode) |
void |
setUrl(Selectable url) |
String |
toString() |
public Page setSkip(boolean skip)
public void putField(String key, Object field)
key - keyfield - fieldpublic Html getHtml()
public Json getJson()
public void setHtml(Html html)
getHtml(), so use setRawText(String) instead.html - htmlpublic void addTargetRequests(List<String> requests)
requests - requestspublic void addTargetRequests(List<String> requests, long priority)
requests - requestspriority - prioritypublic void addTargetRequest(String requestString)
requestString - requestStringpublic void addTargetRequest(Request request)
request - requestpublic Selectable getUrl()
public void setUrl(Selectable url)
public Request getRequest()
public boolean isNeedCycleRetry()
public void setNeedCycleRetry(boolean needCycleRetry)
public void setRequest(Request request)
public ResultItems getResultItems()
public int getStatusCode()
public void setStatusCode(int statusCode)
public String getRawText()
Copyright © 2016. All rights reserved.