| 程序包 | 说明 |
|---|---|
| us.codecraft.webmagic.selector |
Selectors for page extraction.
|
| 限定符和类型 | 类和说明 |
|---|---|
class |
AndSelector
All selectors will be arranged as a pipeline.
|
class |
BaseElementSelector |
class |
CssSelector
CSS selector.
|
class |
JsonPathSelector
JsonPath selector.
|
class |
OrSelector
All extractors will do extracting separately,
and the results of extractors will combined as the final result. |
class |
RegexSelector
Selector in regex.
|
class |
ReplaceSelector
Replace selector.
|
class |
SmartContentSelector
Borrowed from https://code.google.com/p/cx-extractor/
|
class |
XpathSelector
XPath selector based on Xsoup.
|
| 限定符和类型 | 方法和说明 |
|---|---|
static AndSelector |
Selectors.and(Selector... selectors) |
static OrSelector |
Selectors.or(Selector... selectors) |
Selectable |
Selectable.select(Selector selector)
extract by custom selector
|
Selectable |
HtmlNode.select(Selector selector) |
Selectable |
AbstractSelectable.select(Selector selector) |
protected Selectable |
AbstractSelectable.select(Selector selector,
List<String> strings) |
String |
Html.selectDocument(Selector selector) |
List<String> |
Html.selectDocumentForList(Selector selector) |
Selectable |
Selectable.selectList(Selector selector)
extract by custom selector
|
Selectable |
HtmlNode.selectList(Selector selector) |
Selectable |
AbstractSelectable.selectList(Selector selector) |
protected Selectable |
AbstractSelectable.selectList(Selector selector,
List<String> strings) |
| 构造器和说明 |
|---|
AndSelector(Selector... selectors) |
OrSelector(Selector... selectors) |
| 构造器和说明 |
|---|
AndSelector(List<Selector> selectors) |
OrSelector(List<Selector> selectors) |
Copyright © 2016. All rights reserved.