public interface Parser extends FieldPluggable, Configurable
Protocol
implementation. This interface is
implemented by extensions. Nutch's core contains no page parsing code.Modifier and Type | Field and Description |
---|---|
static String |
X_POINT_ID
The name of the extension point.
|
Modifier and Type | Method and Description |
---|---|
Parse |
getParse(String url,
WebPage page)
This method parses content in WebPage instance
|
getFields
getConf, setConf
Copyright © 2015 The Apache Software Foundation