public class JSParseFilter extends Object implements ParseFilter, Parser
Modifier and Type | Field and Description |
---|---|
static org.slf4j.Logger |
LOG |
X_POINT_ID
X_POINT_ID
Constructor and Description |
---|
JSParseFilter() |
Modifier and Type | Method and Description |
---|---|
Parse |
filter(String url,
WebPage page,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Scan the JavaScript looking for possible
Outlink 's |
Configuration |
getConf()
Get the
Configuration object |
Collection<WebPage.Field> |
getFields()
Gets all the fields for a given
WebPage Many datastores need to
setup the mapreduce job by specifying the fields needed. |
Parse |
getParse(String url,
WebPage page)
Set the
Configuration object |
static void |
main(String[] args)
Main method which can be run from command line with the plugin option.
|
void |
setConf(Configuration conf)
Set the
Configuration object |
public Parse filter(String url, WebPage page, Parse parse, HTMLMetaTags metaTags, DocumentFragment doc)
Outlink
'spublic Parse getParse(String url, WebPage page)
Configuration
objectpublic static void main(String[] args) throws Exception
args
- Exception
public void setConf(Configuration conf)
Configuration
objectsetConf
in interface Configurable
public Configuration getConf()
Configuration
objectgetConf
in interface Configurable
public Collection<WebPage.Field> getFields()
WebPage
Many datastores need to
setup the mapreduce job by specifying the fields needed. All extensions
that work on WebPage are able to specify what fields they need.getFields
in interface FieldPluggable
Copyright © 2015 The Apache Software Foundation