Package | Description |
---|---|
org.apache.nutch.protocol |
Classes related to the
Protocol interface,
see also org.apache.nutch.net.protocols . |
org.apache.nutch.protocol.file |
Protocol plugin which supports retrieving local file resources.
|
org.apache.nutch.protocol.ftp |
Protocol plugin which supports retrieving documents via the ftp protocol.
|
org.apache.nutch.protocol.http |
Protocol plugin which supports retrieving documents via the http protocol.
|
org.apache.nutch.protocol.http.api |
Common API used by HTTP plugins (
http ,
httpclient ) |
org.apache.nutch.protocol.sftp |
Protocol plugin which supports retrieving documents via the sftp protocol.
|
Class and Description |
---|
Content |
Protocol
A retriever of url content.
|
ProtocolException |
ProtocolNotFound |
ProtocolOutput
Simple aggregate to pass from protocol plugins both content and protocol
status.
|
ProtocolStatusCodes |
Class and Description |
---|
Content |
Protocol
A retriever of url content.
|
ProtocolException |
ProtocolOutput
Simple aggregate to pass from protocol plugins both content and protocol
status.
|
Class and Description |
---|
Content |
Protocol
A retriever of url content.
|
ProtocolException |
ProtocolOutput
Simple aggregate to pass from protocol plugins both content and protocol
status.
|
RobotRulesParser
This class uses crawler-commons for handling the parsing of
robots.txt files. |
Class and Description |
---|
Protocol
A retriever of url content.
|
ProtocolException |
Class and Description |
---|
Protocol
A retriever of url content.
|
ProtocolException |
ProtocolOutput
Simple aggregate to pass from protocol plugins both content and protocol
status.
|
RobotRulesParser
This class uses crawler-commons for handling the parsing of
robots.txt files. |
Class and Description |
---|
Protocol
A retriever of url content.
|
ProtocolOutput
Simple aggregate to pass from protocol plugins both content and protocol
status.
|
Copyright © 2019 The Apache Software Foundation