This plugin adds the http collector type. It can get and parse web pages using the GET method.
Unpack RRFW main distribution and the plugin in some directories. From RRFW distribution directory, run
./configure --with-plugins=../rrfw-plugin-http-0.0.1 make install
After RRFW installation has finished, add the following lines to your rrfw-siteconfig.pl file:
push( @RRFW::Collector::loadModules, 'RRFW::Collector::Http' ); push( @RRFW::Validator::loadLeafValidators, 'RRFW::Collector::Http' );
That is all, now you can use the parameters described below in your XML configuration.
collector-type
http
is supported by this plugin.
http-url
collector-type=http
.
http-timeout
collector-type=http
.
http-object
collector-type=http
. Specifies an object to look for in the
web page.The object is an RPN expression containing search-items.
A search-item takes the following form:
lineNumber:/pattern/
LineNumber my be a number or *, * means treat the whole web page as one single line. The pattern is a regular expression and must contain a (). The value between () will be replaced in the RPN before it is evaluated.
Copyright (c) 2003-2004 Christian Schnidrig <christian.schnidrig@bluewin.ch>