use WebWatcher::Node;
$node=WebWatcher::Node->new('http://www.ukoln.ac.uk'/);
...
Perl5.004, URI::URL, WebWatcher::Type, HTTP::Request, MD5, NDBM_FILE
This class will manage URL nodes.
- $node->new([URL], [REFERRING URL], [ROOTNODE])
-
Creates a new Node object.
- $node->type([TYPE])
-
Allows the definitition or setting of a type for the Node. Best used with
WebWatcher::Type.
- $node->getnode()
-
Performs a HTTP GET on the URL in the Node and returns a list of the form:
($body, $age, $freshness, $headers). Further
getnode()
requests on the same Node will not result in further GETs - if this is
required, make a new Node. The returned information, along with some other
variables are suitable for instantiating a WebWatch::Resource object.
- $node->to_block(TYPE)
-
Will insert a TYPE into a blocklist. See WebWatcher::Type.
- $node->is_blocked(TYPE)
-
Find out if a TYPE is blocked. Return 1 if it is, else 0.
- $node->md5()
-
Returns an MD5 signature of the URL in the node.
- $node->max_expand([0-9]*)
-
Sets (if argument supplied) and returns the maximum expansion limit for
this node.
- $node->community([COMMUNITY])
-
Sets (if argument supplied) and returns the community that the node belongs
to
- $node->touched(URL, [URL])
-
Deals in URLs (not nodes). Returns 1 if the URL has been visited or 0
otherwise. When the second argument exists, the first URL is ignored and
the second URL 'touched'.
- $node->do_redirect()
-
Returns a reference to a new node that is the redirect object
- $node->get_stat()
-
Returns the last status after a
getnode()
- $node->get_info()
-
Returns the last info after a
getnode()
- fix_url([URL])
-
Return or re-specify the URL of the node.
- fix_ua([URL])
-
Return or re-specify the User-Agent of the node.
- fix_user_email([EMAIL_ADDRESS])
-
Return or re-specify the email address associated with the user agent.
- fix_proxy([URL])
-
Return or re-specify the proxy used, in the form: http://myproxy.com:3128.
- fix_root([URL])
-
Return or re-specify the RootNode associated with this Node.
- fix_refer([URL])
-
Return or re-specify the URL that referred to URL in this Node.
I.Peacock, i.peacock@ukoln.ac.uk