SYNOPSIS

        use WebWatcher::Node;

        $node=WebWatcher::Node->new('http://www.ukoln.ac.uk'/); 
        ...


REQUIRES

        Perl5.004, URI::URL, WebWatcher::Type, HTTP::Request, MD5, NDBM_FILE


DESCRIPTION

        This class will manage URL nodes.


METHODS

$node->new([URL], [REFERRING URL], [ROOTNODE])
Creates a new Node object.

$node->type([TYPE])
Allows the definitition or setting of a type for the Node. Best used with WebWatcher::Type.

$node->getnode()
Performs a HTTP GET on the URL in the Node and returns a list of the form: ($body, $age, $freshness, $headers). Further getnode() requests on the same Node will not result in further GETs - if this is required, make a new Node. The returned information, along with some other variables are suitable for instantiating a WebWatch::Resource object.

$node->to_block(TYPE)
Will insert a TYPE into a blocklist. See WebWatcher::Type.

$node->is_blocked(TYPE)
Find out if a TYPE is blocked. Return 1 if it is, else 0.

$node->md5()
Returns an MD5 signature of the URL in the node.

$node->max_expand([0-9]*)
Sets (if argument supplied) and returns the maximum expansion limit for this node.

$node->community([COMMUNITY])
Sets (if argument supplied) and returns the community that the node belongs to

$node->touched(URL, [URL])
Deals in URLs (not nodes). Returns 1 if the URL has been visited or 0 otherwise. When the second argument exists, the first URL is ignored and the second URL 'touched'.

$node->do_redirect()
Returns a reference to a new node that is the redirect object

$node->get_stat()
Returns the last status after a getnode()

$node->get_info()
Returns the last info after a getnode()

fix_url([URL])
Return or re-specify the URL of the node.

fix_ua([URL])
Return or re-specify the User-Agent of the node.

fix_user_email([EMAIL_ADDRESS])
Return or re-specify the email address associated with the user agent.

fix_proxy([URL])
Return or re-specify the proxy used, in the form: http://myproxy.com:3128.

fix_root([URL])
Return or re-specify the RootNode associated with this Node.

fix_refer([URL])
Return or re-specify the URL that referred to URL in this Node.


AUTHOR

I.Peacock, i.peacock@ukoln.ac.uk