[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: Web Page Watcher



Tim:
>> Have you considered just comparing HTTP headers?

On Mon, 2006-10-09 at 11:59 -0700, Paul Lemmons wrote:
> Ok, I have considered it now and it would be fairly easy to accomplish. 
> I am not sure it would be valuable though. There is a significant amount 
> of data in the headers that is different every time the page is called. 
> Are there particular fields that are only updated when the content of 
> the page has changed? Or were you looking for something else completely?

There's last-modified, expires, and etag headers (basically a checksum)
from the webserver, that spring to mind.  Any one of them would let you
know that the page had changed, with less data to need parsing, and less
of a load on the webserver.  Have a quick look at the Apache manual, or
a website about caching for clues.

-- 
(Currently running FC4, but testing FC5, if that's important.)

Don't send private replies to my address, the mailbox is ignored.
I read messages from the public lists.



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]