OT: wget does not use downloaded stylesheet

Yong Huang yong321 at yahoo.com
Mon Nov 10 21:51:44 UTC 2008


--- On Mon, 11/10/08, Dag Wieers <dag at wieers.com> wrote:

> From: Dag Wieers <dag at wieers.com>
> Subject: Re: OT: wget does not use downloaded stylesheet
> To: yong321 at yahoo.com, "General Red Hat Linux discussion list" <redhat-list at redhat.com>
> Cc: chaim.rieger at gmail.com
> Date: Monday, November 10, 2008, 3:01 PM
> On Mon, 10 Nov 2008, Yong Huang wrote:
> 
> > Thanks, Chaim. But that doesn't work either. I
> created a minimalist test case:
> >
> > http://yong321.freeshell.org/temp/test.html
> >
> > I ran wget -mr
> http://yong321.freeshell.org/temp/test.html and check the
> downloaded test.html file. It still has
> >
> > <link rel="stylesheet"
> type="text/css"
> href="http://yong321.freeshell.org/temp/stylesheettest.css">
> >
> > I don't think wget could be smart enough to modify
> the URL inside the html file.
> 
> You thought wrong. The wget man-page explains it:
> 
>         -k
>         --convert-links
>             After the download is complete, convert the
> links in the document to
>             make them suitable for local viewing.  This
> affects not only the visi-
>             ble hyperlinks, but any part of the document
> that links to external
>             content, such as embedded images, links to
> style sheets, hyperlinks to
>             non-HTML content, etc.
> 
>             Each link will be changed in one of the two
> ways:
> 
>             *   The links to files that have been
> downloaded by Wget will be
>                 changed to refer to the file they point to
> as a relative link.
> 
>                 Example: if the downloaded file
> /foo/doc.html links to
>                 /bar/img.gif, also downloaded, then the
> link in doc.html will be
>                 modified to point to ../bar/img.gif.  This
> kind of transformation
>                 works reliably for arbitrary combinations
> of directories.
> 
>             *   The links to files that have not been
> downloaded by Wget will be
>                 changed to include host name and absolute
> path of the location
>                 they point to.
> 
>                 Example: if the downloaded file
> /foo/doc.html links to
>                 /bar/img.gif (or to ../bar/img.gif), then
> the link in doc.html
>                 will be modified to point to
> http://hostname/bar/img.gif.
> 
>             Because of this, local browsing works reliably:
> if a linked file was
>             downloaded, the link will refer to its local
> name; if it was not down-
>             loaded, the link will refer to its full
> Internet address rather than
>             presenting a broken link.  The fact that the
> former links are con-
>             verted to relative links ensures that you can
> move the downloaded
>             hierarchy to another directory.
> 
>             Note that only at the end of the download can
> Wget know which links
>             have been downloaded.  Because of that, the
> work done by -k will be
>             performed at the end of all the downloads.
> 
> There are some other options that might interest you.
> 
> Kind regards,
> -- 
> --   dag wieers,  dag at wieers.com,  http://dag.wieers.com/  

Thanks Dag. -k is exactly what I need.

Yong


      




More information about the redhat-list mailing list