Anyone know of a tasteful LGPL HTML parser in C?

Joe Orton jorton at redhat.com
Thu Nov 25 12:55:00 UTC 2004


On Thu, Nov 25, 2004 at 09:13:45AM +0800, Yusuf Goolamabbas wrote:
> > I'd like to attempt to support
> >    rpm -qp http://download.fedora.redhat.com/.../*.rpm
> > within rpm by applying fnmatch(3) against parsed HTML hrefs.
> > 
> > So I'm questing existing HTML parser imp[ementations before hacking up 
> > something myself.
> > 
> > The constraints on my rpm problem/implementation space are:
> >   a) must be LGPL
> >   b) must be in C.
> >   c) must be reasonably small and reliable.
> >   d) should work on a significant variety of HTML dialects without problem.
> 
> How about El-Kabong from Jon Travis. I think Covalent wanted to donate
> this to the ASF. There was a thread on new-httpd about this a while ago
> (can't find it now) but check with Joe Orton

That didn't happen, http://www.apacheweek.com/issues/02-09-13#dev.  The
code looked good though, and I believe Jon removed the dependency on APR
too so it should be small and self-contained.

joe




More information about the fedora-devel-list mailing list