Re: [xml] UTF8ToHtml() changes

Date view Thread view Subject view Author view

From: Daniel Veillard (Daniel.Veillard@w3.org)
Date: Mon Aug 28 2000 - 05:49:07 EDT


On Sun, Aug 27, 2000 at 05:41:47PM -0700, Wayne Davison wrote:
> It should be:
>
> if (out + 1 >= outend)
>
> Also, you'll want to change the entity version back to use >= again
> (you've got my tweaked version in cvs at the moment).

  Done,

> > This may be worth a separate function if you just want to output
> > a string extrated from the internal representation. I would accept
> > it without problem.
>
> Yes, I was looking for something to use with the sax handler. I wasn't
> quite sure what to name it, but I chose htmlEncodeEntities() and put it in
> the HTMLparser.c file. My function doesn't return an error for entities
> not in the table (it returns a numeric entity). It just returns -2 for
> encoding errors. See if you like it.
>
> I also changed testHTML.c to use it to output entities rather than raw
> UTF-8 (which you might want to make into an option).
>
> Patch attached (only for the new function, not for the off-by-one
> changes you may wish to make).

  Applied, I also corrected a bug in ingnorable white space detection
when using only SAX HTML parsing,

  thanks,

Daniel

-- 
Daniel.Veillard@w3.org | W3C, INRIA Rhone-Alpes  | Today's Bookmarks :
Tel : +33 476 615 257  | 655, avenue de l'Europe | Linux XML libxml WWW
Fax : +33 476 615 207  | 38330 Montbonnot FRANCE | Gnome rpm2html rpmfind
 http://www.w3.org/People/all#veillard%40w3.org  | RPM badminton Kaffe
----
Message from the list xml@xmlsoft.org
Archived at : http://xmlsoft.org/messages/
to unsubscribe: echo "unsubscribe xml" | mail  majordomo@xmlsoft.org


Date view Thread view Subject view Author view

This archive was generated by hypermail 2b29 : Mon Aug 28 2000 - 03:43:23 EDT