From: Daniel Veillard (Daniel.Veillard@w3.org)
Date: Tue Oct 31 2000 - 07:58:26 EST
On Mon, Oct 30, 2000 at 04:52:44PM -0500, Marc Sanfacon wrote:
> Hi there,
> when parsing a html file from the net, libxml entered an infinite
> loop. After tracking down the problem, it seemed that it didn't like the
> 0x0B caracter in a script definition. I managed to reproduce the bug in a
> small test, that is attached.
>
> I also fixed it by modifying parserInternals.h at line: 41
>
> #define IS_CHAR(c) \
> (((c) == 0x09) || ((c) == 0x0A) || ((c) == 0x0B) || ((c) == 0x0D) ||
> \
> (((c) >= 0x20) && ((c) <= 0xD7FF)) || \
> (((c) >= 0xE000) && ((c) <= 0xFFFD)) || \
> (((c) >= 0x10000) && ((c) <= 0x10FFFF)))
>
> I added 0x0B as a CHAR. Still can't figure what is this caracter though.
> May be it should simply be ignored.
yep accepting 0xB would break XML compatibility. The enclosed patch
reports the error and skip the offending char. This bug was fairly
specific to the new script/style handling.
Daniel
-- Daniel.Veillard@w3.org | W3C, INRIA Rhone-Alpes | libxml Gnome XML toolkit Tel : +33 476 615 257 | 655, avenue de l'Europe | http://xmlsoft.org/ Fax : +33 476 615 207 | 38330 Montbonnot FRANCE | Rpmfind search site http://www.w3.org/People/all#veillard%40w3.org | http://rpmfind.net/
---- Message from the list xml@rpmfind.net Archived at : http://xmlsoft.org/messages/ to unsubscribe: echo "unsubscribe xml" | mail majordomo@rpmfind.net
This archive was generated by hypermail 2b29 : Tue Oct 31 2000 - 09:43:49 EST