Re: [xml] libxml 2.2.0 xmlParseMemory bug

Date view Thread view Subject view Author view

From: Daniel Veillard (Daniel.Veillard@w3.org)
Date: Fri Jul 21 2000 - 09:53:35 EDT


On Fri, Jul 21, 2000 at 02:09:25PM +0200, David Kågedal wrote:
>
> davidk@lysator.liu.se (David Kågedal) writes:
>
> > Someone else noticed that xmlParseMemory got character encodings
> > wrong, and I can only confirm this.
> >
> > I have the following small XML document:
> >
> > <?xml version="1.0" encoding="iso-8859-1"?>
> > <test att="räka">ohåj <börje/></test>
> >
> > And the attached program which tries both xmlParseFile and
> > xmlParseMemory, and fails on the latter. The output is also attached.
>
> Oops, I seem to have included the wrong file. Here it is instead:
>
> - From file
> encoding: iso-8859-1
> charset: 1
> - From memory
> Entity: line 1: error: xmlSwitchEncoding : no input
> <?xml version="1.0" encoding="iso-8859-1"?>
> ^
> Entity: line 2: error: Input is not proper UTF-8, indicate encoding !
> <test att="räka">ohåj <börje/></test>
> ^
> Entity: line 2: error: Bytes: 0xE4 0x6B 0x61 0x22
> <test att="räka">ohåj <börje/></test>
> ^
> encoding: iso-8859-1
> charset: 10

  Okay there is clearly a problem with encoding support when
parsing from memory. I guess I understand where the problem
comes from and I will try to debug it ASAP and provide a patch

Daniel

-- 
Daniel.Veillard@w3.org | W3C, INRIA Rhone-Alpes  | Today's Bookmarks :
Tel : +33 476 615 257  | 655, avenue de l'Europe | Linux XML libxml WWW
Fax : +33 476 615 207  | 38330 Montbonnot FRANCE | Gnome rpm2html rpmfind
 http://www.w3.org/People/all#veillard%40w3.org  | RPM badminton Kaffe
----
Message from the list xml@xmlsoft.org
Archived at : http://xmlsoft.org/messages/
to unsubscribe: echo "unsubscribe xml" | mail  majordomo@xmlsoft.org


Date view Thread view Subject view Author view

This archive was generated by hypermail 2b29 : Wed Aug 02 2000 - 12:30:24 EDT