From: Daniel Veillard (Daniel.Veillard@w3.org)
Date: Fri Jul 21 2000 - 09:53:35 EDT
On Fri, Jul 21, 2000 at 02:09:25PM +0200, David Kågedal wrote:
>
> davidk@lysator.liu.se (David Kågedal) writes:
>
> > Someone else noticed that xmlParseMemory got character encodings
> > wrong, and I can only confirm this.
> >
> > I have the following small XML document:
> >
> > <?xml version="1.0" encoding="iso-8859-1"?>
> > <test att="räka">ohåj <börje/></test>
> >
> > And the attached program which tries both xmlParseFile and
> > xmlParseMemory, and fails on the latter. The output is also attached.
>
> Oops, I seem to have included the wrong file. Here it is instead:
>
> - From file
> encoding: iso-8859-1
> charset: 1
> - From memory
> Entity: line 1: error: xmlSwitchEncoding : no input
> <?xml version="1.0" encoding="iso-8859-1"?>
> ^
> Entity: line 2: error: Input is not proper UTF-8, indicate encoding !
> <test att="räka">ohåj <börje/></test>
> ^
> Entity: line 2: error: Bytes: 0xE4 0x6B 0x61 0x22
> <test att="räka">ohåj <börje/></test>
> ^
> encoding: iso-8859-1
> charset: 10
Okay there is clearly a problem with encoding support when
parsing from memory. I guess I understand where the problem
comes from and I will try to debug it ASAP and provide a patch
Daniel
-- Daniel.Veillard@w3.org | W3C, INRIA Rhone-Alpes | Today's Bookmarks : Tel : +33 476 615 257 | 655, avenue de l'Europe | Linux XML libxml WWW Fax : +33 476 615 207 | 38330 Montbonnot FRANCE | Gnome rpm2html rpmfind http://www.w3.org/People/all#veillard%40w3.org | RPM badminton Kaffe ---- Message from the list xml@xmlsoft.org Archived at : http://xmlsoft.org/messages/ to unsubscribe: echo "unsubscribe xml" | mail majordomo@xmlsoft.org
This archive was generated by hypermail 2b29 : Wed Aug 02 2000 - 12:30:24 EDT