From: Daniel Veillard (Daniel.Veillard@w3.org)
Date: Fri Jul 21 2000 - 15:52:28 EDT
On Fri, Jul 21, 2000 at 03:53:35PM +0200, Daniel Veillard wrote:
>
> On Fri, Jul 21, 2000 at 02:09:25PM +0200, David Kågedal wrote:
> >
> > davidk@lysator.liu.se (David Kågedal) writes:
> >
> > > Someone else noticed that xmlParseMemory got character encodings
> > > wrong, and I can only confirm this.
> > >
> > > I have the following small XML document:
> > >
> > > <?xml version="1.0" encoding="iso-8859-1"?>
> > > <test att="räka">ohåj <börje/></test>
> > >
> > > And the attached program which tries both xmlParseFile and
> > > xmlParseMemory, and fails on the latter. The output is also attached.
> >
> > Oops, I seem to have included the wrong file. Here it is instead:
> >
> > - From file
> > encoding: iso-8859-1
> > charset: 1
> > - From memory
> > Entity: line 1: error: xmlSwitchEncoding : no input
> > <?xml version="1.0" encoding="iso-8859-1"?>
> > ^
> > Entity: line 2: error: Input is not proper UTF-8, indicate encoding !
> > <test att="räka">ohåj <börje/></test>
> > ^
> > Entity: line 2: error: Bytes: 0xE4 0x6B 0x61 0x22
> > <test att="räka">ohåj <börje/></test>
> > ^
> > encoding: iso-8859-1
> > charset: 10
>
> Okay there is clearly a problem with encoding support when
> parsing from memory. I guess I understand where the problem
> comes from and I will try to debug it ASAP and provide a patch
Done, the fix is propagated in W3C CVS base, I will commit it
to gnome base soon. The patch being a bit larger than expected
I will probably issue a 2.2.1 including it soon too.
Daniel
-- Daniel.Veillard@w3.org | W3C, INRIA Rhone-Alpes | Today's Bookmarks : Tel : +33 476 615 257 | 655, avenue de l'Europe | Linux XML libxml WWW Fax : +33 476 615 207 | 38330 Montbonnot FRANCE | Gnome rpm2html rpmfind http://www.w3.org/People/all#veillard%40w3.org | RPM badminton Kaffe ---- Message from the list xml@xmlsoft.org Archived at : http://xmlsoft.org/messages/ to unsubscribe: echo "unsubscribe xml" | mail majordomo@xmlsoft.org
This archive was generated by hypermail 2b29 : Wed Aug 02 2000 - 12:30:24 EDT