Re: [xml] Encoding question

Date view Thread view Subject view Author view

From: Daniel Veillard (Daniel.Veillard@w3.org)
Date: Sat Mar 04 2000 - 06:57:40 EST


On Fri, Mar 03, 2000 at 10:57:10AM -0800, Bill Randle wrote:
>
>
> I noticed in the current cvs (as of yesterday afternoon) that the
> IS_XXX macros in parserInternals.h assume 16 bit characters in
> the macro arguments [i.e. Unicode]. I realize this is left over
> from when xmlChar was a short if UNICODE was defined, but now
> that xmlChar is always a "char", it causes lots of compile time
> warnings in HTMLparser.c, entities.c, parser.c, etc.

  Well the macros assume a type handling at least up to 0x11000,
so it's rather an int really. xmlChar is rightly only 8 bits now
but the parser has been modified to lookup for UTF8 handling

> Is this still code not to the point where I should be using it,
> or I am I missing something?

  the warnings should only occur in xpath.c and HTMLparser.c
since I didn't yet fixed those. I don't have warning in parser.c
not entities.c . The #ifdef UNICODE is dead yes, maintaining both
would have been a big pain, and UTF16 is now supported by switching
it on the fly to UTF-8.

  this code evolves fast, with the goal of being 100% conformant
to the XML spec (or as much as this can be tested).

Daniel

-- 
Daniel.Veillard@w3.org | W3C, INRIA Rhone-Alpes  | Today's Bookmarks :
Tel : +33 476 615 257  | 655, avenue de l'Europe | Linux XML libxml WWW
Fax : +33 476 615 207  | 38330 Montbonnot FRANCE | Gnome rpm2html rpmfind
 http://www.w3.org/People/all#veillard%40w3.org  | RPM badminton Kaffe
----
Message from the list xml@xmlsoft.org
Archived at : http://xmlsoft.org/messages/
to unsubscribe: echo "unsubscribe xml" | mail  majordomo@xmlsoft.org


Date view Thread view Subject view Author view

This archive was generated by hypermail 2b29 : Wed Aug 02 2000 - 12:30:08 EDT