Re: [xml] RE: HTML Parser problem with big files.

Date view Thread view Subject view Author view

From: Daniel Veillard (Daniel.Veillard@w3.org)
Date: Thu Jan 04 2001 - 05:58:29 EST


On Mon, Dec 11, 2000 at 02:45:16PM -0500, Marc Sanfacon wrote:
> I found the problem. It is in encoding.c
>
> line 1810:
>
> toconv = in->use;
> if (toconv == 0)
> return(0);
> written = out->size - out->use;
> if (toconv * 2 >= written) {
> /* Old line: xmlBufferGrow(out, toconv * 2); */
> xmlBufferGrow(out, out->size + toconv * 2);
> written = out->size - out->use - 1;
> }

  Well I reopened the issue and it was quite a bit more
complex than that. the problem came from the iconv support
for the charset=windows-1252 generated by FrontPage. It refused
to process large chunks in one pass, I modified the encoding and
I/O front-end to iterate as multiple chunks.
  It's commited in CVs and will be in the next release.

Daniel

-- 
Daniel Veillard      | Red Hat Network http://redhat.com/products/network/
daniel@veillard.com  | libxml Gnome XML toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
----
Message from the list xml@rpmfind.net
Archived at : http://xmlsoft.org/messages/
to unsubscribe: echo "unsubscribe xml" | mail  majordomo@rpmfind.net


Date view Thread view Subject view Author view

This archive was generated by hypermail 2b29 : Thu Jan 04 2001 - 09:43:33 EST