[xml] Weirdness in XML parser

Date view	Thread view	Subject view	Author view

From: Kristian Hogsberg Kristensen (hogsberg@daimi.au.dk)
Date: Thu Nov 11 1999 - 15:37:40 EST

Next message: Daniel Veillard: "Re: [xml] Weirdness in XML parser"
Previous message: Stefan Bambach: "Re: [xml] parser xml"
Next in thread: Daniel Veillard: "Re: [xml] Weirdness in XML parser"
Reply: Daniel Veillard: "Re: [xml] Weirdness in XML parser"
Reply: Stefan Bambach: "Re: [xml] Weirdness in XML parser"
Maybe reply: Kristian Hogsberg Kristensen: "Re: [xml] Weirdness in XML parser"
Reply: Daniel Veillard: "Re: [xml] Weirdness in XML parser"

[ I sent this earlier, but apparently it didn't show up on the
  list... It's not on the list archive page either, which by the way
  seems to have stopped updating around October 11. Anyway, here
  goes... ]

Hi,

When parsing this document:

The HTML parser makes "foo" a child of <HR>... I tracked the problem
to this piece of code in htmlParseElement (lines 2310-2317):

    if (((depth == ctxt->nameNr) && (oldname == ctxt->name)) ||
        (name == NULL)) {
        if (CUR == '>')
            NEXT;
        return;
    }

which look a bit weird to me... I dont see what it's supposed to do.
What happens is that <HR> autocloses <P>, and when control reaches
these lines, oldname points to freed memory. Accidently, this memory
is used for the name of the new name, so oldname == ctxt->name and
thus htmlParseElement returns prematurely (it doesn't reach the test
for info->empty).

I see you've made <DD> autoclose <DT> and <DT> autoclose <DD>, but
what about also making <DD> autoclose <DD> and likewise for <DT>?
This would make the parser a bit more robust; suppose someone were to
do something like:

it would get parsed as

which I believe is a bit more useful than

regards,
Kristian

----
Message from the list xml@rufus.w3.org
Archived at : http://rufus.w3.org/veillard/XML/messages
to unsubscribe: echo "unsubscribe xml" | mail  majordomo@rufus.w3.org

Next message: Daniel Veillard: "Re: [xml] Weirdness in XML parser"
Previous message: Stefan Bambach: "Re: [xml] parser xml"
Next in thread: Daniel Veillard: "Re: [xml] Weirdness in XML parser"
Reply: Daniel Veillard: "Re: [xml] Weirdness in XML parser"
Reply: Stefan Bambach: "Re: [xml] Weirdness in XML parser"
Maybe reply: Kristian Hogsberg Kristensen: "Re: [xml] Weirdness in XML parser"
Reply: Daniel Veillard: "Re: [xml] Weirdness in XML parser"

Date view	Thread view	Subject view	Author view

This archive was generated by hypermail 2b29 : Wed Aug 02 2000 - 12:29:51 EDT