[xml] bug in HTML parser and request...

Date view Thread view Subject view Author view

From: Marc Sanfacon (sanm@copernic.com)
Date: Fri Oct 20 2000 - 08:45:30 EDT


Hi there,
        one of the part we are using from libxml is the HTML parser. We
just found a problem with the following document:

Document:

<!DOCTYPE html PUBLIC "-//IETF//DTD HTML 2.0//EN">
<html>
  <head>
    <meta http-equiv="Pragma" content="no-cache">
    <title>
      This is a title
    </title>
  </head>
  <body>
  </body>
</html>

Output from libxml:

<!DOCTYPE html PUBLIC "-//IETF//DTD HTML 2.0//EN">
<html>
<head>
<meta http-equiv="Pragma" content="no-cache">
<title>
      This is a title
    </title>
</head>
<body>
</html>

By putting something between <body> & </body>, the output is:

<body><p>
whatever
</p></body>

Thank you.

---------------------------------------------------------------------
 "Better the pride that resides, in a citizen of the world.
  Than the pride that divides, when a colorful rag is
  unfurled." Neil Peart
---------------------------------------------------------------------
Marc Sanfacon, Software developer Copernic.com
e-mail: sanm@copernic.com R&D Group
Tel : (418) 527-0528 ext 1212


----
Message from the list xml@rpmfind.net
Archived at : http://xmlsoft.org/messages/
to unsubscribe: echo "unsubscribe xml" | mail  majordomo@rpmfind.net


Date view Thread view Subject view Author view

This archive was generated by hypermail 2b29 : Fri Oct 20 2000 - 09:43:40 EDT