[xml] Question about libxml...

Date view Thread view Subject view Author view

From: Marc Sanfacon (sanm@copernic.com)
Date: Wed Nov 15 2000 - 14:51:18 EST


Hi there,
        we have found a problem in the HTML parser. Here is my HTML code:

<SCRIPT LANGUAGE="JavaScript">
<!--
var cobrand_directory = "";
//-->
</SCRIPT>

 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML>
<HEAD>
<TITLE>Title</TITLE>
</HEAD>

<BODY>
This is a test
</BODY>
</HTML>

libxml (2.2.7) outputs the following:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<html><head>
<script language="JavaScript">
<!--
var cobrand_directory = "";
//-->
</script>
<html>
<head><title>Title</title></head>
<body><p>
This is a test
</p></body>
</html>
</html>

As you can see, the results contain 2 html tags, 2 head tags, 2 ending html
and only 1 head tag.
I have pinpointed where this comes from (htmlcheckImplied), but haven't
found where to fix it yet.

I think there should be only 1 html and 1 head tag with the proper ending
tag.

I posted this, just in case Daniel, or somebody else, can fix the problem or
can help me fix it.

Regards,
        Marc

---------------------------------------------------------------------
 "Better the pride that resides, in a citizen of the world.
  Than the pride that divides, when a colorful rag is
  unfurled." Neil Peart
---------------------------------------------------------------------
Marc Sanfacon, Software developer Copernic.com
e-mail: msanfacon@copernic.com R&D Group
Tel : (418) 527-0528 ext 1212


----
Message from the list xml@rpmfind.net
Archived at : http://xmlsoft.org/messages/
to unsubscribe: echo "unsubscribe xml" | mail  majordomo@rpmfind.net


Date view Thread view Subject view Author view

This archive was generated by hypermail 2b29 : Wed Nov 15 2000 - 15:43:39 EST