diff options
author | Daniel Veillard <veillard@redhat.com> | 2012-09-07 19:32:12 +0800 |
---|---|---|
committer | Daniel Veillard <veillard@redhat.com> | 2012-09-07 19:32:12 +0800 |
commit | f933c898132f20a50ba39ac6116378b71a01c700 (patch) | |
tree | 309cb5de92c5636021544b06de649eb4b3e4fc1a /result/HTML/html5_enc.html | |
parent | 878ec9db9df09b22322906bc5fc61537391070e4 (diff) | |
download | android_external_libxml2-f933c898132f20a50ba39ac6116378b71a01c700.tar.gz android_external_libxml2-f933c898132f20a50ba39ac6116378b71a01c700.tar.bz2 android_external_libxml2-f933c898132f20a50ba39ac6116378b71a01c700.zip |
Keep non-significant blanks node in HTML parser
For https://bugzilla.gnome.org/show_bug.cgi?id=681822
Regardless if the option HTML_PARSE_NOBLANKS is set or not, blank nodes
are removed from a HTML document, for example:
<html>
<head>
<title>This is a test.</title>
</head>
<body>
<p>This is a test.</p>
</body>
</html>
is read as:
<html><head><title>This is a test.</title></head><body>
<p>This is a test.</p>
</body></html>
This changes the default behaviour but the old behaviour is available
as expected when using the parser flag HTML_PARSE_NOBLANKS
Based on original patch from Igor Ignatyuk <igor_ignatiouk@hotmail.com>
* HTMLparser.c: change various places in the parser where ignorable_space
SAX callback was called without checking for the parser flag preference
* xmllint.c: make sure we use the new flag even for HTML parsing
* result/HTML/*: this modifies the output of a number of tests
Diffstat (limited to 'result/HTML/html5_enc.html')
-rw-r--r-- | result/HTML/html5_enc.html | 4 |
1 files changed, 3 insertions, 1 deletions
diff --git a/result/HTML/html5_enc.html b/result/HTML/html5_enc.html index 596d54d7..44ceebca 100644 --- a/result/HTML/html5_enc.html +++ b/result/HTML/html5_enc.html @@ -1,6 +1,8 @@ <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd"> <html> -<head><meta charset="iso-8859-1"></head> +<head> +<meta charset="iso-8859-1"> +</head> <body> <p>très</p> </body> |