False positive for stray end tags #841

LiLoDavis · 2019-07-18T17:01:36Z

Page: https://www.w3.org/WAI/demos/bad/before/home.html

The Nu HTML Checker reports two stray end tags on this page, but both have corresponding start tags.

Error: Stray end tag noscript.
From line 142, column 70; to line 142, column 80
/FONT></noscript>↩ <
LL comment: The start tag is on the same line as the end tag (142).

Error: Stray end tag head.
From line 144, column 3; to line 144, column 9
script>↩ </head>↩ <bo
LL comment: The start tag is on line 1.

The text was updated successfully, but these errors were encountered:

LiLoDavis · 2019-07-18T22:16:41Z

Perhaps related, I'm also getting false positives for unclosed elements on this page: https://www.uwb.edu/brand/website/accessibility/examples/inaccessible-page

cvrebert · 2019-07-18T23:02:13Z

For https://www.w3.org/WAI/demos/bad/before/home.html , I suspect it's related to the <noscript> containing illegal children due to it being within the <head>. Probably that causes the parser to implicitly close the <noscript> tag early, thus making the explicit close </noscript> tag extraneous.

https://html.spec.whatwg.org/multipage/scripting.html#the-noscript-element

In a head element, if scripting is disabled for the noscript element

The noscript element must contain only link, style, and meta elements.

In a head element, if scripting is enabled for the noscript element

The noscript element must contain only text, except that invoking the HTML fragment parsing algorithm with the noscript element as the context element and the text contents as the input must result in a list of nodes that consists only of link, style, and meta elements that would be conforming if they were children of the noscript element, and no parse errors.

 and  aren't among the permitted children.

LiLoDavis · 2019-07-19T00:21:17Z

Ok, so not exactly a false positive, but rather a false attribution.

LiLoDavis · 2019-07-19T00:33:17Z

Hm. If the problem is that a <noscript> in a <head> can't contain  or  elements, shouldn't removing the  and  elements from the <noscript> prevent the "stray end tag" error? It doesn't.

sideshowbarker · 2019-07-19T02:02:35Z

Hm. If the problem is that a <noscript> in a <head> can't contain  or  elements, shouldn't removing the  and  elements from the <noscript> prevent the "stray end tag" error? It doesn't.

Can you doublecheck that?

Here is minimal test case:

<!doctype html>
<HTML lang="">
<title>test</title>
<noscript><b></b></noscript>

That one has  in <noscript>, which makes the checker report Stray end tag noscript:

https://validator.w3.org/nu/?showsource=yes&doc=data%3Atext%2Fhtml%3Bcharset%3Dutf-8%2C%253C%2521doctype%2520html%253E%250D%250A%253CHTML%2520lang%253D%2522%2522%253E%250D%250A%253Ctitle%253Etest%253C%252Ftitle%253E%250D%250A%253Cnoscript%253E%253Cb%253E%253C%252Fb%253E%253C%252Fnoscript%253E#textarea

Here is another minimal test case:

<!doctype html>
<HTML lang="">
<title>test</title>
<noscript></noscript>

That one has no  in <noscript>, and the checker reports no errors:

https://validator.w3.org/nu/?showsource=yes&doc=data%3Atext%2Fhtml%3Bcharset%3Dutf-8%2C%253C%2521doctype%2520html%253E%250D%250A%253CHTML%2520lang%253D%2522%2522%253E%250D%250A%253Ctitle%253Etest%253C%252Ftitle%253E%250D%250A%253Cnoscript%253E%253C%252Fnoscript%253E#textarea

sideshowbarker · 2019-07-19T02:29:10Z

For https://www.w3.org/WAI/demos/bad/before/home.html , I suspect it's related to the <noscript> containing illegal children due to it being within the <head>. Probably that causes the parser to implicitly close the <noscript> tag early, thus making the explicit close </noscript> tag extraneous.

That is exactly the case. But it’s important to note that you’ll only see that behavior when scripting is disabled (as mentioned in the spec section you cited).

When scripting is not disabled — which of course in the normal case in web browsers — then the parser actually won’t implicitly close that noscript element. More specifically, when scripting is not disabled, the This page uses scripts!!! inside the NOSCRIPT element in the source of https://www.w3.org/WAI/demos/bad/before/home.html just goes into the DOM as a text node.

But the checker is not capable of checking documents with scripting enabled. The checker doesn’t have a JavaScript engine to execute script with. So the checker uses the HTML parser in “scripting disabled” mode. And in “scripting disabled” mode, the parser doesn’t evaluate the NOSCRIPT content as text — instead the parser tries to parse any markup it finds inside the NOSCRIPT.

So exactly what happens here is that when the parser hits that  start tag, the parser inserts an implicit </noscript> end tag before the  start tag. But the parser doesn’t stop there; because the b element cannot appear in head, the parser also inserts both an implicit </head> end tag before the  start tag, and also inserts an implicit <body> start tag.

So with scripting disabled, this is what the parser ends up putting into the DOM:

…and this is what ends up getting rendered:

sideshowbarker · 2019-07-19T02:33:24Z

Ok, so not exactly a false positive, but rather a false attribution.

Yeah, basically. The is just one of the many parts of the HTML parsing algorithm that is almost absurdly arcane and non-intuitive. So for this case, it’s very difficult to have the checker emit a user-friendly error message that clearly and succinctly explains what the root problem actually is.

sideshowbarker · 2019-07-19T02:38:08Z

Perhaps related, I'm also getting false positives for unclosed elements on this page: https://www.uwb.edu/brand/website/accessibility/examples/inaccessible-page

I don’t get any messages about unclosed elements on that page —

https://validator.w3.org/nu/?doc=https%3A%2F%2Fwww.uwb.edu%2Fbrand%2Fwebsite%2Faccessibility%2Fexamples%2Finaccessible-page

Maybe something changed? It got updated in the meantime?

LiLoDavis · 2019-07-19T15:26:06Z

Perhaps related, I'm also getting false positives for unclosed elements on this page: https://www.uwb.edu/brand/website/accessibility/examples/inaccessible-page

I don’t get any messages about unclosed elements on that page —

https://validator.w3.org/nu/?doc=https%3A%2F%2Fwww.uwb.edu%2Fbrand%2Fwebsite%2Faccessibility%2Fexamples%2Finaccessible-page

Maybe something changed? It got updated in the meantime?

I've been using the Check serialized DOM of current page bookmarklet rather than typing the URL into the Nu HTML Checker page. I did it both ways just now and got different results. Wasn't expecting that. :-(
For that page:

Using the Nu HTML Checker directly, I get 12 warnings and 2 errors.
Using the bookmarklet, I get 15 warnings, 14 errors, and 1 fatal error.

sideshowbarker · 2019-08-13T07:11:19Z

I think this is resolved per the comments above. If not, let me know and we can reopen it.

sideshowbarker closed this as completed Aug 13, 2019

StdGit mentioned this issue Feb 8, 2021

subtheme-iservice-en/fr Unmatched end tag wet-boew/cdts-sgdc#76

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

False positive for stray end tags #841

False positive for stray end tags #841

LiLoDavis commented Jul 18, 2019 •

edited

LiLoDavis commented Jul 18, 2019

cvrebert commented Jul 18, 2019

LiLoDavis commented Jul 19, 2019

LiLoDavis commented Jul 19, 2019 •

edited

sideshowbarker commented Jul 19, 2019

sideshowbarker commented Jul 19, 2019

sideshowbarker commented Jul 19, 2019

sideshowbarker commented Jul 19, 2019

LiLoDavis commented Jul 19, 2019

sideshowbarker commented Aug 13, 2019

False positive for stray end tags #841

False positive for stray end tags #841

Comments

LiLoDavis commented Jul 18, 2019 • edited

LiLoDavis commented Jul 18, 2019

cvrebert commented Jul 18, 2019

LiLoDavis commented Jul 19, 2019

LiLoDavis commented Jul 19, 2019 • edited

sideshowbarker commented Jul 19, 2019

sideshowbarker commented Jul 19, 2019

sideshowbarker commented Jul 19, 2019

sideshowbarker commented Jul 19, 2019

LiLoDavis commented Jul 19, 2019

sideshowbarker commented Aug 13, 2019

LiLoDavis commented Jul 18, 2019 •

edited

LiLoDavis commented Jul 19, 2019 •

edited