Structural Semantics The Importance Of HTML5 Sectioning Elements

Advertisement

Whatever you call them — blocks, boxes, areas, regions — we’ve been dividing our Web pages into visible sections for well over a decade. The problem is, we’ve never had the right tools to do so. While our interfaces look all the world like grids, the underlying structure has been cobbled together from numbered headings and unsemantic helper elements; an unbridled stream of content at odds with its own box-like appearance.

Because we can make our <div>s look but not behave like sections, the experience for assistive technology (AT) users and data-mining software is quite different from the experience enjoyed by those gifted with sight.

Now that HTML5 has finally made sectioning elements1 available, many of us greet them with great reluctance. Why? Partly, because we’re a community which is deceptively resistant to change, but also because of some perceived discrepancies regarding advice in the specification. In truth, the advice is sound and the algorithm for sectioning is actually easier to use than previous implementations. Some developers are just very married to their old workflow, and they think you should be too. There’s no good reason why.

Make no mistake: Sectioning elements help you improve document structure, and they’re in the spec’ to stay. Once and for all, I will be exploring the problems these elements solve, the opportunities they offer and their important but misunderstood contribution to the semantic Web. If you’re unfamiliar with the concept of the “semantic Web,” this video2 is a great introduction.

Making Websites

My introduction to Web design was via a university course module called something like “2.1: Dreamweaver,” and I recall my first website well. I remember my deliberately garish choice of Web-safe3 colors. I remember it looking right only in Netscape Navigator4. Most of all, I remember hours of frustration from tugging at the perimeter of a visual layout tool named “table.” I had no idea at the time that this layout tool represented a type of annotation called an HTML tag. Furthermore, no one told me that this annotation invited my patchwork of primary colors and compressed JPEGs to be computed as a sort of demented Excel spreadsheet. In other words, I had no idea I was doing it wrong.

A Dreamweaver table
*Bites tongue*

The fundamental failure of most graphic, product, architectural, and even urban design is its insistence on serving the God of Looking-Good rather than the God of Being-Good.

– Richard Saul Wurman

Macromedia’s Dreamweaver didn’t make the creation of valid documents impossible, but it was one of a number of emerging GUI editors that pandered to our desire for visual expression more than it encouraged informational clarity. Dreamweaver, and other editors classified under the misnomer “WYSIWYG,” helped transform a standardized information system into a home for graphic design and enabled a legion of insufferable Nathan Barleys5 to flypost the World Wide Web with their vapid eye candy. I was one of many.

Web Standards

By the time I made my first website, the Web standards movement, promoting compliance, uniformity and inclusion, was burgeoning. I just wasn’t aware of it until much later. I didn’t have to be: Agency-based Web design was still mainly graphic design with a reluctant programming department clumsily bolted on. If you’re doubtful of the grip that this culture has had on the World Wide Web, look no further than the fact it took until 2010 (2010!) for us to concede that Web browsers are not really made of paper6.

When I finally became familiar with Web standards and the practice of “doing things right,” it was as someone who still worked primarily as a visual designer. Inevitably, my first forays into standards-based design revolved around mastering “CSS layout,” the practice of visually arranging content without relying on the semantically incorrect <table> element. We’ve held up <div>-based layout as a mark of quality for a number of years now. You might even say that it has become a time-honored rite of passage for graphic designers who are moving into “proper” HTML coding.

As I shall demonstrate, the <div> is the ultimate Graphic Design tool. By affecting only appearance, it licenses poor document structure and overengineered interfaces; all without making your document technically invalid. As such, it sanctions the worst kind of hacks.

The Problem With <div>

Every day, thousands of Web developers invoke the almighty <div> to divide, partition and ring-fence their Web pages’ content. We use the <div> to police content, to prevent disparate chunks of information from collapsing into each other. In truth, the <div> has no such power.

Consider the following example:

Two column layout with sidebar encircled with dark border7

In this basic layout, I have included a body of text and an adjacent “sidebar.” To make it absolutely clear to the reader that the sidebar is tangential and does not belong to the main content, I’ve drawn a fat line around it using the border property. For those of you screaming, “That sidebar heading should be an <h3>!”, I’ll get to that shortly. All of my design decisions (the adjacent position, the border and the reduced font size) are facilitated by CSS alone. So, when I take the CSS away, I get this:

The same layout as before is now one column, no borders8

Not only is switching off CSS the quickest way to make a Web page responsive9, but it’s a great way to see how HTML4 documents (which lack sectioning elements) are actually computed. In this case, our so-called “sidebar” is revealed to be just another raft of information in the linear flow10 of the document.

Why Is This So?

The reason for this is that the <div> is, and always has been, a flow content11 element. No matter how thick the <div>’s borders or how dark its background color, it does not stand apart in the structure of the document. Neither, therefore, does its content. With the CSS removed, the faux sidebar’s heading of “Resources” now seems less a distinct component of the page and more a part of the main content. To a parser or screen reader, it would have seemed this way all along.

For reasons of clarity, let’s look at a further example using a snippet of HTML:

<div class="parent">
   <h2>Heading</h2>
   <p>Some content...</p>
      <div class="child">
         <h2>Another heading</h2>
         <p>Some other content...</p>
      </div>
   </div>

I’ve done something slightly different here by entering the two <div>s into a parent-child relationship: The div.child tag belongs to div.parent. We can certainly make it look that way with CSS, anyway. However, <div>s, to quote the specification, “have no special meaning.” Not only do they not mean anything semantically, but they have no impact on the computable structure of the page (sometimes called the “document outline12”). The <div>s we’ve used may as well be invisible; so, to get a meaningful map of the structure we’ve created, we should remove them completely. That leaves just four elements and reveals the parent-child relationship to be an illusion:

<h2>Heading</h2>
   <p>Some content...</p>
   <h2>Another heading</h2>
   <p>Some other content...</p>

As HTML coders interested in sound structure, we should be interested that the above reduction — which omits all meaningless elements — is what we’ve actually made, and it’s not what we set out to do: By not really belonging to “parent,” “child” has a different contextual status in the document than intended.

Heading Levels Don’t Really Help

It’s popular to believe that replacing the second <h2> with an <h3> would solve our problem. If we did so, we’d get the following, more dynamic outline:

  • A Heading (h2)
    • Another Heading (h3)

This solution certainly seems more purposeful, but is it the right decision? Should the second heading be a subheading within the same topic (an <h3>) or be the introduction of an entirely new topic (an <h2>, as we had in the first place)? Headings alone can only show where a piece of content starts, not where it ends, which makes it difficult to tell what belongs to what. We have to simulate belonging by choosing the correct heading level for the context. Just think about that for a second: We’re defining the content’s structural status by labeling it retroactively. It’s just begging to go wrong.

Lets have a look at the homepage13 of accessibility experts The Paciello Group. Naturally, it’s a highly accessible and pretty well organized site, but could the structure be improved with HTML5 sections? You’ll notice their use of a <div> to collectively wrap the three <h2>s, Software Developers, Website Owners and Mike Paciello. Since the <div> doesn’t computably contain these three blocks, the last <h2> and the following <h3> are allowed to pair off in this relationship:

  • Mike Paciello (h2)
    • Contact Us Now (h3)

Wait … so, “Contact Us Now” is a subtopic belonging to the larger theme of “Mike Paciello”. Can that be right? It certainly doesn’t look this way in the visual layout. It’s worth noting at this point that the <div> which fails to thematically group those three <h2> blocks has a class of class="region". Ironically, if this <div> had been a <section>, some screen readers would consider it a “region”. If a <section> had been used in place of the <div>, the observed relationship would not have emerged: The “region” would be self-contained. The class of “region”, however, is not taken into consideration in any meaningful way and does not affect the structure.

Okay, so that’s a weird one, but the situation only gets more confusing when we start to include items for which headings aren’t really even appropriate. Take this further example:

This layout has an h1, an h2 for content and an h3 sidebar with a footer div at the bottom14

In my HTML4 page, I have an <h1> to introduce the document, an <h2> for the main content and an <h3> to mark the start of my “sidebar” (which is just a wishy-washy <div>, as in previous examples). The page follows long-standing convention by having an untitled div#footer resting at the foot of the document for copyright information and other such necessary evils. (It has to be a <div> in HTML4, because the <footer> tag doesn’t exist yet.) The question is, to which heading does the footer belong?

Whose Footer Is This?

Most of us, based on appearances, would agree that the footer must belong to the document. That is what we’ve learned to expect. To the unsighted, it is a different story: Because there is no new introductory heading between the sidebar <h3> and the footer content, it could be extrapolated that these two components are as one (see image below left). By the same token, one could also argue that we’ve included the “sidebar” as a mere “break” from the flow of the main content, before returning to that flow at the advent of the footer (see image below right). This would make the <h2> the footer’s heading.

Red outlines show different interpretations of structure15

The only decent chance we have of understanding the intended structure of the page is by inferring it from a reading of the content. Remembering that the whole point of a “markup language” is to make the structure of information easier to follow, I may as well have chucked the HTML and written my Web page on the back of a napkin.

Some accessibility gurus would suggest that you use a remedial <h2> to head the #footer and bring it back in line, marking up the end of the sidebar like so:

  • h1 (page)
    • h2 (main)
      • h3 (sidebar)
    • h2 (footer)

This kind of works as a hack, but it’s not really sound. Do you really want to make a big announcement of the footer — an announcement as big and bold as the one used to summon the main content, not to mention bolder than the sidebar? No. If our Web page were a film, the footer wouldn’t be the titles — it would be the credits. In HTML5, the <footer> element16 “contains information about its section.” This is semantically superior: We don’t use footers to introduce topics; we use them to conclude them. Accordingly, footers — unlike their parent sections in HTML5 — do not require headings.

Tweet reads: Marking up lots of headings in a page significantly dilutes a screen reader user's ability to navigate between parts of a page efficiently17
Just because the nesting level of headings is correct doesn’t necessarily make a page easy to read.

The closest thing we have to a “system” for structuring documents properly in HTML4 is numbered headings. Not only does this lead to ambiguity, as explained, but in practice we don’t really even use headings to define structure. We use <div>s to define structure and throw in some apologetic headings for accessibility’s sake. To make matters even worse, advice regarding the deployment of numbered headings18 isn’t even clear on whether we should use them in order (h1-h6) or not.

The loose coupling between headings and <div>s is inadequate. Now, with the introduction of sectioning elements, we still use boxes, of sorts, but boxes that actually say something on their own. We are making a move from merely implying sections (by labeling them) to letting them define themselves. Simultaneously, sighted readers and unsighted parsers can experience content that one has effortlessly divided into clear, manageable portions.

The HTML4 spec is very imprecise on what is a section and how its scope is defined. Automatic generation of outlines is important, especially for assistive technology, that are likely to adapt the way they present information to the users according to the structure of the document. HTML5 removes the need for <div> elements from the outlining algorithm by introducing a new element, <section>, the HTML Section Element.

Sectioning

Aware of our desire for legitimate elements to create computable sections, HTML5 offers <section>, <article>, <aside> and <nav>. Like some sort of obnoxious holiday rep’, I’ll introduce the topic of practical sectioning using these elements with a quick quiz. Study the following diagram. How many sections do you count?

An HTML5 page with header, aside and footer20

Multiple-choice answers:

  1. 1
  2. 2
  3. 3
  4. 4

The correct answer is (b), 2. We have included just one of HTML5’s new sectioning elements in the form of an <aside>. Because <footer>s and <header>s are not sectioning elements, what does that leave us with? The <body> tag is the outermost element, making the document itself a kind of section (a supersection, to be precise). So, there you have it: We’ve been using “sectioning” since HTML 1.0, just not with any subsections to speak of.

Some of you may have missed the clue earlier in this article and thought that <header> and <footer> were sectioning elements. Don’t fret; it’s not your fault. Whenever developers like myself try to explain HTML5 page structure, they usually brandish a diagram like the one I used above. In these diagrams, the boxes marked “header,” “aside” and “footer” exist in the same visual paradigm and occupy a similar area. They seem alike, you might say. The other culprit for this endemic confusion is the way the specification is written. Believe it or not, the document structure of some pages in the specification that refer to document structure is structurally unclear! This sort of thing sometimes happens when a standard is constantly evolving. The navigation tree for “4.4 Sections” found in this draft21 is laid out like so:

  • 4.4 Sections
    • 4.4.1 body
    • 4.4.2 nav
    • 4.4.3 article
    • 4.4.4 aside
    • 4.4.5 h1, h2, h3, h4, h5 and h6
    • 4.4.6 hgroup
    • 4.4.7 header
    • 4.4.8 footer
    • 4.4.9 address

You’d be forgiven for thinking that anything in this list qualifies as a sectioning element, absurd as some of them (<address>?) may sound. It’s only when you navigate to 4.4 Sections > 4.4.8 Footer that you’re told that “the footer element is not sectioning content; it doesn’t introduce a new section.” Thanks!

Despite these ambiguities in the spec’ itself, as well as in the surrounding publicity for HTML5, sectioning in practice just works. The following three axioms are probably all you’ll need to understand the algorithm:

  1. <body> is the first section;
  2. <article>, <section>, <nav> and <aside> make subsections;
  3. Subsections may contain more sections (subsections)

Aside from a few trifling details, that’s it. In a little while I’ll cover the completely unnecessary worry that is had over headings combined with sections. For now, let’s take another look at that example from before about footer ownership. This time, I’ll make a few HTML5 substitutions:

The diagram clearly shows the footer in the context of the document22
Note the lack of illustrated headings. Wherever a section is opened, it assumes responsibility for nesting: The heading type is unimportant. More on this soon …

The outline for this example looks like this:

  • Document
    • Article
      • Aside

Now that we’ve implemented sections, the boundaries are clear. Our document contains an article, which, in turn, contains an aside. There are three sections, each belonging to the last, and the depth of each section is reflected in the outline. Importantly, because sectioning elements wrap their contents, we know perfectly well where they end, as well as where they begin. And yes — screen readers like JAWS actually announce the end of sections like these! We know what content belongs to what, which makes deducing the purpose of the footer much easier. Because it exists outside the bounds of both the <article> and its <aside>, it must be the document’s footer. Here’s the same diagram again, with subsections faded out:

The same diagram with subsections faded out23

The power of sectioning lies in its ability to prescribe clearly defined boundaries, resulting in a more modular document hierarchy. The footer unequivocally belongs within the immediate scope of the highest-level section, giving assistive technologies and indexing parsers a good idea of its scope, which helps to make sense of the page’s overall structure.

Headings And Accessibility

When Sir Tim Berners-Lee conceived the <section> element all the way back in 199124, he envisioned the obsolescence of ranked heading levels. The thrust of the idea was that headings should act as mere labels for blocks of content, and the nature (i.e. the importance, scope, etc.) of the content would be calculated automatically based on the content’s standing in the document.

I would in fact prefer, instead of <h1>, <h2> etc for headings [those come from the AAP DTD] to have a nestable <section>..</section> element, and a generic <h>..</h> which at any level within the sections would produce the required level of heading.

Why is this preferable? Determining heading level systemically, based on nesting level, is much more dependable because it removes a layer of decision-making: By “producing” the required heading level automatically, we no longer have to decide separately which numbered heading we should include. It effectively prevents us from choosing the wrong heading level, which would be bad for parsable structure. A subsection must be subject to its parent section. Because this relationship between sections determines “level,” numbered headings are made redundant — hence, the proposed <h>.

A lot of fuss over nothing

Now, this is the supposedly tricky part; the part that causes all the consternation and gnashing of teeth. This is the part that caused Luke Stevens to write this diatribe26, and prompted Roger Johansson into a state of uncharacteristic apoplexy, asking, “are you confused too?27”. Ready?

In the WHATWG specification (in the same place where <footer>s were ostensibly classified as sectioning elements!), we are “strongly encouraged to either use only h1 elements, or to use elements of the appropriate rank for the section’s nesting level.” On first appearance, this seems contrary. Surely only one of these courses of action can possibly be right? What do you do? I’m thinking maybe the first option. Or the second. Who am I?

It certainly confused me, so I spoke with HTML Editor, Ian Hickson. He explained the outline to me in detail and I’m convinced it is perfectly robust. I’m going to do my best to explain it to you here.

Okay. As it turns out, we didn’t get the generic <h> element. This wouldn’t be backwards compatible because older browsers wouldn’t recognise it. However, headings that introduce sections are — regardless of their numbered level — treated as a generic <h>. Quite correctly, it is the section itself that takes responsibility for nesting in these situations — not the heading — and whenever you introduce a new section, you introduce a new nesting level without fail. What does this mean in practice? It means that we can introduce and benefit from the structural clarification offered by sections without abandoning heading levels. Take the following example:

<h4>Page heading</h4>
<p>Introductory paragraph...</p>
<section>
    <h3>Section heading</h3>
    <p>some content...</p>
    <h2>Subheading</h2>
    <p>content following subheading...</p>
    <section>
        <h1>Sub-subheading</h1>
        <p>content two levels deep...</p>
    </section>
</section>
<h5>Another heading</h5>
<p>Continued content...</p>

Our heading levels are all over the place. This is not recommended by the specification, but it helps demonstrate just how robust the HTML5 outlining algorithm really is. If we replace all the headings that open sections with a generic (“wildcard”, if you prefer) <h>, things become clearer:

<h>Page heading</h>
<p>Introductory paragraph...</p>
<section>
    <h>Section heading</h>
    <p>some content...</p>
    <h2>Subheading</h2>
    <p>content following subheading...</p>
    <section>
        <h>Sub-subheading</h>
        <p>content two levels deep...</p>
    </section>
</section>
<h5>Another heading</h5>
<p>Continued content...</p>

It’s important to note that the only errors revealed in the computed outline are ones relating to badly ordered numbered headings within the same section. In the original example, you’ll see that I’ve followed an <h3> with an <h2>. Because they are in the wrong order, the outline interprets them as being on the same level. Had I encapsulated the <h2> in <section>, this error would have been suppressed.

Well, how about that? If you’re not convinced, go ahead and paste my example into the test outliner28 and play around. It works just fine. In fact, it’s really difficult to break.

If you think there is a benefit to screen reader users, you may wish to adhere to the second of the two clauses from the specification and incorporate numbered headings that reflect nesting level. As demonstrated, this will have no effect on the outline, but since heading level (“Heading Level 2 – The Importance Of Sections”) is announced, it gives a clearer impression of structure to those who can’t see boxes inside boxes.

The assertation that heading levels are perpetually indispensable to screen reader users comes under pressure when you consider advancements being made by screen reader vendors. Screen readers like JAWS mark the territory of sections more clearly than headings, by announcing the beginnings and ends of sections and the thematic regions they represent (“Article End!”). From this perspective, using more than one <h1>s in your document might sometimes be applicable. You’ll come up against some accessibility experts who are keen on their “there can only be one [h1]!” mantra, but research shows29 that even in HTML4 or XHTML, this is not necessarily the case.

The approach you choose is yours to make; just employ some common sense and consistency. Bear in mind, though, that not all screen readers are able to announce the bounds of sectioned content. In these cases, there are measures you can take …

ARIA Enhancement

Transition to an HTML5 document structure is made smoother by incorporating some ARIA landmark roles30, which are both relatively well supported31 and somewhat analogous to the section-based navigation we should expect later. ARIA offers many more accessibility-specific features than baseline HTML5 could ever withstand; so, including “bolt-on” ARIA enhancements is certainly polite32. However, regarding ARIA roles as a substitute for semantic HTML would be a grave misconception.

Landmark roles, such as role="contentinfo" and role="banner", address accessibility only — not data mining — and each may be used only once per document. They are essentially shortcuts to parts of the page. HTML elements are more like building blocks, which are used in a repeated and modular fashion. So, while you can assist accessibility by placing role=”banner” into the <header> element closest to the document’s root, this does not preclude you from using <header> to introduce other sections:

The banner landmark role is used just once33

Are Sections The New <div>s?

This is a common misconception.

If it wasn’t clear already, it should be clear to you now that <div>s are semantically inert elements — elements that don’t really do or say anything. If this is clear, then it should also be clear that, when building a structured document, relying heavily on “an element of last resort34” makes for a very poor foundation.

If the new <section> element, for example, was just <div> with a new name, adopting it would be a straightforward matter of search and replace. It wouldn’t exactly be progress, though. The truth is, <div> still has a rightful place in the spec’; we’ve just given its organizational responsibilities to a team of elements that are better qualified. Sorry, <div>, old mate. What do we use <div>s for, then? Precisely what they were good at from the beginning: as a tool for “stylistic applications… when extant meaningful elements have exhausted their purpose35.”

For instance, you shouldn’t employ sections as box-model36 controlling measures like this…

<section class="outer">
      <section class="inner">
         <h1>Section title</h1>
      </section>
   </section>

… because there’s nothing that the outer section does that the inner section doesn’t. We’ve created two sections for one piece of content. A quick run through our outliner37 throws the “Untitled Section” warning:

  • [Untitled Section]
    • Section title

The brilliance of <div> in this context is that it refuses to affect the outline, which is why we can use it without fear of reprisal. This…

<section>
      <div>
         <h1&gtSection title</h1>
      </div>
   </section>

… averts disaster38 and results in this unsullied, if simplistic, outline:

  • Section title

Sections And Semantics

A lot of developers have trouble with the word “semantic.” You might even say that they don’t know what the word means, which (if you are familiar with the term) makes an interesting paradox. For instance, when Jeffrey Zeldman advocates39 for the “semantic” application of the id attribute, he’s kind of missing the point. The main purpose of semantic HTML is for the automated extraction of meaning from content. Applying a private, non-standard id to a <div> would not improve the semantics of the element one iota: Visitors can’t see it and parsers will ignore it. So much for the semantic Web40!

Sections are often characterized as the “semantic” equivalent of <div>. This is a half-truth at best, and I apologize for throwing the term “semantic” around so much — it’s become a bit of a shorthand. Some HTML elements are inherently semantic in that they prescribe specific meaning to their contents. The <address> element is a good example: When a parser reaches <address>, it knows that the contents should probably be interpreted as contact information. What it chooses to do with this knowledge is another matter, but it’s plausible that a screen reader could provide a shortcut to the address or a search engine could use it to refine its results pages.

Definition of syntax from Google search: The arrangement of words and phrases to create well-formed sentences in a language41

Sectioning elements are not so much semantic as syntactic. All <section> tells us is that it is a part of a whole. However, the syntactic contribution of sectioning elements to document structure is not unimportant. Consider the following sentence: If sections you don’t websites your are use obsolete. A lot of recognizable words are in there, but the lack of sensible syntax makes the sentence difficult to unpick. So it is with sectioning: You are not creating meaning so much as assembling it. Meaning isn’t always about the “thing”; it’s sometimes about what that thing’s role is amongst other things.

Microdata

Efficient, syntactically sound data structures are worthless if they are semantically lacking. Fortunately, HTML5 has both angles covered and provides a mechanism for attaching semantic meta data42, called “microdata43,” to our structured content. Using microdata, and by consulting schema.org44, you can define a page’s content as anything from a scholarly article to an exercise regimen. Unlike classes and IDs, this is information that can actually be interpreted usefully.

Google's structured data dashboard45
This microdata was found by Google and displayed in its “Structured Data Dashboard” for the WordPress theme Typical46.

Conclusion

HTML isn’t just an SDK or a Graphic Designer’s palette. It is a metalanguage, a language that tells you special information about information. Sometimes we — or, more precisely, the parsers we employ — benefit from added information about the subject, timing, origin or popularity of content. This is what APIs such as microdata and RDFa are for. Other times, the context, hierarchy, relative importance and codependence of the information are what need to be determined. This is where appropriate syntax, facilitated by sectioning elements, can be employed.

Some people will tell you not to bother with sectioning. They say that it’s hard work or that it doesn’t make sense. This is hokum. Sure, if you’re lazy, don’t bother with sectioning, but don’t pretend you’re doing it on principle. Using sections demonstrably enhances HTML structure without breaking accessibility. We’ve covered this.

Still, there will always be people who will attack this aspect of the specification. Perhaps we’ll enjoy some of these objections in the comments:

  1. They will point to bad implementations by specific vendors:
    These are bugs and bugs get fixed!
  2. They will cite the actions of large websites who don’t use sectioning elements:
    Just because large sites haven’t implemented sections doesn’t mean they wouldn’t like to. Since when does big mean ‘right’ anyway?
  3. They will flood you with examples of developers implementing sections badly:
    Some developers do stupid things and their misuse of HTML doesn’t stop at sections. I include myself here, by the way.
  4. They will present you with anecdotal evidence about user behavior within specific groups:
    It is expensive and impractical to address problems on a case-by-case basis. Fragmentation and complexity would also be inevitable: a loss for the majority of users.

I don’t think anyone would advocate making badly structured Web documents any more than they’d suggest building a house by stuffing a bag full of bricks and throwing it into a ravine. The case has been made and the specification bears it out: Sections aren’t just good for document structure — they finally make proper structure attainable. Some browsers and screen readers have some catching up to do, that’s for sure, but the situation is improving rapidly. Any kind of change is a little turbulent, but this kind is worth it.

Footnotes

  1. 1 http://www.w3.org/WAI/GL/wiki/Using_HTML5_section_elements
  2. 2 http://www.youtube.com/watch?gl=GB&hl=en-GB&v=OGg8A2zfWKg
  3. 3 http://en.wikipedia.org/wiki/Web_colors#Web-safe_colors
  4. 4 http://en.wikipedia.org/wiki/Netscape_Navigator
  5. 5 https://www.youtube.com/watch?v=ONaa4HxD-48&playnext=1&list=PL10DC9ABD97ED9957&feature=results_video
  6. 6 http://www.alistapart.com/articles/responsive-web-design/
  7. 7 http://www.smashingmagazine.com/2013/01/18/the-importance-of-sections/drawing1/
  8. 8 http://www.smashingmagazine.com/2013/01/18/the-importance-of-sections/drawing2/
  9. 9 http://www.smashingmagazine.com/tag/responsive-web-design/
  10. 10 http://webdesign.tutsplus.com/tutorials/htmlcss-tutorials/quick-tip-utilizing-normal-document-flow/
  11. 11 http://www.w3.org/TR/2011/WD-html5-20110525/content-models.html#flow-content-0
  12. 12 http://html5doctor.com/outlines/
  13. 13 http://paciellogroup.com/
  14. 14 http://www.smashingmagazine.com/2013/01/18/the-importance-of-sections/drawing3/
  15. 15 http://www.smashingmagazine.com/2013/01/18/the-importance-of-sections/leftandright-2/
  16. 16 http://dev.w3.org/html5/markup/footer.html
  17. 17 http://www.smashingmagazine.com/2013/01/18/the-importance-of-sections/tweet/
  18. 18 http://www.w3.org/TR/WCAG-TECHS/H42.html
  19. 19 https://developer.mozilla.org/en-US/docs/Sections_and_Outlines_of_an_HTML5_document
  20. 20 http://www.smashingmagazine.com/2013/01/18/the-importance-of-sections/sectioning1/
  21. 21 http://www.w3.org/TR/2011/WD-html5-20110525/sections.html
  22. 22 http://www.smashingmagazine.com/2013/01/18/the-importance-of-sections/sectioning2/
  23. 23 http://www.smashingmagazine.com/2013/01/18/the-importance-of-sections/sectioning3/
  24. 24 http://1997.webhistory.org/www.lists/www-talk.1991/0003.html
  25. 25 http://1997.webhistory.org/www.lists/www-talk.1991/0003.html
  26. 26 http://www.netmagazine.com/features/truth-about-structuring-html5-page
  27. 27 http://www.456bereastreet.com/archive/201103/html5_sectioning_elements_headings_and_document_outlines/
  28. 28 http://gsnedders.html5.org/outliner/
  29. 29 http://webaim.org/projects/screenreadersurvey3/#headings
  30. 30 http://www.paciellogroup.com/blog/2010/10/using-wai-aria-landmark-roles/
  31. 31 http://www.paciellogroup.com/blog/2011/07/html5-accessibility-chops-aria-landmark-support/
  32. 32 http://www.456bereastreet.com/archive/201004/built-in_or_bolt-on_accessibility_in_html5_how_about_a_bit_of_both/
  33. 33 http://www.smashingmagazine.com/2013/01/18/the-importance-of-sections/banner/
  34. 34 http://www.w3.org/wiki/HTML/Elements/div
  35. 35 http://en.wikipedia.org/wiki/Span_and_div
  36. 36 http://www.w3.org/TR/CSS2/box.html
  37. 37 http://gsnedders.html5.org/outliner/
  38. 38 https://twitter.com/jvhellemond/status/257850976478842880
  39. 39 http://www.zeldman.com/2012/11/21/in-defense-of-descendant-selectors-and-id-elements/
  40. 40 http://en.wikipedia.org/wiki/Semantic_Web
  41. 41 http://www.smashingmagazine.com/2013/01/18/the-importance-of-sections/syntax/
  42. 42 http://en.wikipedia.org/wiki/Metadata
  43. 43 http://googlewebmastercentral.blogspot.co.uk/2012/07/introducing-structured-data-dashboard.html
  44. 44 http://schema.org/docs/full.html
  45. 45 http://www.smashingmagazine.com/2013/01/18/the-importance-of-sections/typical/
  46. 46 http://typical-theme.heydonworks.com

↑ Back to top Tweet itShare on Facebook

Heydon has a love/hate relationship with CSS and a lust/indifference relationship with javascript. He writes a lot and makes fonts.

Advertising
  1. 1

    There are people who don’t know the difference between section and aside…. for example, there are some of my friends I work with that find it difficult to know…. although it might be the same , but its different because there is a capacity section can have that aside doesn’t …… nice article! :-)

    0
  2. 52

    Great Post. The point about syntactics and semantics is noted, and something which I will need to explore more. I’m still not persuaded by the concept of significantly expanding the HTML tagset.

    0
  3. 103

    Full Disclosure: I have read Luke Stevens’ “The Truth About HTML5″ (second edition) and I’ve also read countless online articles relating to the HTML5 sectioning elements and the HTML5 Document Outline. I’ve read the W3C spec, relating to this and I’ve got a stack of books that have been read as well.

    There is no doubt that this article explains sectioning in a way that many others don’t and there is a benefit to fully understanding how the sectioning elements should be used. But, I must say that Heydon Pickering seems (to me) to have as much of a bias “for” these elements as Luke Stevens has “against” them, with one important distinction and that is what I hope readers of this comment will focus on.

    I will do my best to explain as objectively and without injecting any “opinion”, but rather stick with facts as best as I can because after reading the back and forth comments between Stevens and Pickering, it is clear that there is *some* level of animosity there and that won’t help the rest of us build better web pages and applications.

    1. Heydon states that *why* and *how* these elements were initially created is not a material issue, but that is his opinion, not a fact. The fact is that understanding the history behind how we got these elements goes to the heart of the *intent* for their usage. Hixie intended to have elements that “paved the cow paths” in HTML5. Like them or hate them, the fact is that they do not do this — they create entirely new pavement that we are being asked to drive on. If a semantic web is the goal, as Heydon says it is, then *intent* for an element is literally the element’s semantics. After all, how many pages use the b element when strong was what Tim Berners-Lee would have intended for us to use? With these new sectioning elements, the intent doesn’t match the spec’s stated semantic meaning and that’s a problem for semantics.

    2. In reality, we must develop for the web we have, not the web we want. This simple paradigm is the basis for every work-around, hack, shim and polyfill that has ever been created. Heydon states that incorrect implementations of the document outline are bugs and “bugs get fixed”, so we should carry on with these elements under the guise that at some future point the web we get will line up with code we have today. This is a patently false assumption that has over 20 years of web history to prove that this is not how it works. We all know perfectly well that IE8 was released in 2009 – – that’s 5 years ago folks. We all also know that there are still enough IE8 users out there for us to worry about how to polyfill for it or gracefully degrade within it. Just ask why so many books, articles and blogs pound Remy Sharp’s IE8 shim into our brains. We cannot simply leave these users behind because they haven’t caught up to the web we wish we had. We must build for the user base we have today and progressively enhance pages for them to stay current with new technologies. Adding these elements in the hopes that technology will catch up to us is the very definition of “Feature Creep”.

    3. Heydon’s article does a fine job detailing how one might implement these elements, but as I read it, the only real case he makes for why they are better than the heading elements (that still contribute to document outlines) alone is that with heading elements, it’s hard to figure out where sections end. Frankly, that’s more opinion. We have 20 years of experience with heading elements and the vast majority of developers/designers I’ve interacted with consider them in the “101” category of complexity. If you can make a Word document that is a report with headings and sub-headings, you understand headings in HTML.

    4. If you agree that the purpose for using these elements is better semantics, better accessibility for AT and better SEO, then you must determine their usefulness by how much they help us in those areas alone. You can’t say we like these elements because of “A” and then offer proof that relates to “B”.

    Is div id=”article” semantically better than article alone?
    The answer must unequivocally be “no”, since they both DO convey that they are each an article. Sure, you can say that article is shorter, but that’s not semantics, that’s “syntactic sugar” and there will never be a clear winner when it comes to debating purely stylistic coding decisions. By the way, if you consider that many HTML5 developers are nesting div elements inside these new sectioning elements for either fall-back purposes or for style hooks, then article isn’t less code, it’s more.

    Do these elements better serve us with AT?
    The answer is again, an unequivocal “no”, since many ATs either don’t use the HTML5 document outline or implement it incorrectly at this time. I freely admit that this *may* change in the future and, if so, we should give these elements another look. But, the possibility is also that this may not change in the future, in which case using these elements for AT is a red-herring.

    ARIA, on the other hand does work with AT and is well-supported. Why reinvent the wheel here?

    Do these elements help with SEO?
    No. There is no evidence to support that the use of these elements helps SEO or that the lack of their use hurts SEO. Google has not changed their ranking algorithm to account for HTML5 – that’s a fact.

    If we want semantics that can help with SEO, we have it today with Schema.org.

    5. Fully half of Heydon’s article is about how div is inadequate to truly section a document and that it is only CSS trickery that makes us believe that they do. And, he’s right. But that’s not news. What took him many paragraphs to say, I’ll say with all that needs to be said: the div element is semantically neutral (as is the span element, by the way). Does this make div useless? Certainly not! We need neutral elements as style hooks AND if we adorn them with ARIA, they magically do take on semantic meaning. With all that Heydon says drilling his point that div’s don’t create *actual* sections, what he glosses over is what I’ve already said – – we don’t need them to create the sections in the document outline, we use headings for that purpose. And, again, Heydon doesn’t prefer this because he finds it difficult to know where a section ends. That’s his opinion and he’s entitled to it, but it is not evidence for why sectioning element are, as a matter of *fact* better than headings.

    6. Heydon does not address, at all, how we are to style headings that are used within sections. This is a glaring omission. If a consistent look for all h1’s is desired, how am I to set up CSS to address the fact that some of my h1’s will be in the main section, some will be in an article or an aside, and still some will be in an aside, nested within an article that is nested within a section that is, itself, nested within another section? Am I to write CSS classes for all these possibilities? You might answer that this is a problem with a simple solution – create 6 CSS classes for the various heading elements and adorn each usage with the desired class. For some situations, this may be just fine. But, what if the content is coming from other content sources and I am just republishing it (such as a blog post or Twitter feed)? In those cases, I can’t modify the HTML to include these CSS adornments. So, now what?

    SUMMARY:

    1. These elements are a solution in search of a problem.

    2. Traditional headings (alone – without the new sectioning elements) still make valid HTML5 document outlines. And, these outlines are correctly implemented, TODAY, in all AT’s.

    3. These elements do not add any semantic value that we cannot already implement through ARIA and Schema.org.

    4. These elements do not affect SEO.

    5. These elements WILL cause untitled sections when headings are not present within them, which CREATES an AT problem that didn’t exist prior to their introduction.

    6. The “one distinction” between Heydon’s affection for these elements and Luke’s loathing of them that I made mention of at the beginning of this post is that Heydon doesn’t present anything other his opinion that sections (the old way) were hard (we’ve already dispelled the other incorrect notions about better semantics and better AT support). Luke provides real, testable and verifiable evidence for why using these elements (today anyway) is not a good idea.

    If you’ve read this far, I thank you for your attention. Again, I’ve tried very hard to stick to the facts here and not include any hyperbole or opinions. Everything I’ve stated is accurate and verifiable. But, I must conclude with this:

    When used correctly, with an understanding of how they work, you CAN use these sectioning elements and cover all your bases. That is a choice that we all must face, just as we do with any/all new technologies that come down the pike. My overall point here is that any argument for using them that includes anything but personal preference is a false-argument. Finally, all of this is true as of 3/11/14. When (not if) the landscape changes, I will adjust my practices accordingly. Just as we all should do with any new options.

    Thank you for your time and attention.

    Scott Marcus
    IT Educator for over 20 years.

    3

Leave a Comment

Yay! You've decided to leave a comment. That's fantastic! Please keep in mind that comments are moderated and rel="nofollow" is in use. So, please do not use a spammy keyword or a domain as your name, or else it will be deleted. Let's have a personal and meaningful conversation instead. Thanks for dropping by!

↑ Back to top