Menu Search
Jump to the content X X
Smashing Conf New York

We use ad-blockers as well, you know. We gotta keep those servers running though. Did you know that we publish useful books and run friendly conferences — crafted for pros like yourself? E.g. upcoming SmashingConf Barcelona, dedicated to smart front-end techniques and design patterns.

Dealing With Redundant, Out-Of-Date And Trivial (ROT) Content

Publishing content to the web is expensive. I know what you’re thinking: no, it’s not; it costs nothing, especially when compared to print. And you would be right, from a certain point of view. The problem is that publishing is cheap. This seduces you, encouraging you to put more and more content online.

In fact, the cost is so cheap that many organizations let almost any employee put content online. They install a content management system and give staff free rein. Even those who enforce standards for consistency and accuracy still produce a lot of content. After all, somebody might find that piece of content useful.

But you will soon discover hidden costs. Costs that are crippling larger organizations.

The Hidden Cost Of Content Link

Although there is a cost to producing this content in the first place, there is a far higher cost in maintaining that content over time. It costs huge amounts of money and time to review content on a regular basis and ensure it is still accurate and relevant. This is especially true when some organizations have millions of pages online. In the end, many companies just give up. We often forget content once we hit “Publish”, unless it is a particularly prominent piece.

The hidden cost is not just limited to maintenance. It also impacts the usability of sites. With so much content online it can be hard for users to find content that is useful. For example, at one point microsoft.com had over 10 million pages online1, over 3 million of which a user had never visited. This clutter only succeeded in damaging findability and lowering customer satisfaction.

Large sites like microsoft.com have millions of pages, many of which are ROT.2
Large sites like microsoft.com have millions of pages, many of which are ROT. (View large version3)

But there is a final hidden cost: a cost to an organization’s ability to evolve its site over time. Take, for example, a company that wants to make its site responsive. In theory we can do this with some updates to the CSS or, at most, the templates in the content management system. But when you have millions of pages produced over an extended period of time this is often not the case. Content producers will have marked up content in a variety of different ways making design changes hard.

Many digital teams give up on the idea. Instead, they redesign the core site and leave legacy content alone. This leads to a fragmented experience as users struggle to adapt to the changing user interface across the site.

How can this enormous challenge be overcome? It begins by addressing the ROT on your site.

What Is ROT? Link

ROT stands for redundant, out-of-date and trivial. Much of the content on our sites falls into one of these three categories. ROT is a huge problem on many larger websites.

The European Commission recently undertook a content audit. It removed a staggering 80% of its online content because it was ROT. This created a better user experience while reducing costs. It also allowed them to evolve their digital offering.

The European Commission removed a staggering 80% of its content.4
The European Commission removed a staggering 80% of its content. (View large version5)

Much of the content that organizations put online is trivial. It caters to edge cases that most users do not care about. Yet it takes time and effort to maintain and makes finding important content harder.

But even important content can become ROT. As an organization evolves so should its online content. Yet it often doesn’t and that content becomes redundant.

Finally, a lot of the content we put online has a limited shelf life. Events that have come and gone, or news stories from years ago that clutter up search results.

Sooner or later this ROT will need addressing. But how do you do that?

Start From A Clean Slate Link

Often the best solution is to start from scratch. On larger sites even auditing what content you have online is too expensive in both time and money. It is not uncommon for digital teams to be unaware of all the content that exists. In such situations the best they can do is migrate content. But that is like putting lipstick on a pig. It doesn’t address the underlying issue.

Instead, many organizations are starting from scratch. They are beginning with user needs by identifying top tasks and producing content around those. This allows them to migrate only the relevant content, often rewriting it as they go.

This is exactly the approach adopted by the Government Digital Service6 when working on the beta for GOV.UK. They translated the content on existing government websites into a user need, such as “I need to report a lost passport.” They then passed these needs through a series of criteria to judge whether that need was worth addressing. They tracked this process through a small web app they created called the Needotron7.

Unfortunately, in many organizations the digital team would be unable to take such radical action. They often do not own the content and so do not have the authority to remove it. I could argue that this shouldn’t be the case but I doubt that would make any difference. Instead, let’s look at some options that might be more possible.

Removing The Redundant Link

The first area to target is redundant content: products or services that no longer exist; campaigns that have long since ended. These are easy to spot, appearing in navigation, search results and analytics.

Addressing this content is often easy, too. Nobody much cares for redundant content and so you won’t hear many complaints when you remove it. What is more, there is less of it so the digital team has the capacity to deal with it.

Out-of-date content is a trickier challenge.

Dealing With The Out-Of-Date Link

Out-of-date content is harder to spot. It is that phone number that no longer works, or a reference to a member of staff who has left; it is that event buried in the events calendar, or a mention of a product that no longer exists. You can find this kind of content deep within pages or subsections on a site.

There is also a lot more out-of-date content than completely redundant content, too much for the digital team to track down. This is going to involve a degree of automation and the cooperation of content producers across the company.

One approach is to archive content such as news and events after an agreed amount of time. This removes the content from site search and navigation, but makes it available for those that want it. But what about content with a less obvious end date?

The best approach is to establish a policy to enforce content review. This will make sure content producers check their content to ensure it is not out of date. For example, this might need people to log in to the content management system once every six months to check their content.

If out-of-date content cannot be removed it should at least be marked as out of date.8
If out-of-date content cannot be removed it should at least be marked as out of date. (View large version9)

Of course, there will need to be consequences if they don’t do that, otherwise they just won’t bother. This will involve removing the offending page from the main navigation and search results as well as adding a banner to the page; the banner will warn users that the content maybe out of date, a technique used by the BBC in the past.

It would be easy enough to use the last modified date in your CMS to trigger an email telling the person who created the page to check it. If that person has left the company and nobody else is supporting the content it needs flagging anyway.

You could go further and notify content producers if their content fails to reach a traffic threshold or a minimum dwell time. The possibilities are limitless. But be careful you don’t chase a false metric. Traffic and dwell time are not always the best measure for all content.

But What About Trivial Content? Link

The hardest type of ROT to deal with is trivial content, because you will face disagreement over what is trivial. What you consider an edge case might be business critical to another member of staff.

To address this problem you need a set of criteria to assess the value of content. These should be:

  • Analytics
  • Users’ top tasks
  • Business objectives

First, you should look at the amount of traffic hitting a page. Falling below a certain traffic threshold should flag it for review. This does not mean that the content is trivial, it is just a way you can find content that might be trivial.

Next, you should compare that content with a list of top tasks10 you know users want to complete. You do have a list like that, don’t you? This should be the major criterion for judging if something is trivial. If the content is not on that list then we have a potential problem.

Of course, a task might not be particularly important to the majority of users and yet be business critical to the organization. Only a fraction of users of a site go on to buy, but this is still considered an important action!

This means it is important to ask whether a piece of content supports one of the top two or three objectives of the business. Supporting some minor business goal is not enough.

If the content fails to meet any of these criteria then it is trivial. But that doesn’t mean you should remove it. Some content needs to be online for regulatory reasons or is of crucial importance to a small but valid user group.

The key here is to ensure it does not interfere with the findability of more important content. You could make it only accessible via search or maybe remove it from the main site completely. It is often much easier to point people at a specific page via social media, email or other communication channel. Easier than expecting them to navigate through the hierarchy of a site to find an obscure page.

A Difficult But Important Challenge Link

Dealing with ROT can feel intimidating on a large site. In fact, it can feel impossible. But it isn’t. Often it is just a matter of putting some processes in place to deal with it.

I would encourage you not to dismiss the clean slate approach out of hand. You may think it will be out of the question in your organization but you may well be wrong. If you create a prototype that gives people a sense of how much better the site could be, they are often more amenable than you think. Now is not the time to be timid. Now is the time to confront the ROT.

(ml, og)

Footnotes Link

  1. 1 http://www.gerrymcgovern.com/new-thinking/removing-poor-quality-content-increases-customer-satisfaction
  2. 2 https://www.smashingmagazine.com/wp-content/uploads/2015/06/01-microsoft-opt.jpg
  3. 3 https://www.smashingmagazine.com/wp-content/uploads/2015/06/01-microsoft-opt.jpg
  4. 4 https://www.smashingmagazine.com/wp-content/uploads/2015/06/02-EC-opt.jpg
  5. 5 https://www.smashingmagazine.com/wp-content/uploads/2015/06/02-EC-opt.jpg
  6. 6 https://gds.blog.gov.uk/2011/09/19/introducing-the-needotron-working-out-the-shape-of-the-product/
  7. 7 https://github.com/gds-attic/need-o-tron
  8. 8 https://www.smashingmagazine.com/wp-content/uploads/2015/06/03-ofd-opt.jpg
  9. 9 https://www.smashingmagazine.com/wp-content/uploads/2015/06/03-ofd-opt.jpg
  10. 10 http://alistapart.com/article/what-really-matters-focusing-on-top-tasks
SmashingConf New York

Hold on, Tiger! Thank you for reading the article. Did you know that we also publish printed books and run friendly conferences – crafted for pros like you? Like SmashingConf Barcelona, on October 25–26, with smart design patterns and front-end techniques.

↑ Back to top Tweet itShare on Facebook

Paul Boag is the author of Digital Adaptation and a leader in digital strategy with over 20 years experience. Through consultancy, speaking, writing, training and mentoring he passionately promotes digital best practice.

  1. 1

    A very bold article. Looks like your missing a closing tag in the “Start From A Clean Slate” chapter. Good article.

    1
  2. 2

    Too much bold.

    -2
  3. 3

    I think you left an open **strong** tag somewhere! Most of the article is bolded.

    1
  4. 7

    Nick Hilhorst

    June 11, 2015 8:28 am

    You describe redundant as “products or services that no longer exist”, but wouldn’t that be Out-Of-Date? They existed at one point, but no more. Isn’t redundant more about unnecessary duplication of information?

    Also, from the last alinea of “Start From A Clean Slate” everything is in bold. I doubt that’s how you meant it to be.

    0
    • 8

      That is actually a fair comment. Hadn’t thought of that. Your example of redundant content is better! You need to write the next one :)

      0
      • 9

        Nick Hilhorst

        June 14, 2015 8:05 am

        Just knowing this one term might not be sufficient qualification for writing an entire article ;-) I have a great interest for webdesgin, but I’m not a webdesigner myself. I worked as both a programmer and a sys admin, so that’s where I learned about redundancy. I like the way you apply it to the content of a website, which was new to me, and which made this a worthwile read. Thanks! :)

        0
  5. 10

    This article comes straight out of a book.

    http://contentstrategy.com/book.html

    0
    • 11

      I am embarrassed to say I have never read Kristina’s book. That said, she is a wise woman who I have been following for years. She has obviously rubbed off on me!

      1
  6. 12

    Thanks for addressing this issue! One more hidden cost is the huge environmental toll taken by millions of servers burning coal energy to store this “information,” requiring incredible amounts of air conditioning and back-up generators, which are using even more energy.

    1
  7. 13

    Great article. I was wondering how you’ve confronted the issue with clients when you’re not the the content owner. I find clients tend to be very protective of their legacy content and aren’t typically into hearing about refining it. Have you had any luck converting the content hoarders?

    0
  8. 15

    As an SEO, the premise of this article makes me want to cry. Decisions like this one here is about the simplest way I can imagine to ditch and lose traffic. Bold? Maybe, but ultimately a source of major waste and more than likely, a foolish decision that will give your SEO team nightmares.

    If you can’t commit to managing content, in my humble opinion, you shouldn’t produce it. Stick to writing generalities that will never need to be updated. Like this article :)

    -2
    • 16

      I am not sure I follow Jon. I can understand how removing content would have an SEO impact, but archiving it or making it as out of date would not.

      That said, I wholeheartedly agree that companies need to stop biting off more than they can chew in terms of content management.

      0
    • 17

      SEO is definitely an important consideration, and any content audit needs to take it into account.

      Having said that, I think the central premise of the article is correct. A site with great SEO might get lots of visitors, but if the content is out of date or too hard to locate once they get there, the user is likely to go somewhere else to find what they want.

      1
      • 18

        Exactly that. I totally agree with the overall sentiment that removing redundant content is important for a good user experience. While currently working on a content migration project and using it as an opportunity to ‘clean up’, I face arguments to migrate everything as the existing content has more value than a redirect (to relevant content) in terms of ranking. Traffic is often the ‘success metric’ and as a content manager it’s difficult to argue for something which goes against the KPIs, which generally ignore UX in favour of SEO.

        0
  9. 19

    While I have never dealt with a site with a million+ pages, one of my least favorite things in the world is going through and evaluating old and forgotten content. You’re right that it would be so much easier for ALL parties if they just adopted a strategy and set of guidelines from the start because doing an audit never seems to be a priority and often gets put off for way too long!

    -2
  10. 20

    A content audit ore review is in many cases a very time consuming task. In my opinion starting from scratch, where you consider of keeping the content ore deleting it, is the only way to go. This will clearly take more time than just copy past everything (which in many cases already can take a lot of time), but the quality assurance is worth all the effort.
    Considering each piece of content individually, give it a quality check, accuracy check, rewriting it, spelling and grammar and if the markup does not contain any unnecessary code (we all now the once who copy/past from Word).

    There should definitely be some guidelines ore benchmarks for keeping content ore deleting it. Analytics data like pagevisits and bounce rates could be some. Didn’t the site get any visitors in the last year, delete it.
    Another solution could be to merge content from sites that have the same ore similar topics and hereby cutting down on single pages ore preventing to delete all the “bad” content, but giving it a new chance in a rewritten version.
    This of course is always also a question of what kind of website you are dealing with. Is it a web-shop with campaigns that have expired ore some kind of governmental site, with content that is bound to laws and has to be accessible at all times.

    In regards of keeping your content up to date some sort of notification system with eg. mails, like mentioned in the article could be one way. Another solution could be to highly enforce the ownership feeling for the single pieces of content with the content owner/writer. Serve them on a regular base informations about there content. How many sitehits/site raking does there content receive? What is the bounce rate of there site? Did the reader perform any action on the content, like link activation ore newsletter sign up? Are there comments/feedback ore broken links?

    This will actually give the content owner some useful feedback, he now can rewrite the content to keep it up to date on a regular bases and make it perform better over time. Often when having many content editors writing and updating the same website they tend to lose track of the sites they where editing and forgetting to come back to check up. Collect all the site data and present it on a loginscreen/dashboard to the CMS for the content owner, and he will at one glance have all the right informations about his content when he is logging in to the CMS.

    And than there is the whole story about organisational/leadership backup for spending so much time updating and nursing the website, but that is a entirely different story ;-)

    0
  11. 21

    A timely article Paul. Many organizations are struggling with this issue. From a SEO perspective, it is crucial to manage what is in the search engine indexes. This is not a blank slate. It should be included in any content audit (in my opinion).

    1
  12. 22

    Joy Moskovic

    July 3, 2015 2:10 pm

    Good article on a topic often overlooked. Wondering if anyone has undertaken a ROT exercise on an internal enterprise social network or wiki.

    0

↑ Back to top