I’ve been working on an upcoming talk that I’m doing in June (it was originally supposed to be last week but it’s been postponed) and I needed an example of a soft 404 page. I ended up finding a few on some big-name, well-known websites – where you wouldn’t expect to see them – so I thought that I’d point them out.
First things first, though…
What is a soft 404?
When you access a page that doesn’t exist on a website, the server gives an error code (a 404 code, a.k.a. a “Page Not Found” code) saying that there’s nothing there – no file or webpage could be found. When a page does exist, it gives a 200 code (a.k.a. an OK code).
A soft 404 is when you access a page that doesn’t exist, however the server gives a 200 code, not a 404 code. In other words, the website might show a “Page Not Found”-style page, but search engines such as Google will see the page and think that it’s an actual live page…
For more info, Google do a good job explaining soft 404s on their Soft 404 Errors page.
The problem with soft 404s from an SEO POV
If Google sees the page and thinks it’s ok (a 200 code), it may index it. If accessing a non-existent URL redirects to a “Page Not Found” style page (e.g. domain.com/zxcvbnm takes the user to domain.com/404), then only one page could be indexed. But if it doesn’t redirect (e.g. domain.com/zxcvbnm shows a “Page Not Found” page but doesn’t redirect, meaning that domain.com/asdfghjkl and domain.com/qwertyuiop do the same) then there’s the risk that any and all error pages could be indexed.
Google hates duplicate content, so if it happens to index a few dozen or a few hundred soft 404s, it might think that you’re trying to game the system – even though it’s a genuine and innocent mistake – and your whole site could suffer as a result.
Finding examples of soft 404 pages
When I was researching the talk, I asked people on Twitter if they knew of any examples of soft 404 pages, but didn’t have much luck unfortunately (many people passed on examples of correctly-working 404 pages). Then it hit me: a Google search for “page not found” would do the trick – granted that you’d have to dig down a few pages in order to find some.
Checking your 404 page
Curious to know if your 404 page is actually showing a 404 code? Use SEOBook’s status code checker. There are plenty of other similar tools out there, but this one is my favourite. Below each example I will include a link to each example’s results in the tool, so that you can see for yourself. You can also use Google Webmaster Tools’ Fetch as Google, if you’d prefer to see what Google makes of the page themselves…
Speaking of which… onto the fun bit now. Here are 4 big websites that currently* have soft 404 pages.
* Note: obviously if you’re reading this post weeks, months or even years after its publication date, they might have been fixed by then, so please bear that in mind…
Mozilla’s soft 404 was the first one that I came across, which was the inspiration for this post and the example that I’ll be using for my talk in June. I like their 404 page (especially the fact that the purple guy’s eyes move and blink every so often), but I imagine the fact that it’s a soft 404 is simply an oversight or an accident.