Broken links detection

Just like our mixed content checks, every site that gets added to Oh Dear! will automatically have all its pages crawled to report HTTP errors (like missing pages, server side errors or timeouts).

If we find the same broken links in subsequent crawls we won't drown you in notifications, but we'll send you a reminder that there are broken links after 7 days.

Reporting broken links

Oh Dear! will crawl your site and all its pages and create a summary for you with all pages that meet these criteria;

  • The HTTP status code is not a HTTP/2xx or HTTP/3xx
  • The page took longer than 5 seconds to load

We treat all pages in the HTTP/200 to HTTP/399 range to be OK, as long as they load in less than 5 seconds. Anything that deviates from that pattern will be reported.

Internal vs. external links

As we crawl your site, we can report all broken links on the same domain. We will also report on sites that you link to that are broken.

This protects you from linking to offline pages & broken or invalid URLs. After all, if you guide your visitors to an external URL, you'd rather have it be a working one - right?

Exclude your own URLs

Some URLs will trigger a 404 on purpose or might be more difficult to fix in the short-term. To help your notifications, you can whitelist particular URL patterns per website in your settings screen per site.

You can add URLs using a simple pattern. Here are some examples.

https://your-domain.com/private/*
https://your-domain.com//forum/admin/*
https://your-domain.com//*/edit
https://external-domain.com/*
*forum*

The example above will ignore any crawl errors on the URLs that start with /private/, /forum/admin/ or that have /edit anywhere in the URL.

We also exclude several particular URLs globally, to avoid making these reports too noisy.

Globally Excluded URLs

We automatically exclude the following URLs in our reports. These are often used for social media sharing and will trigger false positives.

https://linkedin.com/*
https://*.linkedin.com/*
http://linkedin.com/*
http://*.linkedin.com/*
https://news.ycombinator.com/*
https://www.facebook.com/sharer/sharer.php*
http://www.reddit.com/submit?*
https://github.com/issues?*
https://github.com/pulls?*
https://itunes.apple.com*
https://docs.spatie.be/join*
https://www.tumblr.com*
https://tumblr.com*
http://www.tumblr.com*
http://tumblr.com*

If you've noticed a set of URLs we should globally exclude, please get in touch.