Broken links detection
Just like our mixed content checks, every site that gets added to Oh Dear will automatically have all its pages crawled to report HTTP errors (like missing pages, server side errors or timeouts).
If we find the same broken links in subsequent crawls we won't drown you in notifications, but we'll send you a reminder that there are broken links after 7 days.
Reporting broken links
Oh Dear will crawl your site and all its pages and create a summary for you with all pages that meet these criteria;
- The HTTP status code is not a
- The page took longer than 5 seconds to load
We treat all pages in the
HTTP/399 range to be OK, as long as they load in less than 5 seconds. Anything that deviates from that pattern will be reported.
Internal vs. external links
By default we will only report on internal pages of your site that are broken. In other words: pages on the same (sub)domain of your main website.
We can also check for the external pages you link to. If you care about sending your visitors to 3rd party websites, you can check the option to Include external links in the Settings screen. From then on, we'll follow links that move away from your own (sub)domain and check they also respond OK.
This protects you from linking to offline pages & broken or invalid URLs. After all, if you guide your visitors to an external URL, you'd rather have it be a working one - right?
Exclude your own URLs
Some URLs will trigger a 404 on purpose or might be more difficult to fix in the short-term. To help your notifications, you can ignore particular URL patterns per website in your settings screen per site.
You can add URLs using a simple pattern. Here are some examples.
https://your-domain.com/private/* https://your-domain.com//forum/admin/* https://your-domain.com//*/edit https://external-domain.com/* *forum*
The example above will ignore any crawl errors on the URLs that start with
/forum/admin/ or that have
/edit anywhere in the URL.
We also exclude several particular URLs globally, to avoid making these reports too noisy.
Globally Excluded URLs
We automatically exclude the following URLs in our reports. These are often used for social media sharing and will trigger false positives.
https://linkedin.com/* https://*.linkedin.com/* http://linkedin.com/* http://*.linkedin.com/* https://news.ycombinator.com/* https://www.facebook.com/sharer/sharer.php* http://www.reddit.com/submit?* https://github.com/issues?* https://github.com/pulls?* https://itunes.apple.com* https://docs.spatie.be/join* https://www.tumblr.com* https://tumblr.com* http://www.tumblr.com* http://tumblr.com* https://twitter.com* https://www.twitter.com* http://twitter.com* http://www.twitter.com* https://t.co*
If you've noticed a set of URLs we should globally exclude, please get in touch.