What is Uptime Monitoring? A Complete Guide for 2026
Everything you need to know about uptime monitoring — how it works, what to measure, how to set alert thresholds, and why every website needs it regardless of size.
Your site just went down. When did it happen? Who was first to know — you, or a frustrated customer who posted on social media?
Uptime monitoring ensures the answer is always you, and that you found out within minutes, not hours. This guide explains how it works, what you should measure beyond simple HTTP checks, and how to configure alerts that actually help rather than just generating noise.
What is uptime monitoring?
Uptime monitoring is the practice of automatically and continuously checking whether a website or service is reachable and returning the expected response. A monitor sends HTTP requests to your URL at regular intervals — typically every 1 to 5 minutes — and records the result: up, down, or degraded.
When a check fails, an alert is sent immediately via email, SMS, Slack, or webhook so the site owner can investigate and respond.
Why uptime matters more than you think
Revenue impact
For e-commerce sites, downtime directly translates to lost sales. Amazon has publicly estimated losses of ~$220,000 per minute of outage. Even for small shops, an hour of downtime during peak traffic can wipe out a day's revenue.
SEO impact
Googlebot crawls your site continuously. If it repeatedly encounters errors (500, timeout), it reduces your crawl rate and may temporarily lower your rankings. Extended outages can trigger manual review. Recovery takes longer than the outage itself.
User trust
Users who hit an error page rarely come back. Studies consistently show that availability is more important than any other quality signal for first-time visitors. A status page that shows a known incident is far better than a silent error.
How uptime monitoring works
The basic HTTP check
The simplest monitor sends an HTTP GET request to a URL and checks:
- Did the server respond? (TCP connection)
- Did it respond within a timeout? (e.g., 30 seconds)
- Did it return an acceptable status code? (typically 200–299)
If any condition fails, the check is recorded as "down."
Beyond status codes
A basic 200 response doesn't guarantee a working site. Common failure modes that return 200:
- A maintenance page standing in for the real app
- A cached or CDN-served stale page while the origin is down
- A blank page due to a JavaScript error
- A database failure that silently renders empty content
More sophisticated checks add keyword matching — verifying that a specific string (e.g., a product name, a navigation element) is present in the response body.
Multi-location checks
A check from a single location can be fooled by:
- Regional network issues (your monitor's datacenter is up but your users in Germany can't connect)
- BGP routing incidents
- CDN misconfiguration in specific PoPs
Checking from 3+ geographic locations distinguishes "the site is down" from "there's a regional network problem."
Key metrics to track
| Metric | What it measures |
|---|---|
| Availability % | Uptime over a rolling period (day, week, month) |
| TTFB (Time to First Byte) | How quickly the server starts responding |
| Response time | Total time to receive the full response |
| Incident count | Number of outages in a period |
| MTTR (Mean Time to Repair) | Average time from detection to resolution |
| MTTD (Mean Time to Detect) | Average time from outage start to alert firing |
MTTD is the one uptime monitoring directly controls. A 1-minute check interval gives a maximum MTTD of 1 minute.
Setting sensible alert thresholds
Bad alerting is almost worse than no alerting — alert fatigue causes teams to ignore notifications, including real incidents.
Avoid single-failure alerts
A single failed check can be a transient network blip, not a real outage. Requiring 2 or 3 consecutive failures before firing an alert dramatically reduces false positives without meaningfully delaying real incident detection.
Set a response-time threshold
A site that takes 10 seconds to respond is functionally broken for most users but won't trigger a standard uptime alert. Set a secondary alert for when response time exceeds a threshold (e.g., 3 seconds).
Use escalation tiers
- 1 failed check: log silently
- 2 consecutive failures: email alert
- 5+ consecutive failures (or 10+ min downtime): page on-call (SMS/PagerDuty)
What to monitor
Don't monitor just your homepage. Consider:
- Critical user journeys: login page, checkout, API health endpoint
- Subdomains:
api.,mail.,app.can go down independently - Third-party dependencies: if your app calls an external API, that API's availability affects your users
- SSL certificate expiry: a certificate that expires silently takes the whole site offline
Understanding uptime SLAs
| Uptime % | Downtime per year | Downtime per month |
|---|---|---|
| 99% | 3.65 days | 7.3 hours |
| 99.9% | 8.76 hours | 43.8 minutes |
| 99.95% | 4.38 hours | 21.9 minutes |
| 99.99% | 52.6 minutes | 4.4 minutes |
| 99.999% | 5.26 minutes | 26.3 seconds |
Most shared hosting providers offer 99.9%. Managed cloud platforms (Vercel, Netlify, Cloudflare Pages) typically achieve 99.99%+ at the CDN layer, with origin server availability being the more common failure point.
Status pages
A public status page serves two purposes: it gives users a place to check when something seems wrong, and it demonstrates transparency and operational maturity.
Best practices:
- Keep it on a separate domain or CDN from your main site (a status page hosted on the same server as the app you're monitoring goes down when the app goes down)
- Show historical incident data, not just current status
- Update it in real-time as incidents progress — a status page that says "investigating" for 4 hours is not useful
Getting started with Sitecheck uptime monitoring
Sitecheck includes uptime monitoring on every plan:
- Free: 1 monitor with alerts
- Basic: up to 3 monitors
- Pro: up to 5 monitors
Each monitor checks your URL at regular intervals, records response time and status history, and sends an email alert the moment a failure is detected. No infrastructure to manage.
Add your first monitor in the Uptime dashboard — it takes under a minute to set up.
Summary
Uptime monitoring is table stakes for any live website. The cost of not having it — missed outages, frustrated customers, SEO damage, and slow incident response — far exceeds the effort of setting it up. Start with your most critical URL, confirm alerts are reaching the right people, then expand coverage to your full stack.
Detecting problems in minutes instead of hours is the difference between a minor incident and a crisis.