The StatusCat blog

What is uptime monitoring?

Uptime monitoring explained in plain terms: how it works, why it matters, the check types you'll use, and how to set it up so you hear about outages before your customers do.

StatusCat3 min read

Uptime monitoring is the practice of automatically checking that your services are online and responding correctly, and alerting you the moment they aren't. Instead of finding out your site is down because a customer emailed you (or tweeted at you), a monitor catches it first.

That's the whole idea. The details are where it gets useful.

How uptime monitoring works

A monitor does three things on a loop:

  1. Check. From a server somewhere on the internet, it makes a request to your target — loads a URL, opens a TCP port, pings a host, resolves a DNS record, or verifies a certificate.
  2. Evaluate. It decides whether the response is healthy: right status code, fast enough, contains the text it should, certificate still valid.
  3. Alert. If the check fails — usually after a re-check to rule out a fluke — it notifies you through the channels you've chosen.

Good monitoring tools re-check a failure from a second attempt before paging you, so one dropped packet doesn't wake you at 3 AM.

Why it matters

  • Downtime is expensive. Lost sales, missed sign-ups, support load, and eroded trust all stack up fast.
  • You want to be first to know. The gap between "it broke" and "you found out" is where reputation damage happens.
  • It shortens incidents. The sooner you're alerted, the sooner you're fixing — and the shorter the outage.
  • It proves reliability. Uptime history and SLA reports turn "we're reliable" into something you can show customers.

The check types you'll actually use

Different failures need different checks:

  • HTTP/HTTPS — the workhorse. Is the page up, fast, and returning the right status code? Add a keyword check to confirm the page contains expected content (catches "up but broken").
  • TCP — for databases, mail servers, and any service on a specific port.
  • ICMP (ping) — basic host reachability.
  • DNS — is your domain resolving to the right record?
  • SSL/TLS certificate — warn weeks before a certificate expires, not after.
  • Heartbeat — for things you can't reach from outside, like cron jobs and background workers. Your job pings the monitor on a schedule; if it goes quiet, you get alerted.

How to set it up well

  1. Monitor what matters, not everything. Start with the endpoints that cost you money or trust when they fail: your marketing site, your API, your checkout, your login.
  2. Add keyword checks to your HTTP monitors. A 200 response doesn't mean the page is actually working. Assert on a word that should be there.
  3. Route alerts to where your team lives. Slack or Discord for awareness, SMS or a phone call for the "wake someone up" tier.
  4. Set up escalation. If the first person doesn't acknowledge within a few minutes, it should move to the next. That's the difference between a monitor and an on-call system.
  5. Tune out the noise. Re-check before alerting, use quiet hours for non-critical monitors, and set sensible intervals so people trust the alerts they get.
  6. Put up a status page. When something does break, a status page keeps customers informed and cuts your support load dramatically.

Where StatusCat fits

StatusCat does all of the above in one place: multi-protocol checks, twelve alert channels, built-in on-call and escalation, and status pages with custom domains and uptime history. The free plan covers 50 monitors with no credit card, so you can put the ideas in this guide into practice in about five minutes.

Frequently asked questions

What is a good uptime percentage?
99.9% ('three nines') is a common baseline and allows about 43 minutes of downtime per month. 99.99% ('four nines') allows roughly 4 minutes a month and is typical for paid SaaS. Above that, cost rises steeply, so match the target to what your users actually need.
How often should I check my site?
For most sites, every minute is plenty. Critical, revenue-generating endpoints benefit from 30-second checks. Checking more often than that rarely changes outcomes and can add noise.
What's the difference between uptime monitoring and a status page?
Uptime monitoring is the machinery that detects outages and alerts you. A status page is the public-facing page that tells your customers what's happening. Good tools do both, and feed one into the other.

Keep reading