I was looking at some statistic and noticed we are sending out a massive amount of SMS messages from our monitoring systems.
This left me wondering if there isn't a better (and cheaper) alternative to this, something just as reliant but IP based. We all have smartphones these days anyway.
Therefore my question, what are you using to notify admins of incidents?
The advantage of SMS is that it is out of band. Any smtp or other IP based solution requires a stable and working network environment, which is what the alert may be trying to tell you is down.
The advantage of SMS is that it is out of band. Any smtp or other IP based solution requires a stable and working network environment, which is what the alert may be trying to tell you is down.
I do not worry so much about that, part of the monitoring solution is out of band for that reason.
Kind regards / Met vriendelijke groet,
Thijs Stuurman
We use OpsGenie for notifications (and on-call scheduling, etc). There are other similar options such as PagerDuty, etc, as well.
Notifications can be submitted to the service in a variety of ways (email, web API, etc), has a variety of integrations with other tools (Nagios, Pingdom, etc) to aggregate all of your alerts, and there is a callback mechanism where the user can trigger custom actions right from the app (for example, I wrote an interface for it such that when we get an alert, the on-call person can choose to restart the affected service -- or even reboot the entire VM hosting it -- right from within the OpsGenie app).
Each user can choose their method of contact (notification to the smartphone app, SMS, phone call, email, whatever), and on-call schedules (and exceptions) are easily managed.
Which is why you locate a small NMS outside your network (on a VM
somewhere) whose only job is to start alerting when it can't reach the NMS
inside your network. That also helps when your interior NMS system gets
gummed up or when a general emergency in your locality damages your
infrastructure at the same time as the SMS provider's infrastructure.
If your monitoring system is structured well to begin with, email has
efficacy comparable to sms. A smartphone app expecting heartbeats via your
in-band infrastructure has effectiveness superior to both.