I´m trying to develop a series of tests to measure the health of several Vicidial systems (kind of a Nagios monitoring, but Vicidial specific). So far, I´m monitoring these:
- Average load
- Disk space
- Amount of Asterisk calls (core show channels)
- SIP responsiveness
- IAX responsiveness
- SSH access
- Status of PRI/R2 trunks
However, I´ve found out that even though these metrics help me qualify a working Asterisk system, they do not necessarily measure Vicidial very well. For example:
- What if the hopper isn´t getting filled?
- What if the dialer is not sending any outbound calls, and the calls measured by core show channels are only the ones from the Meetme conferences?
- What if everyone is on pause, or calls are simply getting dropped because of a database error?
And so on...
So what I´m looking for is key data which I can get either by looking at logs or at the MySQL database and that I can query every minute or so, load them into a monitoring system, and set an alarm in case any of those numbers look wrong.
Does this make sense to anyone?
What would you monitor?