I've configured all of our production-tier servers in ArcGIS Monitor, and have System counters being monitored for basic things like CPU/memory/disk activity. This works very well.
The one thing I can't figure out how to do is to set up some kind of monitor/alert if a system has completely gone offline. We had a VMware outage over the weekend that took down 5 or more production-tier servers. If Monitor had a way of identifying that had happened and sent an alert, we could have resolved the issue before start of business today.
Anyone have an idea of how to monitor for system failures?