Is the website online? Does the server deliver more than error messages? How fast do pages load? Does the shopping cart process work? All questions can only be answered when the actual monitoring takes place from somewhere outside your servers and even their data center. We use our PRTG Network Monitor on cloud servers from GoGrid (in San Francisco) and Amazon EC2 (US East Coast and Ireland) as well as on servers in our own office datacenter in Nuernberg, Germany, to monitor our production servers - all four locations are quite reliable so it actually makes sense to use these. Here is what we monitor for our site www.paessler.com from a distance:
We had to set up those HTTP sensors for each of our four web servers and for the load balancer, too. Alarms: Whenever one of the sensors finds a problem, alarms are sent to our admins. We stop monitoring of the other sensors, when we can not ping the firewall, because than obviously the connection to the data center itself is down. Here is a screenshot of some of the sensors I have mentioned:
Are the servers healthy? How high is CPU load and memory usage? Is enough diskspace available? These questions can not be answered from the outside. You either must run a monitoring software on the server(s) or must have a monitoring tool that can monitor the server through local probes/agents or gets internal information via scripts. We run Remote Probes of PRTG on our production servers to monitor the following parameters of each server (we have 5 production servers):
Plus we monitor system health of the ASA Firewall (bandwidth usage, memory, cpu) and our SQL server. Alarms: If any of these health variables is outside a desired range (e.g. CPU load above 95% for more than 2/5 minutes) our monitoring solution will send alarms to our admins. Here is a screenshot of most of our sensors on the production systems: