I am just a bit sceptical about the reasons given though. Monday night when the server acted up, the explanation was that they faced a DDOS attack and the sysadmin had to stop some services to counter the attack. Now it is the hard drive that is giving problems...
Advanced detection and notification for both these issues is not rocket science. Detecting any issue and posting an immediate warning to the status thread in my opinion will go a long way to improve the customer perception of service.
|