CAS, RAMSS, eHR outage: Saturday, Feb. 16 2013 5:04 AM to 8:31 AM

A network device failed at 5:04 AM this morning making Ryerson’s Central Authentication System unavailable as well as RAMSS and eHR. The system was restored by 8:31 AM this morning.

Please accept our apologies for the outage.

For those interested in a more detailed technical explanation of this outage, here is a little more information:

The Virtual Router Redundancy Protocol (VRRP) instance on the Alteon load balancer that services the subnet that the SSL accelerators are on,  stopped working this morning at 5:04am. The result was that the SSL accelerators were not able to comunicate with hosts outside their subnet. As a result, the applications that are ssl offloaded by this load balancer, eHR, RAMMS and CAS were not working. All the health checks were good on the load balancer and there were no log entries regarding the problem. This time the SSL accelerators were not the problem, it was the alteon load balancer.

The VRRP instance was disabled to make the load blancer work. This is the first time we have encountered this type of problem.

-Computing and Communications Services

 

This entry was posted in CAS, eHR, RAMSS. Bookmark the permalink.