Why do systems fail? Tandem NonStop system and fault tolerance (erlang-solutions.com)
Why do systems fail? This question should probably be asked more often, considering all the factors it involves. It was central to the NonStop architecture because achieving high availability depends on understanding system failures.