Improved system reliability and fault locating via in-service performance monitoring