Probabilistic Approaches for Fault-Tolerance and Scalability in Extreme-Scale Computing.