Description
Describes a flexible approach providing process fault tolerance by allowing the application to react to failures while maintaining a minimal execution path in failure-free executions. The focus is on returning control to the application by avoiding deadlocks due to failures within the MPI library.