Browsing Supercomputer Education and Research Centre (SERC) by Subject "Adaptive Fault Tolerance"
Now showing items 1-1 of 1
-
Adaptive Fault Tolerance Strategies for Large Scale Systems
(2018-03-07)Exascale systems of the future are predicted to have mean time between node failures (MTBF) of less than one hour. At such low MTBF, the number of processors available for execution of a long running application can widely ...