Change stats to let if run even if some of the machines is down. And also to tolerate temporary outages without keeping remote machines as permanently down