[Paper Review] Building a self-healing operating system
Short Introduction to This Paper
The paper is written in good texts and it introduces several interesting self-healing strategies for a operating system (not for a specific application). However there is a plethora of related work in hardware and software fault-tolerance.
The contribution of this paper is mainly a survey of techniques that can be applied to provide self-healing functionality to an OS. It discussed the concepts, implementation and evaluation on exception handling, code reloading, operating system component isolation, micro-rebooting, automatic system service restarts, watchdog timer based recovery and transactional components.