Short-Liveness of Error Propagation in Kernel Can Improve Operating Systems Availability

Manabu Sugimoto, Takafumi Kubota, Kenji Kono

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The reliability of operating systems is crucial to achieving high availability of computer systems. Unfortunately, Linux, a widely used operating system, is far from bug-free. Some recent studies point out error propagation is very short in the kernel and thus most data in the kernel are not corrupt even when a failure occurs. This paper explores the possibility of exploiting the property of 'short-liveness' of error propagation in the kernel to improve the operating system availability. Our novel design of the memory management scheme allows us to recover the kernel by removing inconsistent data structures corrupted during error propagations.

Original languageEnglish
Title of host publicationProceedings - 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks - Supplemental Volume, DSN-S 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages23-24
Number of pages2
ISBN (Electronic)9781728130286
DOIs
Publication statusPublished - 2019 Jun
Event49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks - Supplemental Volume, DSN-S 2019 - Portland, United States
Duration: 2019 Jun 242019 Jun 27

Publication series

NameProceedings - 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks - Supplemental Volume, DSN-S 2019

Conference

Conference49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks - Supplemental Volume, DSN-S 2019
CountryUnited States
CityPortland
Period19/6/2419/6/27

Keywords

  • Error Propagation
  • Operating System Availability
  • Software Failure

ASJC Scopus subject areas

  • Safety, Risk, Reliability and Quality
  • Information Systems
  • Information Systems and Management
  • Computer Networks and Communications

Fingerprint Dive into the research topics of 'Short-Liveness of Error Propagation in Kernel Can Improve Operating Systems Availability'. Together they form a unique fingerprint.

  • Cite this

    Sugimoto, M., Kubota, T., & Kono, K. (2019). Short-Liveness of Error Propagation in Kernel Can Improve Operating Systems Availability. In Proceedings - 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks - Supplemental Volume, DSN-S 2019 (pp. 23-24). [8805798] (Proceedings - 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks - Supplemental Volume, DSN-S 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/DSN-S.2019.00017