Abstract—Reliability metrics for hardware faults in safety-/mission-critical systems have been historically based solely on hardware failure rates, quantitatively ignoring any effect of the software. Software reliability is usually considered only in terms of bugs/defects, which is a quantity hard to estimate analytically. In this article, we explore the problem of quantifying the impact of software in reliability against Single Event Upsets, highlighting the limits of the current standards that restrict the use of Commercial-Off-The-Shelf components for critical scenarios. We show how to obtain valid software reliability metrics and how this methodology significantly improves reliability estimation compared to hardware-only estimation. The reliability gain is further improved when considering real-time metrics. This analysis is the first step towards a reconciliation between software and hardware reliability and enables the quantification of reliability introduced by Software-Implemented Hardware Fault Tolerance approaches.

Towards Certifiable Software-Implemented Hardware Fault Tolerance

F. Reghenzani;W. Fornaciari
2024-01-01

Abstract

Abstract—Reliability metrics for hardware faults in safety-/mission-critical systems have been historically based solely on hardware failure rates, quantitatively ignoring any effect of the software. Software reliability is usually considered only in terms of bugs/defects, which is a quantity hard to estimate analytically. In this article, we explore the problem of quantifying the impact of software in reliability against Single Event Upsets, highlighting the limits of the current standards that restrict the use of Commercial-Off-The-Shelf components for critical scenarios. We show how to obtain valid software reliability metrics and how this methodology significantly improves reliability estimation compared to hardware-only estimation. The reliability gain is further improved when considering real-time metrics. This analysis is the first step towards a reconciliation between software and hardware reliability and enables the quantification of reliability introduced by Software-Implemented Hardware Fault Tolerance approaches.
2024
Proceedings of IEEE 14th International Symposium on Industrial Embedded Systems (SIES)
File in questo prodotto:
File Dimensione Formato  
2024-SIES.pdf

accesso aperto

Dimensione 413.43 kB
Formato Adobe PDF
413.43 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1276064
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact