The reliability of multi-core systems-on-chip has been the object of several studies in recent years since these devices are heavily utilized in modern digital equipment at any level of complexity. This level of integration has caused a reduced time to failure due to the rapid scaling down of the dimension with consequent increase of the cores temperature and current densities. Past studies have utilized discrete event simulation: a technique very difficult to master in this scenario due to the number of components and the rarity of the failure events. The present study proposes an analytical framework based on Markovian Agent Models (MAM), able to capture systems with the big number of cores possible with the today and tomorrow's technology while at the same time considering the effects caused by the position of the cores (center, border, corner) on the temperature level, and the dynamic redistribution of the workload with the progressive failure of the cores. The paper presents the model, adopting realistic parameters and interaction phenomena taken from the literature.

Scalable analytical model of the reliability of multi-core systems-on-chip by interacting Markovian agents

Bolchini, Cristiana;Gribaudo, Marco;Miele, Antonio
2017-01-01

Abstract

The reliability of multi-core systems-on-chip has been the object of several studies in recent years since these devices are heavily utilized in modern digital equipment at any level of complexity. This level of integration has caused a reduced time to failure due to the rapid scaling down of the dimension with consequent increase of the cores temperature and current densities. Past studies have utilized discrete event simulation: a technique very difficult to master in this scenario due to the number of components and the rarity of the failure events. The present study proposes an analytical framework based on Markovian Agent Models (MAM), able to capture systems with the big number of cores possible with the today and tomorrow's technology while at the same time considering the effects caused by the position of the cores (center, border, corner) on the temperature level, and the dynamic redistribution of the workload with the progressive failure of the cores. The paper presents the model, adopting realistic parameters and interaction phenomena taken from the literature.
2017
ACM International Conference Proceeding Series
9781450363464
Modeling; Multi-core systems; Reliability evaluation; Human-Computer Interaction; Computer Networks and Communications; 1707; Software
File in questo prodotto:
File Dimensione Formato  
p156-bobbio.pdf

Accesso riservato

: Publisher’s version
Dimensione 643.53 kB
Formato Adobe PDF
643.53 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1084800
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact