RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Severe constraints on memory and computation characterizing the Internet-of-Things (IoT) units may prevent the execution of Deep Learning (DL)-based solutions, which typically demand large memory and high processing load. In order to support a real-time execution of the considered DL model at the IoT unit level, DL solutions must be designed having in mind constraints on memory and processing capability exposed by the chosen IoT technology. In this paper, we introduce a design methodology aiming at allocating the execution of Convolutional Neural Networks (CNNs) on a distributed IoT application. Such a methodology is formalized as an optimization problem where the latency between the data-gathering phase and the subsequent decision-making one is minimized, within the given constraints on memory and processing load at the units level. The methodology supports multiple sources of data as well as multiple CNNs in execution on the same IoT system allowing the design of CNN-based applications demanding autonomy, low decision-latency, and high Quality-of-Service.

Distributed Deep Convolutional Neural Networks for the Internet-of-Things

Disabato S.;Roveri M.;Alippi C.

In corso di stampa

Abstract

Severe constraints on memory and computation characterizing the Internet-of-Things (IoT) units may prevent the execution of Deep Learning (DL)-based solutions, which typically demand large memory and high processing load. In order to support a real-time execution of the considered DL model at the IoT unit level, DL solutions must be designed having in mind constraints on memory and processing capability exposed by the chosen IoT technology. In this paper, we introduce a design methodology aiming at allocating the execution of Convolutional Neural Networks (CNNs) on a distributed IoT application. Such a methodology is formalized as an optimization problem where the latency between the data-gathering phase and the subsequent decision-making one is minimized, within the given constraints on memory and processing load at the units level. The methodology supports multiple sources of data as well as multiple CNNs in execution on the same IoT system allowing the design of CNN-based applications demanding autonomy, low decision-latency, and high Quality-of-Service.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				In corso di stampa
			
	Titolo della rivista
	
				IEEE TRANSACTIONS ON COMPUTERS
			
	Parole chiave
	
				Complexity theory
Convolutional Neural Networks
Deep Learning
Hardware
Internet of Things
Internet-of-Things
Memory management
Optimization
Pipelines
Task analysis
			
	Appare nelle tipologie:
	
				01.1 Articolo in Rivista

File in questo prodotto:

File	Dimensione	Formato
09363550.pdf Accesso riservato Descrizione: Articolo Principale : Publisher’s version Dimensione 855.89 kB Formato Adobe PDF Visualizza/Apri	855.89 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1167401

Citazioni

ND

50

38

social impact