RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

Machine Learning models are often composed by sequences of transformations. While this design makes easy to decompose and accelerate single model components at training time, predictions requires low latency and high performance predictability whereby end-to-end runtime optimizations and acceleration is needed to meet such goals. This paper shed some light on the problem by using a production-like model, and showing how by redesigning model pipelines for efficient execution over CPUs and FPGAs performance improvements of several folds can be achieved.

Towards accelerating generic machine learning prediction pipelines

Scolari, Alberto;Lee, Yunseong;Weimer, Markus;INTERLANDI, MATTEO

2017-01-01

Abstract

Machine Learning models are often composed by sequences of transformations. While this design makes easy to decompose and accelerate single model components at training time, predictions requires low latency and high performance predictability whereby end-to-end runtime optimizations and acceleration is needed to meet such goals. This paper shed some light on the problem by using a production-like model, and showing how by redesigning model pipelines for efficient execution over CPUs and FPGAs performance improvements of several folds can be achieved.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2017
			
	Titolo del libro
	
				Proceedings - 35th IEEE International Conference on Computer Design, ICCD 2017
			
	ISBN (International Standard Book Number)
	
				978-1-5386-2254-4
			
	Parole chiave
	
				FPGA; Machine Learning; Model Scoring; Prediction Pipelines; Hardware and Architecture
			
	Appare nelle tipologie:
	
				04.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
accelerating-generic-machine.pdf Accesso riservato Dimensione 163.11 kB Formato Adobe PDF Visualizza/Apri	163.11 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1062745

Citazioni

ND

1

0

social impact