RE.PUBLIC@POLIMI pubblicazioni di ricerca del Politecnico di Milano

A large number of algorithms for multidimensional signals processing and scientific computation come in the form of iterative stencil loops (ISLs), whose data dependencies span across multiple iterations. Because of their complex inner structure, automatic hardware acceleration of such algorithms is traditionally considered as a difficult task. In this paper, we introduce an automatic design flow that identifies, in a wide family of bidimensional data processing algorithms, subportions that exhibit a kind of parallelism close to that of ISLs; these are mapped onto a space of highly optimized ad-hoc architectures, which is efficiently explored to identify the best implementations with respect to both area and throughput. Experimental results show that the proposed methodology generates circuits whose performance is comparable to that of manually optimized solutions, and orders of magnitude higher than those generated by commercial high-level synthesis tools.

Efficient Hardware Design of Iterative Stencil Loops

RANA, VINCENZO;Beretta, Ivan;BRUSCHI, FRANCESCO;NACCI, ALESSANDRO ANTONIO;Atienza, David;SCIUTO, DONATELLA

2016-01-01

Abstract

A large number of algorithms for multidimensional signals processing and scientific computation come in the form of iterative stencil loops (ISLs), whose data dependencies span across multiple iterations. Because of their complex inner structure, automatic hardware acceleration of such algorithms is traditionally considered as a difficult task. In this paper, we introduce an automatic design flow that identifies, in a wide family of bidimensional data processing algorithms, subportions that exhibit a kind of parallelism close to that of ISLs; these are mapped onto a space of highly optimized ad-hoc architectures, which is efficiently explored to identify the best implementations with respect to both area and throughput. Experimental results show that the proposed methodology generates circuits whose performance is comparable to that of manually optimized solutions, and orders of magnitude higher than those generated by commercial high-level synthesis tools.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2016
			
	Titolo della rivista
	
				IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS
			
	Parole chiave
	
				Dataflow synthesis; embedded systems; field-programmable gate array (FPGA); High-level synthesis; iterative functions; multimedia processing; performance optimization; Software; Computer Graphics and Computer-Aided Design; Electrical and Electronic Engineering
			
	Appare nelle tipologie:
	
				01.1 Articolo in Rivista

File in questo prodotto:

File	Dimensione	Formato
TCAD2016.pdf Accesso riservato : Publisher’s version Dimensione 4.65 MB Formato Adobe PDF Visualizza/Apri	4.65 MB	Adobe PDF	Visualizza/Apri
11311-1009339 Rana.pdf accesso aperto : Post-Print (DRAFT o Author’s Accepted Manuscript-AAM) Dimensione 6.69 MB Formato Adobe PDF Visualizza/Apri	6.69 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1009339

Citazioni

ND

5

4

social impact