unmbo_VLSI12

Customizing Instruction Set Extensible Reconfigurable Processors using GPUs

Unmesh D. Bordoloi
 
Bharath Suri
Swaroop Nunna
 
Samarjit Chakraborty
Petru Eles Author homepage
 
Zebo Peng Author homepage

25th International Conferennce on VLSI Design, Hyderabad, India, January 07-11, 2012.

ABSTRACT
Many reconfigurable processors allow their instruction sets to be tailored according to the performance requirements of target applications. They have gained immense popularity in recent years because of this flexibility of adding custom instructions. However, most design automation algorithms for instruction set customization (like enumerating and selecting the optimal set of custom instructions) are computationally intractable. As such, existing tools to customize instruction sets of extensible processors rely on approximation methods or heuristics. In contrast to such traditional approaches, we propose to use GPUs (Graphics Processing Units) to efficiently solve computationally expensive algorithms in the design automation tools for extensible processors. To demonstrate our idea, we choose a custom instruction selection problem and accelerate it using CUDA (CUDA is a GPU computing engine). Our CUDA implementation is devised to maximize the achievable speedups by various optimizations like exploiting on-chip shared memory and register usage. Experiments conducted on well known benchmarks show significant speedups over sequential CPU implementations as well as over multi-core implementations.


Related files:
unmbo_VLSI12.pdfAdobe Acrobat portable document


[DSNC12] Unmesh D. Bordoloi, Bharath Suri, Swaroop Nunna, Samarjit Chakraborty, Petru Eles, Zebo Peng, "Customizing Instruction Set Extensible Reconfigurable Processors using GPUs", 25th International Conferennce on VLSI Design, Hyderabad, India, January 07-11, 2012.
( ! ) perl script by Giovanni Squillero with modifications from Gert Jervan   (v3.1, p5.2, September-2002-)
Last modified on Monday December 04, 2006 by Gert Jervan