A novel large-bit-size architecture and microarchitecture for the implementation of Superscalar Pipeline VLIW microprocessors

Microprocessors have grown tremendously in its computing and data crunching capability since the early days of the invention of a microprocessor. Today, most microprocessors in the market are at 32 bits, while the latest microprocessors from IBM, Intel and AMD are at 64 bits. To further grow the com...

Full description

Saved in:
Bibliographic Details
Main Author: Lee, Weng Fook
Format: Thesis
Language:English
Subjects:
Online Access:http://dspace.unimap.edu.my:80/xmlui/bitstream/123456789/1493/1/Page%201-24.pdf
http://dspace.unimap.edu.my:80/xmlui/bitstream/123456789/1493/2/Full%20text.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-unimap-1493
record_format uketd_dc
spelling my-unimap-14932008-10-15T06:44:12Z A novel large-bit-size architecture and microarchitecture for the implementation of Superscalar Pipeline VLIW microprocessors Lee, Weng Fook Microprocessors have grown tremendously in its computing and data crunching capability since the early days of the invention of a microprocessor. Today, most microprocessors in the market are at 32 bits, while the latest microprocessors from IBM, Intel and AMD are at 64 bits. To further grow the computational capability of a microprocessor, there are two possible paths. One method is to increase the bit size of the microprocessor to 128/256/512 bits. The larger the bitsize, the more data can be crunched at any one time. The second method is to implement multiple microprocessor core in a single microprocessor unit. For example, the Intel’s Pentium 4 Dual Core and AMD’s Athlon Dual Core both have two microprocessor core within a single microprocessor unit. Latest from Intel and AMD are quad core microprocessors with either a configuration of pseudo-quad core or full quad core within a single microprocessor unit. In a pseudo-quad core configuration, two silicon each consists dual core microprocessor is packaged within a single microprocessor unit while a full quad core consists of four microprocessor core on one silicon packaged within a single microprocessor unit. Both methods have its advantages and disadvantages. Both methods yields different design issues and have different engineering limitations. This work explores the method of increasing the data bus size of the microprocessor from 32/64 bits to 128/256/512 bits to allow for more data crunching capability. In the course of this work, a superscalar pipeline 64 bits VLIW microprocessor with 4 stages (fetch, decode, execute, writeback) and 3 parallel pipes is implemented on a TSMC 0.35 micron process. The implementation is then expanded to 128/256/512 bits using the same TSMC 0.35 micron process. To prove the concept that such a large bit size VLIW microprocessor can indeed be implemented, the said VLIW microprocessor of bitsize 64/128/256 is programmed on an Altera Stratix 2 EP2S180F1508I4 FPGA and back annotated for verification. In the TSMC 0.35 micron process implementation of the work, the critical path of the VLIW microprocessor of data bus size 128/256/512 is analyzed with its worst path within the adder of the ALU in the execute stage. Different adder architectures are investigated for suitability on synthesis implementation of large data bus size adder for efficient usage within the ALU. An adder algorithm using repetitive constructs in a parallel algorithm that allows for efficient and optimal synthesis for large data bus size is proposed as a suitable implementation for the adder within the ALU. This work has two important findings. One is the proposed adder architecture synthesis of a large bit size adder that provides for improved performance-gatecountproduct compared to conventional adder architecture synthesis. Second is the proof of concept that a large bit size VLIW microprocessor is possible by implementing a 64/128/256 bits data size on an Altera Stratix 2 EP2S180F1508I4 FPGA. Universiti Malaysia Perlis 2008 Thesis en http://dspace.unimap.edu.my/123456789/1493 http://dspace.unimap.edu.my:80/xmlui/bitstream/123456789/1493/1/Page%201-24.pdf 7d185c0f3fd57bae3ff20cd181fafa37 http://dspace.unimap.edu.my:80/xmlui/bitstream/123456789/1493/2/Full%20text.pdf f70557a77f6533aafdfd8e859c459b5a http://dspace.unimap.edu.my:80/xmlui/bitstream/123456789/1493/3/license.txt 8d600f6198781c5e5ba9f9233bd41d80 Microprocessors VLIW microprocessor Microprocessors -- Design and construction Field programmable gate arrays Very Long Instruction Word (VLIW) Integrated circuits School of Computer & Communication Engineering
institution Universiti Malaysia Perlis
collection UniMAP Institutional Repository
language English
topic Microprocessors
VLIW microprocessor
Microprocessors -- Design and construction
Field programmable gate arrays
Very Long Instruction Word (VLIW)
Integrated circuits
spellingShingle Microprocessors
VLIW microprocessor
Microprocessors -- Design and construction
Field programmable gate arrays
Very Long Instruction Word (VLIW)
Integrated circuits
Lee, Weng Fook
A novel large-bit-size architecture and microarchitecture for the implementation of Superscalar Pipeline VLIW microprocessors
description Microprocessors have grown tremendously in its computing and data crunching capability since the early days of the invention of a microprocessor. Today, most microprocessors in the market are at 32 bits, while the latest microprocessors from IBM, Intel and AMD are at 64 bits. To further grow the computational capability of a microprocessor, there are two possible paths. One method is to increase the bit size of the microprocessor to 128/256/512 bits. The larger the bitsize, the more data can be crunched at any one time. The second method is to implement multiple microprocessor core in a single microprocessor unit. For example, the Intel’s Pentium 4 Dual Core and AMD’s Athlon Dual Core both have two microprocessor core within a single microprocessor unit. Latest from Intel and AMD are quad core microprocessors with either a configuration of pseudo-quad core or full quad core within a single microprocessor unit. In a pseudo-quad core configuration, two silicon each consists dual core microprocessor is packaged within a single microprocessor unit while a full quad core consists of four microprocessor core on one silicon packaged within a single microprocessor unit. Both methods have its advantages and disadvantages. Both methods yields different design issues and have different engineering limitations. This work explores the method of increasing the data bus size of the microprocessor from 32/64 bits to 128/256/512 bits to allow for more data crunching capability. In the course of this work, a superscalar pipeline 64 bits VLIW microprocessor with 4 stages (fetch, decode, execute, writeback) and 3 parallel pipes is implemented on a TSMC 0.35 micron process. The implementation is then expanded to 128/256/512 bits using the same TSMC 0.35 micron process. To prove the concept that such a large bit size VLIW microprocessor can indeed be implemented, the said VLIW microprocessor of bitsize 64/128/256 is programmed on an Altera Stratix 2 EP2S180F1508I4 FPGA and back annotated for verification. In the TSMC 0.35 micron process implementation of the work, the critical path of the VLIW microprocessor of data bus size 128/256/512 is analyzed with its worst path within the adder of the ALU in the execute stage. Different adder architectures are investigated for suitability on synthesis implementation of large data bus size adder for efficient usage within the ALU. An adder algorithm using repetitive constructs in a parallel algorithm that allows for efficient and optimal synthesis for large data bus size is proposed as a suitable implementation for the adder within the ALU. This work has two important findings. One is the proposed adder architecture synthesis of a large bit size adder that provides for improved performance-gatecountproduct compared to conventional adder architecture synthesis. Second is the proof of concept that a large bit size VLIW microprocessor is possible by implementing a 64/128/256 bits data size on an Altera Stratix 2 EP2S180F1508I4 FPGA.
format Thesis
author Lee, Weng Fook
author_facet Lee, Weng Fook
author_sort Lee, Weng Fook
title A novel large-bit-size architecture and microarchitecture for the implementation of Superscalar Pipeline VLIW microprocessors
title_short A novel large-bit-size architecture and microarchitecture for the implementation of Superscalar Pipeline VLIW microprocessors
title_full A novel large-bit-size architecture and microarchitecture for the implementation of Superscalar Pipeline VLIW microprocessors
title_fullStr A novel large-bit-size architecture and microarchitecture for the implementation of Superscalar Pipeline VLIW microprocessors
title_full_unstemmed A novel large-bit-size architecture and microarchitecture for the implementation of Superscalar Pipeline VLIW microprocessors
title_sort novel large-bit-size architecture and microarchitecture for the implementation of superscalar pipeline vliw microprocessors
granting_institution Universiti Malaysia Perlis
granting_department School of Computer & Communication Engineering
url http://dspace.unimap.edu.my:80/xmlui/bitstream/123456789/1493/1/Page%201-24.pdf
http://dspace.unimap.edu.my:80/xmlui/bitstream/123456789/1493/2/Full%20text.pdf
_version_ 1747836768832454656