Please use this identifier to cite or link to this item:
http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19659
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ζέρβα, Βασιλεία | - |
dc.date.accessioned | 2025-07-04T11:38:25Z | - |
dc.date.available | 2025-07-04T11:38:25Z | - |
dc.date.issued | 2025-06-27 | - |
dc.identifier.uri | http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19659 | - |
dc.description.abstract | Growing computational demands of modern workloads and the need for better application performance, has led designers to focus on increasing the number of cores into a single chip. This multi-core design is highly effective in many applications because it increases thread-level parallelism, improving the overall throughput. However, improving single-thread performance remains crucial, especially for applications with limited parallelism. Since in many scenarios, instruction-level parallelism becomes a bottleneck designers, tried to implement wider designs, with wider instructions windows and widths. This of course came with an important trade-off: wider windows and issue widths typically require more complex control logic, more power consumption, and significantly impact the clock frequency due to the increased complexity and size of the structures. Another approach was introduced in order to avoid the hazards mentioned before, called “clustering”. Clustering, means dividing resources into smaller, independent groups, each handling a subset of instructions with its own set of resources. Using this method could significantly reduce wire delays, helping to preserve high clock frequencies even as the overall system scales. In this thesis, we analyze the limitations of clustering and review steer ing techniques that have been proposed in previous work. Then, we design and implement four instruction steering methods —Round-Robin, Dependency, Dependency-Load, and Loadcut— which, carefully adapted to the specific goals and architecture of our system. We then evaluate these methods through sim ulation using the gem5 simulator and analyze their performance. | en_US |
dc.language | en | en_US |
dc.subject | Clustering, instruction steering, Instruction-Level Parallelism, clock frequency, gem5, performance | en_US |
dc.title | Design and Evaluation of Clustered Processor Architectures | en_US |
dc.description.pages | 73 | en_US |
dc.contributor.supervisor | Πνευματικάτος Διονύσιος | en_US |
dc.department | Τομέας Τεχνολογίας Πληροφορικής και Υπολογιστών | en_US |
Appears in Collections: | Διπλωματικές Εργασίες - Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
thesis_zerva.pdf | 1.66 MB | Adobe PDF | View/Open |
Items in Artemis are protected by copyright, with all rights reserved, unless otherwise indicated.