Please use this identifier to cite or link to this item: http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19581
Full metadata record
DC FieldValueLanguage
dc.contributor.authorΖέρβα, Μαρία-
dc.date.accessioned2025-04-01T21:01:31Z-
dc.date.available2025-04-01T21:01:31Z-
dc.date.issued2025-03-17-
dc.identifier.urihttp://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19581-
dc.description.abstractOwing to their exceptional computational performance and cost efficiency, GPUs have solidified their status as the premier platform for accelerating general-purpose workloads. Nonetheless, a subset of these workloads continues to exhibit performance stagnation. The previously proposed Light-weight Out-Of-Order GPU (LOOG) execu- tion scheme addresses this issue by augmenting conventional Thread-Level Parallelism with the exploitation of inherent Instruction-Level Parallelism. Although LOOG has been modeled using GPU simulation tools in previous studies, these implementations have suffered from limited accuracy in power consumption and critical path estima- tions, in addition to slow execution of applications. To overcome these limitations, this thesis proposes integrating LOOG into an RTL GPU framework and specifically Vortex GPU version 2.0, an open-source design that is well-suited for deployment on FPGA platforms. To preserve LOOG’s performance gain in Vortex’s RISC-V–based pipeline, the extension is meticulously designed to com- plement the existing micro-architecture and the operations it supports. Furthermore, a comprehensive investigation of design optimizations and trade-offs is conducted to enhance performance while constraining the overall Area and Power overhead. A detailed characterization of 21 Vortex workloads based on their stalling behav- ior is executed previous to the experimental evaluation, enabling the right-sizing of the micro-architecture across a broad design space that is supported by Vortex’s configura- bility. The results demonstrate an average speedup of up to approximately 23.5%, while maintaining lower Area-Delay and Power-Delay products compared to the in-order Vortex in various configurations.en_US
dc.languageenen_US
dc.subjectHigh Performance Computingen_US
dc.subjectGPU Micro-Architectureen_US
dc.subjectOut-Of-Order Executionen_US
dc.subjectRISC-Ven_US
dc.subjectRTL Designen_US
dc.subjectFPGAen_US
dc.subjectHardware Evaluationen_US
dc.titleFPGA Design and Analysis of a RISC-V Out-Of-Order GPUen_US
dc.description.pages117en_US
dc.contributor.supervisorΞύδης Σωτήριοςen_US
dc.departmentΤομέας Τεχνολογίας Πληροφορικής και Υπολογιστώνen_US
Appears in Collections:Διπλωματικές Εργασίες - Theses

Files in This Item:
File Description SizeFormat 
maria_zerva_diploma_thesis.pdf2.94 MBAdobe PDFView/Open


Items in Artemis are protected by copyright, with all rights reserved, unless otherwise indicated.