Please use this identifier to cite or link to this item: http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19077
Full metadata record
DC FieldValueLanguage
dc.contributor.authorΧάιδος, Παναγιώτης-
dc.date.accessioned2024-04-22T11:30:20Z-
dc.date.available2024-04-22T11:30:20Z-
dc.date.issued2024-04-04-
dc.identifier.urihttp://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19077-
dc.description.abstractIn recent years with the rise of power-hungry ML applications, Printed Computing serves to meet the requirements for cost, conformity, and non-toxicity where standard silicon-based computing seems to be lacking. The aspects of low manufacturing costs and disposability that printed technologies provide, fit well with a variety of ML applications’ needs. On top of that, using printed microprocessors allows for programability and thus flexibility to the workloads that can be run, compared to application specific hardware that tends to be more power hungry. Limitations for printed technologies can be extremely large feature sizes along with limited power support from small printed batteries. Hence there is a rising need for improvements in the domains of area and power, in order to fit complex processors. One approach to this issue is hardware reduction techniques, that have shown to be fruitful and necessary when considering printed processors that need to meet constraints. In this thesis we explore the possibilities for area and power gains of printed microprocessors using the EGFET standard cell library for low voltage printing technology, regarding machine learning workloads and printed workloads. We synthesize and analyse hardware measurements for a set of examined processors, focusing mostly on low gate-count and low power architectures. We compile the benchmarks and simulate the processors with RTL and netlist simulations to extract the execution traces using the Synopsys EDA suite and Modelsim simulator. We analyse the execution traces of the workloads to locate and remove unused whole compo- nents and more specific logic functionalities of the ISA of our processors, with the aim of building bespoke processors with improved hardware specs. We then incorporate MAC units that efficiently improve the performance and consumption specifically for ML workloads like MLPs and SVMs with high MAC usage. Finally, we explore the benefits of introducing precision-scaling in our new MAC units, measuring the speedup and accuracy loss tradeoff. Our proposed units and bespoke modifications achieve from 22.2%, 23.6% and 33.79% improvements in area, power and speedup when imposing no accuracy loss, up to 29.3%, 28.7% and 41.73% gains in area, power and speedup, with just a 0.5% decrease in average accuracy estimated over 3 datasets for the main Zero-Riscy Core.en_US
dc.languageenen_US
dc.subjectPrinted Electronicsen_US
dc.subjectPrinted Computingen_US
dc.subjectMachine Learningen_US
dc.subjectEDAen_US
dc.subjectPrecision Scalingen_US
dc.subjectBespoke Processorsen_US
dc.titleDesign and Evaluation of Bespoke Microprocessor Architectures for Flexible Devicesen_US
dc.description.pages107en_US
dc.contributor.supervisorΣούντρης Δημήτριοςen_US
dc.departmentΤομέας Τεχνολογίας Πληροφορικής και Υπολογιστώνen_US
Appears in Collections:Διπλωματικές Εργασίες - Theses

Files in This Item:
File Description SizeFormat 
pchaidos_Diploma_Thesis.pdf4.1 MBAdobe PDFView/Open


Items in Artemis are protected by copyright, with all rights reserved, unless otherwise indicated.