Please use this identifier to cite or link to this item:
http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19877Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Γαλανόπουλος, Σπυρίδων | - |
| dc.date.accessioned | 2025-11-03T08:03:14Z | - |
| dc.date.available | 2025-11-03T08:03:14Z | - |
| dc.date.issued | 2025-10-29 | - |
| dc.identifier.uri | http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19877 | - |
| dc.description.abstract | Memory allocation has become a critical performance and energy bottleneck in modern systems. Each allocation typically requires handling a page fault in software, which involves expensive context switches, pipeline flushes, and execution on power-hungry out-of-order cores. While acceptable for long-running applications, these overheads dominate short-lived and latency-sensitive workloads such as serverless functions, microservices, and LLM inference, where allocation often accounts for more than 30% of total runtime. At the same time, advanced placement policies (e.g., page coloring, NUMA/NUCA-aware allocation) increasingly demand tight coupling between allocation and address translation, as well as responsiveness to runtime conditions—capabilities that are hard to achieve at software timescales. We present a new hardware–software co-design that accelerates and enriches memory allocation by introducing a programmable hardware allocation engine. It allows the operating system to selectively program this engine to handle allocations directly in hardware, bypassing expensive kernel traps. The engine integrates into the memory hierarchy, executes OS-defined policies on page faults, updates translation structures, and adapts placement decisions dynamically using microarchitectural feedback (e.g., DRAM bandwidth, cache occupancy). This enables fast, translation-aware, and runtime-adaptive memory allocation, while retaining OS control over policy. We prototype our proposed design on an FPGA using a modified RISC-V core that runs Linux. Across a range of short-lived and placement-sensitive workloads, our solution accelerates page allocation by 7-15x, improves end-to-end application performance by 77% on average, with negligible hardware cost (1.5% area). This thesis demonstrates that combining OS programmability with hardware acceleration enables memory systems that are both high-performance and highly adaptable to dynamic runtime conditions. | en_US |
| dc.language | en | en_US |
| dc.subject | virtual memory | en_US |
| dc.subject | εικονική μνήμη | en_US |
| dc.subject | page fault | en_US |
| dc.subject | σφάλμα σελίδας | en_US |
| dc.subject | operating systems | en_US |
| dc.subject | λειτουργικά συστήματα | en_US |
| dc.subject | reconfigurable hardware | en_US |
| dc.subject | επαναπρογραμματιζόμενο υλικό | en_US |
| dc.subject | address translation | en_US |
| dc.subject | μετάφραση διεύθυνσης | en_US |
| dc.title | Ειδικευμένες Αρχιτεκτονικές για ενεργειακά αποδοτική και έξυπνη εκχώρηση φυσικής μνήμης | en_US |
| dc.description.pages | 78 | en_US |
| dc.contributor.supervisor | Σούντρης Δημήτριος | en_US |
| dc.department | Τομέας Τεχνολογίας Πληροφορικής και Υπολογιστών | en_US |
| Appears in Collections: | Διπλωματικές Εργασίες - Theses | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Diploma_Thesis_Spyros_Galanopoulos.pdf | 2.95 MB | Adobe PDF | View/Open |
Items in Artemis are protected by copyright, with all rights reserved, unless otherwise indicated.