Please use this identifier to cite or link to this item:
http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19676
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Σταθοπούλου, Φωτεινή | - |
dc.date.accessioned | 2025-07-07T09:31:19Z | - |
dc.date.available | 2025-07-07T09:31:19Z | - |
dc.date.issued | 2025-07-03 | - |
dc.identifier.uri | http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19676 | - |
dc.description.abstract | The rapid evolution of Artificial Intelligence (AI) and Machine Learning (ML) has significantly heightened computational demands, particularly for inference serving workloads. While traditional cloud-based deployments offer scalability, they face challenges such as network congestion, high energy consumption, and privacy concerns. In contrast, edge computing provides low-latency and sustainable alternatives but is constrained by limited computational resources. In this thesis, we introduce SynergAI, a novel framework designed for performance- and architecture-aware inference serving across heterogeneous edge-to-cloud infrastructures. Built upon a comprehensive performance characterization of modern inference engines, SynergAI integrates a combination of offline and online decision-making policies to deliver intelligent, lightweight, and architecture-aware scheduling. By dynamically allocating workloads across diverse hardware architectures, it effectively minimizes Quality of Service (QoS) violations. We implement SynergAI within a Kubernetes-based ecosystem and evaluate its efficiency. Our results demonstrate that architecture-driven inference serving enables optimized and architecture-aware deployments on emerging hardware platforms, achieving an average reduction of 2.4× in QoS violations compared to a State-of-the-Art (SotA) solution. | en_US |
dc.language | en | en_US |
dc.subject | Cloud,Edge, Inference, Scheduling, Performance-aware, Architecture aware | en_US |
dc.title | Edge-to-Cloud Synergy for Architecture-Driven High-Performance Orchestration for AI Inference | en_US |
dc.description.pages | 85 | en_US |
dc.contributor.supervisor | Σούντρης Δημήτριος | en_US |
dc.department | Τομέας Τεχνολογίας Πληροφορικής και Υπολογιστών | en_US |
Appears in Collections: | Διπλωματικές Εργασίες - Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Thesis Stathopoulou Foteini.pdf | 1.85 MB | Adobe PDF | View/Open |
Items in Artemis are protected by copyright, with all rights reserved, unless otherwise indicated.