Please use this identifier to cite or link to this item: http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19180
Full metadata record
DC FieldValueLanguage
dc.contributor.authorTzachristas, Ioannis-
dc.date.accessioned2024-07-18T08:44:30Z-
dc.date.available2024-07-18T08:44:30Z-
dc.date.issued2024-06-10-
dc.identifier.urihttp://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19180-
dc.description.abstractLarge Language Models (LLMs) have revolutionized various aspects of engineering and science. Their utility is often bottlenecked by the lack of interaction with the external digital environment. In order to overcome this limitation and achieve integration of LLMs and Artificial Intelligence (AI) into real-world applications, customized AI-agents are being constructed every day. Based on the technological trends and the existing techniques, we extract a high-level approach for constructing these AI-agents, focusing on their underlying architecture. This thesis serves as a comprehensive guide that elucidates a multi-faceted approach for empowering LLMs with the capability to leverage Application Programming Interfaces (APIs). We present a 7-step methodology that begins with the selection of suitable LLMs and the task decomposition that is necessary for complex problem-solving. This methodology includes techniques for generating training data for API interactions and heuristics for selecting the appropriate API among a plethora of options. These steps eventually lead to the generation of API calls that are both syntactically and semantically aligned with the LLM's understanding of a given task. Moreover, we review existing frameworks and tools that facilitate these processes and highlight the gaps in current attempts. In this direction, we propose an on-device architecture that aims to exploit the functionality of carry-on devices by using small models from the Hugging Face community. We examine the effectiveness of the aforementioned approaches on real-world applications of various domains, including the generation of a piano sheet. Through an extensive analysis of the literature and available technologies, this thesis aims to set a compass for researchers and practitioners to harness the full potential of LLMs augmented with external tool capabilities, thus paving the way for more autonomous, robust and context-aware AI-agents.en_US
dc.languageenen_US
dc.subjectLarge Language Models (LLMs)en_US
dc.subjectApplication Programming Interfaces (APIs)en_US
dc.subjectArtificial Intelligence (AI)en_US
dc.subjectNatural Language Processing (NLP)en_US
dc.subjectAI-agent Architectureen_US
dc.subjectClassificationen_US
dc.subjectSemantic Vector Spaceen_US
dc.subjectSemantic Alignmenten_US
dc.subjectWord Embeddingen_US
dc.subjectHuman-Computer Interaction (HCI)en_US
dc.subjectMusic Notationen_US
dc.titleCreating an LLM-based AI-agent: A high-level methodology towards enhancing LLMs with APIsen_US
dc.description.pages104en_US
dc.contributor.supervisorΠαπασπύρου Νικόλαοςen_US
dc.departmentΤομέας Τεχνολογίας Πληροφορικής και Υπολογιστώνen_US
dc.description.notesThis Master's Thesis was conducted in Huawei Munich Research Center (Riesstrasse 25, 80992 Munich, Germany) under the supervision of Dr. Aifen Sui and Prof. Nikolaos Papaspyrou.en_US
Appears in Collections:Διπλωματικές Εργασίες - Theses

Files in This Item:
File Description SizeFormat 
Ioannis Tzachristas - Master's Thesis 2024.pdfCreating an LLM-based AI-agent: A high-level methodology towards enhancing LLMs with APIs2.5 MBAdobe PDFView/Open


Items in Artemis are protected by copyright, with all rights reserved, unless otherwise indicated.