Please use this identifier to cite or link to this item: http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19381
Title: Μελέτη Σφαλμάτων σε Μεγάλα Γλωσσικά Μοντέλα
Authors: ΓΡΗΓΟΡΙΑΔΟΥ, ΝΑΤΑΛΙΑ-ΜΑΡΙΑ
Στάμου Γιώργος
Keywords: Μεγάλα γλωσσικά μοντέλα (LLMs), ανίχνευση hallucination, Επεξεργασία Φυσικής Γλώσσας (NLP), Παραγωγή Φυσικής Γλώσσας (NLG), μοντέλα μαύρου κουτιού, ensemble voting classifier, fine-tuning, prompting, μηχανική μετάφραση, παραγωγή ορισμών, παραγωγή παραφράσεων, Natural Language Inference (NLI)
Issue Date: 24-Oct-2024
Abstract: In recent years, the emergence of Large Language Models (LLMs) has revolutionized Natural Language Processing (NLP), but has also raised critical concerns regarding the trustworthiness and accuracy of their outputs. One of the most pressing issues is hallucination, where LLMs generate information that is se- mantically inconsistent or unrelated to the input. Addressing this challenge is paramount to improving the reliability of LLMs, especially in Natural Language Generation (NLG) tasks. This thesis explores efficient methods for hallucination detection, focusing on black-box settings where the model’s internal workings are inaccessible. The research contributes to the detection of hallucinations through participation in the SemEval-2024 Task 6 (SHROOM), which involves binary classification of hallucinations across tasks such as machine translation, definition modeling, and paraphrase generation. By leveraging pre-trained models fine-tuned on hallucination detection and Natural Language Inference (NLI) datasets, this work achieves significant improvements over baseline systems, with accuracies reaching approximately 80%. Key contributions of this thesis include the development of an ensemble voting classifier combining multiple models to enhance hallucination detection, as well as an in-depth analysis of the challenges posed by different NLG tasks. The findings provide valuable insights into the nature of hallucinations in LLMs and offer a robust framework for future efforts to mitigate this inherent issue.
URI: http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19381
Appears in Collections:Διπλωματικές Εργασίες - Theses

Files in This Item:
File Description SizeFormat 
Thesis_Grigoriadou.pdf3.46 MBAdobe PDFView/Open


Items in Artemis are protected by copyright, with all rights reserved, unless otherwise indicated.