Please use this identifier to cite or link to this item: http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/18542
Full metadata record
DC FieldValueLanguage
dc.contributor.authorΓαλάνης, Δημήτρης-
dc.date.accessioned2022-11-14T14:15:44Z-
dc.date.available2022-11-14T14:15:44Z-
dc.date.issued2022-10-31-
dc.identifier.urihttp://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/18542-
dc.description.abstractThe rapid increase of digitized text documents has accentuated the need for reliable automatic methods that discern the important information from the unimportant. In the legal domain of court judgements, this process is done mostly manually by specialized legal editors, which is a time-consuming process. However, court judgement summaries are an essential part of a legal practitioner’s workflow, as they are shorter in length, thus enabling faster and more specific search for relevant case-laws. Furthermore, summarized versions of court judgements allow the legal practitioner to intuitively focus on its main points and thus acquire a better understanding of it. Recent advances in Machine Learning have enabled better performance in Automatic Text Summarization (ATS) systems, in terms of automatic evaluation metrics. Moreover, deep pre-trained Language Models enable the use of ATS without large amounts of training data. However, most methods are trained and evaluated for the news-article domain, which differs from the court-judgements domain as the latter includes longer documents, having significantly different structure and making use of specialized legal terminology. In our work, we attempt to automatically summarize Greek court judgements using machine learning methods. To that end, we first conduct an extended survey of the automatic text summarization literature; the methods, the datasets and evaluation metrics used and the criticism that has been applied to them. Then we proceed by constructing a dataset of Greek court judgement texts and their summaries. We build an extractive summarization system, based on the LexRank algorithm, that extracts the most important sentences from a judgement. We train an Encoder-Decoder Deep Learning model based on the BERT architecture, using open-sourced checkpoints trained on Greek parliamentary corpora and use it to model abstractive summarization as a sequence generation task. We evaluate our methods using the ROUGE-family of automatic evaluation metrics and also conduct a human evaluation study. We show that domain informed preprocessing and including judgement classification information can increase the performance of our abstractive summarization methods. We provide a comparison of different variations of our extractive summarization methods. Legal experts’ evaluation shows our extractive methods perform average, and our abstractive methods, while generating moderately fluent and coherent text, have low scores in the relevance and consistency metrics, indicating the need of methods factually aligned to the judgement text.en_US
dc.languageenen_US
dc.subjectAutomatic Text Summarizationen_US
dc.subjectCourt Judgementsen_US
dc.subjectMachine Learningen_US
dc.subjectNeural Networksen_US
dc.subjectNatural Language Processingen_US
dc.subjectBERTen_US
dc.subjectROUGEen_US
dc.subjectLegal-AIen_US
dc.subjectΑυτόματη Περίληψη Κειμένουen_US
dc.subjectΔικαστικές Αποφάσειςen_US
dc.subjectΜηχανική Μάθησηen_US
dc.subjectΝευρωνικά Δίκτυαen_US
dc.subjectΕπεξεργασία Φυσικής Γλώσσαςen_US
dc.subjectΤεχνητή Νοημοσύνη και Δίκαιοen_US
dc.titleAutomatic Summarization of Court Judgements using Machine Learning, with applications to summarizing Greek Court Judgementsen_US
dc.description.pages199en_US
dc.contributor.supervisorΤσανάκας Παναγιώτηςen_US
dc.departmentΤομέας Τεχνολογίας Πληροφορικής και Υπολογιστώνen_US
Appears in Collections:Διπλωματικές Εργασίες - Theses

Files in This Item:
File Description SizeFormat 
NTUA_ECE_Thesis_Template_EN__Copy_ (24).pdf4.85 MBAdobe PDFView/Open


Items in Artemis are protected by copyright, with all rights reserved, unless otherwise indicated.