Please use this identifier to cite or link to this item:
http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/17685
Title: | Source Code Classification using Neural Networks |
Authors: | Kanavakis, Eleftherios Γκούμας Γεώργιος |
Keywords: | Source code classification Compilers Abstract Syntax Tree (AST) Long Short Term Memory (LSTM) Hierarchical Attention Network (HAN) Deep Averaging Network (DAN) |
Issue Date: | 9-Sep-2020 |
Abstract: | The purpose of this dissertation is to study the problem of source code classification using neural networks. More specifically, in this problem, a piece of code is classified into an algorithmic class based on the function it performs. Pre-existing research has shown that neural networks are an effective way of modeling source code and solving such classification problems. Although literature results are encouraging, there are limitations related not only to datasets and preprocessing techniques but also to machine learning models. To this end, we propose a system that initially builds quality datasets, which are free of biases and noise. It then uses compilers to process these sets and finally uses neural networks to classify them into an algorithmic class. In the context of optimizing the system above, we studied a variety of pre-processing techniques and machine learning models. |
URI: | http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/17685 |
Appears in Collections: | Διπλωματικές Εργασίες - Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Thesis_Kanavakis.pdf | 3.04 MB | Adobe PDF | View/Open |
Items in Artemis are protected by copyright, with all rights reserved, unless otherwise indicated.