Please use this identifier to cite or link to this item:
http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19917Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Μαρκογιαννάκης, Άρης | - |
| dc.date.accessioned | 2025-11-12T07:29:14Z | - |
| dc.date.available | 2025-11-12T07:29:14Z | - |
| dc.date.issued | 2025-11-05 | - |
| dc.identifier.uri | http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19917 | - |
| dc.description.abstract | Single-cell RNA sequencing has revolutionized biological research by enabling gene expression measurement at cellular resolution, revealing diverse cell types, states, and disease contexts. Recent single-cell foundation models can learn generalizable representations directly from expression data, improving downstream classification and clustering tasks. However, such models typically rely on fixed label spaces that limit their ability to express cellular diversity. This thesis presents Cell2Text, a multimodal generative framework that transforms single-cell transcriptomic profiles into structured natural language descriptions. By integrating pretrained single-cell encoders with large language models through learnable projection modules, Cell2Text generates coherent summaries describing cellular identity, tissue of origin, disease relevance, and biological pathway activity. Experimental results show that Cell2Text achieves higher accuracy than baseline models, maintains strong ontological consistency through PageRank-based similarity metrics, and produces semantically faithful text outputs. Overall, the proposed approach highlights the potential of combining biological and linguistic representations for scalable and informative single-cell characterization. | en_US |
| dc.language | en | en_US |
| dc.subject | Deep Learning | en_US |
| dc.subject | Multimodal Learning | en_US |
| dc.subject | Natural Language Generation | en_US |
| dc.subject | Large Language Models | en_US |
| dc.subject | Foundation Models | en_US |
| dc.subject | Single-cell RNA-seq | en_US |
| dc.title | Cell2Text: Multimodal LLM for Generating Textual Descriptions from Single-Cell RNA-Seq Profiles | en_US |
| dc.description.pages | 109 | en_US |
| dc.contributor.supervisor | Στάμου Γιώργος | en_US |
| dc.department | Τομέας Τεχνολογίας Πληροφορικής και Υπολογιστών | en_US |
| Appears in Collections: | Διπλωματικές Εργασίες - Theses | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| thesis_ArisMarkogiannakis.pdf | 3.74 MB | Adobe PDF | View/Open |
Items in Artemis are protected by copyright, with all rights reserved, unless otherwise indicated.