Pitfalls of Scale: Investigating the Inverse Task of Redefinition in Large Language Models

Στριγγλή, Ελένη

National Technical University of Athens

School of Electrical and Computer Engineering

Artemis is Live!

Welcome to our digital repository! The aim of Artemis is the systematic archiving and dissemination of the scientific work produced in the School of Electrical and Computer Engineering, National Technical University of Athens, Greece, using the technology of digital libraries.

Please use this identifier to cite or link to this item: http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19731

Full metadata record

DC Field	Value	Language
dc.contributor.author	Στριγγλή, Ελένη	-
dc.date.accessioned	2025-07-16T07:34:44Z	-
dc.date.available	2025-07-16T07:34:44Z	-
dc.date.issued	2025-07-02	-
dc.identifier.uri	http://artemis.cslab.ece.ntua.gr:8080/jspui/handle/123456789/19731	-
dc.description.abstract	As Large Language Models (LLMs) continue to grow in scale, they exhibit increasingly sophisticated behaviors, including abilities that resemble logical reasoning. However, the authenticity of such advances remains a subject of debate, with many arguing that they are largely a byproduct of memorization and advanced statistical pattern recognition rather than genuine understanding. To shed light on these limitations, researchers have developed experimental conditions that challenge LLMs to override entrenched associations, highlighting gaps in reasoning and adaptability when compared to human cognition. Inverse scaling tasks are designed to uncover such weaknesses by revealing a paradoxical decline in performance as model scale increases, thereby exposing critical blind spots in scale-driven improvements. In this thesis, we explore the redefinition task, which challenges LLMs to adopt nonstandard definitions for familiar scientific constants and units of measurement and then respond based on these altered values. We evaluate state-of-the-art models from multiple LLM families and demonstrate that larger LLMs not only perform worse at following redefinitions, anchoring more strongly to their memorized knowledge, but also demonstrate increased confidence in generating false responses rather than choosing to abstain. In addition, although factors such as response formatting and prompting techniques can influence these behaviors, no strategy fully counteracts the tendency of larger models to revert to pretraining priors.	en_US
dc.language	en	en_US
dc.subject	Large Language Models	en_US
dc.subject	prompt engineering	en_US
dc.subject	inverse scaling	en_US
dc.subject	reasoning capabilities	en_US
dc.subject	adaptability	en_US
dc.subject	interpretability	en_US
dc.title	Pitfalls of Scale: Investigating the Inverse Task of Redefinition in Large Language Models	en_US
dc.description.pages	107	en_US
dc.contributor.supervisor	Βουλόδημος Αθανάσιος	en_US
dc.department	Τομέας Τεχνολογίας Πληροφορικής και Υπολογιστών	en_US
Appears in Collections:	Διπλωματικές Εργασίες - Theses

Files in This Item:

File	Description	Size	Format
Diploma_Thesis_Elena_Stringli.pdf		8.85 MB	Adobe PDF	View/Open

Show simple item record