arXiv

On the Persistent Effects of Lexicality in Large Language Mod

Title: The Enduring Impact of Lexicality in Large Language Models

Abstract: Extracted representations from large language models (LLMs) are critical for numerous downstream applications. Yet, the architecture of these representations is frequently shaped by lexical overlap instead of semantic substance. Our comprehension of how lexical influence interacts with semantic content, and what this means for practical applications, remains constrained. This study examines LLM representations to measure the magnitude of lexical overlap in comparison to semantic content. We employ various adversarial semantic stress tests and relate our results to an information-theoretic framework. Our analysis reveals that lexical influence permeates the entire depth of the models, a pattern that persists across different architectures, training methods, and objective functions, even among models specifically optimized for semantic similarity. Additionally, we identify a transitional zone in the middle layers where both lexical and semantic signals deteriorate concurrently, suggesting a phase in which representations are ineffective for capturing both surface-level forms and deeper meaning. To illustrate the real-world consequences of this lexical influence, we analyze its impact on downstream LLM utility through case studies involving summarization and model editing.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TechCrunch

The world’s largest privately owned laser just turned on

Xcimer Energy activated the Phoenix laser, the world’s largest privately owned laser, aiming to commercialize fusion pow...

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya
Bloomberg

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya

Uber plans to double its electric motorcycle fleet in Kenya. This expansion aims to enhance sustainable transport option...

AI Saves Time But Most Companies Waste the Gain, Study Shows
Bloomberg

AI Saves Time But Most Companies Waste the Gain, Study Shows

A study reveals that while AI saves employee time, most companies fail to capitalize on these gains, squandering potenti...

JPMorgan Lifts S&P Target on Earnings 'Supercycle'
Bloomberg

JPMorgan Lifts S&P Target on Earnings 'Supercycle'

JPMorgan raised its S&P 500 target, citing an earnings “supercycle” that reflects heightened confidence in corporate pro...

Europe Sleepwalking Into Economic Ruin, Serb Leader Says
Bloomberg

Europe Sleepwalking Into Economic Ruin, Serb Leader Says

Serbian leader warns Europe is sleepwalking into economic ruin.

Delta Electronics Flags Power Crunch
Bloomberg

Delta Electronics Flags Power Crunch

Delta Electronics warns of a looming power deficit due to surging demand and constrained production, predicting serious ...