arXiv

Neuron Populations Exhibit Divergent Selectivity with Scale

Title: Scaling Laws Drive Divergent Selectivity in Neuron Populations

Abstract:

This study examines whether the composition of neuron populations within neural networks undergoes predictable changes as model scale increases, thereby extending the application of scaling laws beyond traditional macroscopic metrics like loss. To address this, we focus on "Rosetta Neurons," a distinct class of units identified by Dravid et al. (2023) for their consistent activation patterns across independently trained models. Through separate analyses of language models reaching 30 billion parameters and vision models up to 5 billion parameters, we find that the total count of Rosetta Neurons increases with model size according to a sublinear power law. Consequently, while their absolute numbers grow, they constitute a diminishing proportion of the total neuronal population.

We also identify a "Neuron Polarization Effect," wherein Rosetta Neurons become increasingly monosemantic and selective as models scale, effectively diverging from the expanding non-Rosetta population, which retains lower selectivity. An analytical framework that weighs feature utility against the constraints of limited neuron capacity provides a theoretical explanation for both the sublinear scaling and this polarization phenomenon. Furthermore, our findings indicate that Rosetta Neurons exhibit heightened domain specialization at larger scales. We demonstrate their specific selectivity through a case study involving targeted data filtering for continued pretraining. These results establish a scaling law for interpretable, shared neuron-level structures, connecting model size to systematic shifts in neuron universality, selectivity, and specialization.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TikTok Billionaire Tops Ambani as Asia’s Second-Richest
Bloomberg

TikTok Billionaire Tops Ambani as Asia’s Second-Richest

TikTok founder surpasses Mukesh Ambani to become Asia’s second-richest person, marking a significant shift in the region...

Publishers in UK can opt out of Google AI search results
BBC News

Publishers in UK can opt out of Google AI search results

UK publishers can now opt out of Google’s AI search summaries, a CMA ruling designed to boost their bargaining power and...

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.
Bloomberg

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.

Kioxia’s market cap nears Toyota’s, signaling a major shift in Japan’s corporate hierarchy. This narrowing gap highlight...

Reuters

Morning Bid: Marvell, a fitting name for the latest AI darling

Reuters highlights Marvell as a top AI stock, noting its name perfectly suits its status as the newest market darling.

Financial Times

Tim Hayward: I built the Jaguar E-Type of computer keyboards

Tim Hayward compares his bespoke keyboard designs to the Jaguar E-Type. He explores high-end customization for personal ...

Financial Times

AI Labs: Zuckerberg’s $100bn gamble

Meta’s $100 billion AI investment aims to secure AI dominance, but questions remain whether sheer spending can outpace c...