arXiv

Light or Full Verb? A Minimal-Pair Dataset for Probing Phraseological Competence in Language Models

Title: Light or Full Verb? A Minimal-Pair Dataset for Probing Phraseological Competence in Language Models

Abstract:

Common English verbs like 'have' and 'make' serve dual roles: they can act as collocates within light-verb constructions or function as full lexical predicates, exemplified by the contrast between "make a decision" and "make a cake." It remains uncertain whether current language models adequately capture this linguistic distinction. To address this, we present a large-scale, controlled dataset comprising minimally varied English sentence pairs where identical contexts feature the same verb used both as a light verb and as a full verb. Our probing experiments demonstrate that language models can distinguish between these two uses even within minimal contexts, revealing distinct behavioral patterns based on object types. We make the dataset, generation code, and associated materials available as a reusable resource. This framework is designed to be extensible, allowing for integration into broader contexts, the inclusion of additional verbs, and application across other languages.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

TechCrunch

A burglar used a Waymo to steal yoga clothes in San Francisco — and got away with it

A thief stole yoga clothes using a Waymo, but police failed to catch them because the car’s video data was deleted and b...

Goldman Sachs CEO David Solomon on the Coming Mega IPOs
Bloomberg

Goldman Sachs CEO David Solomon on the Coming Mega IPOs

Goldman Sachs CEO David Solomon anticipates a surge in major IPOs, signaling renewed market confidence and significant o...

What Are A.I. Agents Actually Doing?
New York Times

What Are A.I. Agents Actually Doing?

Arena research shows tech professionals are most likely to use AI agents at work, highlighting a strong industry trend i...

TechCrunch

Cash App launches a wand for tap-and-pay

Cash App launched a $25 NFC "Magic Wand" for tap-and-pay, blending viral novelty with practical contactless payments. It...

Databricks CEO Plans to Avoid IPO During Year of Huge Offerings
Bloomberg

Databricks CEO Plans to Avoid IPO During Year of Huge Offerings

Databricks CEO plans to avoid an IPO in 2021, despite a surge in public offerings. This contrasts with earlier reports t...

TechCrunch

Waymo’s spent robotaxi batteries will be used as grid storage

Waymo partners with B2U to repurpose retired robotaxi batteries for grid storage in California and Texas, aligning with ...