arXiv

SEA-NLI: Natural Language Inference as a Lens into Southeast Asian Cultural Understanding

Title: SEA-NLI: Using Natural Language Inference to Probe Southeast Asian Cultural Comprehension

Abstract: While large language models (LLMs) demonstrate strong capabilities in Western settings, they are rarely evaluated on underrepresented regions, including Southeast Asia (SEA). Current Natural Language Inference (NLI) benchmarks are predominantly centered on Western cultures, often relying on translations or single-language datasets, which restricts their capacity to assess reasoning that is deeply rooted in specific cultural contexts. To address this gap, we present SEA-NLI, a culturally embedded NLI benchmark that encompasses eight Southeast Asian nations. The dataset is available in both English and native regional languages and has been validated by native speakers. Our evaluation across 17 encoder and decoder models reveals consistently low performance across the board, with particularly poor results in knowledge-heavy domains like Languages and Science and Technology. Further analysis indicates that these shortcomings are primarily due to a lack of SEA-specific cultural knowledge. While adapting models to the region and employing culture-aware prompting strategies yielded improvements, Chain-of-Thought (CoT) prompting provided only marginal benefits.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TechCrunch

The world’s largest privately owned laser just turned on

Xcimer Energy activated the Phoenix laser, the world’s largest privately owned laser, aiming to commercialize fusion pow...

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya
Bloomberg

Uber Targets Doubling Its Fleet of Electric Motorcycles in Kenya

Uber plans to double its electric motorcycle fleet in Kenya. This expansion aims to enhance sustainable transport option...

AI Saves Time But Most Companies Waste the Gain, Study Shows
Bloomberg

AI Saves Time But Most Companies Waste the Gain, Study Shows

A study reveals that while AI saves employee time, most companies fail to capitalize on these gains, squandering potenti...

JPMorgan Lifts S&P Target on Earnings 'Supercycle'
Bloomberg

JPMorgan Lifts S&P Target on Earnings 'Supercycle'

JPMorgan raised its S&P 500 target, citing an earnings “supercycle” that reflects heightened confidence in corporate pro...

Europe Sleepwalking Into Economic Ruin, Serb Leader Says
Bloomberg

Europe Sleepwalking Into Economic Ruin, Serb Leader Says

Serbian leader warns Europe is sleepwalking into economic ruin.

Delta Electronics Flags Power Crunch
Bloomberg

Delta Electronics Flags Power Crunch

Delta Electronics warns of a looming power deficit due to surging demand and constrained production, predicting serious ...