arXiv

IDEAFix: Evaluation Framework for Creative Defixation Prompting in LLMs

Title: IDEAFix: A Framework for Assessing Creative Defixation in Large Language Models

Abstract:

As large language models (LLMs) become more prevalent in creative problem-solving and idea generation, their actual creative potential remains a subject of debate. While some research highlights their superior performance relative to humans, other studies point to structural constraints, such as fixation and the tendency toward output homogenization. Current evaluation methods often fall short; they either utilize narrow, decontextualized tasks that fail to capture goal-oriented generation or employ broader settings where multiple variables of the creative process are confounded. This complexity makes it difficult to isolate the specific impacts of task formulation, prompting techniques, and evaluation design. Furthermore, the influence of structured prompting strategies on idea generation has received limited attention.

To address these gaps, we present IDEAFix, an evaluation framework designed to analyze divergent thinking within open-ended idea generation contexts. Our approach involves prompting models to produce multiple original solutions to controlled variations of short design scenarios, adjusting for task attributes and defixation prompting strategies. This methodology allows for a systematic examination of how structured guidance shapes LLM output.

Our findings indicate that both the formulation of the task and the selection of attributes play significant roles in model performance, with simple prompting strategies capable of enhancing the originality of the generated solutions. Despite these improvements, we observed persistent homogenization of outputs across different models, underscoring inherent limitations in their capacity to produce truly diverse ideas. Ultimately, IDEAFix offers a controlled and extensible platform for investigating the mechanisms that drive creativity in large language models.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Law’s Billable Hour Is Being Shredded by AI
Bloomberg

Law’s Billable Hour Is Being Shredded by AI

AI is dismantling the billable hour by automating routine legal tasks. This technological shift threatens the traditiona...

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026
Bloomberg

Iran War: Trump Tries to Stop Israel’s Lebanon Push | The Opening Trade 6/2/2026

SoftBank in Early Talks to Back $800 Million Agile Robots Round
Bloomberg

SoftBank in Early Talks to Back $800 Million Agile Robots Round

SoftBank is in early talks to back Agile Robots’ $800 million funding round. The Japanese tech giant is currently in pre...

Amundi Is Diversifying Risk Via Commodity Currencies, Gold
Bloomberg

Amundi Is Diversifying Risk Via Commodity Currencies, Gold

Amundi diversifies risk by investing in commodity-linked currencies and gold. This strategy hedges against market volati...

Reuters

Marvell Technology surges after Nvidia's Huang calls it 'next trillion-dollar company'

Marvell Technology shares surged after Nvidia CEO Jensen Huang labeled the firm the “next trillion-dollar company.”

Russia Says It Found Foreign Spyware on Top Officials’ Phones
Bloomberg

Russia Says It Found Foreign Spyware on Top Officials’ Phones

Russia’s FSB claims to have discovered foreign spyware on senior officials’ phones. Moscow attributes the intrusion to h...