arXiv

Characterizing initial human-AI proof formalization workflows

Title: Mapping Early Human-AI Collaborative Patterns in Proof Formalization

Abstract:

While human mathematicians have relied on written proofs to validate their arguments for centuries, the automatic verification of these proofs has historically remained a significant hurdle. Recent breakthroughs in artificial intelligence—particularly in code generation and sophisticated mathematical reasoning—offer the potential to revolutionize how proofs are formalized and subsequently verified. Rather than concentrating on benchmarking the capabilities of current state-of-the-art models, this research investigates the practical application of these tools by users. We employ a mixed-methods approach to examine the initial effects of AI on formalization workflows, exploring users' stated objectives, perceived obstacles, and actual adaptation strategies in practice. Qualitative survey data reveals a diverse range of preferences, though there is a prevailing consensus for AI support that maintains substantial human oversight in the proof discovery phase. To evaluate real-world engagement with AI under these constraints, we executed a controlled user study wherein participants attempted to formalize informal mathematical problems and their corresponding proofs, both with and without AI assistance. This study spanned various domains and difficulty levels. Despite existing limitations in autoformalization technology, participants generally achieved higher accuracy when AI tools were available compared to working alone, and most users demonstrated flexibility by integrating multiple distinct AI tools into their processes. Collectively, our findings illuminate the nascent phase of AI integration into formalization, highlighting the complex and interactive dynamic between human and machine collaboration.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia
Bloomberg

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia

Cerebras confirmed partnerships with all major AI hardware vendors except Nvidia. This broad engagement positions Cerebr...

Putin Turns Russia’s AI Future Into a Kremlin Family Business
Bloomberg

Putin Turns Russia’s AI Future Into a Kremlin Family Business

Putin is consolidating Russia’s AI ambitions into a Kremlin family business, effectively turning the sector into a dynas...

Reuters

Meta repeatedly pushes back new AI model release for developers, WSJ says

Meta has repeatedly delayed the release of its new AI model for developers, according to the WSJ. This ongoing postponem...