arXiv

Lean-GAP: A Dataset of Formalized Graduate Algebra Problems

Title: Lean-GAP: A Collection of Formalized Graduate Algebra Exercises

Abstract:

This paper introduces Lean-GAP (Lean-Graduate Algebra Problems), a new dataset comprising 430 graduate-level algebra problems formalized from the textbook Abstract Algebra by Dummit and Foote. We outline a scalable workflow that encompasses PDF-to-LaTeX preprocessing, autoformalization into Lean 4, and the verification of alignment between informal and formal representations. Although the initial preprocessing and autoformalization phases can be largely automated, our findings indicate that verification is the most intricate and labor-intensive stage, necessitating rigorous human oversight.

Our primary contributions are threefold: (i) the creation of a structured repository of formalized exercises; (ii) the establishment of a systematic methodology for formalizing mathematical content from textbooks; and (iii) a detailed examination of persistent challenges encountered during formalization. Additionally, we evaluate the performance of various autoformalization models and identify critical bottlenecks in the translation of informal mathematical statements into formal languages.


Source: arXiv Generated at: 2026-06-03 00:00:00 UTC

Related Articles

TikTok Billionaire Tops Ambani as Asia’s Second-Richest
Bloomberg

TikTok Billionaire Tops Ambani as Asia’s Second-Richest

TikTok founder surpasses Mukesh Ambani to become Asia’s second-richest person, marking a significant shift in the region...

Publishers in UK can opt out of Google AI search results
BBC News

Publishers in UK can opt out of Google AI search results

UK publishers can now opt out of Google’s AI search summaries, a CMA ruling designed to boost their bargaining power and...

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.
Bloomberg

Kioxia Edges Nearer Toyota’s Market Cap in Shakeup to Japan Inc.

Kioxia’s market cap nears Toyota’s, signaling a major shift in Japan’s corporate hierarchy. This narrowing gap highlight...

Reuters

Morning Bid: Marvell, a fitting name for the latest AI darling

Reuters highlights Marvell as a top AI stock, noting its name perfectly suits its status as the newest market darling.

Financial Times

Tim Hayward: I built the Jaguar E-Type of computer keyboards

Tim Hayward compares his bespoke keyboard designs to the Jaguar E-Type. He explores high-end customization for personal ...

Financial Times

AI Labs: Zuckerberg’s $100bn gamble

Meta’s $100 billion AI investment aims to secure AI dominance, but questions remain whether sheer spending can outpace c...