arXiv

Ekka: Automated Diagnosis of Silent Errors in LLM Inference

Title: Ekka: Automated Detection of Silent Errors in LLM Inference

Abstract:

Large Language Model (LLM) serving frameworks are undergoing rapid evolution, characterized by increasingly complex software stacks and a wide array of optimization techniques. This accelerated development cycle often introduces silent errors—issues where output quality degrades without triggering any explicit error signals. Diagnosing these problems is notoriously challenging due to the significant semantic gap that exists between high-level symptoms and their low-level root causes.

We demonstrate that diagnosing silent errors can be effectively addressed as a differential debugging problem by utilizing semantically correct reference implementations. To this end, we introduce Ekka, an automated diagnosis system designed to pinpoint root causes by systematically aligning and comparing intermediate execution states between a target framework and a reference one.

We developed a benchmark comprising real-world silent errors found in popular serving frameworks. On this benchmark, Ekka achieved a pass@1 diagnosis accuracy of 80% and a pass@5 accuracy of 88%, surpassing state-of-the-art systems. Furthermore, Ekka successfully identified four previously unknown silent errors in serving frameworks, all of which were subsequently confirmed by the respective developers.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)
Bloomberg

SpaceX Seeks to Raise $75 Billion in Record IPO (Video)

SpaceX aims for a record $75 billion valuation through an initial public offering. This historic IPO marks a significant...

Broadcom AI Chip Outlook Disappoints Investors
Bloomberg

Broadcom AI Chip Outlook Disappoints Investors

Broadcom’s AI chip projections disappointed investors, dampening market sentiment. The outlook fell short of expectation...

Hiranandani Group CEO on Powering India's Digital Future
Bloomberg

Hiranandani Group CEO on Powering India's Digital Future

Hiranandani Group CEO discusses driving India's digital transformation.

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia
Bloomberg

Cerebras Says It’s Working With All AI Gear Makers Except Nvidia

Cerebras confirmed partnerships with all major AI hardware vendors except Nvidia. This broad engagement positions Cerebr...

Putin Turns Russia’s AI Future Into a Kremlin Family Business
Bloomberg

Putin Turns Russia’s AI Future Into a Kremlin Family Business

Putin is consolidating Russia’s AI ambitions into a Kremlin family business, effectively turning the sector into a dynas...

Reuters

Meta repeatedly pushes back new AI model release for developers, WSJ says

Meta has repeatedly delayed the release of its new AI model for developers, according to the WSJ. This ongoing postponem...