Global News Digest

arXiv

Completion at the Boundary (CaB): Deployable Switching with Completion-Aware Control under Limited Calibration

Title: Completion at the Boundary (CaB): Deployable Switching with Completion-Aware Control under Limited Calibration

Abstract: While Vision-Language-Action (VLA) agents possess the capability to follow natural-language instructions, deployed systems currently struggle with a critical operational gap: determining precisely when an instruction has been fulfilled. This challenge is particularly pronounced in short composite commands (e.g., "do A, then B"), where poorly timed transitions can trigger cascading failures in subsequent tasks. Because switching acts as an intervention that alters the instructional context—and thereby influences future actions and observations—completion management is inherently closed-loop.

This study investigates completion protocols within a deployable, low-calibration framework. Motivated by the vastness of open-ended instruction spaces, we enforce strict constraints: no relearning during testing and the use of a single, globally calibrated switching rule derived from the development set and applied unchanged to the test set. Under these limitations, reducing asymmetric boundary evidence to a single scalar value proves fragile, especially when task polarities shift.

To address this, we introduce Completion at the Boundary (CaB). Rather than collapsing evidence, CaB predicts an event-local completion object structured as Boundary-Phase Tokens (Before, Hit, or After), thereby preserving two-sided boundary evidence. The system utilizes two components: CaB-When translates this completion object into a minimal, auditable decision regarding when to switch, while CaB-How leverages the same object to condition action generation, ensuring boundary-stable control during handoffs. Evaluated using an intervention-aware E1/E2 protocol, our results demonstrate that CaB enhances both composite execution and handoff quality on a first-person Minecraft VLA benchmark, achieving these improvements while maintaining matched capacity and deployability constraints.


Source: arXiv Generated at: 2026-06-02 00:00:00 UTC

Related Articles

Schroders Renewable Unit Targets AI Assets as Power Demand Soars
Bloomberg

Schroders Renewable Unit Targets AI Assets as Power Demand Soars

Schroders’ renewable unit targets AI infrastructure, pivoting to meet soaring energy demand from artificial intelligence...

State Street's Paglia on SBI Group Partnership, ETFs
Bloomberg

State Street's Paglia on SBI Group Partnership, ETFs

State Street's Paglia discusses the SBI Group partnership and ETFs, but the source text is missing. Please provide the a...

Nvidia Boss Says Workers Should Be Paid ‘as Much as Possible’
Bloomberg

Nvidia Boss Says Workers Should Be Paid ‘as Much as Possible’

Nvidia CEO Jensen Huang advocates for paying workers “as much as possible,” emphasizing maximum compensation. This stanc...

TSE Talking With Regulator For Easing ETF Listing Rules
Bloomberg

TSE Talking With Regulator For Easing ETF Listing Rules

The Tokyo Stock Exchange is discussing with regulators to ease ETF listing rules. This aims to simplify market access an...

S&P DJI CEO on Japan Markets, Mega IPOs
Bloomberg

S&P DJI CEO on Japan Markets, Mega IPOs

S&P DJI CEO discusses Japan's financial markets and major IPOs.