arXiv

Are Tools Always Beneficial? Learning to Invoke Tools Adaptively for Dual-Mode Multimodal LLM Reasoning

Title: Reevaluating the Utility of Tools: Adaptive Invocation for Dual-Mode Multimodal LLM Reasoning

Abstract: While tool-augmented reasoning represents a promising avenue for strengthening the inferential powers of multimodal large language models (MLLMs), current research predominantly concentrates on the mechanics of tool invocation, often overlooking the critical question of when tools are actually necessary. We posit that relying on tools is not universally advantageous; superfluous or ill-suited invocations can significantly inflate reasoning costs and potentially distort model predictions. To mitigate these challenges, we present AutoTool, a framework that dynamically determines the necessity of tool usage based on specific query attributes. Operating within a reinforcement learning paradigm, AutoTool employs an explicit dual-mode reasoning strategy, utilizing distinct reward functions for each mode to steer the model toward precise outputs. Furthermore, to avoid early convergence on a single reasoning approach, the system simultaneously explores and balances tool-assisted and text-centric reasoning during training, encouraging broader exploration in subsequent phases. Comprehensive evaluations confirm that AutoTool achieves superior performance and efficiency, securing a 21.8% accuracy increase on the V* benchmark relative to the baseline model, and delivering a 44.9% efficiency boost over current tool-augmented methods on the POPE benchmark. The codebase is accessible at https://github.com/MQinghe/AutoTool.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Who’s Excited for SpaceX’s I.P.O.? Space Nerds.
New York Times

Who’s Excited for SpaceX’s I.P.O.? Space Nerds.

Space enthusiasts are the most eager for SpaceX’s IPO, driven by their passion for space exploration.

TechCrunch

Apple touts $1.4 trillion in App Store billings and sales, 90% without a commission

Apple reported $1.4 trillion in App Store billings for 2025, noting 90% were commission-free. Digital sales rose to $149...

Dimon and SpaceX Executives to Pitch IPO to Clients
Bloomberg

Dimon and SpaceX Executives to Pitch IPO to Clients

JPMorgan Chase CEO Jamie Dimon and SpaceX executives are pitching IPO details to clients.

Financial Times

Europe is finally flexing its innovation muscles

The EU’s new tech sovereignty package signals a positive shift from defensive regulation to proactive innovation, markin...

Apollo’s Zelter Expects High-Grade Debt Sales to Top US Treasuries
Bloomberg

Apollo’s Zelter Expects High-Grade Debt Sales to Top US Treasuries

Apollo’s Zelter expects high-grade debt sales to surpass US Treasuries. He anticipates investment-grade debt outperformi...

EU Insurance Watchdog Warns on Loan Risks
Bloomberg

EU Insurance Watchdog Warns on Loan Risks

EIOPA warns insurers to closely monitor loan risks, though initial reports lack specific details on the nature or scope ...