arXiv

LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

June 3, 2026 · Xinwu Ye, Yicheng Mao, Yuxuan Liao, Jia Zhang, Yimeng Liu, Li Hao, Fang Wu, Zhiwei Li, Zehong Wang, Zhiyuan Liu, Zhenfei Yin, Li Yuan, Philip Torr, Huan Sun, xiangxiang Zeng, Mengdi Wang, Le Cong, Shenghua Gao, Xiangru Tang · Original Source

Title: LatentChem: Transitioning from Textual Chain-of-Thought to Latent Thinking in Chemical Reasoning

Abstract:

Current chemical large language models (LLMs) primarily depend on explicit Chain-of-Thought (CoT) methodologies to address complex reasoning challenges. However, compelling nonverbal, tacit chemical logic into discrete natural language creates a fundamental "modality mismatch." This forced translation establishes an artificial bottleneck that hinders effective reasoning. To address this, we present LatentChem, a novel reasoning interface that separates chemical logic from linguistic generation. This approach allows the model to process information through dynamic perception and continuous thought vectors.

Our research highlights a significant emergent behavior: spontaneous internalization. We define this phenomenon as the self-selected adoption of internal processes when optimization is focused solely on outcomes. Under outcome-only optimization, the model discards verbose textual derivations in favor of implicit latent computation. This suggests that the model recognizes the continuous manifold as a more native substrate for chemical logic.

This paradigm shift also demonstrates superior computational efficiency. On the rigorous ChemCoTBench, LatentChem secured a 59.88% non-tie win rate against a strong CoT baseline. Furthermore, it delivered an average reduction of 10.84$\times$ in reasoning step overhead, translating to a 5.96$\times$ improvement in wall-clock speedup, across all evaluated benchmarks. These findings offer empirical evidence that chemical reasoning is more naturally and effectively executed as continuous latent dynamics rather than discretized linguistic trajectories.

Source: arXiv Generated at: 2026-06-03 00:00:00 UTC