Technology News - Global News Digest

arXiv

TechGraphRAG: An Agentic Graph-Augmented RAG Framework for Technical Literature Reasoning

June 2, 2026 · Kanwar Bharat Singh

TechGraphRAG is an agentic RAG framework using a 13-step workflow, Neo4j graphs, and iterative searches to reason through 2,100 vehicle control articles with self-correcting quality audits.

arXiv

FedMTFI: Feature Importance Based Optimized Multi Teacher Knowledge Distillation in Heterogeneous Federated Learning Environment

June 2, 2026 · Nazmus Shakib Shadin, Aaron Cummings, Xinyue Zhang, Bobin Deng

FedMTFI enhances heterogeneous federated learning by clustering clients and using Shapley values for multi-teacher knowledge distillation. This approach improves accuracy and interpretability on non-IID data.

arXiv

Demystifying Multimodal Biomolecular Co-design With Intrinsic Geodesic Coupling

June 2, 2026 · Keyue Qiu, Xintong Wang, Zhilong Zhang, Hao Zhou, Wei-Ying Ma

GeoCoupling optimizes temporal alignment in biomolecular co-design, surpassing synchronous baselines. This yields biomolecules with enhanced physical validity and diversity.

arXiv

EvoPool: Evolutionary Programmatic Annotation for Label-Efficient Specialized Supervision

June 2, 2026 · Tianyi Xu, Yaolun Zhang, Xuan Ouyang, Huazheng Wang

EvoPool is a label-efficient, evolutionary multi-agent framework that generates specialized annotation code at near-zero cost. It outperforms LLM baselines in complex tasks, achieving significant macro-F1 gains while being thousands of times faster.

arXiv

AlphaToken: Decoupling Adaptation and Stability for Path-Aware Response Token Valuation in LLM Post-Training

June 2, 2026 · Liu Qing, Ou Wu, Yi Du

AlphaToken decouples adaptation and stability for path-aware token valuation in LLM post-training. It filters low-value tokens to enhance performance while preventing catastrophic forgetting.

arXiv

A Framework for Graph-Conditioned Hierarchical Shapley Attribution in Patent Valuation

June 2, 2026 · Joy Bose

PatentXAI uses graph-conditioned Shapley values to efficiently value patents within large portfolios. It achieves high accuracy in milliseconds by limiting computations to each patent's Markov Blanket.

arXiv

MINTS: Minimalist Thompson Sampling

June 2, 2026 · Kaizheng Wang

MINTS is a minimalist Bayesian approach restricting priors to the optimum’s location, enabling efficient handling of structural constraints. It achieves near-optimal regret bounds for constrained multi-armed bandits.

arXiv

DOT-MoE: Differentiable Optimal Transport for MoEfication

June 2, 2026 · Udbhav Bamba, Arnav Chavan, Aryamaan Thakur, Steve Teig, Deepak Gupta

DOT-MoE transforms dense LLMs into efficient MoEs using differentiable optimal transport for neuron assignment. It retains 90% performance while halving active parameters, outperforming heuristic baselines.

arXiv

E4GEN: Event-level Explainable Extreme-Enhanced Time-series Generation

June 2, 2026 · Lin Jiang, Dahai Yu, Ximiao Li, Guang Wang

E4GEN is an explainable diffusion framework generating realistic time-series data with high extreme-event fidelity. It outperforms state-of-the-art models across multiple dimensions, including overall fidelity and downstream utility.

arXiv

Time-Aware Diffusion based on Preference Disentanglement for Generative Recommendation

June 2, 2026 · Bangguo Zhu, Peng Huo, Yuanbo Zhao, Zhicheng Du, Jun Yin, Senzhang Wang

TDPM introduces time-aware diffusion to generative recommendation by disentangling stable period preferences from recent point preferences. It significantly outperforms baselines, achieving up to 29.21% HR@20 gains.

arXiv

HAIM: Human-AI Music Datasets for AI Music Production Tracking Benchmark

June 2, 2026 · Seonghyeon Go, Yumin Kim

HAIM introduces a benchmark dataset tracking specific AI integration points in music production, moving beyond binary detection. It enables granular evaluation of hybrid human-AI workflows and exposes weaknesses in current detectors.

arXiv

Off-the-Shelf LLMs as Process Scorers: Training-Free Alternative to PRMs for Mathematical Reasoning

June 2, 2026 · Atoosa Chegini, Soheil Feizi

This training-free method uses pre-trained LLMs as process scorers to guide smaller models' mathematical reasoning. By ranking fixed-length candidate chunks, it prevents error propagation without requiring additional training.

arXiv

Understanding Identity Continuity in Thermal Video through Scene-Level Consistency

June 2, 2026 · Wei-Chieh Sun, Gyungmin Ko, Heejae Kwon, Hsiang-Wei Huang, Jenq-Neng Hwang

This study introduces a lightweight identity-repair backend for thermal MOT, boosting IDF1 to 84.93 via scene-level consistency. It proves that high-precision trajectory relinking outperforms complex re-identification models for robust identity recovery.

arXiv

Easier to Mislead Than to Correct: Harmful and Beneficial Revision in LLM Conformity

June 2, 2026 · Jiaming Qu, Lucheng fu, Yibo Hu

LLMs are easily misled by peer consensus, yet hard to correct when wrong. Authority labels worsen this, and reasoning prompts fail to fix the asymmetry without hindering beneficial updates.

arXiv

RPCASSM: Robust PCA State Space Model For Infrared Small Target Detection

June 2, 2026 · Pingping Liu, Aohua Li, Yubing Lu, Jin Kuang, Tongshun Zhang, Qiuzhan Zhou

RPCASSM uses Robust PCA and specialized state space modules to accurately detect infrared small targets. It outperforms existing models by better capturing target boundaries and background details.

arXiv

JenBridge: Adaptive Long-Form Video Soundtracking across Scene Transitions

June 2, 2026 · Jiashuo Yu, Yao Yao, Boyu Chen, Alex Wang

JenBridge is a Transformer-based framework for adaptive long-form video soundtracking, using an LLM agent to ensure seamless scene transitions. It outperforms existing methods in narrative coherence and audio fidelity on the new LVS Benchmark.

arXiv

Fair Finetuning Mitigates Distribution Inference Attacks

June 2, 2026 · Rakshit Naidu

Fair Fine-tuning mitigates distribution inference attacks by enforcing Equalized Odds on complementary data. It theoretically bounds adversarial advantage and empirically reduces leakage across diverse datasets.

arXiv

Two-Fidelity Best-Action Identification for Stochastic Minimax Tree

June 2, 2026 · Peter Chen, Xi Chen

The paper introduces 2FFS, a dual-fidelity tree-search algorithm for stochastic minimax trees that balances cheap, biased evaluations with costly, accurate ones. It significantly reduces sample and computational costs while ensuring fixed-confidence best-action identification.

arXiv

Shortcut to Nowhere: Demystifying Deep Spurious Regression

June 2, 2026 · Guanrong Xu, Jessica Li, Hao Wang, Yuzhe Yang

This paper introduces Deep Spurious Regression (DSR) to address continuous prediction challenges. It leverages attribute similarities to calibrate distributions, ensuring robust generalization beyond training data.

arXiv

Post-Deterministic Distributed Systems: A New Foundation for Trustworthy Autonomous Infrastructure

June 2, 2026 · Jun He, Deying Yu

The paper introduces Post-Deterministic Distributed Systems (PDDS) to coordinate heterogeneous autonomous agents. It proposes five architectural pillars, including Epistemic State Replication, to ensure trustworthy infrastructure beyond traditional deterministic models.