Technology News - Global News Digest

arXiv

Detect Before You Leap: Mirage Detection in Vision-Language Models

June 2, 2026 · Sayeed Shafayet Chowdhury, Md. Shaown Miah

This study introduces TC-LIA, a pre-release mirage detection method for VLMs that tracks image-text alignment across layers. It achieves ~94.7% accuracy, reducing mirage rates below 3% compared to 21% in baselines.

arXiv

CodeCytos: AI-assisted spatial molecular imaging analysis via code-augmented agent action space

June 2, 2026 · Hung Q. Vo, Huy Q. Vo, Son T. Ly, Zhihao Wan, Anh-Vu Nguyen, Hong Zhao, Jianting Sheng, Stephen T. C. Wong, Hien V. Nguyen

CodeCytos uses AI agents to automate and customize spatial molecular imaging analysis. It enables dynamic, code-driven exploration of complex tissue data without extensive manual input.

arXiv

Pre-Deployment Robustness Stress Testing for CT Segmentation Systems Using Clinically Motivated Multi-Corruption Augmentation

June 2, 2026 · CholMin Kang, Jonghyun Chung, Amanpreet Kaurb, Nagesh Gulkotwarb, Arthi Sivasankaranb

RAMP, a multi-corruption augmentation framework, significantly improves CT segmentation robustness against clinical noise. It reduces performance gaps on corrupted images, ensuring safer real-world deployment.

arXiv

TabChange: Precise Attribute Changes in Tabular Data

June 2, 2026 · Arjun Dahal, Yu Lei, Raghu N. Kacker, Richard Kuhn

TabChange precisely edits tabular attributes by adapting to feature correlations, ensuring realistic, instance-specific changes. It outperforms baselines by generating more valid counterfactuals while preserving natural data structures.

arXiv

Skill or Skip? Learning Selective Skill Invocation in Agentic Tasks via Dual-Granularity Preference Learning

June 2, 2026 · Chishui Chen, Jiaye Lin, Te Sun, Junxi Wang, Yi Yang, Cong Qin, Yangen Hu, Lu Pan, Ke Zeng

SelSkill enables selective skill invocation via dual-granularity preference learning, significantly boosting task success and precision on benchmarks like ALFWorld and BFCL.

arXiv

V-LynX: Token Interface Alignment for Video+X LLMs

June 2, 2026 · Jungin Park, Jiyoung Lee, Kwanghoon Sohn

V-LynX integrates new modalities into Video LLMs via a lightweight auxiliary pathway, aligning them with the model’s internal token interface. This approach achieves state-of-the-art performance without heavy encoders or paired supervision.

arXiv

PaCo-VLA: Passivity-Shielded Compliance Prior for Contact-Rich Vision-Language-Action Manipulation

June 2, 2026 · Haofan Cao, Zhaoyang Li, Zhichao You, Liang Guo, Tianrui Li

PaCo-VLA decouples semantic reasoning from control by using a passivity shield to regulate VLA compliance proposals, ensuring safe, high-frequency contact dynamics. This approach prevents unsafe predictions from overriding physics, achieving superior precision in contact-rich manipulation tasks.

arXiv

CAFOSat: A Strongly Annotated Dataset for Infrastructure-Aware CAFO Mapping Using High-Resolution Imagery

June 2, 2026 · Oishee Bintey Hoque, Nibir Chandra Mandal, Mandy L Wilson, Samarth Swarup, Madhav Marathe, Abhijin Adiga

CAFOSat is a strongly annotated dataset of 45,000 high-resolution patches for mapping US CAFOs with infrastructure details. It enhances model robustness through refined annotations and synthetic augmentation.

arXiv

Interpretable Policy Distillation for Power Grid Topology Control

June 2, 2026 · Aleksandra Dmitruka, Karlis Freivalds

This study distills a deep RL power grid controller into interpretable tree models, achieving higher performance and transparency. The lightweight surrogates enable real-time deployment while revealing key operational drivers.

arXiv

A Practical Upper Bound on Selection Bias Effects in Medical Prediction Models

June 2, 2026 · Kara Liu, Maggie Wang, Russ B. Altman

This study introduces a practical upper bound to estimate selection bias effects in medical prediction models using partially observable data. Validated on synthetic and real-world datasets, it offers a robust framework for assessing model generalizability in healthcare.

arXiv

Richer Representations for Neural Algorithmic Reasoning via Auxiliary Reconstruction

June 2, 2026 · Jiafu Huang, Chao Peng, Chenyang Xu, Zhengfeng Yang, Kecheng Cai, Chenhao Zhang, Yi Wang, Yiwei Gong, Wanqin Zhou, Irene Zheng

This study enhances neural algorithmic reasoning by introducing auxiliary reconstruction to improve encoder representations. This approach boosts performance on benchmarks by preserving input details and capturing intra-state feature dependencies.

arXiv

Revisiting Parameter-Based Knowledge Editing in Large Language Models: Theoretical Limits and Empirical Evidence

June 2, 2026 · Wanying Ren, Xin Song, Futing Wang, Guoxiu He, Aixin Sun

This study reveals that parameter-based knowledge editing in LLMs causes reasoning collapse due to interference. Retrieval-based methods consistently outperform editing, highlighting the need to preserve core model capabilities.

arXiv

On the Difficulty of Learning a Meta-network for Training Data Selection

June 2, 2026 · Zilin Du, Junqi Zhao, Boyang Albert Li

This study identifies low gradient signal-to-noise ratio and poor feature correlation as key hurdles in meta-network training data selection. Increasing batch size and using informative features significantly improve performance across benchmarks.

arXiv

Improving Visual Representation Alignment Generation with GRPO

June 2, 2026 · Shentong Mo, Sukmin Yun

VRPO replaces static alignment with reinforcement learning, boosting diffusion transformer training efficiency and image fidelity. It achieves 1.8x FID improvement and 2.3x faster convergence than REPA with minimal overhead.

arXiv

Critic-R: Improving Agentic Search using Instruction-tuned Retrievers with Natural Language Introspective Feedback

June 2, 2026 · Md Zarif Ul Alam, Alireza Salemi, Hamed Zamani

Critic-R improves agentic search via a critic model providing natural language feedback. It refines queries at inference and optimizes retrievers using automatic supervision, boosting accuracy without manual annotations.

arXiv

SPADER: Step-wise Peer Advantage with Diversity-Aware Exploration Rewards for Multi-Answer Question Answering

June 2, 2026 · Qiming Shi, Zhaolu Kang, Yunfan Zhou, Di Weng, Yingcai Wu

SPADER enhances multi-answer QA via step-wise peer advantage and diversity-aware rewards. It outperforms existing methods in recall and F1 on benchmarks like QAMPARI.

arXiv

CARE-RL: Capability-Aware Reinforcement Learning for Mitigating Cross-Domain Conflicts

June 2, 2026 · Rui Zhang, Xinle Wu, Yao Lu

CARE-RL mitigates cross-domain conflicts in RL via protocol-aware rewards and capability-aware optimization. It outperforms baselines on Qwen2.5-7B and Qwen3-4B benchmarks.

arXiv

MemGraphRAG: Memory-based Multi-Agent System for Graph Retrieval-Augmented Generation

June 2, 2026 · Chuanjie Wu, Zhishang Xiang, Yunbo Tang, Zerui Chen, Qinggang Zhang, Jinsong Su

MemGraphRAG uses a memory-based multi-agent system to build coherent knowledge graphs, resolving fragmentation issues in GraphRAG. It outperforms state-of-the-art baselines in retrieval accuracy and efficiency.

arXiv

MemPro: Agentic Memory Systems as Evolvable Programs

June 2, 2026 · Qingshan Liu, Guoqing Wang, Wen Wu, Jingqi Huang, Xinqi Tao, Dejia Song, Jie Zhou, Liang He

MemPro treats agentic memory as evolvable code, using an agent to iteratively refine the entire retrieval pipeline. It outperforms static baselines by adapting to failures across diverse benchmarks.

arXiv

Authenticity Debt and the Synthetic Content Threat Landscape: A Layered Framework for Trust, Provenance, and IP Governance in the Generative AI Era

June 2, 2026 · Shubhashis Sengupta, Benjamin McCarty, Milind Savagaonkar, Rhine Andotra

The paper defines "authenticity debt" from unverified AI content and proposes a layered framework combining cryptographic provenance, human verification, and governance to mitigate risks and ensure trust.