Technology News - Global News Digest

arXiv

A Primer in Post-Training Reasoning Data: What We Know About How It Works

June 2, 2026 · Yaoming Li, Guangxiang Zhao, Qilong Shi, Lin Sun, Xiangzheng Zhang, Tong Yang

This primer consolidates insights from 150+ studies to structure post-training reasoning data research. It addresses data nature, effectiveness, construction, and scaling to guide future model development.

arXiv

Jailbreaking Multimodal Large Language Models using Multi-Clip Video

June 2, 2026 · Choongwon Kang, Seungjong Sun, Hyunmin Jun, Jang Hyun Kim

This study introduces Multi-Clip Video SafetyBench, revealing that increasing video clip variety significantly boosts jailbreak success in MLLMs. It proposes leveraging image stability as a defensive strategy against these video-specific vulnerabilities.

arXiv

LALE: Lightweight-Transformer Architecture for Land-Cover Estimation

June 2, 2026 · \"Umit Mert \c{C}a\u{g}lar, Alptekin Temizel

LALE is a lightweight transformer for land-cover estimation that balances efficiency and performance. It achieves high accuracy with significantly fewer parameters and computational costs than baselines.

arXiv

How Hard Can It Be? Hardness-Aware Multi-Objective Unlearning

June 2, 2026 · Jiangwei Chen, Xinyuan Niu, Rachael Hwee Ling Sim, Zhengyuan Liu, Nancy F. Chen, Bryan Kian Hsiang Low

HAMU uses hardness-aware multi-objective optimization to guarantee forget quality improvements while minimizing retain utility loss. It identifies unavoidable trade-offs and outperforms baselines on image and text datasets.

arXiv

Variational Learning for Insertion-based Generation

June 2, 2026 · Yangtian Zhang, Zhe Wang, Arthur Gretton, Rex Ying, David van Dijk, Michalis K. Titsias, Jiaxin Shi

The Insertion Process (IP) model learns variable-length generation and insertion order via permutation-based variational inference. It outperforms fixed-grid methods in molecular and planning tasks by adapting to non-monotonic structures.

arXiv

Understanding-Enhanced Model Collaboration for Long-Tailed Egocentric Mistake Detection

June 2, 2026 · Boyu Han, Qianqian Xu, Shilong Bao, Zhiyong Yang, Ruochen Cui, Qingming Huang

UE-MCM combines lightweight and large models to detect rare egocentric errors, using dynamic collaboration and specialized loss functions to handle long-tailed distributions efficiently.

arXiv

Rethinking Evaluation Paradigms in IBP-based Certified Training

June 2, 2026 · Konstantin Kaulen, Hadar Shavit, Holger H. Hoos

This paper proposes Pareto front comparisons to fairly evaluate IBP-based certified training, revealing that prior methods were often undertuned. This approach establishes new state-of-the-art results and exposes significant performance complementarities among existing techniques.

arXiv

VLBM: Variational Latent Basis Modeling for OOD Robust Multivariate Time Series Forecasting

June 2, 2026 · Xudong Zhang, Jierui Lei, Jiacheng Li, Lingdong Shen, Jian Cui, Haina Tang

VLBM is a variational latent basis model that enhances OOD robustness in multivariate time series forecasting by decomposing stable dynamics from OOD deviations. It achieves state-of-the-art performance, improving MAE by 15.08% and MSE by 7.74% across diverse benchmarks.

arXiv

Multimodal Approaches for Visually-Rich Document Type Classification: A Comparative Analysis

June 2, 2026 · Catyana Heyne, J\"urgen Frikel, Filippo Riccio

This study compares multimodal models on the RVL-CDIP benchmark, finding specialized transformers outperform LLMs for complex documents. Visual data proves more critical than OCR for accurate classification.

arXiv

Predicting the risk of colorectal anastomotic leak based on preoperative mapping of the blood supply of the bowel

June 2, 2026 · Zahra Tabatabaei, Jon Sporring, Mark Bremholm Elleb{\ae}k, Alaa El-Hussuna

This protocol outlines an AI system using preoperative CT scans to predict colorectal anastomotic leak risk. It integrates vascular analysis with historical case retrieval to enhance surgical decision-making.

arXiv

Multilingual Idioms in Sentences and Conversations Across High-, Medium-, and Low-Resource Languages

June 2, 2026 · Saeed Almheiri, Bilal Elbouardi, Salsabila Zahirah Pranida, Irina Nikishina, Ashwath Rao B, Parameswari Krishnamurthy, Muhammad Cendekia Airlangga, Rifo Ahmad Genadi, Nguyen Phan Gia Bao, Amir Hossein Yari, Hawau Olamide Toyin, Nurdaulet Mukhituly, Mena A

The study introduces MIDI, a multilingual idiom dataset across varying resource levels, revealing that models struggle with literal idioms and low-resource languages. While conversational context helps, it cannot fully bridge performance gaps or overcome current model limitations.

arXiv

Order within Chaos: Capturing Intrinsic Energy Anomalies for AI-Manipulated Image Forgery Localization

June 2, 2026 · Yiming Wang, Baiqi Wu, Qingming Li, Jiahao Chen, Tong Zhang, Shouling Ji

FLAME localizes AI image forgeries by detecting intrinsic energy anomalies from diffusion processes, outperforming existing methods. It also introduces EditStream, an automated pipeline for continuous, instruction-based training data synthesis.

arXiv

On the Generalization in Topology Optimization via Sensitivity-Conditioned Bernoulli Flow Matching

June 2, 2026 · Mohammad Rashed, Duarte F. Valoroso Madeira, Babak Gholami, Caglar Guerbuez, Yunjia Yang, Nils Thuerey

This study proves adjoint sensitivity is the optimal conditioning signal for topology optimization generalization. It introduces pseudo-sensitivities and validates their efficacy via Bernoulli flow matching across structural and CFD benchmarks.

arXiv

Consistency Training while Mitigating Obfuscation via Rate Matching

June 2, 2026 · Sohaib Imran, Prakhar Gupta, Jannes Elstner, David Demitri Africa

Rate Matching Consistency Training (RMCT) mitigates obfuscation by stabilizing behavior rates rather than forcing identical outputs. This preserves monitorability while effectively reducing biases like sycophancy in language models.

arXiv

Faster Synchronous On-Policy RL via Straggler-Aware Group Sizing

June 2, 2026 · Azal Ahmad Khan, Ammar Ahmed, Zeshan Fayyaz, Sheng Di, Mingyi Hong, Ali Anwar

SAGC dynamically adjusts RL group sizes to mitigate stragglers, boosting wall-clock efficiency and model performance. It outperforms static baselines in training speed and reasoning benchmarks without explicit length penalties.

arXiv

FW-NKF: Frequency-Weighted Neural Kalman Filters

June 2, 2026 · Adnan Harun Dogan, Berken Utku Demirel, Christian Holz

FW-NKF integrates spectral shaping into neural Kalman filters to suppress band-limited noise. It reduces localization error by 10% and improves orientation accuracy across diverse benchmarks.

arXiv

AgentRedBench: Dynamic Redteaming and Integration-Aware Defense for LLM Agents over SaaS Integrations

June 2, 2026 · Hiskias Dingeto, Will Leeney

AgentRedBench introduces a dynamic redteaming benchmark for LLM agents, revealing high vulnerability to indirect prompt injections. Its companion defense, AgentRedGuard, drastically reduces attack success rates while maintaining low false positives.

arXiv

Towards Resolving Optimization Conflicts Between Image- and Text-Based Person Re-Identification

June 2, 2026 · Karina Kvanchiani, Timur Mamedov

This study proposes a decoupled, two-stage training framework to resolve optimization conflicts between image- and text-based person ReID. Results show pre-training with I2I and integrating textual supervision significantly boost unified representation performance.

arXiv

CityTrajBench: A Unified Benchmark for City-Scale Vehicle Trajectory Generation

June 2, 2026 · Shibo Zhu, Xiaodan Shi, Dayin Chen, Yuntian Chen, Haoran Zhang, Tianhao Wu, Jinyue Yan

CityTrajBench standardizes city-scale trajectory generation via a unified framework and protocol. It evaluates diverse models across five dimensions, revealing distinct trade-offs in realism, fidelity, and efficiency.

arXiv

Quantitative Movement Testing: Measuring Patient Movements from a Single Smartphone Video

June 2, 2026 · Pranav Mahajan, Amanda Wall, Eleonora Maria Camerone, Julie Stebbins, Eoin Kelleher, Shuangyi Tong, Annina Schmid, Katja Wiech, Anushka Irani, Ben Seymour

QMT extracts 3D kinematic biomarkers from smartphone videos, validating against motion capture. It reliably monitors chronic pain patients’ movements in home settings.