Paper Insights - AI Arxiv Paper Analysis

math.ST 2606.06332

Bentkus-type asymptotic e-values

Introducing Bentkus-type asymptotic e-values that eliminate the missing factor, improving inference sharpness in multiple testing and post-hoc analysis.

Diego Martinez-Taboada, Ben Chugg, Aaditya Ramdas

2026-06-05 59

cs.LG 2606.06329

Efficient Mean Curvature Computation on High-Dimensional Data Manifolds

Proposes an algebraic identity and low-rank SVD approximation to compute mean curvature efficiently on high-dimensional data manifolds, reducing complexity from O(m^4) to near O(k^2 m).

Alexandre L. M. Levada

2026-06-05 65

cs.RO 2606.06323

VOLT: Vision and Language Trajectory Segmentation for Faster-than-Demonstration Policies

VOLT leverages vision-language models for trajectory segmentation, enabling robots to execute tasks up to 2.57× faster while maintaining success rates.

Robert Ramirez Sanchez, Daniel J. Evans, Dylan P. Losey et al.

2026-06-04 68

cs.CL 2606.06242

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

Introduced a benchmark for data snapshot detection, evaluated open-source models, revealing significant gaps in real-world institutional document understanding.

AJ Carl P. Dy, Aivin V. Solatorio

2026-06-04 73

cs.IR 2606.06225

Bridging the Semantic-Collaborative Gap: An Asymmetric Graph Architecture for Cold-Start Item Recommendation

Proposes Shallow-RHS, an asymmetric graph architecture for cold-start content recommendation, mapping intrinsic features into a collaborative filtering space for immediate deployment.

Anh Truong, John Trenkle, Yuanbo Chen et al.

2026-06-04 84

cs.NE 2606.06198

Hub-Aware Hybrid Search: Accelerating the Locally Aligned Ant Technique

Proposes Hub-Aware hybrid search combining pre-processing and likelihood-pheromone guidance to enhance cosmic web filament detection efficiency.

Simone Vilardi, Reynier Peletier, Felipe Contreras et al.

2026-06-04 57

cs.RO 2606.06041

Sample-efficient Low-level Motion Planning for Robotic Manipulation Tasks via Zero-shot Transfer Learning

Proposes iCEM+TL framework integrating transfer learning to boost low-level robotic motion planning success rate by 23%, enabling zero-shot transfer for complex tasks.

Yuanzhi He, Victor Romero-Cano, José J. Patiño et al.

2026-06-04 65

cs.LG 2606.05693

MolE-RAG: Molecular Structure-Enhanced Retrieval-Augmented Generation for Chemistry

MolE-RAG integrates literature, molecular features, and structural similarity to enhance LLM-based molecular property prediction, boosting ROC-AUC by up to 28% and reducing RMSE by 67%.

Joey Chan, Wonbin Kweon, Ashley Shin et al.

2026-06-04 73

cs.LG 2606.05152

Reinforcement Learning from Rich Feedback with Distributional DAgger

Proposes DistIL, a distributional imitation learning algorithm with monotonic improvement guarantees, leveraging rich feedback for complex reasoning tasks.

Rishabh Agrawal, Jacob Fein-Ashley, Paria Rashidinejad

2026-06-04 75

cs.RO 2606.03985

Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking

Humanoid-GPT employs a 2B-frame large-scale motion dataset and GPT-style causal Transformer to achieve zero-shot high-dynamic motion tracking, surpassing shallow MLP trackers.

Zekun Qi, Xuchuan Chen, Dairu Liu et al.

2026-06-03 51

cs.LG 2606.03980

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

Skill-RM unifies heterogeneous evaluation criteria via agent skills, enabling dynamic resource orchestration, outperforming traditional judges with a 3-6% improvement on RewardBench2.

Tao Chen, Gangwei Jiang, Pengyu Cheng et al.

2026-06-03 81

cs.LG 2606.03979

Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories

Introducing the 'Sleep' paradigm with Knowledge Seeding and Dreaming mechanisms enables LLMs to self-modify and consolidate memories for continual learning.

Ali Behrouz, Farnoosh Hashemi, Vahab Mirrokni

2026-06-03 2 citations 49

cs.LG 2606.03962

Using Reward Uncertainty to Induce Diverse Behaviour in Reinforcement Learning

Proposes ROSA, a reward distribution-based framework for inducing diverse behaviors without performance loss, leveraging set functions and unbiased gradient estimators.

Anthony GX-Chen, Ankit Anand, Gheorghe Comanici et al.

2026-06-03 53

cs.RO 2606.03949

Preference-Calibrated Human-in-the-Loop Reinforcement Learning for Robotic Manipulation

Proposed PACT framework uses preference signals and task progress modeling to correct overestimated Q-values, boosting success rate by 24.5% and accelerating convergence 1.3× in real robot tasks.

Zeyi Liu, Guangyao Liu, Yinuo Qu et al.

2026-06-03 56

eess.IV 2606.03940

SEAOTTER: Sensor Embedded Autoencoding with One-Time Transcode for Efficient Reconstruction

SEAOTTER combines a low-complexity learned latent encoder with a learnable JPEG codec and one-time cloud transcode, achieving 200:1 compression with 7× faster encoding, 3.5× decoding, and 8% accuracy boost on ImageNet.

Dan Jacobellis, Neeraja J. Yadwadkar

2026-06-03 46

cs.LG 2606.03584

Training a Predictive Coding Network on ImageNet using Equilibrium Propagation

This paper introduces an equilibrium propagation (EP)-based training method for deep predictive coding networks (PCNs), achieving 13.23% Top-5 error on ImageNet with a 10-layer VGG model, close to the 12.2% baseline of backpropagation.

Tugdual Kerjan, Rasmus Høier, Benjamin Scellier

2026-06-02 42

cs.CV 2606.02580

Thinking in Blender: Staged Executable Inverse Graphics with Vision-Language Models

Proposes SEIG, a staged framework leveraging pretrained vision-language models (VLMs) to reconstruct editable 3D scenes from a single image, achieving high fidelity in geometry, materials, and lighting.

Guangzhao He, Rundong Luo, Wei-Chiu Ma et al.

2026-06-02 116

cs.CV 2606.02569

AdaCodec: A Predictive Visual Code for Video MLLMs

AdaCodec employs predictive visual coding, transmitting full reference frames only when prediction is costly, reducing visual tokens by 84.7% and boosting long-video understanding efficiency.

Haowen Hou, Zhen Huang, Zheming Liang et al.

2026-06-02 144

cs.CV 2606.02564

VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization

This paper introduces VLM as a teacher for video reasoning via test-time online optimization, achieving a 16.7-point performance boost, surpassing traditional methods.

Junhao Cheng, Liang Hou, Tianxiong Zhong et al.

2026-06-02 85

cs.CL 2606.02559

From Layers to Submodules: Rethinking Granularity in Replacement-Based LLM Compression

SubFit introduces non-contiguous submodule replacement in LLMs, achieving superior compression with 84.6% accuracy at 25% sparsity, using residual fitting without retraining.

Elia Cunegatti, Marcus Vukojevic, Erik Nielsen et al.

2026-06-02 78