Paper Insights - AI Arxiv Paper Analysis

cs.RO 2604.19670

Multi-Cycle Spatio-Temporal Adaptation in Human-Robot Teaming

RAPIDDS framework enhances human-robot teaming efficiency through multi-cycle spatio-temporal adaptation, significantly improving plan fluency and user preference.

Alex Cuellar, Michael Hagenow, Julie Shah

2026-04-22 39

cs.IR 2604.19664

ECLASS-Augmented Semantic Product Search for Electronic Components

ECLASS-augmented dense retrieval method achieves 94.3% HitRate@5 in semantic search for electronic components.

Nico Baumgart, Markus Lange-Hegermann, Jan Henze

2026-04-22 31

cs.CL 2604.19645

The signal is the ceiling: Measurement limits of LLM-predicted experience ratings from open-ended survey text

GPT models predict experience ratings from open-ended survey text; prompt optimization improves accuracy by 2%.

Andrew Hong, Jason Potteiger, Luis E. Zapata

2026-04-22 33

cs.RO 2604.19643

A Gesture-Based Visual Learning Model for Acoustophoretic Interactions using a Swarm of AcoustoBots

Gesture recognition using OpenCLIP visual learning model improves AcoustoBot swarm interaction accuracy to 87.8%.

Alex Lin, Lei Gao, Narsimlu Kemsaram et al.

2026-04-22 36

cs.CL 2604.19642

Micro Language Models Enable Instant Responses

Micro Language Models (μLMs) enable instant responses by generating the first 4-8 words on-device, with cloud models completing the response.

Wen Cheng, Tuochao Chen, Karim Helwani et al.

2026-04-22 34

cs.AI 2604.19638

SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models

SafetyALFRED evaluates safety planning in multimodal LLMs in kitchen settings, finding good hazard recognition but low risk mitigation success.

Josue Torres-Fonseca, Naihao Deng, Yinpei Dai et al.

2026-04-22 35

cs.RO 2604.19618

Autonomous UAV Pipeline Near-proximity Inspection via Disturbance-Aware Predictive Visual Servoing

The ESKF-PRE-VMPC framework reduces RMSE by 52.63% and 75.04% in UAV pipeline inspection without wind.

Wen Li, Hui Wang, Jinya Su et al.

2026-04-22 37

cs.CL 2604.19578

Impact of large language models on peer review opinions from a fine-grained perspective: Evidence from top conference proceedings in AI

Study shows large language models impact AI conference peer reviews, especially in linguistic complexity and evaluative focus.

Wenqing Wu, Chengzhi Zhang, Yi Zhao et al.

2026-04-21 50

cs.IR 2604.19566

Diagnosable ColBERT: Debugging Late-Interaction Retrieval Models Using a Learned Latent Space as Reference

Diagnosable ColBERT enhances ColBERT model diagnostics by aligning token embeddings to a clinically-grounded reference latent space.

François Remy

2026-04-21 33

cs.IR 2604.19550

LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction

LoopCTR enhances CTR prediction through loop scaling, significantly reducing computational costs.

Jiakai Tang, Runfeng Zhang, Weiqiu Wang et al.

2026-04-21 30

cs.RO 2604.19536

LiveVLN: Breaking the Stop-and-Go Loop in Vision-Language Navigation

LiveVLN breaks the stop-and-go loop in vision-language navigation, reducing waiting time by up to 77.7%.

Xiangchen Wang, Weiye Zhu, Teng Wang et al.

2026-04-21 36

cs.LG 2604.19451

Heterogeneity-Aware Personalized Federated Learning for Industrial Predictive Analytics

Proposes a heterogeneity-aware personalized federated learning model to enhance failure time prediction accuracy in industrial predictive analytics.

Yuhan Hu, Xiaolei Fang

2026-04-21 33

cs.IR 2604.19414

CAST: Modeling Semantic-Level Transitions for Complementary-Aware Sequential Recommendation

CAST framework models semantic-level transitions, achieving 17.6% Recall and 16.0% NDCG gains with 65x training acceleration.

Qian Zhang, Lech Szymanski, Haibo Zhang et al.

2026-04-21 32

cs.NE 2604.19343

Scalable Memristive-Friendly Reservoir Computing for Time Series Classification

MARS model achieves 21x training speedup and significant performance improvement through parallelization and subtractive skip connections.

Coşku Can Horuz, Andrea Ceni, Claudio Gallicchio et al.

2026-04-21 34

cs.AI 2604.19301

Large Language Models Exhibit Normative Conformity

Large language models exhibit normative conformity, revealing underlying mechanisms.

Mikako Bito, Keita Nishimoto, Kimitaka Asatani et al.

2026-04-21 36

cs.IR 2604.19269

CS3: Efficient Online Capability Synergy for Two-Tower Recommendation

CS3 framework enhances two-tower recommendation systems with Cycle-Adaptive Structure, Cross-Tower Synchronization, and Cascade-Model Sharing, achieving an 8.36% revenue increase.

Lixiang Wang, Shaoyun Shi, Peng Wang et al.

2026-04-21 50

stat.ML 2604.19091

Fast estimation of Gaussian mixture components via centering and singular value thresholding

Fast estimation of Gaussian mixture components via centering and singular value thresholding without iteration.

Huan Qing

2026-04-21 28

cs.LG 2604.19072

S2MAM: Semi-supervised Meta Additive Model for Robust Estimation and Variable Selection

S2MAM uses bilevel optimization for robust estimation and variable selection, validated on 16 datasets.

Xuelin Zhang, Hong Chen, Yingjie Wang et al.

2026-04-21 29

cs.AI 2604.18584

MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval

MathNet provides a global multimodal benchmark for mathematical reasoning and retrieval, covering 30,676 Olympiad-level problems from 47 countries.

Shaden Alshammari, Kevin Wen, Abrar Zainal et al.

2026-04-21 1 citations 34

cs.CV 2604.18583

MUA: Mobile Ultra-detailed Animatable Avatars

MUA method achieves up to 2000X lower computational cost using Wavelet-guided Multi-level Spatial Factorized Blendshapes.

Heming Zhu, Guoxing Sun, Marc Habermann

2026-04-21 37