Paper Insights - AI Arxiv Paper Analysis

cs.LG 2604.19698

On two ways to use determinantal point processes for Monte Carlo integration

Utilizing determinantal point processes for Monte Carlo integration to enhance estimator variance convergence speed.

Guillaume Gautier, Rémi Bardenet, Michal Valko

2026-04-22 27 citations 98

cs.AI 2604.19689

A-MAR: Agent-based Multimodal Art Retrieval for Fine-Grained Artwork Understanding

A-MAR framework enhances multimodal art retrieval explanation quality through structured reasoning plans.

Shuai Wang, Hongyi Zhu, Jia-Hong Huang et al.

2026-04-22 136

cs.CL 2604.19685

An Answer is just the Start: Related Insight Generation for Open-Ended Document-Grounded QA

InsightGen generates diverse and relevant insights to enhance open-ended document QA.

Saransh Sharma, Pritika Ramu, Aparna Garimella et al.

2026-04-22 92

cs.RO 2604.19683

Mask World Model: Predicting What Matters for Robust Robot Policy Learning

Mask World Model predicts semantic masks instead of pixels, enhancing robust robot policy learning, excelling in LIBERO and RLBench.

Yunfan Lou, Xiaowei Chi, Xiaojie Zhang et al.

2026-04-22 284

cs.RO 2604.19677

Learning Hybrid-Control Policies for High-Precision In-Contact Manipulation Under Uncertainty

MATCH method improves peg-in-hole task success rate by 35% under high noise, reducing average force by 30%.

Hunter L. Brown, Geoffrey Hollinger, Stefan Lee

2026-04-22 133

cs.RO 2604.19670

Multi-Cycle Spatio-Temporal Adaptation in Human-Robot Teaming

RAPIDDS framework enhances human-robot teaming efficiency through multi-cycle spatio-temporal adaptation, significantly improving plan fluency and user preference.

Alex Cuellar, Michael Hagenow, Julie Shah

2026-04-22 95

cs.IR 2604.19664

ECLASS-Augmented Semantic Product Search for Electronic Components

ECLASS-augmented dense retrieval method achieves 94.3% HitRate@5 in semantic search for electronic components.

Nico Baumgart, Markus Lange-Hegermann, Jan Henze

2026-04-22 111

cs.CL 2604.19645

The signal is the ceiling: Measurement limits of LLM-predicted experience ratings from open-ended survey text

GPT models predict experience ratings from open-ended survey text; prompt optimization improves accuracy by 2%.

Andrew Hong, Jason Potteiger, Luis E. Zapata

2026-04-22 103

cs.RO 2604.19643

A Gesture-Based Visual Learning Model for Acoustophoretic Interactions using a Swarm of AcoustoBots

Gesture recognition using OpenCLIP visual learning model improves AcoustoBot swarm interaction accuracy to 87.8%.

Alex Lin, Lei Gao, Narsimlu Kemsaram et al.

2026-04-22 114

cs.CL 2604.19642

Micro Language Models Enable Instant Responses

Micro Language Models (μLMs) enable instant responses by generating the first 4-8 words on-device, with cloud models completing the response.

Wen Cheng, Tuochao Chen, Karim Helwani et al.

2026-04-22 87

cs.AI 2604.19638

SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models

SafetyALFRED evaluates safety planning in multimodal LLMs in kitchen settings, finding good hazard recognition but low risk mitigation success.

Josue Torres-Fonseca, Naihao Deng, Yinpei Dai et al.

2026-04-22 130

cs.RO 2604.19618

Autonomous UAV Pipeline Near-proximity Inspection via Disturbance-Aware Predictive Visual Servoing

The ESKF-PRE-VMPC framework reduces RMSE by 52.63% and 75.04% in UAV pipeline inspection without wind.

Wen Li, Hui Wang, Jinya Su et al.

2026-04-22 116

cs.CL 2604.19578

Impact of large language models on peer review opinions from a fine-grained perspective: Evidence from top conference proceedings in AI

Study shows large language models impact AI conference peer reviews, especially in linguistic complexity and evaluative focus.

Wenqing Wu, Chengzhi Zhang, Yi Zhao et al.

2026-04-21 157

cs.IR 2604.19566

Diagnosable ColBERT: Debugging Late-Interaction Retrieval Models Using a Learned Latent Space as Reference

Diagnosable ColBERT enhances ColBERT model diagnostics by aligning token embeddings to a clinically-grounded reference latent space.

François Remy

2026-04-21 123

cs.IR 2604.19550

LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction

LoopCTR enhances CTR prediction through loop scaling, significantly reducing computational costs.

Jiakai Tang, Runfeng Zhang, Weiqiu Wang et al.

2026-04-21 171

cs.RO 2604.19536

LiveVLN: Breaking the Stop-and-Go Loop in Vision-Language Navigation

LiveVLN breaks the stop-and-go loop in vision-language navigation, reducing waiting time by up to 77.7%.

Xiangchen Wang, Weiye Zhu, Teng Wang et al.

2026-04-21 209

cs.LG 2604.19451

Heterogeneity-Aware Personalized Federated Learning for Industrial Predictive Analytics

Proposes a heterogeneity-aware personalized federated learning model to enhance failure time prediction accuracy in industrial predictive analytics.

Yuhan Hu, Xiaolei Fang

2026-04-21 113

cs.IR 2604.19414

CAST: Modeling Semantic-Level Transitions for Complementary-Aware Sequential Recommendation

CAST framework models semantic-level transitions, achieving 17.6% Recall and 16.0% NDCG gains with 65x training acceleration.

Qian Zhang, Lech Szymanski, Haibo Zhang et al.

2026-04-21 98

cs.NE 2604.19343

Scalable Memristive-Friendly Reservoir Computing for Time Series Classification

MARS model achieves 21x training speedup and significant performance improvement through parallelization and subtractive skip connections.

Coşku Can Horuz, Andrea Ceni, Claudio Gallicchio et al.

2026-04-21 104

cs.AI 2604.19301

Large Language Models Exhibit Normative Conformity

Large language models exhibit normative conformity, revealing underlying mechanisms.

Mikako Bito, Keita Nishimoto, Kimitaka Asatani et al.

2026-04-21 93