Paper Insights - AI Arxiv Paper Analysis

cs.AI 2604.16278

Learning to Reason with Insight for Informal Theorem Proving

Proposed DeepInsightTheorem framework enhances informal theorem proving by identifying core techniques, significantly outperforming baselines.

Yunhe Li, Hao Shi, Bowen Deng et al.

2026-04-18 91

cs.CL 2604.16270

From Benchmarking to Reasoning: A Dual-Aspect, Large-Scale Evaluation of LLMs on Vietnamese Legal Text

A dual-aspect evaluation framework analyzes LLMs on Vietnamese legal text, revealing readability-accuracy trade-offs.

Van-Truong Le

2026-04-18 86

cs.RO 2604.16263

Semantic Area Graph Reasoning for Multi-Robot Language-Guided Search

Proposed SAGR framework coordinates multi-robot language-guided search using semantic area graphs, improving efficiency by 18.8% in large environments.

Ruiyang Wang, Hao-Lun Hsu, Jiwoo Kim et al.

2026-04-18 154

cs.LG 2604.16259

Beyond Distribution Sharpening: The Importance of Task Rewards

Task-reward optimization enhances Llama-3.2-3B-Instruct's performance on math datasets.

Sarthak Mittal, Leo Gagnon, Guillaume Lajoie

2026-04-18 105

cs.AI 2604.16258

Characterising LLM-Generated Competency Questions: a Cross-Domain Empirical Study using Open and Closed Models

Using the CompCQ framework, this study analyzes LLM-generated competency questions across domains, revealing generation characteristics.

Reham Alharbi, Valentina Tamma, Terry R. Payne et al.

2026-04-18 72

cs.CV 2604.16248

Where Do Vision-Language Models Fail? World Scale Analysis for Image Geolocalization

This study systematically evaluates various vision-language models for country-level image geolocalization, revealing their limitations in capturing fine-grained geographic cues.

Siddhant Bharadwaj, Ashish Vashist, Fahimul Aleem et al.

2026-04-18 115

cs.LG 2604.16247

Joint-Centric Dual Contrastive Alignment with Structure-Preserving and Information-Balanced Regularization

HILBERT framework achieves significant performance improvement in long-sequence audio-text representation learning through dual contrastive learning and information-balanced regularization.

Habibeh Naderi, Behrouz Haji Soleimani, Stan Matwin

2026-04-18 95

cs.LG 2604.16242

Detecting and Suppressing Reward Hacking with Gradient Fingerprints

Detect and suppress reward hacking using Gradient Fingerprints, achieving superior performance on math, code, and logical reasoning benchmarks.

Songtao Wang, Quang Hieu Pham, Fangcong Yin et al.

2026-04-18 287

cs.CL 2604.16241

BAGEL: Benchmarking Animal Knowledge Expertise in Language Models

BAGEL benchmark evaluates language models' performance on animal knowledge using closed-book questions on taxonomy, morphology, etc.

Jiacheng Shen, Masato Hagiwara, Milad Alizadeh et al.

2026-04-18 100

cs.CV 2604.16240

CollideNet: Hierarchical Multi-scale Video Representation Learning with Disentanglement for Time-To-Collision Forecasting

CollideNet enhances time-to-collision forecasting precision by disentangling temporal patterns in multi-scale video representation learning.

Nishq Poorav Desai, Ali Etemad, Michael Greenspan

2026-04-18 97

stat.ML 2604.16239

Adaptive multi-fidelity optimization with fast learning rates

Kometo algorithm achieves fast learning rates in multi-fidelity optimization without known smoothness or fidelity assumptions.

Come Fiegel, Victor Gabillon, Michal Valko

2026-04-18 6 citations 160

cs.CV 2604.16234

A Two-Stage, Object-Centric Deep Learning Framework for Robust Exam Cheating Detection

Proposed a two-stage deep learning framework using YOLOv8n and RexNet-150, achieving 95% accuracy in cheating detection.

Van-Truong Le, Le-Khanh Nguyen, Trong-Doanh Nguyen

2026-04-18 110

cs.RO 2604.16201

DENALI: A Dataset Enabling Non-Line-of-Sight Spatial Reasoning with Low-Cost LiDARs

DENALI dataset enables non-line-of-sight spatial reasoning with low-cost LiDARs, covering 72,000 scenes.

Nikhil Behari, Diego Rivero, Luke Apostolides et al.

2026-04-18 1 citations 118

cs.LG 2604.16076

Prototype-Grounded Concept Models for Verifiable Concept Alignment

Prototype-Grounded Concept Models (PGCMs) verify concept alignment via visual prototypes, enhancing interpretability.

Stefano Colamonaco, David Debot, Pietro Barbiero et al.

2026-04-17 122

cs.NE 2604.15997

Combining Convolution and Delay Learning in Recurrent Spiking Neural Networks

Combining convolution and delay learning in RSNNs achieves 52x inference speedup and 99% parameter savings on audio tasks.

Lúcio Folly Sanches Zebendo, Eleonora Cicciarella, Michele Rossi

2026-04-17 102

cs.CV 2604.15946

SENSE: Stereo OpEN Vocabulary SEmantic Segmentation

SENSE leverages stereo vision and vision-language models to enhance open-vocabulary semantic segmentation, achieving a 2.9% precision improvement on PhraseStereo.

Thomas Campagnolo, Ezio Malis, Philippe Martinet et al.

2026-04-17 115

cs.RO 2604.15890

Robust Fleet Sizing for Multi-UAV Inspection Missions under Synchronized Replacement Demand

Proposed a fleet sizing rule for multi-UAV missions ensuring 99.8% success, needing only four extra drones even under harshest conditions.

Vishal Ramesh, Antony Thomas

2026-04-17 95

cs.RO 2604.15865

DTEA: A Dual-Topology Elastic Actuator Enabling Real-Time Switching Between Series and Parallel Compliance

DTEA enables real-time switching between SEA and PEA topologies with switching time under 33.33 ms.

Vishal Ramesh, Aman Singh, Shishir Kolathaya

2026-04-17 94

cs.RO 2604.15864

Environment-Adaptive Solid-State LiDAR-Inertial Odometry

Proposed environment-adaptive solid-state LiDAR-inertial odometry, achieving 12.8% average RMSE reduction.

Zhi Zhang, Chalermchon Satirapod, Bingtao Ma et al.

2026-04-17 111

cs.RO 2604.15854

Limits of Lamarckian Evolution Under Pressure of Morphological Novelty

In modular robots, Lamarckian evolution outperforms Darwinian in single-task optimization but declines under morphological diversity pressure.

Jed R Muff, Karine Miras, A. E. Eiben

2026-04-17 87