Paper Insights - AI Arxiv Paper Analysis

cs.LG 2606.20537

Execution-State Capsules: Graph-Bound Execution-State Checkpoint and Restore for Low-Latency, Small-Batch, On-Device Physical-AI Serving

Proposes graph-bound execution-state capsules for low-latency, small-batch on-device AI, enabling byte-exact snapshot and restore with sub-millisecond GPU performance.

Liang Su

2026-06-19 32

cs.LG 2606.19878

On the Oracle Complexity of Interpolation-Based Gradient Descent

Proposes Piecewise Polynomial Interpolation-based Gradient Descent (PPI-GD) achieving oracle complexity of O((p/ε)^{d/(2ℓ)}) for data dimension d=O(log^{0.49}(n)), outperforming classical GD/SGD.

Dongmin Lee, William Lu, Anuran Makur

2026-06-18 13

cs.LG 2606.19236

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Proposes STARE, a surprisal-guided advantage reweighting method, stabilizing policy entropy and improving accuracy by 4%-8% on models from 1.5B to 32B.

Haipeng Luo, Qingfeng Sun, Songli Wu et al.

2026-06-18 38

cs.LG 2606.18933

Zero-Shot Active Feature Acquisition via LLM-Elicitation

Proposes a zero-shot active feature acquisition framework using LLM-derived discriminative statistics and MaxEnt closure, significantly improving IBD diagnosis accuracy.

Binyamin Perets, Natalie Mendelson, Shiran Vainberg et al.

2026-06-17 28

cs.LG 2606.18208

Looped World Models

Proposes LoopWM, a parameter-shared transformer with iterative latent refinement, achieving 100× parameter efficiency and stable long-horizon environment prediction.

Hongyuan Adam Lu, Z. L. Victor Wei, Qun Zhang et al.

2026-06-17 43

cs.LG 2606.18186

Kolmogorov Regression for Robust Diffusion Policies

Introduces Kolmogorov PDE-based diffusion policies with dimension-independent convergence, improving long-horizon control in robotics and manufacturing.

Lekan Molu

2026-06-17 29

cs.LG 2606.13657

Dense Supervision, Sparse Updates: On the Sparsity and Geometry of On-Policy Distillation

This paper analyzes the sparsity and geometric structure of on-policy distillation (OPD), revealing small, coordinate-sparse updates that are spectrally concentrated and deviate from source principal directions.

Guo Yu, Wenlin Liu, Yulan Hu et al.

2026-06-12 96

cs.LG 2606.12370

Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

Proposes Bebop with TV loss and rejection sampling to stabilize MTP acceptance rate, achieving up to 95% and 1.8× RL training acceleration.

Yucheng Li, Huiqiang Jiang, Yang Xu et al.

2026-06-11 181

cs.LG 2606.12364

On Subquadratic Architectures: From Applications to Principles

This study compares xLSTM, Mamba-2, and Gated DeltaNet architectures, demonstrating xLSTM's superior performance in complex sequence tasks due to its robust state tracking and memory accumulation.

Anamaria-Roberta Hartl, Levente Zólyomi, David Stap et al.

2026-06-11 65

cs.LG 2606.12362

Latent World Recovery for Multimodal Learning with Missing Modalities

Proposes Latent World Recovery (LWR), a robust multimodal learning framework that aligns modality-specific embeddings in a shared latent space, handling missing modalities without imputation.

Hui Wang, Tianyu Ren, Joseph Butler et al.

2026-06-11 56

cs.LG 2606.11988

What Uncertainties Do We Need for Dynamical Systems?

This paper offers a machine learning perspective on uncertainty in dynamical systems, distinguishing aleatoric and epistemic uncertainties, and analyzing their propagation across tasks.

Yusuf Sale, Christopher Bülte, Felix Czaja et al.

2026-06-10 60

cs.LG 2606.11182

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

EEVEE framework employs a router-conditioned prompt set with co-evolution to enhance LLM robustness across heterogeneous task streams, improving scores by 10.38-24.32 points.

Weixian Xu, Shilong Liu, Mengdi Wang

2026-06-10 57

cs.LG 2606.11171

Algorithmic and Minimax Complexities in Kernel Bandits

Unified analysis of GP-UCB and DEC in RKHS bandits via MAIR framework, revealing their fundamental differences and advantages.

Yunbei Xu

2026-06-10 51

cs.LG 2606.11162

COGENT: Continuous Graph Emulators with Neural Ordinary Differential Equations for Long-Term Physical Forecasting

COGENT integrates Graph Neural Networks with Neural ODEs for continuous long-term physical forecasting on irregular meshes, outperforming traditional autoregressive models.

Zesheng Liu, Maryam Rahnemoonfar

2026-06-10 48

cs.LG 2606.11149

Efficiently Learning Drifting Halfspaces with Massart Noise

Proposes an efficient online algorithm for drifting halfspaces under Massart noise, achieving an error bound of η + ˜O(Δ^{1/3}/γ), nearly matching theoretical limits.

Mingchen Ma, Guyang Cao, Jelena Diakonikolas et al.

2026-06-10 46

cs.LG 2606.11057

Flexible Kernels for Protein Property Prediction

This paper introduces flexible sequence kernels based on evolutionary substitution matrices, leveraging Gaussian processes for data-efficient protein property prediction, outperforming embedding-based methods.

Martin Jankowiak, Yerdos Ordabayev, Rudraksh Tuwani et al.

2026-06-10 43

cs.LG 2606.09821

Rethinking the Divergence Regularization in LLM RL

DRPO introduces smooth advantage-weighted quadratic regularization to improve stability and efficiency in LLM RL training, replacing hard masks with continuous gradient weights.

Jiarui Yao, Xiangxin Zhou, Penghui Qi et al.

2026-06-09 60

cs.LG 2606.09806

Topological Neural Operators

Introducing Topological Neural Operators (TNO), a framework leveraging cell complexes and discrete exterior calculus to improve PDE modeling on complex geometries, achieving over 20% accuracy gains.

Lennart Bastian, Samuel Leventhal, Mustafa Hajij et al.

2026-06-09 117

cs.LG 2606.09787

Zero Touch Predictive Orchestration: Automating Time-Series Models for the Cloud-Edge Continuum

Proposes an fully automated time-series forecasting framework combining high-frequency dataset TimeTrack with dynamic local telemetry, using NAS to generate accurate models, effectively addressing cold-start issues.

Abd Elghani Meliani, Arora Sagar, Adlen Ksentini et al.

2026-06-09 63

cs.LG 2606.07488

CoMetaPNS: Continually Meta-learning Personalized Neural Surrogates for Cardiac Electrophysiology Simulations

Proposes CoMetaPNS, integrating continual Bayesian GMM with set-conditioned generative models for personalized cardiac electrophysiology simulation, achieving superior accuracy and anti-forgetting.

Ryan Missel, Xiajun Jiang, Linwei Wang

2026-06-06 58