cs.LG 2606.19878

On the Oracle Complexity of Interpolation-Based Gradient Descent

Proposes Piecewise Polynomial Interpolation-based Gradient Descent (PPI-GD) achieving oracle complexity of O((p/ε)^{d/(2ℓ)}) for data dimension d=O(log^{0.49}(n)), outperforming classical GD/SGD.

Dongmin Lee, William Lu, Anuran Makur

2026-06-18 13
cs.LG 2606.18933

Zero-Shot Active Feature Acquisition via LLM-Elicitation

Proposes a zero-shot active feature acquisition framework using LLM-derived discriminative statistics and MaxEnt closure, significantly improving IBD diagnosis accuracy.

Binyamin Perets, Natalie Mendelson, Shiran Vainberg et al.

2026-06-17 28
cs.LG 2606.18208

Looped World Models

Proposes LoopWM, a parameter-shared transformer with iterative latent refinement, achieving 100× parameter efficiency and stable long-horizon environment prediction.

Hongyuan Adam Lu, Z. L. Victor Wei, Qun Zhang et al.

2026-06-17 43
cs.LG 2606.12364

On Subquadratic Architectures: From Applications to Principles

This study compares xLSTM, Mamba-2, and Gated DeltaNet architectures, demonstrating xLSTM's superior performance in complex sequence tasks due to its robust state tracking and memory accumulation.

Anamaria-Roberta Hartl, Levente Zólyomi, David Stap et al.

2026-06-11 65
cs.LG 2606.11988

What Uncertainties Do We Need for Dynamical Systems?

This paper offers a machine learning perspective on uncertainty in dynamical systems, distinguishing aleatoric and epistemic uncertainties, and analyzing their propagation across tasks.

Yusuf Sale, Christopher Bülte, Felix Czaja et al.

2026-06-10 60
cs.LG 2606.11149

Efficiently Learning Drifting Halfspaces with Massart Noise

Proposes an efficient online algorithm for drifting halfspaces under Massart noise, achieving an error bound of η + ˜O(Δ^{1/3}/γ), nearly matching theoretical limits.

Mingchen Ma, Guyang Cao, Jelena Diakonikolas et al.

2026-06-10 46
cs.LG 2606.11057

Flexible Kernels for Protein Property Prediction

This paper introduces flexible sequence kernels based on evolutionary substitution matrices, leveraging Gaussian processes for data-efficient protein property prediction, outperforming embedding-based methods.

Martin Jankowiak, Yerdos Ordabayev, Rudraksh Tuwani et al.

2026-06-10 43
cs.LG 2606.09821

Rethinking the Divergence Regularization in LLM RL

DRPO introduces smooth advantage-weighted quadratic regularization to improve stability and efficiency in LLM RL training, replacing hard masks with continuous gradient weights.

Jiarui Yao, Xiangxin Zhou, Penghui Qi et al.

2026-06-09 60
cs.LG 2606.09806

Topological Neural Operators

Introducing Topological Neural Operators (TNO), a framework leveraging cell complexes and discrete exterior calculus to improve PDE modeling on complex geometries, achieving over 20% accuracy gains.

Lennart Bastian, Samuel Leventhal, Mustafa Hajij et al.

2026-06-09 117