cs.LG 2604.18578

Bounded Ratio Reinforcement Learning

Introduced Bounded Ratio Reinforcement Learning (BRRL) framework, outperforming PPO in environments like MuJoCo.

Yunke Ao, Le Chen, Bruce D. Lee et al.

2026-04-21 25
math.ST 2604.18441

Conformal Robust Set Estimation

Proposed a robust conformal prediction method using half-mass radius, suitable for heavy-tailed distributions.

Alejandro Cholaquidis, Emilien Joly, Leonardo Moreno

2026-04-20 21
stat.ML 2604.18420

Spectral bandits for smooth graph functions

Introduced spectral bandit algorithms for smooth graph functions, achieving linear and sublinear scaling in effective dimension.

Michal Valko, Rémi Munos, Branislav Kveton et al.

2026-04-20 118 citations 33