cs.LG 2604.19740

Generalization at the Edge of Stability

Introduces 'sharpness dimension' to explain improved generalization at the edge of stability.

Mario Tuci, Caner Korkmaz, Umut Şimşekli et al.

2026-04-22 101
cs.LG 2604.19730

FASTER: Value-Guided Sampling for Fast RL

FASTER method reduces computational cost by early action sample filtering during denoising while maintaining RL performance.

Perry Dong, Alexander Swerdlow, Dorsa Sadigh et al.

2026-04-22 103
cs.LG 2604.18578

Bounded Ratio Reinforcement Learning

Introduced Bounded Ratio Reinforcement Learning (BRRL) framework, outperforming PPO in environments like MuJoCo.

Yunke Ao, Le Chen, Bruce D. Lee et al.

2026-04-21 114