cs.LG 2603.16867

Efficient Reasoning on the Edge

Efficient reasoning in small LLMs using LoRA adapters and RL, significantly reducing response length.

Yelysei Bondarenko, Thomas Hehn, Rob Hesselink et al.

2026-03-18 57
cs.LG 2603.12231

Temporal Straightening for Latent Planning

Temporal Straightening improves latent planning success rates by 20-60% using curvature regularization.

Ying Wang, Oumayma Bounou, Gaoyue Zhou et al.

2026-03-13 50