physics.optics 2603.17049

Attractor-Keyed Memory

Attractor-Keyed Memory merges selection and memory access, reducing latency and energy in sparse routing architectures.

Natalia G. Berloff

2026-03-18 42
cs.CV 2603.16870

Demystifing Video Reasoning

Video models exhibit reasoning via Chain-of-Steps mechanism during diffusion denoising steps.

Ruisi Wang, Zhongang Cai, Fanyi Pu et al.

2026-03-18 48
cs.LG 2603.16867

Efficient Reasoning on the Edge

Efficient reasoning in small LLMs using LoRA adapters and RL, significantly reducing response length.

Yelysei Bondarenko, Thomas Hehn, Rob Hesselink et al.

2026-03-18 57
cs.CL 2603.15619

Mixture-of-Depths Attention

Mixture-of-Depths Attention (MoDA) improves downstream task performance by 2.11% on a 1.5B-parameter model with only a 3.7% increase in FLOPs.

Lianghui Zhu, Yuxin Fang, Bencheng Liao et al.

2026-03-17 66