cs.CL 2603.15619

Mixture-of-Depths Attention

Mixture-of-Depths Attention (MoDA) improves downstream task performance by 2.11% on a 1.5B-parameter model with only a 3.7% increase in FLOPs.

Lianghui Zhu, Yuxin Fang, Bencheng Liao et al.

2026-03-17 123
cs.AI 2603.15586

Computational Concept of the Psyche

Proposes a cognitive architecture viewing the psyche as an operating system for constructing AGI.

Anton Kolonin, Vladimir Krykov

2026-03-17 109