cs.CV 2605.30347

NeuROK: Generative 4D Neural Object Kinematics

NeuROK employs a transformer-based encoder-decoder to learn a low-dimensional latent space for 4D object dynamics, trained on large-scale geometric trajectories, bypassing predefined physical models.

Chen Geng, Guangzhao He, Yue Gao et al.

2026-05-29 59
cs.CV 2605.28806

Personal Visual Memory from Explicit and Implicit Evidence

VisualMem introduces a structured visual memory module integrated with text memory, achieving 95% accuracy in personal entity recall, surpassing caption-based methods by over 40%.

Viet Nguyen, Thao Nguyen, Vishal M. Patel et al.

2026-05-28 127