Sparse Autoencoders Reveal Interpretable and Steerable Features in VLA Models
Sparse Autoencoders reveal interpretable and steerable features in VLA models, enhancing generalization on the LIBERO benchmark.
Aiden Swann, Lachlain McGranahan, Hugo Buurmeijer et al.