Sessa: Selective State Space Attention
Sessa enhances long-range memory by embedding selective attention in feedback paths.
Liubomyr Horbatko
Sessa enhances long-range memory by embedding selective attention in feedback paths.
Liubomyr Horbatko
Introduced Bounded Ratio Reinforcement Learning (BRRL) framework, outperforming PPO in environments like MuJoCo.
Yunke Ao, Le Chen, Bruce D. Lee et al.
BLF system achieves state-of-the-art binary forecasting performance on ForecastBench using sequential Bayesian updating of linguistic beliefs.
Kevin Murphy
Apollo model integrates 28 medical modalities and 12 specialties to predict disease risk up to 5 years in advance.
Andrew Zhang, Tong Ding, Sophia J. Wagner et al.
Revisiting active sequential prediction-powered mean estimation reveals smallest confidence width when constant probability weight is near one.
Maria-Eleni Sfyraki, Jun-Kun Wang
The study reveals dual alignment between language model layers and human sentence processing, with early layers suited for natural reading and later layers better modeling complex syntactic processing.
Tatsuki Kuribayashi, Alex Warstadt, Yohei Oseki et al.
ConforNets controls AF3 latent representations via channel-wise affine transforms, enhancing multi-state prediction success.
Minji Lee, Colin Kalicki, Minkyu Jeon et al.
SynAgent leverages Solo-to-Cooperative Agent Synergy for generalizable humanoid manipulation, significantly enhancing generalization across diverse object geometries.
Wei Yao, Haohan Ma, Hongwen Zhang et al.
GSQ achieves high-accuracy low-bit quantization using Gumbel-Softmax sampling, narrowing the accuracy gap with QTIP methods.
Alireza Dadgarnia, Soroush Tabesh, Mahdi Nikdan et al.
ClawEnvKit automates environment generation for claw-like agents, reducing costs by 13,800x.
Xirui Li, Ming Li, Derry Xu et al.
Transition-matrix regularization improves next dialogue act prediction in counseling conversations, boosting macro-F1 by 9-42%.
Eric Rudolph, Philipp Steigerwald, Jens Albrecht
MetaCloak-JPEG enhances JPEG robustness of adversarial perturbations for DreamBooth deepfake prevention, achieving 32.7 dB PSNR.
Tanjim Rahaman Fardin, S M Zunaid Alam, Mahadi Hasan Fahim et al.
Study of LLM jailbreaks via RLVR, SFT, and refusal-feature abliteration reveals RLVR models closely resemble base models.
Md Rysul Kabir, Zoran Tiganj
Document-as-image representations underperform in scientific retrieval; interleaved text+image representations are more effective.
Ghazal Khalighinejad, Raghuveer Thirukovalluru, Alexander H. Oh et al.
OneVL achieves one-step latent reasoning and planning with vision-language explanations, surpassing explicit CoT at answer-only latency.
Jinghui Lu, Jiayi Guan, Zhijian Huang et al.
XEmbodied model enhances VLA models with 3D geometric and physical cues, improving performance across benchmarks.
Kangan Qian, ChuChu Xie, Yang Zhong et al.
ACoFi method combines learned safety filters with adaptive conformal inference, enhancing control system safety.
Sacha Huriot, Ihab Tabbara, Hussein Sibai
Proposed a robust conformal prediction method using half-mass radius, suitable for heavy-tailed distributions.
Alejandro Cholaquidis, Emilien Joly, Leonardo Moreno
Introduced spectral bandit algorithms for smooth graph functions, achieving linear and sublinear scaling in effective dimension.
Michal Valko, Rémi Munos, Branislav Kveton et al.
Adaptive kernel selection enhances stability and accuracy of kernelized diffusion maps.
Othmane Aboussaad, Adam Miraoui, Boumediene Hamzi et al.