The Latent Color Subspace: Emergent Order in High-Dimensional Chaos
Achieve color control in FLUX's VAE latent space, revealing a structure reflecting Hue, Saturation, and Lightness.
Mateusz Pach, Jessica Bader, Quentin Bouniot et al.
Achieve color control in FLUX's VAE latent space, revealing a structure reflecting Hue, Saturation, and Lightness.
Mateusz Pach, Jessica Bader, Quentin Bouniot et al.
HumDex system uses IMU tracking and learning methods for portable humanoid dexterous manipulation, enhancing data collection efficiency and generalization.
Liang Heng, Yihe Tang, Jiajun Xu et al.
DreamVideo-Omni achieves multi-subject video customization with latent identity reinforcement learning, enhancing identity fidelity and motion control precision.
Yujie Wei, Xinyu Liu, Shiwei Zhang et al.
AutoGaze autoregressively selects multi-scale video patches, reducing redundancy and enhancing efficiency, enabling 1K-frame 4K video processing.
Baifeng Shi, Stephanie Fu, Long Lian et al.
EndoCoT activates MLLMs' reasoning potential, achieving 92.1% accuracy, 8.3% higher than the baseline.
Xuanlang Dai, Yujie Zhou, Long Xing et al.
The study enhances performance in non-verifiable LLM post-training using reasoning LLM judges, with gpt-oss-120b as the gold standard.
Yixin Liu, Yue Yu, DiJia Su et al.
Separable Neural Architectures (SNA) unify predictive and generative intelligence by constraining interaction order and tensor rank.
Reza T. Batley, Apurba Sarker, Rajib Mostakim et al.
BiGain enhances diffusion models by frequency separation, improving classification accuracy by 7.15% and FID by 0.34.
Jiacheng Liu, Shengkun Tang, Jiacheng Cui et al.
STAMP framework uses the Polar mechanism to achieve superior privacy-utility trade-offs in text privacy.
Fengwei Tian, Payel Bhattacharjee, Heidi Hanson et al.
Incremental neural network verification via learned conflicts achieves up to 1.9x speedup in Marabou verifier.
Raya Elsaleh, Liam Davis, Haoze Wu et al.
Temporal Straightening improves latent planning success rates by 20-60% using curvature regularization.
Ying Wang, Oumayma Bounou, Gaoyue Zhou et al.
RandOpt enhances large-scale models via random perturbations and ensemble voting around pretrained weights.
Yulu Gan, Phillip Isola
Idea-Catalyst framework boosts scientific creativity via interdisciplinary insights, improving novelty by 21% and insightfulness by 16%.
Priyanka Kargupta, Shuhaib Mehri, Dilek Hakkani-Tur et al.
Porfolio-CEGAR-SEQ algorithm optimizes object packing and scheduling in 3D printing, reducing the number of printing plates used.
Pavel Surynek
RDNet enhances salient object detection in optical remote sensing images using dynamic adaptive modules.
Bin Wan, Runmin Cong, Xiaofei Zhou et al.
CLASP model detects malicious tokens using XGBoost classifier, achieving 95.9% token-level F1 score.
Alexandre Le Mercier, Thomas Demeester, Chris Develder
IndexCache accelerates sparse attention by reusing cross-layer indices, reducing 75% of computations, achieving 1.82x speedup.
Yushi Bai, Qian Dong, Ting Jiang et al.
Introduced a Polish long-context encoder model handling up to 8192 tokens, significantly improving long-document task performance.
Sławomir Dadas, Rafał Poświata, Marek Kozłowski et al.
ComFree-Sim is a GPU-parallelized contact physics engine achieving near-linear scaling in contact-rich scenarios, with 2-3x throughput improvement.
Chetan Borse, Zhixian Xie, Wei-Cheng Huang et al.
Quantifies forgetting in generative models post-training using forward and reverse KL objectives, avoiding quality degradation.
Krishnakumar Balasubramanian, Shiva Prasad Kasiviswanathan