NEAT-NC: NEAT guided Navigation Cells for Robot Path Planning
NEAT-NC enhances NEAT with navigation cells for dynamic environment path planning.
Hibatallah Meliani, Khadija Slimani, Samira Khoulji
NEAT-NC enhances NEAT with navigation cells for dynamic environment path planning.
Hibatallah Meliani, Khadija Slimani, Samira Khoulji
HippoCamp benchmarks multimodal file management agents, revealing limitations in user environments with top accuracy only 48.3%.
Zhe Yang, Shulin Tian, Kairui Hu et al.
Proposed a Markovian framework for auditing agentic AI reliability and oversight cost, improving state-action blind mass by 12.53%.
Biplab Pal, Santanu Bhattacharya
Latent-WAM achieves efficient end-to-end autonomous driving with spatially-aware and dynamics-informed latent world representations, scoring 89.3 on NAVSIM v2.
Linbo Wang, Yupeng Zheng, Qiang Chen et al.
Study finds RAG system improvements in retrieval do not guarantee better QA performance in AI policy analysis.
Saahil Mathur, Ryan David Rittner, Vedant Ajit Thakur et al.
MARCH framework significantly reduces LLM hallucination using multi-agent reinforced self-check, enhancing factual consistency in an 8B parameter model.
Zhuo Li, Yupeng Zhang, Pengyu Cheng et al.
EndoVGGT enhances surgical 3D reconstruction with DeGAT, improving PSNR by 24.6% and SSIM by 9.1%.
Falong Fan, Yi Xie, Arnis Lektauers et al.
Chameleon enhances robotic manipulation with geometry-grounded multimodal memory, improving decision reliability in long-horizon tasks.
Xinying Guo, Chenxi Jiang, Hyun Bin Kim et al.
VFIG uses vision-language models for complex figure-to-SVG conversion, achieving a VLM-Judge score of 0.829.
Qijia He, Xunmei Liu, Hammaad Memon et al.
Self-distillation can degrade LLMs' reasoning in math by suppressing uncertainty expression.
Jeonghye Kim, Xufang Luo, Minbeom Kim et al.
MedObvious exposes the Medical Moravec's Paradox in VLMs via a 1,880-task benchmark for clinical triage.
Ufaq Khan, Umair Nawaz, L D M S S Teja et al.
UniGRPO optimizes text and image generation policies using GRPO, enhancing reasoning-driven visual generation quality.
Jie Liu, Zilyu Ye, Linxiao Yuan et al.
DA-Flow combines diffusion and convolutional features to enhance optical flow estimation in degraded videos.
Jaewon Min, Jaeeun Lee, Yeji Choi et al.
WildWorld dataset offers over 450 actions and explicit state annotations for generative ARPG dynamic world modeling.
Zhen Li, Zian Meng, Shuwei Shi et al.
VISOR method enhances LVLM efficiency by sparsely selecting vision-language interactions, reducing inference cost.
Adrian Bulat, Alberto Baldrati, Ioannis Maniadis Metaxas et al.
AgentRVOS combines SAM3 and MLLM for zero-shot video object segmentation, achieving leading performance.
Woojeong Jin, Jaeho Lee, Heeseong Shin et al.
CSTS enhances cross-environment AI detection stability through entity-relational abstraction, addressing schema perturbation collapse.
Abdul Rahman
c-CRAB dataset evaluates code review agents' abilities; current agents solve only 40% of tasks.
Yuntong Zhang, Zhiyuan Pan, Imam Nur Bani Yusuf et al.
3DCity-LLM enhances 3D city-scale perception with a coarse-to-fine feature encoding strategy, leveraging a 1.2M-sample dataset.
Yiping Chen, Jinpeng Li, Wenyu Ke et al.
Study shows LLMs struggle with test generation under software evolution, with pass rates dropping to 66% under semantic changes.
Sabaat Haroon, Mohammad Taha Khan, Muhammad Ali Gulzar