Paper Insights - AI Arxiv Paper Analysis

cs.IR 2606.20280

ELVA: Exploring Ranking-Driven Universal Multimodal Retrieval

ELVA employs a ranking-driven reinforcement learning framework with rule-based rewards, achieving 13.1% improvement on MRBench for multi-grain retrieval.

Yuhan Liu, Pei Fu, Hang Li et al.

2026-06-18 11

cs.IR 2606.20235

ScholarQuest: A Taxonomy-Guided Benchmark for Agentic Academic Paper Search in Open Literature Environments

ScholarQuest introduces a taxonomy-guided benchmark with over 1000 CS topics, four research intents, and automated answer construction, advancing systematic evaluation of academic search agents.

Tingyue Pan, Mingyue Cheng, Daoyu Wang et al.

2026-06-18 14

cs.IR 2606.17707

Do Generative Recommenders Deepen the Information Cocoon? A Closed-Loop Simulation with LLM-powered User Simulators

This study introduces RecLoop, a closed-loop simulation framework, comparing generative and traditional recommenders; findings show generative models better preserve diversity but still face cocoon effects.

Jiyuan Yang, Gengxin Sun, Mengqi Zhang et al.

2026-06-16 38

cs.IR 2606.16970

A Theoretical Framework for Risk Analysis of Stochastic Rankers

Developed a theoretical framework for reranking risk based on DCG variation, validated by experiments on TREC data showing close alignment between predicted and observed deviations.

Debasis Ganguly

2026-06-16 39

cs.IR 2606.13438

CQC-RAG: Robust Retrieval-Augmented Generation via Cross-Query Consistency

CQC-RAG introduces cross-query consistency to enhance robustness in retrieval-augmented generation, outperforming baselines by +4.76 EM on TriviaQA and +9.12 EM on MuSiQue.

Yanjia Sun, Sifan Liu, Jie Shao

2026-06-11 86

cs.IR 2606.10759

miniReranker: Efficient Multimodal Reranking through Visual Cache Reuse and Interaction Sparsity

miniReranker employs visual cache reuse and interaction sparsity to reduce reranking runtime to <1% with >96% performance, based on Qwen3-VL.

Yingqi Fan, Xuan Lu, Anhao Zhao et al.

2026-06-09 72

cs.IR 2606.10388

SkillResolve-Bench: Measuring and Resolving Same-Capability Ambiguity in Agent Skill Retrieval

Introduces SkillResolve-Bench and SkillResolve, achieving Recall@3 0.766, NDCG@3 0.699, and HSR@3=0, effectively reducing same-capability ambiguity risks.

Jiandong Ding

2026-06-09 113

cs.IR 2606.09595

Popcorn: A Configurable Benchmark for Visual Evidence in Multimodal Movie Recommendation

Popcorn benchmark combines title-aligned full-movie/trailer embeddings with VLM-encoded thumbnails to evaluate visual evidence in multimodal movie recommendation.

Ali Tourani, Fatemeh Nazary, Yashar Deldjoo et al.

2026-06-08 48

cs.IR 2606.07252

Constrained Dominant Sets for Multimodal Document Question Answering

Proposes a Constrained Dominant Set (CDS) method for multimodal long-document QA, achieving 66.99 on VisDoMBench, surpassing previous SOTA by 37.1 points.

Ambuj Mehrish, Sebatiano Vascon

2026-06-05 62

cs.IR 2606.06225

Bridging the Semantic-Collaborative Gap: An Asymmetric Graph Architecture for Cold-Start Item Recommendation

Proposes Shallow-RHS, an asymmetric graph architecture for cold-start content recommendation, mapping intrinsic features into a collaborative filtering space for immediate deployment.

Anh Truong, John Trenkle, Yuanbo Chen et al.

2026-06-04 84

cs.IR 2605.22766

Diversed Model Discovery via Structured Table Discovery

StructuredSemanticSearch improves model discovery via structured table retrieval, boosting nugget coverage on 597 queries

Zhengyuan Dong, Renée J. Miller

2026-05-22 56

cs.IR 2604.22722

Aligning Dense Retrievers with LLM Utility via DistillationAligning Dense Retrievers with LLM Utility via Distillation

Aligning Dense Retrievers with LLM Utility via Distillation, UAE improves Recall@1 by 30.59% on QASPER benchmark.

Rajinder Sandhu, Di Mu, Cheng Chang et al.

2026-04-25 178

cs.IR 2604.22661

Can QPP Choose the Right Query Variant? Evaluating Query Variant Selection for RAG Pipelines

Research evaluates QPP for selecting the best query variant in RAG pipelines to enhance generation quality.

Negar Arabzadeh, Andrew Drozdov, Michael Bendersky et al.

2026-04-24 168

cs.IR 2604.22504

Objective Shaping with Hard Negatives: Windowed Partial AUC Optimization for RL-based LLM Recommenders

Introduced TAWin method using WPAUC to optimize RL-based recommenders, enhancing Top-K performance.

Wentao Shi, Qifan Wang, Chen Chen et al.

2026-04-24 108

cs.IR 2604.22195

Rethinking Semantic Collaborative Integration: Why Alignment Is Not Enough

Proposes a complementarity fusion method for semantic and collaborative views, avoiding global alignment limitations to enhance recommender systems.

Maolin Wang, Dongze Wu, Jianing Zhou et al.

2026-04-24 210

cs.IR 2604.22180

ResRank: Unifying Retrieval and Listwise Reranking via End-to-End Joint Training with Residual Passage Compression

ResRank enhances retrieval efficiency and effectiveness via residual passage compression and end-to-end joint training.

Xiaojie Ke, Shuai Zhang, Liansheng Sun et al.

2026-04-24 206

cs.IR 2604.19664

ECLASS-Augmented Semantic Product Search for Electronic Components

ECLASS-augmented dense retrieval method achieves 94.3% HitRate@5 in semantic search for electronic components.

Nico Baumgart, Markus Lange-Hegermann, Jan Henze

2026-04-22 111

cs.IR 2604.19566

Diagnosable ColBERT: Debugging Late-Interaction Retrieval Models Using a Learned Latent Space as Reference

Diagnosable ColBERT enhances ColBERT model diagnostics by aligning token embeddings to a clinically-grounded reference latent space.

François Remy

2026-04-21 123

cs.IR 2604.19550

LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction

LoopCTR enhances CTR prediction through loop scaling, significantly reducing computational costs.

Jiakai Tang, Runfeng Zhang, Weiqiu Wang et al.

2026-04-21 171

cs.IR 2604.19414

CAST: Modeling Semantic-Level Transitions for Complementary-Aware Sequential Recommendation

CAST framework models semantic-level transitions, achieving 17.6% Recall and 16.0% NDCG gains with 65x training acceleration.

Qian Zhang, Lech Szymanski, Haibo Zhang et al.

2026-04-21 98