Paper Insights - AI Arxiv Paper Analysis

cs.CL 2603.24472

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Self-distillation can degrade LLMs' reasoning in math by suppressing uncertainty expression.

Jeonghye Kim, Xufang Luo, Minbeom Kim et al.

2026-03-26 281

cs.CL 2603.22267

TiCo: Time-Controllable Training for Spoken Dialogue Models

TiCo method significantly enhances time control in dialogue models using Spoken Time Markers, reducing MAE to 4.54 seconds.

Kai-Wei Chang, Wei-Chih Chen, En-Pei Hu et al.

2026-03-24 136

cs.CL 2603.22241

MemDLM: Memory-Enhanced DLM Training

MemDLM embeds a simulated denoising process into training via bi-level optimization, enhancing DLM training efficiency and long-context understanding.

Zehua Pei, Hui-Ling Zhen, Weizhe Lin et al.

2026-03-24 93

cs.CL 2603.20161

Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

Semantic Token Clustering (STC) method achieves efficient uncertainty quantification in large language models, significantly reducing computational overhead.

Qi Cao, Andrew Gambardella, Takeshi Kojima et al.

2026-03-21 112

cs.CL 2603.20100

An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models

Study of SFT-DPO interaction in small models reveals full fine-tuning outperforms LoRA.

Yuming Feng, Christy Yang

2026-03-21 137

cs.CL 2603.19223

F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World

F2LLM-v2 offers efficient multilingual embeddings using a two-stage training and matryoshka learning, supporting over 200 languages.

Ziyin Zhang, Zihan Liao, Hang Yu et al.

2026-03-20 3 citations 105

cs.CL 2603.19220

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Nemotron-Cascade 2 achieves top-tier reasoning with Cascade RL and multi-domain distillation in a 30B MoE model.

Zhuolin Yang, Zihan Liu, Yang Chen et al.

2026-03-20 10 citations 260

cs.CL 2603.19152

VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models

VEPO enhances translation quality and tokenization efficiency for low-resource languages using reinforcement learning with verifiable rewards.

Chonghan Liu, Yimin Du, Qi An et al.

2026-03-20 92

cs.CL 2603.17942

Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing

Efficient training-free multi-token prediction via embedding-space probing, improving LLaMA3 acceptance length by 12%.

Raghavv Goel, Mukul Gagrani, Mingu Lee et al.

2026-03-19 213

cs.CL 2603.15619

Mixture-of-Depths Attention

Mixture-of-Depths Attention (MoDA) improves downstream task performance by 2.11% on a 1.5B-parameter model with only a 3.7% increase in FLOPs.

Lianghui Zhu, Yuxin Fang, Bencheng Liao et al.

2026-03-17 123

cs.CL 2603.15615

Mechanistic Origin of Moral Indifference in Language Models

Correcting moral indifference in language models using Sparse Autoencoders, achieving a 75% win-rate on adversarial benchmarks.

Lingyu Li, Yan Teng, Yingchun Wang

2026-03-17 125

cs.CL 2603.15611

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Code-A1 enhances code and test generation through an adversarial co-evolution framework.

Aozhe Wang, Yuchen Yan, Nan Zhou et al.

2026-03-17 2 citations 116

cs.CL 2603.13201

Neuron-Aware Data Selection In Instruction Tuning For Large Language Models

NAIT framework selects efficient instruction tuning data via neuron activation patterns, enhancing LLM performance.

Xin Chen, Junchao Wu, Shu Yang et al.

2026-03-14 231

cs.CL 2603.13154

ESG-Bench: Benchmarking Long-Context ESG Reports for Hallucination Mitigation

ESG-Bench significantly reduces hallucinations in long-context ESG report analysis using task-specific Chain-of-Thought prompting strategies.

Siqi Sun, Ben Peng Wu, Mali Jin et al.

2026-03-14 259

cs.CL 2603.13045

Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation

WALAR method enhances low-resource language translation using monolingual data, surpassing LLaMAX model.

Yifeng Liu, Siqi Ouyang, Yatish Hosmane Revanasiddappa et al.

2026-03-13 139

cs.CL 2603.13038

Interpretable Semantic Gradients in SSD: A PCA Sweep Approach and a Case Study on AI Discourse

Proposed a PCA sweep method to optimize dimension selection in SSD, enhancing interpretability and stability.

Hubert Plisiecki, Maria Leniarska, Jan Piotrowski et al.

2026-03-13 1 citations 105

cs.CL 2603.12963

Long-form RewardBench: Evaluating Reward Models for Long-form Generation

Long-form RewardBench evaluates reward models for long-form generation, revealing current models' deficiencies in long-form reward modeling.

Hui Huang, Yancheng He, Wei Liu et al.

2026-03-13 86

cs.CL 2603.12920

HMS-BERT: Hybrid Multi-Task Self-Training for Multilingual and Multi-Label Cyberbullying Detection

HMS-BERT uses hybrid multi-task self-training for multilingual, multi-label cyberbullying detection, achieving a macro F1-score of 0.9847.

Zixin Feng, Xinying Cui, Yifan Sun et al.

2026-03-13 100

cs.CL 2603.12226

Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration

Idea-Catalyst framework boosts scientific creativity via interdisciplinary insights, improving novelty by 21% and insightfulness by 16%.

Priyanka Kargupta, Shuhaib Mehri, Dilek Hakkani-Tur et al.

2026-03-13 139

cs.CL 2603.12206

CLASP: Defending Hybrid Large Language Models Against Hidden State Poisoning Attacks

CLASP model detects malicious tokens using XGBoost classifier, achieving 95.9% token-level F1 score.

Alexandre Le Mercier, Thomas Demeester, Chris Develder

2026-03-13 200