Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

TL;DR

Proposes Code2LoRA, a hypernetwork-based method generating repository-specific adapters, achieving 63.8% cross-repo and 66.2% in-repo exact match on static tasks, and 60.3% on evolving codebases.

cs.SE 🔴 Advanced 2026-06-05 72 views

Liliana Hotsko Yinxi Li Yuntian Deng Pengyu Nie

AI Reader Arxiv Page Download PDF

code understanding parameter-efficient tuning hypernetwork software evolution model adaptation

Key Findings

Methodology

The proposed Code2LoRA framework employs a hypernetwork architecture that maps repository-level code embeddings to low-rank LoRA adapters. It consists of a frozen repository encoder that compresses the entire codebase into a dense vector, which then serves as input to a trainable hypernetwork. In the static scenario, the hypernetwork directly projects this embedding into LoRA weights, enabling a one-shot adaptation. In the dynamic setting, a GRU-based recurrent hypernetwork incrementally updates the adapter parameters by processing sequential code diffs, capturing the evolution of the repository over time. The training process optimizes the hypernetwork parameters via cross-entropy loss on assertion completion tasks, ensuring the generated adapters effectively encode repository knowledge without incurring inference token overhead. This design addresses the challenge of large context windows and codebase evolution, providing a scalable solution for code understanding in large-scale, evolving repositories.

Key Results

On the static track of RepoPeftBench, Code2LoRA-Static achieved 63.8% exact match (EM) on cross-repo splits, surpassing retrieval-augmented generation (RAG) with 39.7% and dependency analysis with 48.2%. In in-repo settings, it reached 66.2%, nearly matching the per-repository LoRA upper bound of 64.0%.
In the evolution track, Code2LoRA-Evo, leveraging sequential diff updates via GRU, attained 60.3% cross-repo EM, outperforming a single shared LoRA by 5.2 percentage points. It maintained robust performance on temporal out-of-distribution tests, demonstrating strong generalization and adaptation capabilities.
Overall, the hypernetwork-driven approach significantly improves model performance and adaptability across static and dynamic codebases, outperforming traditional fine-tuning and context-based methods, thus setting new benchmarks for repository-level code understanding.

Significance

This work introduces a paradigm shift in code understanding by embedding repository-specific knowledge directly into model parameters through hypernetworks, bypassing the limitations of context window size and retrieval latency. The ability to dynamically adapt to code evolution enables continuous, real-time model updates, greatly benefiting large-scale software maintenance, automated bug fixing, and intelligent code navigation. The framework's scalability and efficiency open new avenues for deploying sophisticated code models in industrial settings, where software changes are frequent and rapid. By addressing core challenges in software engineering and machine learning integration, this research paves the way for more resilient, adaptable, and intelligent code assistants.

Technical Contribution

The paper's key technical contribution lies in designing a hypernetwork that maps comprehensive repository embeddings to all seven types of LoRA matrices across transformer layers, enabling full-parameter adaptation. The static hypernetwork directly projects the repository embedding, while the dynamic variant incorporates a GRU to process sequential diffs, capturing the temporal evolution of code. The repository encoder employs a two-step embedding process using a frozen pre-trained model, ensuring efficiency and robustness. The training procedure optimizes the hypernetwork end-to-end with cross-entropy loss on assertion tasks, without additional inference overhead. The authors also develop a comprehensive benchmark, RepoPeftBench, to evaluate the approach across diverse repositories and tasks, establishing a new standard for repository-level PEFT methods.

Novelty

This research is the first to utilize hypernetworks for generating full-spectrum LoRA adapters conditioned on entire code repositories, addressing both static and evolving scenarios. Unlike prior methods limited to short task descriptions or single documents, Code2LoRA scales to large codebases, maintaining high performance without retraining. The dynamic variant, which employs a GRU to track code changes over time, introduces a novel mechanism for continual adaptation in software evolution, a significant advancement over static or retriever-based approaches. This integration of hypernetworks with repository-level code understanding represents a pioneering step in scalable, parameter-efficient AI for software engineering.

Limitations

The effectiveness of Code2LoRA heavily depends on the quality of the repository encoder; if the embedding fails to capture essential code semantics, adapter performance diminishes.
Training the hypernetwork requires substantial computational resources, especially for the dynamic variant with recurrent updates, limiting real-time deployment in extremely large repositories.
In cases of drastic code restructuring or refactoring, the incremental updates may not suffice, necessitating further research into more robust adaptation mechanisms.

Future Work

Future research will explore more sophisticated repository encoders, such as graph-based or semantic-aware models, to enhance embedding quality. Additionally, integrating reinforcement learning or meta-learning strategies could optimize the update policies of the dynamic hypernetwork, improving robustness against complex code changes. Extending the framework to multi-language repositories and incorporating test and documentation contexts are promising directions. Finally, efforts to reduce computational overhead and enable real-time deployment will be crucial for industrial adoption.

AI Executive Summary

The rapid growth and continuous evolution of software repositories pose significant challenges for code understanding and automation. Traditional models rely heavily on long input contexts or frequent fine-tuning, both of which are costly and brittle in dynamic environments. This paper introduces Code2LoRA, a novel framework that leverages hypernetworks to generate repository-specific LoRA adapters, effectively embedding repository knowledge directly into model parameters without incurring inference token overhead.

At its core, Code2LoRA employs a frozen repository encoder that compresses the entire codebase into a dense vector representation. This embedding serves as the input to a trainable hypernetwork, which maps it to the low-rank matrices defining the LoRA adapters. In the static scenario, a single snapshot of the repository is encoded once, and the hypernetwork produces a fixed adapter that captures the repository’s knowledge. For evolving codebases, the dynamic scenario introduces a GRU-based recurrent hypernetwork that processes sequential code diffs, updating the adapter parameters incrementally as the repository changes.

The authors constructed RepoPeftBench, a comprehensive benchmark comprising 604 Python repositories, designed to evaluate parameter-efficient fine-tuning methods across static and dynamic settings. The static track involves tasks derived from a single snapshot, while the evolution track leverages commit histories to simulate real-world software development. Results demonstrate that Code2LoRA-Static achieves 63.8% cross-repo exact match, outperforming retrieval-based methods like RAG (39.7%) and dependency analysis (48.2%). In in-repo tests, it reaches 66.2%, nearly matching the upper bound of per-repository LoRA training (64.0%).

In the dynamic setting, Code2LoRA-Evo, which updates adapters via a GRU as code changes, attains 60.3% cross-repo exact match, surpassing static methods and shared LoRA baselines by significant margins. Its robustness is further validated on temporal out-of-distribution tests, maintaining high performance despite ongoing code evolution.

These findings highlight the potential of hypernetwork-driven parameter adaptation for large-scale, evolving software systems. By embedding repository knowledge directly into model parameters, Code2LoRA overcomes the limitations of context window size and retrieval latency, enabling scalable, real-time code understanding. The approach opens new avenues for intelligent code assistants, automated maintenance, and continuous integration workflows. Future work will focus on enhancing repository embeddings, reducing computational costs, and extending the framework to multi-language and multi-modal scenarios. Overall, this research marks a significant step toward resilient, adaptable AI-powered software engineering tools, with broad implications for both academia and industry.

Deep Analysis

Background

随着软件系统规模的不断扩大，代码仓库的复杂性和演化速度也在不断提升。传统的代码理解模型多依赖于长上下文输入或频繁微调，面临着成本高昂、效率低和适应性差的难题。近年来，参数高效微调（PEFT）技术如LoRA（Hu et al., 2022）被提出，用于在保持模型性能的同时减少训练成本，但其在仓库级别的应用仍有限。超网络（Ha et al., 2017）作为一种生成目标网络参数的机制，为实现仓库级知识注入提供了新的可能。相关工作如Text2LoRA和Doc2LoRA，虽能在短文本或单文档中生成LoRA，但难以扩展到长代码仓库的复杂场景。软件工程研究也强调对代码变更的持续跟踪（Kagdi et al., 2007; Hassan, 2008），但缺乏结合深度学习模型的动态适应机制。本文结合这些研究背景，提出了基于超网络的仓库级适配方案，旨在解决大规模软件系统中的知识注入和持续演化问题。

Core Problem

现有的代码理解模型在面对大规模、持续演化的代码仓库时，表现出明显的局限性。长输入上下文限制了模型的理解能力，频繁微调成本高昂且难以实时更新，而依赖上下文检索的方法在检索效率和准确性上存在瓶颈。特别是在软件持续演化的场景中，模型需要不断适应新提交的代码变更，传统的微调或静态适配器难以满足动态需求。这些问题严重制约了代码智能辅助工具的实用性和推广应用。

Innovation

本文的核心创新在于引入超网络机制，动态生成仓库特定的LoRA适配器，实现知识的参数化注入。具体创新点包括：1）设计了仓库编码器，将全局代码信息压缩为固定向量，作为超网络的输入；2）开发了静态超网络（Code2LoRA-Static），实现一次性映射；3）引入递归GRU机制（Code2LoRA-Evo），持续跟踪代码变更，动态更新适配器；4）构建了专门的基准RepoPeftBench，为参数高效微调提供评估平台。这些创新突破了传统上下文注入和微调的局限，使模型在软件演化中表现出更强的适应性和泛化能力。

Methodology

�� 仓库编码器：利用预训练的Qwen3-Embedding-0.6B模型，将每个文件或差异片段划分为4096-token块，嵌入后平均池化，得到文件向量，再通过加权平均和最大池化融合为仓库向量。
�� 超网络设计：静态场景中，超网络将仓库向量直接映射为LoRA参数；动态场景中，加入GRU递归机制，逐步更新适配器参数。
�� 训练流程：采用交叉熵损失，端到端训练超网络参数，输入为仓库嵌入或递归状态，输出为各层的LoRA矩阵。
�� 适配器生成：在每次推理中，超网络根据仓库信息生成LoRA参数，注入到预训练模型中，无需额外的token开销。
�� 动态更新：在演化场景中，利用提交差异信息作为GRU输入，递归更新适配器状态，保持模型对代码变更的敏感性。

Experiments

�� 数据集：构建包含604个Python仓库的RepoPeftBench，分为静态和演化两个轨道，涵盖不同的任务和测试集。
�� 评估指标：采用Exact Match（EM）、编辑相似度（Edit Similarity）和CodeBLEU，全面衡量生成代码的准确性和结构一致性。
�� 实验设置：比较多种基线，包括预训练模型、RAG、依赖分析、微调LoRA、单一LoRA、每仓库微调LoRA，以及Text2LoRA等超网络方法。
�� 超参数：Rank=16的LoRA，超网络参数约720-745M，训练3轮，使用H100 GPU。
�� Ablation研究：验证不同超网络设计和递归机制对性能的影响。

Results

�� 静态场景中，Code2LoRA-Static在CR（跨仓库）测试中达到63.8%的EM，超越RAG（39.7%）和依赖分析（48.2%），在IR（仓库内）测试中达66.2%，几乎达到单仓微调的上限（64.0%）。
�� 演化场景中，Code2LoRA-Evo在提交差异序列中持续更新，取得60.3%的CR和64.5%的IR精确匹配率，比单一共享LoRA提升了5.2个百分点，且在时间外测试中表现出优异的鲁棒性。
�� 这些结果验证了超网络在仓库级知识注入和动态适应中的有效性，显著优于传统微调和上下文注入方法。

Applications

�� 代码理解与生成：可应用于自动补全、错误修复、代码导航等场景，特别适合大型、动态演化的代码库。
�� 软件维护：帮助开发者快速理解和适应代码变更，减少维护成本。
�� 智能开发助手：结合持续集成流程，实现模型的实时更新和智能辅助，提升开发效率。

Limitations & Outlook

�� 当前模型对极端代码重构或大规模结构变化的适应性有限，未来需引入更强的鲁棒机制。
�� 超网络训练成本较高，尤其在动态场景中递归更新带来额外计算负担，限制了在极大规模仓库中的实时应用。
�� 仓库编码器的表达能力依赖于预训练模型的质量，若编码器表现不足，可能影响整体性能。

Plain Language Accessible to non-experts

想象你在一家大工厂工作，工厂里有许多不同的车间，每个车间都在不断变化。有时候，工厂会引入新设备，或者调整生产流程。为了让工厂保持高效运转，你需要一个聪明的助手，能随时了解工厂的最新情况，并根据变化调整自己的工作方式。

传统的方法就像是每次工厂变动后都让助手重新学习所有的流程，这样既费时又不灵活。而本文提出的方案，像是给助手装了一个智能的“超级大脑”，它可以根据工厂的最新变化，实时调整自己的操作策略，而不需要每次都重新学习所有内容。这个“超级大脑”通过一种叫超网络的技术实现，它能根据工厂的整体情况，快速生成最适合当前工厂的操作指南。

更厉害的是，这个助手还能记住工厂的历史变化，随着时间推移，逐步调整自己的策略，就像是它在不断学习工厂的“脾气”。这样，无论工厂怎么变，它都能帮你保持高效率，减少出错。这个方法不仅节省了时间，还让工厂的生产变得更加智能和灵活。

Abstract

Code language models need repository-level context to resolve imports, APIs, and project conventions. Existing methods inject this knowledge as long inputs (retrieved through RAG or dependency analysis) or through per-repository fine-tuning and LoRA -- costly at repository scale and brittle to evolving codebases. We introduce Code2LoRA, a hypernetwork framework that generates repository-specific LoRA adapters, effectively injecting repository knowledge with zero inference-time token overhead. Code2LoRA supports two usage scenarios: Code2LoRA-Static converts a single repository snapshot into an adapter, suitable for comprehension of stable codebases; while Code2LoRA-Evo maintains an adapter backed by a GRU hidden state updated per code diff, suitable for active development of evolving codebases. To evaluate Code2LoRA against parameter-efficient fine-tuning baselines, we build RepoPeftBench, a benchmark of 604 Python repositories with two tracks: a static track with 40K training and 12K test assertion-completion tasks, and an evolution track with 215K commit-derived training and 87K commit-derived test tasks. On the static track, Code2LoRA-Static achieves 63.8% cross-repo and 66.2% in-repo exact match, matching the per-repository LoRA upper bound; on the evolution track, Code2LoRA-Evo achieves 60.3% cross-repo exact match (+5.2 pp over a single shared LoRA). Code2LoRA's code can be found at https://anonymous.4open.science/r/code2lora-6857; the model checkpoints and RepoPeftBench datasets can be found at https://huggingface.co/code2lora.

cs.SE cs.AI cs.CL

Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

Key Findings

Methodology

Key Results

Significance

Technical Contribution

Novelty

Limitations

Future Work

AI Executive Summary

Deep Analysis

Background

Core Problem

Innovation

Methodology

Experiments

Results

Applications

Limitations & Outlook

Plain Language Accessible to non-experts

Abstract

Related Papers

Probe-and-Refine Tuning of Repository Guidance for Coding Agents

FASE: Fast Adaptive Semantic Entropy for Code Quality

Scaffold, Not Vocabulary? A Controlled, Two-Tier, Pre-Registered Study of a Popperian Code-Generation Skill

From Natural Language to Verified Code: Toward AI Assisted Problem-to-Code Generation with Dafny-Based Formal Verification

Code Review Agent Benchmark

Evaluating LLM-Based Test Generation Under Software Evolution