ARIADNE: A Perception-Reasoning Synergy Framework for Trustworthy Coronary Angiography Analysis

TL;DR

ARIADNE uses DPO and RL for coronary angiography, achieving a centerline Dice of 0.838.

cs.CV πŸ”΄ Advanced 2026-03-20 40 views
Zhan Jin Yu Luo Yizhou Zhang Ziyang Cui Yuqing Wei Xianchao Liu Xueying Zeng Qing Zhang
Coronary Angiography Deep Learning Topological Consistency Preference Learning Reinforcement Learning

Key Findings

Methodology

The ARIADNE framework combines a preference-aligned perception module with an RL-based reasoning module. The perception module employs DPO to fine-tune the Sa2VA vision-language model using Betti number constraints as preference signals, ensuring geometrically complete vessel structures. The reasoning module formulates stenosis localization as a Markov Decision Process with an explicit rejection mechanism, autonomously deferring ambiguous anatomical candidates like bifurcations and vessel crossings.

Key Results

  • On 1,400 clinical angiograms, ARIADNE achieves a state-of-the-art centerline Dice of 0.838, reducing false positives by 41% compared to geometric baselines.
  • External validation on multi-center benchmarks ARCADE and XCAD confirms generalization across acquisition protocols.
  • This represents the first application of DPO for topological alignment in medical imaging, demonstrating that preference-based learning mitigates topological violations while maintaining diagnostic sensitivity.

Significance

The ARIADNE framework addresses the topological consistency issue in coronary angiography analysis by integrating deep learning and reinforcement learning. It not only improves the accuracy of vessel segmentation but also reduces false positive rates in stenosis detection. This successful application provides a more reliable automated diagnostic tool for interventional cardiology workflows, filling the gap in structural continuity and diagnostic reliability in existing methods.

Technical Contribution

ARIADNE's technical contribution lies in the first application of DPO for topological alignment in medical imaging, achieving structural constraint optimization through preference learning. The integration of the perception and reasoning modules offers a novel diagnostic automation approach, maintaining high sensitivity and low false positive rates in complex anatomical structures.

Novelty

ARIADNE is the first framework to apply DPO for topological alignment in medical imaging. Its core innovation lies in guiding the model to learn geometrically complete vessel structures through preference signals, rather than merely optimizing pixel-level overlap.

Limitations

  • The model's topological consistency remains unstable in low-contrast and complex anatomical structures like vessel crossings.
  • The preference-aligned perception module requires a large amount of annotated data for training, which may limit its practical deployment.
  • In extreme cases, the rejection mechanism may defer too many ambiguous candidates, affecting diagnostic efficiency.

Future Work

Future research directions include optimizing the preference alignment learning process to reduce reliance on annotated data and further improving topological consistency in complex anatomical structures. Additionally, exploring the application of this framework to other medical imaging fields, such as cerebral angiography or pulmonary CT scans, is promising.

AI Executive Summary

Coronary artery disease (CAD) remains a leading cause of morbidity and mortality worldwide. Traditional coronary angiography analysis relies heavily on manual interpretation, characterized by significant inter-observer variability and susceptibility to clinician fatigue. While convolutional neural networks (CNNs) and vision transformers (ViTs) have made strides in pixel-level performance, they still face limitations in preserving vascular topology. Traditional loss functions fail to explicitly penalize topological errors, resulting in fragmented vascular trees.

To address this issue, the ARIADNE framework combines a preference-aligned perception module with an RL-based reasoning module. The perception module employs DPO to fine-tune the Sa2VA vision-language model using Betti number constraints as preference signals, ensuring geometrically complete vessel structures. The reasoning module formulates stenosis localization as a Markov Decision Process with an explicit rejection mechanism, autonomously deferring ambiguous anatomical candidates like bifurcations and vessel crossings.

On 1,400 clinical angiograms, ARIADNE achieves a state-of-the-art centerline Dice of 0.838, reducing false positives by 41% compared to geometric baselines. External validation confirms the method's generalization across different acquisition protocols. This successful application provides a more reliable automated diagnostic tool for interventional cardiology workflows.

However, ARIADNE's topological consistency remains unstable in low-contrast and complex anatomical structures. Additionally, the preference-aligned perception module requires a large amount of annotated data for training, which may limit its practical deployment.

Future research directions include optimizing the preference alignment learning process to reduce reliance on annotated data and further improving topological consistency in complex anatomical structures. Additionally, exploring the application of this framework to other medical imaging fields, such as cerebral angiography or pulmonary CT scans, is promising.

Deep Analysis

Background

Coronary artery disease (CAD) is a leading cause of morbidity and mortality worldwide. Invasive X-ray coronary angiography (XCA) serves as the primary tool for CAD diagnosis and guidance of percutaneous coronary interventions (PCI), offering high temporal resolution necessary for visualizing hemodynamic flow. However, current clinical workflows rely heavily on manual interpretation, a process characterized by significant inter-observer variability and susceptibility to clinician fatigue. As healthcare institutions universally adopt Picture Archiving and Communication Systems (PACS), a critical gap persists between passive image storage and active, automated clinical interpretation. While hospitals have implemented digital image storage, they lack automated systems capable of transforming raw imaging data into actionable clinical insights. The growing volume of interventional procedures makes purely manual interpretation increasingly unsustainable, creating demand for computer-aided diagnosis systems that can bridge the gap between data acquisition and clinical decision-making.


Accurate segmentation of the coronary vascular tree represents a fundamental prerequisite for automated coronary analysis. Over the past decade, convolutional neural networks (CNNs), particularly U-Net and its attention-enhanced variants such as CS-Net and SA-UNet, have dominated the field. More recently, vision transformers (ViTs) have been introduced to capture global spatial relationships. Despite achieving high pixel-level performance metrics, these models face a critical limitation in preserving vascular topology. Traditional loss functions, including cross-entropy and Dice loss, optimize pixel-level accuracy independently without explicitly penalizing topological errors. Consequently, these models frequently produce fragmented vessel trees where distal branches appear disconnected, particularly due to signal loss in thin vessels during downsampling operations. In coronary hemodynamics analysis, topological connectivity is essential; a segmentation with a high Dice score remains insufficient for clinical use if discontinuities prevent accurate centerline extraction and subsequent geometric analysis.

Core Problem

The core problem in coronary angiography analysis is ensuring topological consistency in vessel segmentation while maintaining high pixel-level accuracy. Conventional pixel-wise loss functions fail to enforce topological constraints in coronary vessel segmentation, producing fragmented vascular trees despite high pixel-level accuracy. Existing automated frameworks predominantly follow a sequential approach where segmentation and stenosis detection are performed as independent tasks. However, these deterministic algorithms lack the ability to distinguish pathological stenosis from common anatomical artifacts, resulting in elevated false positive rates.

Innovation

The core innovations of the ARIADNE framework include the integration of a preference-aligned perception module and an RL-based reasoning module. β€’ The perception module employs DPO to fine-tune the Sa2VA vision-language model using Betti number constraints as preference signals, ensuring geometrically complete vessel structures. β€’ The reasoning module formulates stenosis localization as a Markov Decision Process with an explicit rejection mechanism, autonomously deferring ambiguous anatomical candidates like bifurcations and vessel crossings. β€’ This represents the first application of DPO for topological alignment in medical imaging, demonstrating that preference-based learning mitigates topological violations while maintaining diagnostic sensitivity.

Methodology

The ARIADNE framework is designed to emulate the hierarchical decision-making process of human experts through two biomimetic stages: a perception module for anatomically consistent vascular reconstruction and a reasoning module for context-aware lesion localization. β€’ The perception module employs the Sa2VA foundation model with a progressive training strategy designed to enforce topological continuity throughout the segmentation process. We integrate DPO into the training pipeline to align model outputs toward geometrically complete vessel structures rather than fragmented pixel-level predictions. β€’ The reasoning module operates as a structure-guided diagnostic agent that navigates the extracted vessel skeleton to identify stenotic lesions. We develop an RL agent that analyzes local geometric features, including radius gradients and curvature patterns, to perform context-aware lesion localization.

Experiments

The experimental design was structured to address two primary objectives: evaluation of topological consistency in vascular segmentation across diverse angiographic conditions, and assessment of stenosis detection accuracy and false positive management in anatomically complex scenarios. β€’ A proprietary dataset was curated from coronary angiography video sequences acquired at Guizhou Aviation Industry Group 302 Hospital using a Siemens angiography system. β€’ The dataset comprises 1,400 high-resolution images at 512Γ—512 resolution from 35 patients, with an average of 40 frames extracted per patient to capture varying vessel angulations and contrast conditions. β€’ To prevent data leakage inherent in video-based acquisitions, the dataset was partitioned at the patient level rather than the image level.

Results

On 1,400 clinical angiograms, ARIADNE achieves a state-of-the-art centerline Dice of 0.838, reducing false positives by 41% compared to geometric baselines. β€’ External validation on multi-center benchmarks ARCADE and XCAD confirms generalization across acquisition protocols. β€’ This represents the first application of DPO for topological alignment in medical imaging, demonstrating that preference-based learning mitigates topological violations while maintaining diagnostic sensitivity.

Applications

The ARIADNE framework can be directly applied to interventional cardiology workflows to enhance the automation of coronary angiography analysis. β€’ The method significantly reduces false positive rates while maintaining high sensitivity, improving diagnostic reliability. β€’ Future exploration could extend the framework's application to other medical imaging fields, such as cerebral angiography or pulmonary CT scans.

Limitations & Outlook

While ARIADNE performs well in coronary angiography analysis, the model's topological consistency remains unstable in low-contrast and complex anatomical structures like vessel crossings. β€’ The preference-aligned perception module requires a large amount of annotated data for training, which may limit its practical deployment. β€’ In extreme cases, the rejection mechanism may defer too many ambiguous candidates, affecting diagnostic efficiency.

Plain Language Accessible to non-experts

Imagine you're cooking in a kitchen. You have a recipe that guides you step-by-step to make a delicious dish. Now, imagine this recipe not only tells you what to do at each step but also adjusts based on your kitchen's actual situation, like suggesting alternatives if you're missing an ingredient. This is what the ARIADNE framework does in coronary angiography analysis. It's like a smart recipe that not only identifies vessels but also ensures these vessels are coherent in the image, just like ensuring each step of your dish is done correctly. In this way, it helps doctors diagnose coronary artery disease more accurately, just like a good chef can make a perfect dish.

ELI14 Explained like you're 14

Hey there, buddy! Did you know doctors use something called coronary angiography to check if our heart's blood vessels are blocked? It's like looking at a super complex map to find where the traffic jams are. But sometimes these maps aren't very clear, like driving in fog. So, scientists invented a super helper called ARIADNE. It's like a smart GPS that helps doctors see every road on the map clearly, making sure they don't miss any important spots. This way, doctors can find problems faster and more accurately, just like finding hidden treasures in a game! Cool, right?

Glossary

Coronary Angiography

A medical imaging technique used to visualize the blood vessels of the heart, aiding in the diagnosis of coronary artery disease.

Used in the paper to evaluate the effectiveness of the ARIADNE framework.

Topological Consistency

Ensuring the structural integrity and coherence of segmentation results in image analysis.

A key goal achieved by the ARIADNE framework through DPO.

Preference Learning

A machine learning approach that guides model learning by comparing the relative merits of different samples.

Used to guide the perception module in the ARIADNE framework.

Reinforcement Learning

A machine learning method that trains agents to make decisions through rewards and penalties.

Used in the reasoning module of the ARIADNE framework.

Betti Number

A concept in topology used to describe the connectivity of a spatial structure.

Used as a preference signal in the perception module.

Centerline Dice

A metric for evaluating the overlap between segmentation results and true structures.

Used to assess the segmentation performance of the ARIADNE framework.

Markov Decision Process

A mathematical framework for modeling decision-making processes, including states, actions, and rewards.

Used for stenosis localization in the reasoning module.

Rejection Mechanism

A strategy to defer decisions in uncertain situations, reducing erroneous judgments.

Used in the reasoning module to handle ambiguous anatomical candidates.

Vision Transformer

A deep learning model for image analysis that captures global spatial relationships.

Mentioned as one of the traditional methods in the background.

Convolutional Neural Network

A deep learning model for image processing, adept at extracting local features.

Mentioned as one of the traditional methods in the background.

Open Questions Unanswered questions from this research

  • 1 How to further improve topological consistency in low-contrast and complex anatomical structures? Current methods are unstable in these scenarios, requiring stronger structural constraints and better visual signal processing.
  • 2 How to reduce reliance on large amounts of annotated data? The preference-aligned perception module requires substantial annotated data for training, which may limit its practical deployment.
  • 3 How to optimize the rejection mechanism to improve diagnostic efficiency? In extreme cases, the rejection mechanism may defer too many ambiguous candidates, affecting diagnostic efficiency.
  • 4 How to apply the ARIADNE framework to other medical imaging fields? While it performs well in coronary angiography, its application in other fields remains to be explored.
  • 5 How to further reduce false positive rates? Although significantly improved, false positives still occur in complex anatomical structures, requiring more refined diagnostic strategies.

Applications

Immediate Applications

Interventional Cardiology

The ARIADNE framework can be used to enhance the automation of coronary angiography analysis, aiding doctors in more accurately diagnosing coronary artery disease.

Medical Imaging Analysis

The method can be applied to other medical imaging fields, such as cerebral angiography or pulmonary CT scans, providing more reliable automated diagnostic tools.

Clinical Workflow Optimization

By reducing false positive rates and improving diagnostic efficiency, the ARIADNE framework can optimize clinical workflows, alleviating the workload of doctors.

Long-term Vision

Fully Automated Diagnostic Systems

The successful application of the ARIADNE framework lays the foundation for developing fully automated diagnostic systems, potentially transforming the way medical imaging analysis is conducted.

Cross-Domain Applications

As the technology matures, the ARIADNE framework is expected to find applications in other fields, such as industrial inspection and image analysis in autonomous vehicles.

Abstract

Conventional pixel-wise loss functions fail to enforce topological constraints in coronary vessel segmentation, producing fragmented vascular trees despite high pixel-level accuracy. We present ARIADNE, a two-stage framework coupling preference-aligned perception with RL-based diagnostic reasoning for topologically coherent stenosis detection. The perception module employs DPO to fine-tune the Sa2VA vision-language foundation model using Betti number constraints as preference signals, aligning the policy toward geometrically complete vessel structures rather than pixel-wise overlap metrics. The reasoning module formulates stenosis localization as a Markov Decision Process with an explicit rejection mechanism that autonomously defers ambiguous anatomical candidates such as bifurcations and vessel crossings, shifting from coverage maximization to reliability optimization. On 1,400 clinical angiograms, ARIADNE achieves state-of-the-art centerline Dice of 0.838, reduces false positives by 41% compared to geometric baselines. External validation on multi-center benchmarks ARCADE and XCAD confirms generalization across acquisition protocols. This represents the first application of DPO for topological alignment in medical imaging, demonstrating that preference-based learning over structural constraints mitigates topological violations while maintaining diagnostic sensitivity in interventional cardiology workflows.

cs.CV cs.AI

References (20)

Automatic Quantitative Coronary Analysis Based on Deep Learning

Xuqing Liu, Xiaofei Wang, Donghao Chen et al.

2023 18 citations ⭐ Influential

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Rafael Rafailov, Archit Sharma, E. Mitchell et al.

2023 7677 citations ⭐ Influential View Analysis β†’

Training and validation of a deep learning architecture for the automatic analysis of coronary angiography

Fu Wai, T. Du, PhD Lihua Xie et al.

24 citations ⭐ Influential

Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy

Stanislav Nikolov, Sam Blackwell, R. Mendes et al.

2018 324 citations ⭐ Influential View Analysis β†’

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

Suprosanna Shit, J. Paetzold, A. Sekuboyina et al.

2020 442 citations ⭐ Influential View Analysis β†’

FlowVM-Net: Enhanced Vessel Segmentation in X-Ray Coronary Angiography Using Temporal Information Fusion.

Guangyu Wei, Xueying Zeng, Qing Zhang

2025 2 citations ⭐ Influential

UNet++: A Nested U-Net Architecture for Medical Image Segmentation

Zongwei Zhou, M. R. Siddiquee, Nima Tajbakhsh et al.

2018 8320 citations ⭐ Influential View Analysis β†’

StenUNet: Automatic Stenosis Detection from X-ray Coronary Angiography

Hui Lin, Tom Liu, A. Katsaggelos et al.

2023 11 citations ⭐ Influential View Analysis β†’

U-Net: Convolutional Networks for Biomedical Image Segmentation

O. Ronneberger, P. Fischer, T. Brox

2015 92631 citations ⭐ Influential View Analysis β†’

FR-UNet: A Feature Restoration-Based UNet for Seismic Data Consecutively Missing Trace Interpolation

Yupeng Tian, Lihua Fu, Wenqian Fang et al.

2025 4 citations

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Haobo Yuan, Xiangtai Li, Tao Zhang et al.

2025 110 citations View Analysis β†’

SAM 2: Segment Anything in Images and Videos

Nikhila Ravi, Valentin Gabeur, Yuan-Ting Hu et al.

2024 2737 citations View Analysis β†’

Development of deep learning segmentation models for coronary X-ray angiography: Quality assessment by a new global segmentation score and comparison with human performance.

Miguel Nobre Menezes, JoΓ£o LourenΓ§o-Silva, B. Silva et al.

2022 17 citations

LoRA: Low-Rank Adaptation of Large Language Models

J. Hu, Yelong Shen, Phillip Wallis et al.

2021 17337 citations View Analysis β†’

Vesselness via Multiple Scale Orientation Scores

J. Hannink, R. Duits, E. Bekkers

2014 9 citations View Analysis β†’

Optimizing ensemble U-Net architectures for robust coronary vessel segmentation in angiographic images

Shih-Sheng Chang, Ching-Ting Lin, Wei-Chun Wang et al.

2024 17 citations

A Deep Learning Model for Coronary Artery Segmentation and Quantitative Stenosis Detection in Angiographic Images

Baixiang Huang, Yu Luo, Guangyu Wei et al.

2024 1 citations View Analysis β†’

Lightweight U-Net for Blood Vessels Segmentation in X-Ray Coronary Angiography

Jesus Salvador Ramos-Cortez, D. E. Alvarado-Carrillo, Emmanuel Ovalle-Magallanes et al.

2025 3 citations

Proximal Policy Optimization Algorithms

John Schulman, Filip Wolski, Prafulla Dhariwal et al.

2017 25965 citations View Analysis β†’

Diffusion Model Alignment Using Direct Preference Optimization

Bram Wallace, Meihua Dang, Rafael Rafailov et al.

2023 612 citations View Analysis β†’