BIRDNet: Mining and Encoding Boolean Implication Knowledge Graphs as Interpretable Deep Neural Networks

TL;DR

BIRDNet encodes mined Boolean implication graphs into sparse, interpretable deep neural networks, achieving near state-of-the-art AUROC with 96x fewer active parameters on six biomedical datasets.

cs.LG 🔴 Advanced 2026-05-28 68 views

Tirtharaj Dash

AI Reader Arxiv Page Download PDF

knowledge graphs neurosymbolic AI sparse neural networks interpretability bioinformatics

Key Findings

Methodology

This study introduces a framework that mines Boolean implication relationships (BIRs) from high-dimensional tabular data using a statistical sparse-exception binomial test. The mined implications form a typed directed graph, representing propositional rules with at most two literals. These graphs are encoded as the connectivity of a layered neural network, called BIRDNet, where each hidden unit corresponds to one implication and connects only to its two features, enforced by a fixed binary mask. The network is constructed layer-wise in a greedy manner: at each layer, a new implication graph is mined from the post-activation outputs of the previous layer, and each implication adds a hidden unit. This design guarantees sparsity—each unit has exactly two active incoming weights—and interpretability, as each unit retains a stable symbolic identity. The model does not rely on external rule bases; all structure is data-driven. Training involves fixing the structure and optimizing weights via AdamW, with the network maintaining the mined structure. Post-training, rules are directly read off the network, enabling transparent explanations. The approach is validated on six biomedical datasets, including transcriptomics and proteomics, showing competitive AUROC scores within 0.02 of dense baselines, with up to 96-fold reduction in active parameters.

Key Results

Across six datasets, BIRDNet achieves AUROC scores within 0.02 of the strongest dense models, e.g., 0.998 on TCGA RPPA, demonstrating near-par performance while maintaining extreme sparsity. The model uses up to 96 times fewer active parameters than an architecture-matched dense MLP, significantly reducing complexity. The first-layer rules recover known biological signatures, such as HER2 amplification, proliferation markers, and immune infiltration signatures, validating the biological relevance of the mined rules. The parameter compression ratio exceeds 30-fold in some datasets, with the rules being directly interpretable and biologically meaningful. The model's sparse structure allows for efficient inference and transparent rule extraction, facilitating scientific understanding and validation.
The rules extracted from the trained network align well with established biological knowledge, such as gene co-amplifications and tissue-specific modules. For example, in breast cancer datasets, rules involving ERBB2 and NCAPGCENPA correspond to known biomarkers. The hierarchical rule tracing for individual samples demonstrates how the model’s decisions are grounded in interpretable propositional implications, enabling detailed mechanistic insights. The combination of high predictive accuracy, interpretability, and parameter efficiency positions BIRDNet as a promising tool for biomedical discovery and clinical decision support.
The experimental results highlight that BIRDNet maintains robust performance across diverse datasets with varying sample sizes and feature dimensions. Its ability to identify biologically relevant features and rules underscores its potential for uncovering novel biomarkers and understanding disease mechanisms. The model's design ensures that each rule is stable and readable, providing a transparent decision pathway that can be scrutinized and validated by domain experts. This transparency is crucial for advancing trust and adoption in sensitive fields like healthcare.

Significance

This work addresses a fundamental challenge in applying deep learning to scientific domains: the trade-off between predictive performance and interpretability. By integrating symbolic logic directly into the neural network structure, BIRDNet offers a novel solution that combines the strengths of rule-based reasoning with the flexibility of deep learning. Its data-driven approach to structure learning eliminates reliance on external knowledge bases, making it adaptable to various domains. The resulting models are not only accurate but also inherently interpretable, enabling scientists and clinicians to understand the basis of predictions, verify biological hypotheses, and discover new insights. This paradigm shift paves the way for AI systems that are both powerful and transparent, fostering greater trust and facilitating scientific validation.

Technical Contribution

The core technical innovation lies in formalizing Boolean implication relationships as a structured, mineable knowledge graph, which is then encoded into a neural network architecture with guaranteed sparsity. The use of a sparse-exception binomial test for relationship mining ensures statistical robustness. The layer-wise greedy construction process allows the network to grow adaptively, with each hidden unit corresponding to a specific implication, connected only to two features, thus ensuring interpretability and parameter efficiency. The fixed binary mask enforces the structure throughout training, preventing the need for pruning or regularization. The model's ability to directly read out propositional rules from trained units distinguishes it from existing neurosymbolic models that rely on post-hoc explanations or external rule bases. Theoretical bounds on active weights provide guarantees on sparsity and compression ratios, contributing to the understanding of model complexity.

Novelty

This research is the first to embed mined Boolean implication graphs as structural priors directly into neural networks, enabling the model to learn interpretable, sparse representations solely from data. Unlike prior neurosymbolic approaches that depend on pre-existing rule bases or ontologies, BIRDNet constructs its symbolic structure dynamically, reflecting the data's inherent logical relationships. Its layer-wise, greedy construction strategy ensures scalability and interpretability, setting it apart from traditional dense neural networks and rule-based systems. The approach bridges the gap between symbolic reasoning and deep learning, offering a new paradigm for transparent AI in scientific domains.

Limitations

The current implementation only supports binary (2-arity) implications, which may be insufficient for systems with complex, higher-order relationships, limiting its expressive power in some scientific contexts.
The structure is entirely data-driven, lacking prior domain knowledge integration, which could hinder performance in data-scarce or noisy environments. Incorporating domain expertise could enhance robustness.
Computational cost of mining implications and training sparse networks increases with feature dimensionality, posing challenges for ultra-high-dimensional datasets. Future work should focus on algorithmic optimization.
The model does not explicitly handle temporal or dynamic relationships, limiting its applicability to static datasets. Extending to time-series or multi-modal data remains an open challenge.
While the model excels in interpretability, the reliance on mined rules may overlook subtle, complex interactions that require higher-order logic, suggesting a need for future extensions to capture such relations.

Future Work

Future research will explore extending the framework to higher-arity implications, enabling modeling of more complex relationships. Integrating prior domain knowledge, such as curated ontologies, can further improve the model's robustness and biological relevance. Algorithmic improvements are needed to scale the mining process for ultra-high-dimensional data, possibly through parallelization or approximation techniques. Additionally, incorporating temporal dynamics and multi-modal data will broaden applicability, especially in longitudinal studies and multi-omics integration. Developing user-friendly tools for rule visualization and domain expert interaction will facilitate adoption in biological research and clinical practice. Finally, applying BIRDNet to real-world biomedical problems, such as drug response prediction and biomarker discovery, will validate its practical impact.

AI Executive Summary

In the rapidly evolving field of biomedical data analysis, deep learning models have demonstrated remarkable predictive capabilities, yet their opaque nature severely hampers scientific interpretability. Researchers and clinicians alike face the challenge of understanding why a model makes a particular prediction, especially when decisions impact patient care or biological discovery. Traditional rule-based systems, while transparent, lack the flexibility and scalability needed for complex, high-dimensional data. To bridge this gap, Tirtharaj Dash introduces BIRDNet, a novel neural network architecture that encodes mined Boolean implication relationships directly into its structure, achieving an elegant balance between accuracy, sparsity, and interpretability.

The core innovation of BIRDNet lies in its data-driven construction of a knowledge graph composed of Boolean implications—logical relationships indicating that the high or low state of one feature implies the same for another. These relationships are identified using a statistical sparse-exception binomial test, which filters out spurious correlations and retains only significant implications. The resulting graph is a typed, directed structure that captures the latent logical dependencies among features. Instead of relying on external ontologies or predefined rules, BIRDNet learns this structure solely from the data, making it highly adaptable to different domains.

This knowledge graph is then encoded into a layered neural network, where each hidden unit corresponds to a specific implication, connecting only to the two features involved. The network construction proceeds layer-wise in a greedy manner: at each step, a new set of implications is mined from the previous layer’s outputs, and each adds a new unit with fixed, sparse connections. This design guarantees that each layer remains sparse—at most 2/d of the weights are active, where d is the input dimension—and that each rule retains a clear symbolic identity. During training, the structure is fixed, and weights are optimized via AdamW, ensuring the network remains sparse and interpretable.

Empirical evaluation on six biomedical datasets—including transcriptomics and proteomics—demonstrates that BIRDNet achieves predictive performance within 0.02 AUROC of the strongest dense baseline, while using up to 96 times fewer active parameters. The first-layer rules successfully recover known biological signatures, such as gene amplifications, co-expression modules, and immune markers, validating the biological relevance of the mined rules. The interpretability is further enhanced by the ability to directly read off propositional implications from trained units, enabling transparent explanations at the sample level.

Overall, BIRDNet offers a powerful paradigm for integrating symbolic reasoning into deep learning, providing models that are both accurate and inherently interpretable. Its data-driven structure learning approach aligns well with scientific discovery, allowing researchers to uncover meaningful biological relationships without reliance on external rule bases. This work opens new avenues for explainable AI in life sciences, with potential applications in disease diagnosis, biomarker discovery, and personalized medicine. Future extensions will focus on capturing higher-order relations, incorporating domain knowledge, and scaling to larger datasets, ultimately advancing AI’s role in scientific understanding and clinical decision-making.

Deep Analysis

Background

The application of deep learning in biomedical sciences has revolutionized data-driven discovery, enabling high-accuracy predictions in genomics, transcriptomics, and proteomics. However, these models often function as black boxes, limiting their utility for scientific interpretation and clinical trust. Early efforts incorporated external ontologies, such as Gene Ontology or Reactome pathways, to embed biological knowledge into neural architectures, but these approaches depend heavily on curated databases and lack flexibility. Recent advances in neurosymbolic AI aim to combine symbolic reasoning with neural networks, but many rely on predefined rule bases or complex logical modules, which are not automatically learned from data. Sahoo et al. introduced Boolean implication networks as a way to discover latent logical relationships among genes, providing a promising direction for data-driven symbolic modeling. Nonetheless, integrating these relationships into scalable, interpretable neural architectures remains an open challenge. This paper builds upon these foundations, proposing a novel method to encode mined implication graphs directly into neural network structures, thereby achieving a balance of interpretability, sparsity, and predictive performance.

Core Problem

Despite the success of deep models in biomedical tasks, their lack of transparency hinders scientific validation and clinical adoption. Existing rule-based models depend on external knowledge bases, which may be incomplete or outdated, and often lack flexibility to adapt to new data. Conversely, purely data-driven neural networks are opaque, making it difficult to interpret their decisions or extract meaningful biological insights. The core problem addressed here is how to construct a neural network that is both highly sparse and inherently interpretable, by embedding logical relationships directly into its architecture, without relying on external rule sets. Achieving this requires developing robust methods to mine latent feature relationships, encode them as network connections, and ensure that the resulting model maintains high predictive accuracy while providing transparent rule explanations.

Innovation

The key innovations include: 1) a statistical method based on sparse-exception binomial testing to automatically mine Boolean implication relationships from data, capturing latent logical dependencies among features; 2) a layer-wise greedy construction algorithm that encodes these implications as sparse network connections, with each hidden unit representing a specific propositional rule involving only two features; 3) a fixed binary mask that enforces the structure during training, ensuring sparsity and interpretability without post-hoc pruning; 4) direct rule readout from trained units, enabling transparent explanations aligned with biological knowledge; 5) theoretical bounds on the active weight fraction, guaranteeing model compression and efficiency. These innovations collectively enable the creation of a neural network that is both sparse and symbolic, bridging the gap between black-box models and rule-based systems.

Methodology

�� Data preprocessing: Apply StepMiner thresholding to binarize continuous features, resulting in a binary feature matrix. • Implication mining: For each feature pair, test four types of implications (e.g., high→high, low→low) using a sparse-exception binomial test, which counts exception samples violating the implication. Significance is determined by p-value<10^-10 and exception fraction<0.0516. • Knowledge graph construction: Valid implications form a typed directed graph, with edges labeled by implication type. • Network encoding: Each implication corresponds to a hidden unit with weights initialized to connect only the two involved features, enforced by a binary mask. • Layer-wise construction: Starting from input, mine implications among features; then, for each subsequent layer, mine implications on the previous layer’s outputs, adding units accordingly. • Training: Fix the network structure, optimize weights via AdamW, with BatchNorm, ReLU, and Dropout. • Rule reading: Post-training, extract propositional rules directly from units based on their fixed symbolic identities, and evaluate rule precision, recall, and lift on validation data. • Explanation: Use hierarchical relevance propagation to trace decision paths back to specific rules and features.

Experiments

�� Datasets: Six biomedical datasets—UCI mice protein, TCGA RPPA, GSE39582 transcriptomics, UCI gene expression, METABRIC, TCGA RNA-seq—covering sample sizes from 566 to 10,051, feature counts from 77 to 54,675, and class numbers from 5 to 27. • Baselines: Dense MLP with identical architecture, L1-penalized linear models, and Random Forests. • Hyperparameters: p-value threshold 10^-10, maximum of 5000 implications per layer, maximum depth 2. • Cross-validation: 5-fold stratified, with 15% held-out for early stopping. Features standardized within training folds. • Metrics: AUROC for predictive performance, active parameter count, rule precision, and lift for interpretability. • Rule evaluation: Extract top rules per class, validate on held-out data, and analyze biological relevance through gene and pathway annotations.

Results

�� Predictive accuracy: BIRDNet’s AUROC scores are within 0.02 of the dense baseline across all six datasets, e.g., 0.998 AUROC on TCGA RPPA, demonstrating near-equivalent performance with vastly fewer parameters. • Parameter efficiency: The number of active weights is up to 96 times less than the dense MLP, achieving high compression ratios while maintaining accuracy. • Biological relevance: First-layer rules recover known signatures such as ERBB2 amplification in breast cancer, co-expression modules in tissue-specific datasets, and immune infiltration markers. These rules align with established biological knowledge, validating the interpretability. • Model sparsity: Theoretical bounds guarantee that each layer’s active weights do not exceed 2/d of total weights, ensuring extreme sparsity. • Rule interpretability: Rules can be directly read from the trained network, providing transparent decision pathways that match biological intuition and facilitate validation.

Applications

�� Diagnostic tools: BIRDNet can be employed for cancer subtype classification, biomarker discovery, and disease prognosis, especially when high-dimensional omics data are available. • Scientific discovery: The interpretable rules facilitate hypothesis generation and validation, enabling biologists to verify potential causal relationships. • Clinical decision support: Transparent models can assist clinicians in understanding model predictions, increasing trust and adoption. • Future integration: Combining BIRDNet with multi-omics and temporal data could enable dynamic disease modeling, personalized treatment planning, and real-time monitoring. • Knowledge base construction: The mined rules can serve as a foundation for building automated, evolving biological knowledge repositories.

Limitations & Outlook

�� The current framework supports only pairwise (2-arity) implications, limiting its ability to model complex multi-factor interactions common in biological systems. Extending to higher-arity rules is necessary. • The reliance on data-driven mining without prior domain knowledge may lead to missing biologically relevant relationships, especially in small or noisy datasets. Incorporating expert knowledge could improve robustness. • The computational cost of mining implications increases with feature dimensionality, posing scalability challenges for ultra-high-dimensional data. Algorithmic optimization and parallelization are needed. • The model does not explicitly handle temporal or spatial relationships, restricting its application to static datasets. Future work should integrate temporal modeling. • While highly interpretable, the model may overlook subtle, non-linear interactions that require more complex logical structures, suggesting a need for richer rule representations.

Abstract

Tabular data in knowledge-rich domains often carries a latent prior in the form of Boolean implication relationships (BIRs) between pairs of features. We mine such relationships with a sparse-exception binomial test. The mined implications form a typed directed graph, equivalent to a propositional rule base of 2-literal clauses. We encode this graph as the connectivity of a layered neural network, called BIRDNet, in which each hidden unit corresponds to one mined rule and binds only to its two features. We show two consequences of this design: First, the architecture is sparse by construction: at most $2/d$ of the weights in each BIR layer are active, where $d$ is the input dimension. Second, the model is interpretable: every trained unit keeps a stable symbolic identity, so rules can be read off the network without surrogate models. Unlike most neurosymbolic models, BIRDNet does not consume an external rule base; its structural prior is mined from the data. We evaluate BIRDNet on six transcriptomic and proteomic benchmarks. Our results show that BIRDNet stays within 0.02 AUROC of the strongest dense baseline, at a small accuracy cost, while using up to $96\times$ fewer active parameters than an architecture-matched dense MLP. First-layer rules recover known biological signatures across multiple cancer subtypes and tissue types, including canonical amplicons, lineage-defining co-expression modules, and immune-infiltration markers. Data and code are available at: https://github.com/MAHI-Group/BIRDNet.

cs.LG cs.AI cs.NE q-bio.QM

References (13)

Immunohistochemical and Clinical Characterization of the Basal-Like Subtype of Invasive Breast Carcinoma

T. Nielsen, Forrest D. Hsu, Kristin Jensen et al.

2004 2685 citations

Gene Expression Classification of Colon Cancer into Molecular Subtypes: Characterization, Validation, and Prognostic Value

L. Marisa, A. de Reyniès, A. Duval et al.

2013 1334 citations

On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation

Sebastian Bach, Alexander Binder, G. Montavon et al.

2015 4896 citations

Boolean implication networks derived from large scale, whole genome microarray datasets

D. Sahoo, D. Dill, A. Gentles et al.

2008 130 citations

The Cancer Genome Atlas Pan-Cancer Analysis Project

J. Weinstein, E. Collisson, G. Mills et al.

2013 8155 citations

A pan-cancer proteomic perspective on The Cancer Genome Atlas

Rehan Akbani, P. Ng, H. M. Werner et al.

2014 549 citations

Self-Organizing Feature Maps Identify Proteins Critical to Learning in a Mouse Model of Down Syndrome

C. Higuera, K. Gardiner, K. Cios

2015 173 citations

Structured Pruning Learns Compact and Accurate Models

J. Devlin, Ming-Wei Chang, Kenton Lee et al.

2021 103 citations

The Power of Boolean Implication Networks

D. Sahoo

2012 24 citations

Alveolar Surfactant Homeostasis and the Pathogenesis of Pulmonary Disease

J. Whitsett, S. Wert, T. Weaver

2010 431 citations

The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups

C. Curtis, Sohrab P. Shah, S. Chin et al.

2012 5662 citations

Logical Explanations for Deep Relational Machines Using Relevance Information

A. Srinivasan, L. Vig, Michael Bain

2018 17 citations View Analysis →

Activation of multiple cancer-associated genes at the ERBB2 amplicon in breast cancer.

P. Kauraniemi, A. Kallioniemi

2006 131 citations

BIRDNet: Mining and Encoding Boolean Implication Knowledge Graphs as Interpretable Deep Neural Networks

Key Findings

Methodology

Key Results

Significance

Technical Contribution

Novelty

Limitations

Future Work

AI Executive Summary

Deep Analysis

Background

Core Problem

Innovation

Methodology

Experiments

Results

Applications

Limitations & Outlook

Abstract

References (13)

Related Papers

Execution-State Capsules: Graph-Bound Execution-State Checkpoint and Restore for Low-Latency, Small-Batch, On-Device Physical-AI Serving

On the Oracle Complexity of Interpolation-Based Gradient Descent

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Zero-Shot Active Feature Acquisition via LLM-Elicitation

Looped World Models

Kolmogorov Regression for Robust Diffusion Policies