Robustness, Cost, and Attack-Surface Concentration in Phishing Detection

TL;DR

A cost-aware evasion framework reveals robustness gaps in phishing detection; median evasion cost is 2, with over 80% attacks on three low-cost features.

cs.LG 🔴 Advanced 2026-03-20 79 views

Julian Allagan Mohamed Elbakary Zohreh Safari Weizheng Gao Gabrielle Morgan Essence Morgan Vladimir Deriglazov

AI Reader Arxiv Page Download PDF

phishing detection robustness evasion attacks feature economics machine learning

Key Findings

Methodology

The paper introduces a cost-aware evasion framework that models discrete, monotone feature edits under explicit attacker budgets. Three diagnostics are introduced: minimal evasion cost (MEC), the evasion survival rate, and the robustness concentration index (RCI). The framework computes evasion paths through shortest-path search, considering attacker cost constraints on feature edits.

Key Results

On the UCI Phishing Websites benchmark, Logistic Regression, Random Forests, Gradient Boosted Trees, and XGBoost achieve AUC≥0.979 under static evaluation. However, under budgeted evasion, all models have a median MEC of 2, with over 80% of successful evasions concentrating on three low-cost surface features.
Feature restriction improves robustness only when it removes all dominant low-cost transitions. Under strict cost schedules, infrastructure-leaning feature sets exhibit 17-19% infeasible mass for ensemble models, while the median MEC among evadable instances remains unchanged.
If a positive fraction of correctly detected phishing instances admit evasion through a single feature transition, no classifier can raise the corresponding MEC quantile above this transition cost without modifying the feature representation or cost model.

Significance

This study reveals that robustness in phishing detection is governed by feature economics rather than model complexity. By providing a clear cost-aware framework, the research bridges the gap between static evaluation and adversarial deployment, highlighting the phenomenon of attack-surface concentration in the context of feature economics. This finding is significant for both academia and industry, as it shifts the focus from model selection to feature selection and cost model design when facing attackers.

Technical Contribution

The technical contribution of this paper lies in proposing a novel cost-aware evasion framework that provides deep insights into the robustness of phishing detection models through discrete cost optimization, attack-surface concentration measurement, and a formal robustness ceiling. Unlike existing continuous perturbation models, this framework considers discrete feature semantics and attacker economic behavior, offering new theoretical guarantees and engineering possibilities.

Novelty

This study is the first to formalize the evasion problem in phishing detection as a shortest-path problem on a cost-weighted discrete transition graph. Compared to previous studies, this paper emphasizes the importance of feature economics in adversarial environments and introduces new diagnostics to evaluate model robustness.

Limitations

The threat model only considers monotone feature edits, excluding anti-feature injection or extractor-level attacks, which may underestimate the actual capabilities of attackers.
The UCI Phishing Websites dataset used in the study is somewhat outdated, lacking modern phishing detection signals such as certificate transparency logs and JavaScript behavioral fingerprints.
The conclusions on feature economics depend on the specific cost model, which may not be applicable to all scenarios.

Future Work

Future research could relax the monotonicity constraint to consider anti-feature injection and extractor-level attacks for a more comprehensive robustness evaluation. Additionally, studies could extend to modern datasets to verify whether low-cost transitions continue to dominate evasion paths and explore new features and cost models.

AI Executive Summary

Phishing attacks pose a significant challenge in cybersecurity, where attackers modify website features to evade detection. Existing phishing detection models perform well under static evaluation but may be vulnerable in real-world deployment. Many studies assume attackers do not adaptively modify features, which is rarely the case.

This paper introduces a novel cost-aware evasion framework that simulates discrete, monotone feature edits by attackers under explicit budgets. It analyzes the robustness of phishing detection models post-deployment using three diagnostics: minimal evasion cost (MEC), evasion survival rate, and robustness concentration index (RCI). The framework computes evasion paths through shortest-path search, considering the cost constraints on feature edits.

On the UCI Phishing Websites benchmark, Logistic Regression, Random Forests, Gradient Boosted Trees, and XGBoost achieve AUC≥0.979 under static evaluation. However, under budgeted evasion, all models have a median MEC of 2, with over 80% of successful evasions concentrating on three low-cost surface features. Feature restriction improves robustness only when it removes all dominant low-cost transitions.

The study shows that robustness in phishing detection is governed by feature economics rather than model complexity. This finding is significant for both academia and industry, as it shifts the focus from model selection to feature selection and cost model design when facing attackers. Future research could relax the monotonicity constraint to consider anti-feature injection and extractor-level attacks for a more comprehensive robustness evaluation.

Deep Analysis

Background

Phishing attacks have long been a challenge in cybersecurity, where attackers disguise themselves as legitimate websites to steal sensitive user information. Traditional phishing detection methods rely on feature engineering and machine learning models, which perform well under static evaluation but may face evasion by attackers in real-world deployment. Recently, researchers have begun to focus on robustness issues in adversarial environments, proposing various methods to enhance model robustness. However, these methods often assume attackers do not adaptively modify features, ignoring the economic behavior of attackers and the cost of feature edits.

Core Problem

The core problem in phishing detection is maintaining model robustness in adversarial environments. Attackers can evade detection by modifying observable website features, and the cost of modifying these features is often asymmetric. Surface features like URL structure and HTML artifacts are inexpensive to modify, while infrastructure signals like domain age and DNS records require higher costs. Existing robustness analyses often rely on continuous perturbation models that fail to fully consider discrete feature semantics and attacker economic behavior.

Innovation

The core innovation of this paper is the introduction of a cost-aware evasion framework that formalizes the evasion problem in phishing detection as a shortest-path problem on a cost-weighted discrete transition graph. • The framework computes evasion paths through shortest-path search, considering attacker cost constraints on feature edits. • It introduces three diagnostics: minimal evasion cost (MEC), evasion survival rate, and robustness concentration index (RCI) to evaluate model robustness. • It emphasizes the importance of feature economics in adversarial environments, highlighting that model selection is less important than feature selection and cost model design.

Methodology

The methodology of this paper includes several key steps:

�� Establishing a cost-aware evasion framework: formalizing the evasion problem in phishing detection as a shortest-path problem on a cost-weighted discrete transition graph.

�� Designing minimal evasion cost (MEC): calculating the smallest cumulative cost required to induce misclassification under a given budget.

�� Calculating evasion survival rate and robustness concentration index (RCI): evaluating whether evasion paths disperse across many features or concentrate on a small subset under budget constraints.

�� Computing evasion paths through shortest-path search, considering attacker cost constraints on feature edits.

�� Using the UCI Phishing Websites benchmark dataset for experiments to evaluate the robustness of different models in adversarial environments.

Experiments

The experimental design uses the UCI Phishing Websites benchmark dataset, which contains 11,055 instances and 30 ternary features. The study evaluates the robustness of Logistic Regression, Random Forests, Gradient Boosted Trees, and XGBoost models. Experiments use a 75/25 train-test split and test evasion attacks under budget constraints. Key hyperparameters include the number of trees, maximum depth, and learning rate. Ablation studies analyze the impact of different feature sets and cost schedules on model robustness.

Results

The experimental results show that under static evaluation, all models achieve AUC≥0.979. However, under budgeted evasion, all models have a median MEC of 2, with over 80% of successful evasions concentrating on three low-cost surface features. Feature restriction improves robustness only when it removes all dominant low-cost transitions. Under strict cost schedules, infrastructure-leaning feature sets exhibit 17-19% infeasible mass for ensemble models, while the median MEC among evadable instances remains unchanged.

Applications

The method proposed in this paper can be directly applied to enhance the robustness of phishing detection systems, especially when facing attackers. By optimizing feature selection and cost model design, systems can improve adversarial robustness without significantly reducing detection accuracy. This is significant for cybersecurity in finance, e-commerce, and social media.

Limitations & Outlook

The limitations of this paper include its threat model, which only considers monotone feature edits and excludes anti-feature injection or extractor-level attacks. Additionally, the UCI Phishing Websites dataset used is somewhat outdated, lacking modern phishing detection signals such as certificate transparency logs and JavaScript behavioral fingerprints. Future research could relax the monotonicity constraint to consider anti-feature injection and extractor-level attacks for a more comprehensive robustness evaluation.

Plain Language Accessible to non-experts

Imagine a factory with many machines, each having a specific function. Now, there are workers trying to produce counterfeit products by changing the machine settings, while the factory manager needs to ensure these machines operate correctly and are not tampered with. The manager can implement various security measures to prevent workers from altering the machine settings, such as locking the machines or setting up alarm systems. However, these measures come at different costs; some are cheap, like simple locks, while others are expensive, like complex alarm systems. The manager needs to find a balance between cost and security to ensure the factory runs smoothly.

In phishing detection, attackers are like those workers trying to tamper with machine settings, and the detection system is like the factory manager. Attackers evade detection by modifying website features, while the detection system needs to optimize feature selection and cost model design to enhance robustness. This study shows that robustness in phishing detection is governed by feature economics rather than model complexity. By providing a clear cost-aware framework, the research bridges the gap between static evaluation and adversarial deployment, highlighting the phenomenon of attack-surface concentration in the context of feature economics.

ELI14 Explained like you're 14

Hey there, friends! Did you know that in the online world, some bad guys pretend to be legit websites to steal our info? It's like in games where some players pretend to be your teammates but actually want to backstab you! To stop this, we need smart detection systems to spot these imposters.

But these bad guys are clever too. They change small details on websites to dodge detection, just like in games where they might change their name or wear different gear to confuse you. Our detection systems need constant upgrades to keep up with them.

This study is like giving our detection systems a new strategy to better spot these imposters. It finds that the robustness of detection systems mainly depends on how they choose and use features, not the complexity of the model itself. Just like in games, choosing the right gear and skills is more important than having complex moves!

In the future, we can further improve detection systems' robustness by optimizing feature selection and cost model design. This way, we can better protect our info from bad guys' attacks!

Glossary

Minimal Evasion Cost (MEC)

Minimal evasion cost is the smallest cumulative cost required for an attacker to induce misclassification under a given budget. It is computed through shortest-path search, considering attacker cost constraints on feature edits.

In this paper, MEC is used to evaluate the robustness of different models in adversarial environments.

Evasion Survival Rate

Evasion survival rate measures whether evasion paths disperse across many features or concentrate on a small subset under budget constraints. It is evaluated by calculating survival curves under different budgets.

The paper uses evasion survival rate to analyze attack-surface concentration.

Robustness Concentration Index (RCI)

Robustness concentration index measures the fraction of adversarial effort concentrated on a small subset of features. It evaluates model robustness by calculating the proportion of most-edited features.

RCI is used to analyze whether evasion paths concentrate on a small subset of features under budget constraints.

Cost-Aware Evasion Framework

A cost-aware evasion framework simulates discrete, monotone feature edits by attackers under explicit budgets. It computes evasion paths through shortest-path search, considering attacker cost constraints on feature edits.

The paper proposes a cost-aware evasion framework to evaluate the robustness of phishing detection models.

UCI Phishing Websites Benchmark

The UCI Phishing Websites benchmark is a dataset containing 11,055 instances and 30 ternary features, used to evaluate the performance of phishing detection models.

The paper uses the UCI Phishing Websites benchmark to evaluate the robustness of different models in adversarial environments.

Feature Economics

Feature economics studies the impact of feature selection and cost model design on model robustness, emphasizing the cost of feature edits and attacker economic behavior.

The paper emphasizes the importance of feature economics in phishing detection, highlighting that model selection is less important than feature selection and cost model design.

Monotone Feature Edits

Monotone feature edits refer to attackers removing phishing indicators and pushing feature values toward legitimate states to evade detection.

The paper's threat model considers monotone feature edits to evaluate model robustness.

Anti-Feature Injection

Anti-feature injection refers to attackers adding benign-looking HTML artifacts to boost legitimacy scores, thereby evading detection.

The paper does not consider anti-feature injection but notes its potential impact on robustness evaluation.

Extractor-Level Attacks

Extractor-level attacks involve attackers exploiting parser ambiguities to alter computed features without semantic change.

The paper does not consider extractor-level attacks but notes their potential impact on robustness evaluation.

Attack-Surface Concentration

Attack-surface concentration refers to evasion paths concentrating on a small subset of low-cost features under budget constraints, rather than dispersing across many features.

The paper analyzes attack-surface concentration through RCI and evasion survival rate.

Open Questions Unanswered questions from this research

1 How can feature selection and cost model design be optimized to enhance adversarial robustness without significantly reducing detection accuracy? Existing studies often focus on model selection, neglecting feature selection and cost model design.
2 How do anti-feature injection and extractor-level attacks affect model robustness when the monotonicity constraint is relaxed? Existing studies fail to fully consider these attack methods.
3 How can low-cost transitions continue to dominate evasion paths on modern datasets? The UCI Phishing Websites dataset used in existing studies is somewhat outdated, lacking modern phishing detection signals.
4 How can new features and cost models be designed to enhance model robustness against more sophisticated attackers? Existing studies often assume attackers do not adaptively modify features.
5 How can model robustness be improved in adversarial environments without increasing computational costs? Existing studies often focus on model accuracy, neglecting computational cost issues.
6 How can feature selection and cost model design be adjusted according to specific needs in different application scenarios to enhance system robustness? Existing studies often focus on general models, neglecting specific scenario needs.
7 How can new theoretical guarantees and engineering possibilities further enhance the robustness of phishing detection models? Existing studies often focus on improving existing methods, neglecting new theoretical and engineering possibilities.

Applications

Immediate Applications

Financial Security

Financial institutions can enhance adversarial robustness by optimizing feature selection and cost model design in phishing detection systems, preventing economic losses from phishing attacks.

E-commerce Security

E-commerce platforms can apply the method proposed in this paper to enhance the robustness of their security systems, protecting users' personal information and transaction security.

Social Media Security

Social media platforms can improve phishing detection systems to prevent users from phishing attacks, protecting users' privacy and account security.

Long-term Vision

Comprehensive Cybersecurity

By optimizing feature selection and cost model design, more robust cybersecurity systems can be built, significantly enhancing cybersecurity across industries.

Intelligent Security Systems

Develop intelligent security systems that can automatically adapt to changes in attacker strategies, improving system adaptability and robustness.

Abstract

Phishing detectors built on engineered website features attain near-perfect accuracy under i.i.d.\ evaluation, yet deployment security depends on robustness to post-deployment feature manipulation. We study this gap through a cost-aware evasion framework that models discrete, monotone feature edits under explicit attacker budgets. Three diagnostics are introduced: minimal evasion cost (MEC), the evasion survival rate $S(B)$, and the robustness concentration index (RCI). On the UCI Phishing Websites benchmark (11\,055 instances, 30 ternary features), Logistic Regression, Random Forests, Gradient Boosted Trees, and XGBoost all achieve $\mathrm{AUC}\ge 0.979$ under static evaluation. Under budgeted sanitization-style evasion, robustness converges across architectures: the median MEC equals 2 with full features, and over 80\% of successful minimal-cost evasions concentrate on three low-cost surface features. Feature restriction improves robustness only when it removes all dominant low-cost transitions. Under strict cost schedules, infrastructure-leaning feature sets exhibit 17-19\% infeasible mass for ensemble models, while the median MEC among evadable instances remains unchanged. We formalize this convergence: if a positive fraction of correctly detected phishing instances admit evasion through a single feature transition of minimal cost $c_{\min}$, no classifier can raise the corresponding MEC quantile above $c_{\min}$ without modifying the feature representation or cost model. Adversarial robustness in phishing detection is governed by feature economics rather than model complexity.

cs.LG

References (19)

Catching Phishers By Their Bait: Investigating the Dutch Phishing Landscape through Phishing Kit Detection

Hugo L. J. Bijmans, Tim M. Booij, Anneke Schwedersky et al.

2021 57 citations ⭐ Influential

Sunrise to Sunset: Analyzing the End-to-end Life Cycle and Effectiveness of Phishing Attacks at Scale

Adam Oest, Penghui Zhang, Brad Wardman et al.

2020 159 citations ⭐ Influential

Machine learning based phishing detection from URLs

O. K. Sahingoz, Ebubekir Buber, Önder Demir et al.

2019 668 citations

Towards Evaluating the Robustness of Neural Networks

Nicholas Carlini, D. Wagner

2016 9647 citations View Analysis →

SoK: A Comprehensive Reexamination of Phishing Research From the Security Perspective

Avisha Das, Shahryar Baki, Ayman El Aassal et al.

2019 130 citations View Analysis →

Phishing Detection: A Literature Survey

Mahmoud Khonji, Y. Iraqi, Andrew Jones

2013 567 citations

Explaining and Harnessing Adversarial Examples

I. Goodfellow, Jonathon Shlens, Christian Szegedy

2014 21600 citations View Analysis →

Deep Learning for Phishing Detection: Taxonomy, Current Challenges and Future Directions

N. Do, A. Selamat, O. Krejcar et al.

2022 153 citations

Automatically Evading Classifiers: A Case Study on PDF Malware Classifiers

Weilin Xu, Yanjun Qi, David Evans

2016 246 citations

A comprehensive survey of AI-enabled phishing attacks detection techniques

A. Basit, Maham Zafar, Xuan Liu et al.

2020 344 citations

Evasion Attacks against Machine Learning at Test Time

B. Biggio, Igino Corona, Davide Maiorca et al.

2013 2286 citations View Analysis →

Intriguing Properties of Adversarial ML Attacks in the Problem Space

Fabio Pierazzi, Feargus Pendlebury, Jacopo Cortellazzi et al.

2019 326 citations

CatchPhish: detection of phishing websites by inspecting URLs

Routhu Srinivasa Rao, T. Vaishnavi, A. R. Pais

2019 120 citations

“Real Attackers Don't Compute Gradients”: Bridging the Gap Between Adversarial ML Research and Practice

Giovanni Apruzzese, H. Anderson, Savino Dambra et al.

2022 114 citations View Analysis →

DeltaPhish: Detecting Phishing Webpages in Compromised Websites

Igino Corona, B. Biggio, M. Contini et al.

2017 84 citations View Analysis →

Adversarial Robustness of Deep Learning: Theory, Algorithms, and Applications

Wenjie Ruan, Xinping Yi, Xiaowei Huang

2021 20 citations View Analysis →

Wild Patterns: Ten Years After the Rise of Adversarial Machine Learning

B. Biggio, F. Roli

2017 1572 citations View Analysis →

Predicting phishing websites based on self-structuring neural network

R. Mohammad, F. Thabtah, L. Mccluskey

2013 341 citations

Phishpedia: A Hybrid Deep Learning Based Approach to Visually Identify Phishing Webpages

Yun Lin, Ruofan Liu, Dinil Mon Divakaran et al.

2021 158 citations

Robustness, Cost, and Attack-Surface Concentration in Phishing Detection

Key Findings

Methodology

Key Results

Significance

Technical Contribution

Novelty

Limitations

Future Work

AI Executive Summary

Deep Analysis

Background

Core Problem

Innovation

Methodology

Experiments

Results

Applications

Limitations & Outlook

Plain Language Accessible to non-experts

ELI14 Explained like you're 14

Glossary

Minimal Evasion Cost (MEC)

Evasion Survival Rate

Robustness Concentration Index (RCI)

Cost-Aware Evasion Framework

UCI Phishing Websites Benchmark

Feature Economics

Monotone Feature Edits

Anti-Feature Injection

Extractor-Level Attacks

Attack-Surface Concentration

Open Questions Unanswered questions from this research

Applications

Immediate Applications

Financial Security

E-commerce Security

Social Media Security

Long-term Vision

Comprehensive Cybersecurity

Intelligent Security Systems

Abstract

References (19)

Related Papers

Scalable Hyperparameter-Divergent Ensemble Training with Automatic Learning Rate Exploration for Large Models

Efficient learning by implicit exploration in bandit problems with side observations

Necessary and sufficient conditions for universality of Kolmogorov-Arnold networks

Collocation-based Robust Physics Informed Neural Networks for time-dependent simulations of pollution propagation under thermal inversion conditions on Spitsbergen

Spend Less, Fit Better: Budget-Efficient Scaling Law Fitting via Active Experiment Selection

Neural Recovery of Historical Lexical Structure in Bantu Languages from Modern Data