Similarity-based Portfolio Construction for Black-box Optimization

Key Findings

Methodology

The paper proposes a similarity-based algorithm portfolio construction method to address the algorithm selection problem in black-box optimization. The method first constructs an initial portfolio over the full training set and then fine-tunes it for unseen instances using a k-nearest neighbor approach. This allows the portfolio to better adapt to the specific characteristics of problem instances, thereby improving optimization performance. Specifically, the methodology includes: 1) using the Discrete Empirical Attainment Function (EAF) as a performance metric; 2) employing a greedy strategy to construct the portfolio; 3) using k-nearest neighbors to select similar training instances for fine-tuning.

Key Results

Result 1: The initial portfolio constructed over the full training set already surpasses the Virtual Best Solver (VBS), demonstrating its superiority.
Result 2: The k-nearest neighbor fine-tuned portfolio further improves performance on unseen instances, with an average performance increase of about 10%.
Result 3: In multi-dimensional problems, locally optimized portfolios outperform globally optimized ones, especially in higher dimensions.

Significance

This research is significant in the field of algorithm selection. Traditional algorithm selection methods often rely on choosing a single algorithm, which carries the risk of poor performance if the wrong algorithm is selected. The proposed portfolio method significantly enhances optimization performance by allocating the budget to multiple algorithms. This approach not only reduces the risk of selecting the wrong algorithm but also leverages complementarities between algorithms to achieve performance gains. For fixed-budget black-box optimization problems, this method offers a more refined solution.

Technical Contribution

The technical contribution of this paper lies in transforming the algorithm selection problem into a budget allocation problem and proposing a k-nearest neighbor-based fine-tuning method to optimize the portfolio. Compared to existing methods, this approach better exploits the complementarities between algorithms and demonstrates robustness and adaptability across various problem dimensions. Additionally, the paper introduces the Discrete Empirical Attainment Function (EAF) as a performance metric, providing a new theoretical foundation for portfolio construction.

Novelty

This paper is the first to apply the k-nearest neighbor method for fine-tuning algorithm portfolios, significantly improving performance in black-box optimization. Compared to traditional single algorithm selection methods, this approach shows higher flexibility and adaptability when dealing with diverse problem instances.

Limitations

Limitation 1: The method has high computational complexity, especially when dealing with large training sets, as k-nearest neighbor search can increase computational overhead.
Limitation 2: The performance of the method depends on the quality of the feature space; if the features do not effectively represent problem instances, it may affect the portfolio's effectiveness.
Limitation 3: In some cases, locally optimized portfolios may not be as stable as globally optimized ones.

Future Work

Future research directions include: 1) developing more efficient k-nearest neighbor search algorithms to reduce computational complexity; 2) exploring richer feature spaces to improve the method's adaptability; 3) validating the method's effectiveness on larger and more complex optimization problems.

AI Executive Summary

In the field of black-box optimization, selecting the right algorithm to solve a given problem has always been a challenge. Traditional algorithm selection methods often rely on choosing a single algorithm, which carries the risk of poor performance if the wrong algorithm is selected. Even well-performing algorithms can yield unsatisfactory results in a single run due to high variance. To address this issue, the paper proposes a similarity-based algorithm portfolio construction method.

The method first constructs an initial portfolio over the full training set and then fine-tunes it for unseen instances using a k-nearest neighbor approach. Specifically, the methodology includes: using the Discrete Empirical Attainment Function (EAF) as a performance metric, employing a greedy strategy to construct the portfolio, and using k-nearest neighbors to select similar training instances for fine-tuning.

Experimental results show that the initial portfolio constructed over the full training set already surpasses the Virtual Best Solver (VBS), demonstrating its superiority. The k-nearest neighbor fine-tuned portfolio further improves performance on unseen instances, with an average performance increase of about 10%. In multi-dimensional problems, locally optimized portfolios outperform globally optimized ones, especially in higher dimensions.

This research is significant in the field of algorithm selection. Traditional algorithm selection methods often rely on choosing a single algorithm, which carries the risk of poor performance if the wrong algorithm is selected. The proposed portfolio method significantly enhances optimization performance by allocating the budget to multiple algorithms. This approach not only reduces the risk of selecting the wrong algorithm but also leverages complementarities between algorithms to achieve performance gains. For fixed-budget black-box optimization problems, this method offers a more refined solution.

However, the method has high computational complexity, especially when dealing with large training sets, as k-nearest neighbor search can increase computational overhead. Additionally, the performance of the method depends on the quality of the feature space; if the features do not effectively represent problem instances, it may affect the portfolio's effectiveness. Future research directions include developing more efficient k-nearest neighbor search algorithms to reduce computational complexity, exploring richer feature spaces to improve the method's adaptability, and validating the method's effectiveness on larger and more complex optimization problems.

Deep Analysis

Background

Black-box optimization is a critical research area that involves optimizing unknown or non-analytical objective functions. As optimization algorithms have diversified, selecting the right algorithm to solve specific problems has become a key challenge. Traditional algorithm selection methods often rely on machine learning techniques, using feature extraction and classification models to predict the optimal algorithm. However, these methods have inherent risks, as selecting the wrong algorithm can lead to poor performance. Additionally, many optimization algorithms are stochastic, meaning that even if the optimal algorithm is selected, it may still perform poorly in a single run. To address these challenges, researchers have begun exploring algorithm portfolio methods, which allocate the budget to multiple algorithms to improve overall performance.

Core Problem

The core problem is how to select the appropriate algorithm for a given problem in black-box optimization. Traditional algorithm selection methods often rely on choosing a single algorithm, which carries the risk of poor performance if the wrong algorithm is selected. Even well-performing algorithms can yield unsatisfactory results in a single run due to high variance. To mitigate this risk, the paper proposes a similarity-based algorithm portfolio construction method, which allocates the budget to multiple algorithms to improve overall performance.

Innovation

The core innovation of this paper is transforming the algorithm selection problem into a budget allocation problem and proposing a k-nearest neighbor-based fine-tuning method to optimize the portfolio. Specific innovations include: 1) using the Discrete Empirical Attainment Function (EAF) as a performance metric, providing a new theoretical foundation for portfolio construction; 2) employing a greedy strategy to construct the initial portfolio, ensuring reasonable budget allocation; 3) using k-nearest neighbors to select similar training instances for fine-tuning, enhancing the portfolio's adaptability and robustness.

Methodology

The methodology of this paper includes the following steps:

�� Use the Discrete Empirical Attainment Function (EAF) as a performance metric to evaluate algorithm performance under different budgets.

�� Construct an initial portfolio over the full training set using a greedy strategy to ensure reasonable budget allocation.

�� Use k-nearest neighbors to select training instances similar to the unseen instance in the feature space.

�� Fine-tune the selected similar instances to optimize the portfolio for specific problem instances.

�� Validate the method's effectiveness through experiments, comparing the performance of portfolios before and after fine-tuning.

Experiments

The experimental design is based on the multi-dimensional MA-BBOB benchmark function set, covering various optimization problem instances. The experiments use four optimization algorithms: CMA-ES, Diagonal CMA-ES, RCobyla, and Differential Evolution. Each algorithm is evaluated under a fixed budget to ensure fair comparison. The experiments use Exploratory Landscape Analysis (ELA) features as the feature representation, with cosine similarity used for similarity measurement in the feature space. The experimental results validate the method's effectiveness by comparing the performance of portfolios on the full training set and unseen instances.

Results

Experimental results show that the initial portfolio constructed over the full training set already surpasses the Virtual Best Solver (VBS), demonstrating its superiority. The k-nearest neighbor fine-tuned portfolio further improves performance on unseen instances, with an average performance increase of about 10%. In multi-dimensional problems, locally optimized portfolios outperform globally optimized ones, especially in higher dimensions. The results also demonstrate that the portfolio method effectively leverages complementarities between algorithms to achieve significant performance gains.

Applications

This method has potential applications in various fields, including hyperparameter optimization of machine learning models, parameter tuning of complex systems, and multi-objective optimization in engineering design. By allocating the budget to multiple algorithms, the method can improve optimization performance in environments with high uncertainty. Additionally, the method can be applied to real-time systems that require rapid response, dynamically adjusting algorithm combinations to adapt to changing environments.

Limitations & Outlook

Despite its outstanding performance in multiple experiments, the method has some limitations. First, it has high computational complexity, especially when dealing with large training sets, as k-nearest neighbor search can increase computational overhead. Second, the performance of the method depends on the quality of the feature space; if the features do not effectively represent problem instances, it may affect the portfolio's effectiveness. Finally, in some cases, locally optimized portfolios may not be as stable as globally optimized ones. Future research directions include developing more efficient k-nearest neighbor search algorithms to reduce computational complexity, exploring richer feature spaces to improve the method's adaptability, and validating the method's effectiveness on larger and more complex optimization problems.

Plain Language Accessible to non-experts

Imagine you're shopping in a large supermarket, and you need to buy ingredients to cook a big meal. The supermarket has a wide variety of ingredients, each with its own strengths and weaknesses. You don't know which ingredient is best for your recipe because you've never cooked this dish before. This is like the problem selection in black-box optimization: you don't know which algorithm is best suited to solve your problem.

To ensure your meal is successful, you decide not to choose just one ingredient but to select a combination of several. You first pick some ingredients that you think might be suitable, just like constructing an initial portfolio on the training set. Then, you ask other customers in the supermarket to see what ingredients they usually choose for similar dishes, which is like using the k-nearest neighbor method to fine-tune your selection.

In this way, you finally choose a set of ingredients whose combination maximizes the likelihood of your dish's success. This method not only reduces the risk of choosing the wrong ingredient but also enhances the overall flavor of the dish through the complementarity of different ingredients.

Similarly, in black-box optimization, the similarity-based portfolio method ensures better optimization results in environments with high uncertainty by allocating the budget to multiple algorithms.

ELI14 Explained like you're 14

Hey there, buddy! Imagine you're playing a super complex game with lots of levels, and each level needs different skills to pass. You've got a toolbox with lots of tools, each with its own specialty. The problem is, you don't know which tool is best for the current level!

This is like what scientists face when doing black-box optimization: they don't know which algorithm is best for solving the current problem. To make sure you can pass the level, you decide not to rely on just one tool but to use a combination of several. First, you pick some tools from your toolbox that you think might be useful, just like scientists constructing an initial portfolio on the training set.

Then, you ask other players to see what tools they've used in similar levels, which is like scientists using the k-nearest neighbor method to fine-tune their selection. In this way, you finally choose a set of tools that help you perform better in the game.

So, scientists do the same thing! By allocating the budget to multiple algorithms, they ensure better optimization results in environments with high uncertainty. This way, even when they encounter super tough levels, they can pass them smoothly!

Glossary

Black-box Optimization

An optimization problem where the objective function is either unknown or non-analytical, typically requiring evaluation through experiments or simulations.

In this paper, black-box optimization problems are the core background of the study.

Algorithm Selection

The process of selecting the most suitable algorithm for a given problem instance to achieve optimal performance.

The paper explores how portfolio methods can improve algorithm selection.

Portfolio

In optimization, refers to the strategy of using multiple algorithms simultaneously to improve overall performance.

The paper proposes a similarity-based algorithm portfolio construction method.

k-Nearest Neighbors

A distance-based classification and regression method that predicts by selecting the k closest instances in feature space.

The paper uses the k-nearest neighbor method to fine-tune algorithm portfolios.

Discrete Empirical Attainment Function

A metric tool for evaluating algorithm performance, based on the probability of achieving a target under different budgets.

The paper uses EAF as a performance metric for portfolio construction.

Virtual Best Solver

A theoretical benchmark representing the selection of the best-performing algorithm for each problem instance.

The experimental results show that the initial portfolio surpasses the Virtual Best Solver.

Exploratory Landscape Analysis

A method for characterizing optimization landscapes through sample-based descriptors.

The paper uses ELA features as the feature representation for problem instances.

Cosine Similarity

A method for measuring the similarity between two vectors, based on their angle rather than length.

The paper uses cosine similarity for similarity measurement in the feature space.

Greedy Strategy

An algorithm design strategy that selects the locally optimal solution at each step with the hope of finding a global optimum.

The paper employs a greedy strategy to construct the initial portfolio.

Multi-dimensional Problem

A complex problem involving multiple variables or parameters, often requiring more sophisticated optimization strategies.

The paper validates the method's effectiveness on multi-dimensional problems.

Open Questions Unanswered questions from this research

1 In black-box optimization, effectively selecting features to represent problem instances remains an open question. Current methods rely on predefined feature spaces, but these features may not fully capture the complexity of the problem. Future research needs to explore richer and more dynamic feature representations to enhance algorithm selection and portfolio performance.
2 Although the k-nearest neighbor method performs well in fine-tuning portfolios, its computational complexity is high, especially on large datasets. Developing more efficient k-nearest neighbor search algorithms to reduce computational overhead is a worthwhile research question.
3 In diverse optimization problems, constructing portfolios with higher robustness remains a challenge. Current methods may not be as stable as globally optimized portfolios in some cases. Future research could explore more flexible combination strategies to adapt to different problem variations.
4 Current portfolio methods mainly rely on the complementarities between algorithms, but how to better quantify and utilize this complementarity still requires further research. Particularly in high-dimensional problems, the interactions between algorithms may be more complex.
5 Applying portfolio methods in real-time systems faces many challenges, including how to quickly respond to environmental changes and dynamically adjust algorithm combinations. This requires developing new dynamic optimization strategies to meet the needs of real-time systems.

Applications

Immediate Applications

Hyperparameter Optimization of Machine Learning Models

By allocating the budget to multiple optimization algorithms, the efficiency of hyperparameter optimization in machine learning models can be improved, especially in high-dimensional parameter spaces.

Parameter Tuning of Complex Systems

In engineering design, using portfolio methods can more effectively tune the parameters of complex systems, improving system performance and stability.

Dynamic Optimization in Real-time Systems

In real-time systems that require rapid response, portfolio methods can dynamically adjust algorithm combinations to adapt to changing environments.

Long-term Vision

Multi-objective Optimization

In multi-objective optimization problems, portfolio methods can simultaneously optimize multiple objectives by allocating the budget to different algorithms, providing more comprehensive solutions.

Intelligent Decision Systems

In the future, portfolio methods can be applied to intelligent decision systems, improving decision accuracy and efficiency by combining multiple algorithms.

Abstract

In black-box optimization, a central question is which algorithm to use to solve a given, previously unseen, problem. Selecting a single algorithm, however, entails inherent risks: inaccuracies in the selector may lead to poor choices, and even well-performing algorithms with high variance can yield unsatisfactory results in a single run. A natural remedy is to split the evaluation budget across multiple runs of potentially different algorithms. Such sequential algorithm portfolios benefit from variance reduction and complementarities between algorithms, often outperforming approaches that allocate the entire budget to a single solver. While effective portfolios can be constructed post-hoc, transferring this idea to the algorithm selection setting is non-trivial. We show that a naive portfolio constructed over the full training set already outperforms the strongest traditional baseline, the virtual best solver. We then propose a simple yet effective k-nearest-neighbor-based finetuning approach to construct portfolios tailored to unseen instances, yielding further improvements and highlighting the effectiveness of portfolio selection in fixed-budget black-box optimization.

cs.NE

References (14)

A Direct Search Optimization Method That Models the Objective and Constraint Functions by Linear Interpolation

M. Powell

1994 1519 citations

Efficient Online Automated Algorithm Selection in the Face of Data-Drift in Optimisation Problem Instances

Jeroen G. Rook, Quentin Renau, H. Trautmann et al.

2025 1 citations

Cross-disciplinary perspectives on meta-learning for algorithm selection

K. Smith‐Miles

2009 583 citations

Learning the characteristics of engineering optimization problems with applications in automotive crash

Fu Xing Long, Niki van Stein, M. Frenzel et al.

2022 43 citations

A survey of features used for representing black-box single-objective continuous optimization

Gjorgjina Cenikj, Ana Nikolikj, G. Petelin et al.

2026 5 citations

Inferential Performance Assessment of Stochastic Optimisers and the Attainment Function

V. G. D. Fonseca, C. Fonseca, Andreia O. Hall

2001 235 citations

Learning Techniques for Automatic Algorithm Portfolio Selection

A. Guerri, M. Milano

2004 80 citations

A Simple Modification in CMA-ES Achieving Linear Time and Space Complexity

Raymond Ros, Nikolaus Hansen

2008 318 citations

An algorithm for the selection problem

R. Dromey

1986 797 citations

Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces

R. Storn, K. Price

1997 27407 citations

A Portfolio Approach to Algorithm Selection

H. Zender, G. Kruijff, Ivana Kruijff-Korbayová

2009 96 citations

Automated Algorithm Selection: Survey and Perspectives

P. Kerschke, H. Hoos, F. Neumann et al.

2018 452 citations View Analysis →

Per-run Algorithm Selection with Warm-starting using Trajectory-based Features

Ana Kostovska, Anja Jankovic, D. Vermetten et al.

2022 42 citations View Analysis →

Exploratory landscape analysis

Olaf Mersmann, B. Bischl, H. Trautmann et al.

2011 394 citations

Key Findings

Methodology

Key Results

Significance

Technical Contribution

Novelty

Limitations

Future Work

AI Executive Summary

Deep Analysis

Background

Core Problem

Innovation

Methodology

Experiments

Results

Applications

Limitations & Outlook

Plain Language Accessible to non-experts

ELI14 Explained like you're 14

Glossary

Black-box Optimization

Algorithm Selection

Portfolio

k-Nearest Neighbors

Discrete Empirical Attainment Function

Virtual Best Solver

Exploratory Landscape Analysis

Cosine Similarity

Greedy Strategy

Multi-dimensional Problem

Open Questions Unanswered questions from this research

Applications

Immediate Applications

Hyperparameter Optimization of Machine Learning Models

Parameter Tuning of Complex Systems

Dynamic Optimization in Real-time Systems

Long-term Vision

Multi-objective Optimization

Intelligent Decision Systems

Abstract

References (14)

Related Papers

Why Architecture Choice Matters in Symbolic Regression

Structure-Guided Diffusion Model for EEG-Based Visual Cognition Reconstruction

L-System Genetic Encoding for Scalable Neural Network Evolution: A Comparison with Direct Matrix Encoding

Scalable Memristive-Friendly Reservoir Computing for Time Series Classification

Combining Convolution and Delay Learning in Recurrent Spiking Neural Networks

Neuromorphic Parameter Estimation for Power Converter Health Monitoring Using Spiking Neural Networks