Replication Data for: A Metaheuristic Search Algorithm Based on Sampling and Clustering

Dataset

DOI

This repository contains the experimental data associated with the MCSA (Montecarlo-Clustering Search Algorithm), a stochastic metaheuristic designed for solving optimization problems. The dataset supports the results presented in the associated publication and includes raw outputs, benchmark evaluations, and problem-specific instances such as the Knapsack Problem (KP) and Multi-Objective Knapsack Problem (MOKP). Detailed tabular and file-specific descriptions are provided within each experiment folder (EX01_instance–EX04_Synthetic). For a deep dive into the algorithmic steps, binary encoding, and problem decomposition, please refer to the README_methodology

METHODOLOGICAL OVERVIEW

The dataset was generated using the MCSA (Montecarlo-Clustering Search Algorithm), which combines:

Montecarlo sampling for global exploration Clustering strategies for solution space refinement

Due to its stochastic nature:

Multiple independent executions are required Different random seeds produce different solution trajectories Performance must be evaluated statistically across runs

DATA TYPES INCLUDED

Plain text files (.txt): raw outputs from algorithm executions

CSV files (.csv): structured problem instances and reference solutions, and aggregated experimental data (e.g., MOKP and benchmarks)

Image files (.png): visualizations of results

REPRODUCIBILITY

Each execution in the dataset corresponds to an independent run of the algorithm.

File naming conventions include parameter settings and random seeds.

Raw outputs are provided to allow independent analysis.

Note: Source code is not included due to access restrictions. However, the dataset contains sufficient information to analyze the behavior and results of the algorithm.

USAGE NOTES

Users should begin by exploring the EXPERIMENTS directory (EX01_instance–EX04_Synthetic) for structured and well-documented cases.

The PARALLEL BENCHMARKS directory contains large-scale experimental data and may require additional processing.

The METHODOLOGY folder is intended for interpretability and documentation of the algorithm’s internal behavior.

MORE INFORMATION

Due to the stochastic nature of MCSA, not all executions converge to the optimal solution. Therefore, analysis should consider multiple runs and aggregated performance metrics rather than single execution results. The best solution is obtained by comparing the results across multiple independent executions and selecting the best-performing outcome.

For detailed descriptions of individual experiments, refer to the README files included in each experiment folder.

For a deep dive into the algorithmic steps, binary encoding, and problem decomposition, please refer to the README_methodology

Identifier
DOI	https://doi.org/10.34810/data3157
Related Identifier	IsSupplementTo https://doi.org/10.1109/ACCESS.2024.3354714
Metadata Access	https://dataverse.csuc.cat/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34810/data3157

Provenance
Creator	Harita, Maria ; Wong, Alvaro ; Suppi, Remo ; Rexachs, Dolores (ORCID: 0000-0001-5500-850X); Luque, Emilio
Publisher	CORA.Repositori de Dades de Recerca
Contributor	Harita Rascon, Maria de los Angeles; Universitat Autònoma Barcelona
Publication Year	2026
Funding Reference	https://ror.org/003x0zc53 PID2020-112496GB-I00
Rights	CC BY 4.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by/4.0
OpenAccess	true
Contact	Harita Rascon, Maria de los Angeles (Universitat Autònoma de Barcelona)

Representation
Resource Type	Aggregate data; Dataset
Format	text/plain; application/zip
Size	7515; 9312211
Version	1.0
Discipline	Construction Engineering and Architecture; Engineering; Engineering Sciences; Mathematics; Natural Sciences