<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article
  PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.0/JATS-journalpublishing1.dtd">
<article article-type="research-article" dtd-version="1.0" specific-use="sps-1.8" xml:lang="en" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
	<front>
		<journal-meta>
			<journal-id journal-id-type="publisher-id">rfing</journal-id>
			<journal-title-group>
				<journal-title>Revista Facultad de Ingeniería</journal-title>
				<abbrev-journal-title abbrev-type="publisher">Rev. Fac. ing.</abbrev-journal-title>
			</journal-title-group>
			<issn pub-type="ppub">0121-1129</issn>
			<issn pub-type="epub">2357-5328</issn>
			<publisher>
				<publisher-name>Universidad Pedagógica y Tecnológica de Colombia</publisher-name>
			</publisher>
		</journal-meta>
		<article-meta>
			<article-id pub-id-type="doi">10.19053/01211129.v33.n68.2024.18379</article-id>
			<article-id pub-id-type="publisher-id">00002</article-id>
			<article-categories>
				<subj-group subj-group-type="heading">
					<subject>Articles</subject>
				</subj-group>
			</article-categories>
			<title-group>
				<article-title>A systematic review of machine learning-enhanced metaheuristics for solving capacitated vehicle routing problems</article-title>
				<trans-title-group xml:lang="es">
					<trans-title>Revisión sistemática de metaheurísticas mejoradas con aprendizaje automático para resolver problemas de enrutamiento de vehículos con capacidad</trans-title>
				</trans-title-group>
			</title-group>
			<contrib-group>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0000-0002-9872-604X</contrib-id>
					<name>
						<surname>Maca-Chagüendo</surname>
						<given-names>Mauricio</given-names>
					</name>
					<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
				</contrib>
				<contrib contrib-type="author">
					<contrib-id contrib-id-type="orcid">0000-0002-6263-1911</contrib-id>
					<name>
						<surname>Cobos-Lozada</surname>
						<given-names>Carlos-Alberto</given-names>
					</name>
					<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
				</contrib>
			</contrib-group>
			<aff id="aff1">
				<label>1</label>
				<institution content-type="original">Universidad del Cauca (Popayán, Colombia). mmaca@unicauca.edu.co</institution>
				<institution content-type="normalized">Universidad del Cauca</institution>
				<institution content-type="orgname">Universidad del Cauca</institution>
				<addr-line>
					<named-content content-type="city">Popayán</named-content>
				</addr-line>
				<country country="CO">Colombia</country>
				<email>mmaca@unicauca.edu.co</email>
			</aff>
			<aff id="aff2">
				<label>2</label>
				<institution content-type="original"> Universidad del Cauca (Popayán, Colombia). ccobos@unicauca.edu.co</institution>
				<institution content-type="normalized">Universidad del Cauca</institution>
				<institution content-type="orgname">Universidad del Cauca</institution>
				<addr-line>
					<named-content content-type="city">Popayán</named-content>
				</addr-line>
				<country country="CO">Colombia</country>
				<email>ccobos@unicauca.edu.co</email>
			</aff>
			<!--<pub-date date-type="pub" publication-format="electronic">
				<day>02</day>
				<month>06</month>
				<year>2024</year>
			</pub-date>
			<pub-date date-type="collection" publication-format="electronic">
				<season></season>
				<year></year>
			</pub-date>-->
			<pub-date pub-type="epub-ppub">
				<season>Apr-Jun</season>
				<year>2024</year>
			</pub-date>
			<volume>33</volume>
			<issue>68</issue>
			<elocation-id>e18379</elocation-id>
			<history>
				<date date-type="received">
					<day>10</day>
					<month>03</month>
					<year>2024</year>
				</date>
				<date date-type="accepted">
					<day>21</day>
					<month>06</month>
					<year>2024</year>
				</date>
			</history>
			<permissions>
				<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/" xml:lang="en">
					<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution License</license-p>
				</license>
			</permissions>
			<abstract>
				<title>ABSTRACT</title>
				<p>Combinatorial optimization problems are prevalent in various domains, such as logistics, planning, and resource allocation. This study is essential to fields such as applied mathematics and computer science. A prominent example is the Capacitated Vehicle Routing Problem (CVRP), which aims to minimize the route costs for a fleet of vehicles with capacity constraints. Traditionally, this problem has been tackled using exact methods and metaheuristics, but advancements in hybrid algorithms, modern metaheuristics, and machine learning have significantly improved both the solution quality and computational efficiency. This review followed Kitchenham's methodological guidelines, selecting and analyzing articles published between 2019 and 2024 from the Scopus and Web of Science databases. Specific inclusion and quality assessment criteria were applied, synthesizing the information quantitatively and qualitatively and categorizing studies based on their approaches and techniques. The studies were grouped by their focus, addressing research questions on advancements in machine-learning techniques for enhancing metaheuristics, datasets used, and comparison metrics. The findings emphasize the potential of hybrid approaches that combine metaheuristics with machine learning techniques to effectively solve the CVRP. These approaches have demonstrated their ability to improve efficiency and solution quality, which is particularly relevant for large-scale, complex problems. Significant improvements in solution accuracy and considerable reductions in computation times were observed.</p>
			</abstract>
			<trans-abstract xml:lang="es">
				<title>RESUMEN</title>
				<p>Los problemas de optimización combinatoria son muy comunes en diversas áreas como la logística, la planificación y la asignación de recursos. Este estudio es esencial en campos como las matemáticas aplicadas y las ciencias de la computación. Un ejemplo destacado es el problema de enrutamiento de vehículos con capacidad (CVRP), que busca minimizar los costos de las rutas de una flota de vehículos con restricciones de capacidad. Tradicionalmente, este problema se ha abordado mediante métodos exactos y metaheurísticos, pero la evolución de algoritmos híbridos, las metaheurísticas modernas y el aprendizaje automático han mejorado tanto la calidad de las soluciones como la eficiencia computacional. La revisión se realizó siguiendo las directrices metodológicas de Kitchenham. Se seleccionaron y analizaron artículos publicados entre 2019 y 2024 en bases de datos como Scopus y Web of Science. Se aplicaron criterios específicos de inclusión y evaluación de la calidad, sintetizando la información de forma cuantitativa y cualitativa, y categorizando los estudios según sus enfoques y técnicas empleadas. Los estudios se agruparon según su enfoque y se abordaron las preguntas de investigación sobre avances en técnicas de aprendizaje automático para mejorar las metaheurísticas, los conjuntos de datos empleados y las métricas de comparación. Los resultados destacan el potencial de los enfoques híbridos que combinan metaheurísticas con técnicas de aprendizaje automático para resolver el CVRP de manera efectiva. Estos enfoques han demostrado ser capaces de mejorar la eficiencia y la calidad de las soluciones, lo cual es especialmente relevante en problemas de gran escala y complejidad. Se observó una mejora significativa en la precisión de las soluciones y una reducción considerable en los tiempos de cómputo.</p>
			</trans-abstract>
			<kwd-group xml:lang="en">
				<title>Keywords:</title>
				<kwd>Artificial neural networks</kwd>
				<kwd>Capacitated Vehicle Routing Problem (CVRP)</kwd>
				<kwd>machine learning</kwd>
				<kwd>metaheuristics</kwd>
				<kwd>reinforcement learning</kwd>
			</kwd-group>
			<kwd-group xml:lang="es">
				<title>Palabras clave:</title>
				<kwd>Aprendizaje automático</kwd>
				<kwd>aprendizaje por refuerzo</kwd>
				<kwd>metaheurísticas</kwd>
				<kwd>problema de enrutamiento de vehículos con capacidad</kwd>
				<kwd>redes neuronales artificiales</kwd>
			</kwd-group>
			<counts>
				<fig-count count="1"/>
				<table-count count="6"/>
				<equation-count count="0"/>
				<ref-count count="33"/>
				<page-count count="0"/>
			</counts>
		</article-meta>
	</front>
	<body>
		<sec sec-type="intro">
			<title>1. INTRODUCTION</title>
			<p>Combinatorial Optimization Problems (COPs) are foundational across disciplines such as applied mathematics, computer science, operations research, engineering, and economics. They seek to maximize or minimize an objective function over a finite but vast set of solutions, which makes them computationally complex, with many classified as NP-hard [<xref ref-type="bibr" rid="B1">1</xref>]-[<xref ref-type="bibr" rid="B2">2</xref>]. Theoretically, COPs have been studied in complexity theory and areas of mathematics such as graph theory, probability, and optimization, which has enabled the formulation, analysis, and development of methods to find approximate solutions in polynomial time [<xref ref-type="bibr" rid="B3">3</xref>]-[<xref ref-type="bibr" rid="B4">4</xref>].</p>
			<p>In practice, COPs are applied to route optimization, resource allocation, scheduling, and inventory management, which are crucial for improving processes. In this context, the Capacitated Vehicle Routing Problem (CVRP) stands out as one of the most researched and applied problems, as it optimizes distribution routes with capacity constraints and minimizes travel costs [<xref ref-type="bibr" rid="B2">2</xref>]-[<xref ref-type="bibr" rid="B4">4</xref>]. Exact and heuristic methods have traditionally been used to address this problem. However, exact methods, such as integer linear programming and branch-and-bound techniques, are only suitable for small instances. By contrast, heuristic and metaheuristic methods, such as the Clarke-Wright algorithm, tabu search, and variable neighborhood search, are preferred for large-scale problems because they can approximate optimal solutions at a lower computational cost [<xref ref-type="bibr" rid="B5">5</xref>]-[<xref ref-type="bibr" rid="B6">6</xref>].</p>
			<p>The CVRP solutions have recently evolved using advanced artificial intelligence (AI) and machine learning (ML) techniques. Modern metaheuristics, such as Particle Swarm Optimization (PSO) and Ant Colony Optimization (ACO), along with hybrid and memetic techniques, have improved the efficiency of exploring the solution space. Additionally, the integration of neural networks and reinforcement learning (RL) has led to neural solvers capable of learning patterns and strategies, enabling a more efficient resolution of complex CVRP instances in terms of time and solution quality [<xref ref-type="bibr" rid="B7">7</xref>].</p>
			<p>Thirty articles published between 2019 and 2024 were selected from the Web of Science and Scopus scientific databases for this study. The documents were categorized by the type of metaheuristics employed, machine-learning technique applied, and identification of datasets used for evaluation.</p>
			<p>The rest of this paper is organized as follows: Section two summarizes the methodology followed, including the research question, search strategy, selection criteria for primary studies, quality evaluation criteria, data extraction, and synthesis method. Section three presents the results related to metaheuristics and the various machine learning techniques. Finally, section four presents the conclusions of the systematic review.</p>
		</sec>
		<sec sec-type="methods">
			<title>2. METHODOLOGY</title>
			<p>This systematic review was conducted following the methodological guidelines established in Kitchenham and Brereton [<xref ref-type="bibr" rid="B8">8</xref>], given their rigor and widespread use in various fields of knowledge. This methodology consists of three main phases: planning, execution of the review, and dissemination of results.</p>
			<sec>
				<title><italic>A. Research Question</italic></title>
				<p>This work aims to select, analyze, and synthesize documents that address approaches, techniques, and models for solving the CVRP, determining recent advancements that enhance metaheuristics with machine learning techniques. <xref ref-type="table" rid="t1">Table 1</xref> shows the research questions formulated to guide the collection and categorization of information, identifying the most promising and notable proposals to date.</p>
				<p>
					<table-wrap id="t1">
						<label>Table 1</label>
						<caption>
							<title>Research questions</title>
						</caption>
						<table frame="hsides" rules="groups">
							<colgroup>
								<col/>
								<col/>
							</colgroup>
							<thead>
								<tr>
									<th align="left">Research Questions</th>
									<th align="left">Motivation of the Question</th>
								</tr>
							</thead>
							<tbody>
								<tr>
									<td align="left">Q1. What machine learning techniques are used, how do they operate, and what stage do they support in a metaheuristic algorithm to solve the CVRP problems?</td>
									<td align="left">This question aims to identify and understand the specific machine learning techniques used to enhance metaheuristic algorithms for the CVRP and their role within these algorithms.</td>
								</tr>
								<tr>
									<td align="left">Q2. How does the metaheuristic algorithm use the knowledge obtained from the machine learning technique to solve the CVRP problems?</td>
									<td align="left">This question aims to identify how metaheuristic algorithms integrate and apply the knowledge generated by machine learning techniques to improve the CVRP resolution.</td>
								</tr>
								<tr>
									<td align="left">Q3. What CVRP datasets are used to evaluate and compare the performance of the algorithms?</td>
									<td align="left">The aim is to identify the datasets/problems used in the reviewed studies, which is crucial for understanding the validity and comparability of results across different approaches.</td>
								</tr>
								<tr>
									<td align="left">Q4. What metrics are used to evaluate and compare the performance of the algorithms?</td>
									<td align="left">This question seeks to identify the specific metrics used to evaluate and compare the effectiveness and efficiency of algorithms enhanced by machine learning techniques in solving the CVRP.</td>
								</tr>
							</tbody>
						</table>
					</table-wrap>
				</p>
			</sec>
			<sec>
				<title><italic>B. Search Strategy</italic></title>
				<p>The article search was conducted using Scopus and Web of Science. <xref ref-type="table" rid="t2">Table 2</xref> details the search string, which was iteratively refined after analyzing the preliminary results. Based on the PICOC method (Population, Intervention, Comparison, Outcomes, Context) [<xref ref-type="bibr" rid="B9">9</xref>], the search string was structured with terms combined by conjunction and applied to titles, abstracts, and keywords, using advanced search options to optimize the results.</p>
				<p>
					<table-wrap id="t2">
						<label>Table 2</label>
						<caption>
							<title>Advanced Search Strings</title>
						</caption>
						<table frame="hsides" rules="groups">
							<colgroup>
								<col/>
								<col/>
								<col/>
							</colgroup>
							<thead>
								<tr>
									<th align="left">Source</th>
									<th align="center">Search String</th>
									<th align="center">Articles</th>
								</tr>
							</thead>
							<tbody>
								<tr>
									<td align="left">Scopus</td>
									<td align="left">TITLE-ABS-KEY = ( ( metaheuristic OR heuristic OR genetic OR swarm OR “optimization algorithm” OR “Evolutionary computation” OR “memetic algorithm” OR “differential evolution” OR “hybrid metaheuristic” OR “Combinatorial optimization” ) AND ( “capacitated vehicle routing problem” OR CVRP ) AND ( “aprendizaje de máquina” OR “artificial intelligence” OR “deep learning” OR “learnable evolution model” OR LEM OR “data mining” OR “reinforcement learning”)) AND PUBYEAR &gt; 2018 AND ( LANGUAGE ( english ) OR LANGUAGE ( spanish ) ) AND DOCTYPE ( “cp” OR “ar” )</td>
									<td align="center">31</td>
								</tr>
								<tr>
									<td align="left">Web of Science</td>
									<td align="left">TS = (( metaheuristic OR heuristic OR genetic OR swarm OR “optimization algorithm” OR “Evolutionary computation” OR “memetic algorithm” OR “differential evolution” OR “hybrid metaheuristic” OR “Combinatorial optimization” ) AND ( “capacitated vehicle routing problem” OR CVRP ) AND ( “aprendizaje de máquina” OR “artificial intelligence” OR “deep learning” OR “learnable evolution model” OR LEM OR “data mining” OR “reinforcement learning” )) AND PY=(2019-2024)</td>
									<td align="center">22</td>
								</tr>
							</tbody>
						</table>
					</table-wrap>
				</p>
			</sec>
			<sec>
				<title><italic>C. Selection Criteria for Primary Studies</italic></title>
				<p>Specific inclusion criteria were applied, focusing on the titles, abstracts, and keywords. The selected studies had to meet the following inclusion criteria: 1) be journal articles or conference papers, 2) be written in English or Spanish, 3) have been published between 2019 and 2024, 4) address at least one of the formulated research questions, and 5) be freely accessible or available through the University of Cauca's library. The selected studies were evaluated after removing duplicate documents and applying these selection criteria.</p>
			</sec>
			<sec>
				<title><italic>D. Quality Evaluation Criteria</italic></title>
				<p>
					<xref ref-type="table" rid="t3">Table 3</xref> details the six quality criteria established to evaluate the primary studies using a three-point scoring system (0, 0.5, 1). In this system, a score of zero (0) indicates that the article does not meet the quality criterion, a score of zero point five (0.5) indicates partial fulfillment of the criterion, and a score of one (1) indicates full compliance. The studies received a minimum score of 0 and maximum score of 6. The evaluation was conducted using the full texts, and the answers to the research questions allowed for classifying the studies according to their approach and content, thus providing an overview of the state of knowledge in the field.</p>
				<p>
					<table-wrap id="t3">
						<label>Table 3</label>
						<caption>
							<title>Quality Criteria</title>
						</caption>
						<table frame="hsides" rules="groups">
							<colgroup>
								<col/>
								<col/>
								<col/>
								<col/>
								<col/>
							</colgroup>
							<thead>
								<tr>
									<th align="left" rowspan="2">Criterion</th>
									<th align="left" rowspan="2">Description</th>
									<th align="center"> </th>
									<th align="center">Score</th>
									<th align="center"> </th>
								</tr>
								<tr>
									<th align="center">0</th>
									<th align="center">0.5</th>
									<th align="center">1</th>
								</tr>
							</thead>
							<tbody>
								<tr>
									<td align="center">C1</td>
									<td align="left">The study integrates metaheuristic approaches with machine learning techniques.</td>
									<td align="center">No</td>
									<td align="center">Partially</td>
									<td align="center">Yes</td>
								</tr>
								<tr>
									<td align="center">C2</td>
									<td align="left">Other authors have cited the article.</td>
									<td align="center">No</td>
									<td align="center">Partially</td>
									<td align="center">Yes</td>
								</tr>
								<tr>
									<td align="center">C3</td>
									<td align="left">The article describes the algorithms and their interaction to solve the problem in detail.</td>
									<td align="center">No</td>
									<td align="center">Partially</td>
									<td align="center">Yes</td>
								</tr>
								<tr>
									<td align="center">C4</td>
									<td align="left">The experimental design for evaluation and comparison allows the experiment to be replicated.</td>
									<td align="center">No</td>
									<td align="center">Partially</td>
									<td align="center">Yes</td>
								</tr>
								<tr>
									<td align="center">C5</td>
									<td align="left">The article presents in detail the results of the experimentation.</td>
									<td align="center">No</td>
									<td align="center">Partially</td>
									<td align="center">Yes</td>
								</tr>
								<tr>
									<td align="center">C6</td>
									<td align="left">The article presents coherence between the proposal, the experimentation, the discussion, and the conclusions.</td>
									<td align="center">No</td>
									<td align="center">Partially</td>
									<td align="center">Yes</td>
								</tr>
							</tbody>
						</table>
					</table-wrap>
				</p>
			</sec>
			<sec>
				<title><italic>E. Synthesis Methods and Systematic Review Timeline</italic></title>
				<p>The synthesis was conducted using both quantitative and qualitative data. The studies were organized into a table by year, reference, approach, techniques, repositories, and quality criteria. The studies were grouped into three main categories, and within each, a summary of the central primary studies found is presented by year. Data collection for this study was conducted between May and July 2024.</p>
			</sec>
		</sec>
		<sec sec-type="results|discussion">
			<title>3. RESULTS AND DISCUSSION</title>
			<p>A total of 53 documents were found through Web of Science and Scopus. After applying the selection criteria, 32 relevant studies were identified after applying selection criteria. Subsequently, two articles were discarded after the quality evaluation, leaving 30 studies, which were classified into three categories: a) results related to metaheuristics (see <xref ref-type="table" rid="t4">Table 4</xref>), b) results related to reinforcement learning (see <xref ref-type="table" rid="t5">Table 5</xref>), and c) results related to machine learning (see <xref ref-type="table" rid="t6">Table 6</xref>). The following subsections describe each group of studies and highlight the main elements and quality evaluation results.</p>
			<p>
				<table-wrap id="t4">
					<label><bold>Table 4</bold></label>
					<caption>
						<title>Results Related to Metaheuristics</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup>
							<col/>
							<col/>
							<col/>
							<col/>
							<col span="6"/>
							<col/>
						</colgroup>
						<thead>
							<tr>
								<th align="center">Year</th>
								<th align="center">Reference</th>
								<th align="left">Techniques</th>
								<th align="left">Repositories</th>
								<th align="center" colspan="6">Quality Criteria </th>
								<th align="center">Score </th>
							</tr>
							<tr>
								<th align="center"> </th>
								<th align="center"> </th>
								<th align="left"> </th>
								<th align="left"> </th>
								<th align="center">C<sub>1</sub></th>
								<th align="center">C<sub>2</sub></th>
								<th align="center">C<sub>3</sub></th>
								<th align="center">C<sub>4</sub></th>
								<th align="center">C<sub>5</sub></th>
								<th align="center">C<sub>6</sub></th>
								<th align="center"> </th>
							</tr>
						</thead>
						<tbody>
							<tr>
								<td align="center">2019</td>
								<td align="center">[<xref ref-type="bibr" rid="B1">1</xref>]</td>
								<td align="left">FA</td>
								<td align="left">CVRPLIB</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2019</td>
								<td align="center">[<xref ref-type="bibr" rid="B10">10</xref>]</td>
								<td align="left">FWA</td>
								<td align="left">CVRPLIB</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2020</td>
								<td align="center">[<xref ref-type="bibr" rid="B11">11</xref>]</td>
								<td align="left">ACO, VNS</td>
								<td align="left">Generated</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2021</td>
								<td align="center">[<xref ref-type="bibr" rid="B4">4</xref>]</td>
								<td align="left">MAS, SA, PathR</td>
								<td align="left">CVRPLIB</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2021</td>
								<td align="center">[<xref ref-type="bibr" rid="B2">2</xref>]</td>
								<td align="left">FA, Parallel</td>
								<td align="left">CVRPLIB</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2022</td>
								<td align="center">[<xref ref-type="bibr" rid="B12">12</xref>]</td>
								<td align="left">BACO, OS-MMAS, RH</td>
								<td align="left">WCCI2020 (IEEE)</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2022</td>
								<td align="center">[<xref ref-type="bibr" rid="B5">5</xref>]</td>
								<td align="left">ML, ACO, ASGD</td>
								<td align="left">CVRPLIB TSPLIB</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">5.5</td>
							</tr>
							<tr>
								<td align="center">2022</td>
								<td align="center">[<xref ref-type="bibr" rid="B7">7</xref>]</td>
								<td align="left">ML, LNS, DNNs, AM SplitILS</td>
								<td align="left">CVRP SDVRP CTOP</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">5.0</td>
							</tr>
							<tr>
								<td align="center">2022</td>
								<td align="center">[<xref ref-type="bibr" rid="B3">3</xref>]</td>
								<td align="left">ACO, DSR, LNS</td>
								<td align="left">CVRP</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2023</td>
								<td align="center">[<xref ref-type="bibr" rid="B6">6</xref>]</td>
								<td align="left">RL, GVNS, MAB, UHGS</td>
								<td align="left">CVRPLIB</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">5.5</td>
							</tr>
							<tr>
								<td align="center">2023</td>
								<td align="center">[<xref ref-type="bibr" rid="B13">13</xref>]</td>
								<td align="left">ML, K-Mean, K-Medoids, ACO</td>
								<td align="left">CVRPLIB</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">5.5</td>
							</tr>
						</tbody>
					</table>
				</table-wrap>
			</p>
			<p>
				<table-wrap id="t5">
					<label>Table 5</label>
					<caption>
						<title>Results related to Reinforcement Learning</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup>
							<col/>
							<col/>
							<col/>
							<col/>
							<col span="6"/>
							<col/>
						</colgroup>
						<thead>
							<tr>
								<th align="center" rowspan="2">Year</th>
								<th align="center" rowspan="2">Reference</th>
								<th align="center" rowspan="2">Techniques</th>
								<th align="center" rowspan="2">Repositories</th>
								<th align="center" colspan="6">Quality Criteria</th>
								<th align="center" rowspan="2">Score </th>
							</tr>
							<tr>
								<th align="center">C<sub>1</sub></th>
								<th align="center">C<sub>2</sub></th>
								<th align="center">C<sub>3</sub></th>
								<th align="center">C<sub>4</sub></th>
								<th align="center">C<sub>5</sub></th>
								<th align="center">C<sub>6</sub></th>
							</tr>
						</thead>
						<tbody>
							<tr>
								<td align="center">2021</td>
								<td align="center">[<xref ref-type="bibr" rid="B14">14</xref>]</td>
								<td align="center">DRL, ERRL</td>
								<td align="center">Nazari TSP-CVRP MRPFF</td>
								<td align="center">0</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4</td>
							</tr>
							<tr>
								<td align="center">2022</td>
								<td align="center">[<xref ref-type="bibr" rid="B15">15</xref>]</td>
								<td align="center">DRL AM</td>
								<td align="center">Generated CVRPLIB</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">5.0</td>
							</tr>
							<tr>
								<td align="center">2022</td>
								<td align="center">[<xref ref-type="bibr" rid="B16">16</xref>]</td>
								<td align="center">DRL AM</td>
								<td align="center">Generated</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">5.0</td>
							</tr>
							<tr>
								<td align="center">2022</td>
								<td align="center">[<xref ref-type="bibr" rid="B17">17</xref>]</td>
								<td align="center">DRL, MA</td>
								<td align="center">Generated</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2022</td>
								<td align="center">[<xref ref-type="bibr" rid="B18">18</xref>]</td>
								<td align="center">DRL GATGANN</td>
								<td align="center">Generated TSPLIB, CVRPLIB</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2022</td>
								<td align="center">[<xref ref-type="bibr" rid="B19">19</xref>]</td>
								<td align="center">QL</td>
								<td align="center">Real instances SBRP</td>
								<td align="center">0.5</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2023</td>
								<td align="center">[<xref ref-type="bibr" rid="B20">20</xref>]</td>
								<td align="center">GSF, QL, DQL, PPO</td>
								<td align="center">SOLOMON</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2023</td>
								<td align="center">[<xref ref-type="bibr" rid="B21">21</xref>]</td>
								<td align="center">CISA, PPO, GA, MDP</td>
								<td align="center">TSPLIB, CVRPLIB, BPP</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">5.0</td>
							</tr>
							<tr>
								<td align="center">2023</td>
								<td align="center">[<xref ref-type="bibr" rid="B22">22</xref>]</td>
								<td align="center">MA, LSTM</td>
								<td align="center">Generated Amsterdam</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2023</td>
								<td align="center">[<xref ref-type="bibr" rid="B23">23</xref>]</td>
								<td align="center">PPO</td>
								<td align="center">SOLOMON</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">5.0</td>
							</tr>
							<tr>
								<td align="center">2023</td>
								<td align="center">[<xref ref-type="bibr" rid="B24">24</xref>]</td>
								<td align="center">PPO, DQN</td>
								<td align="center">SOLOMON</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">5.0</td>
							</tr>
							<tr>
								<td align="center">2024</td>
								<td align="center">[<xref ref-type="bibr" rid="B25">25</xref>]</td>
								<td align="center">DRL, MSTOP</td>
								<td align="center">CVRLIB TSP</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
						</tbody>
					</table>
				</table-wrap>
			</p>
			<p>
				<table-wrap id="t6">
					<label><bold>Table 6</bold></label>
					<caption>
						<title>Results Related to Machine Learning</title>
					</caption>
					<table frame="hsides" rules="groups">
						<colgroup>
							<col/>
							<col/>
							<col/>
							<col/>
							<col span="6"/>
							<col/>
						</colgroup>
						<thead>
							<tr>
								<th align="center" rowspan="2">Year</th>
								<th align="center" rowspan="2">Reference</th>
								<th align="center" rowspan="2">Techniques</th>
								<th align="center" rowspan="2">Repositories</th>
								<th align="center" colspan="6">Quality Criteria</th>
								<th align="center" rowspan="2">Score</th>
							</tr>
							<tr>
								<th align="center">C<sub>1</sub></th>
								<th align="center">C<sub>2</sub></th>
								<th align="center">C<sub>3</sub></th>
								<th align="center">C<sub>4</sub></th>
								<th align="center">C<sub>5</sub></th>
								<th align="center">C<sub>6</sub></th>
							</tr>
						</thead>
						<tbody>
							<tr>
								<td align="center">2021</td>
								<td align="center">[<xref ref-type="bibr" rid="B27">27</xref>]</td>
								<td align="left">Random forest classifiers, </td>
								<td align="left">Salomon Gehring &amp; Homberger</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2022</td>
								<td align="center">[<xref ref-type="bibr" rid="B28">28</xref>]</td>
								<td align="left">ANN, NCO, AM, TL</td>
								<td align="left">Generated</td>
								<td align="center">0</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">3.5</td>
							</tr>
							<tr>
								<td align="center">2023</td>
								<td align="center">[<xref ref-type="bibr" rid="B29">29</xref>]</td>
								<td align="left">K-Means, NN B&amp;C</td>
								<td align="left">Generated</td>
								<td align="center">0</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">3.5</td>
							</tr>
							<tr>
								<td align="center">2023</td>
								<td align="center">[<xref ref-type="bibr" rid="B30">30</xref>]</td>
								<td align="left">Beam Search, GCN</td>
								<td align="left">Generated SOLOMON</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.5</td>
							</tr>
							<tr>
								<td align="center">2023</td>
								<td align="center">[<xref ref-type="bibr" rid="B31">31</xref>]</td>
								<td align="left">RAC, Gray-Box</td>
								<td align="left">Generated XECVRP</td>
								<td align="center">0</td>
								<td align="center">0</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">3.5</td>
							</tr>
							<tr>
								<td align="center">2024</td>
								<td align="center">[<xref ref-type="bibr" rid="B32">32</xref>]</td>
								<td align="left">RL, ASP, PSA, DE</td>
								<td align="left">TSPLIB, CVRPLIB, SDVRP, Generated. Real world</td>
								<td align="center">0</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.0</td>
							</tr>
							<tr>
								<td align="center">2024</td>
								<td align="center">[<xref ref-type="bibr" rid="B33">33</xref>]</td>
								<td align="left">SLM</td>
								<td align="left">CVRPLIB</td>
								<td align="center">0</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">0.5</td>
								<td align="center">1</td>
								<td align="center">1</td>
								<td align="center">4.0</td>
							</tr>
						</tbody>
					</table>
				</table-wrap>
			</p>
			<sec>
				<title><italic>A. Results Related to Metaheuristics</italic></title>
				<p>The CVRP-FA algorithm was introduced in 2019 as an improved variant of the Firefly Algorithm (FA) for the CVRP. This algorithm enhances the convergence and solution quality by integrating the local search (2-Opt and 2-h-opt) and genetic operators. Evaluated on 82 instances from the CVRPLIB (<ext-link ext-link-type="uri" xlink:href="http://vrp.galgos.inf.puc-rio.br/index.php/en/">http://vrp.galgos.inf.puc-rio.br/index.php/en/</ext-link>), the CVRP-FA demonstrated faster convergence and greater precision than previous FA versions, achieving its effectiveness through parameter tuning using the Taguchi method [<xref ref-type="bibr" rid="B1">1</xref>].</p>
				<p>In 2019, an improved algorithm based on the Fireworks Algorithm (FWA) was introduced to solve the CVRP. The FWA generates new solutions through explosions and mutations, thereby improving route quality. Evaluated on the P and G instances from the CVRPLIB, it was compared with algorithms like Active-Guided Evolution Strategies (AGES), Greedy Randomized Evolutionary Local Search (GRELS), and Hybrid Genetic Particle Swarm Optimization (HGPSO), demonstrating superior solution quality with minimal deviation from benchmark solutions and faster execution times [<xref ref-type="bibr" rid="B10">10</xref>].</p>
				<p>In 2020, researchers introduced a hybrid algorithm that integrates ACO with Variable Neighborhood Search (VNS) to address the CVRP in scenarios focused on equitable distribution route scheduling for emergency aid. The approach generates initial solutions with ACO and refines them with VNS, optimizing the waiting times. Compared to ACO with 2-opt, GA, and CPLEX, it improved equity and solution quality. The test data randomly generated based on previous studies allowed the evaluation of its effectiveness in various scenarios [<xref ref-type="bibr" rid="B11">11</xref>].</p>
				<p>In 2021, the hybrid algorithm MAS-SA-PR was proposed, integrating a Modified Ant System (MAS), Sweep Algorithm (SA), and Path Relinking (PathR) to solve the CVRP. The algorithm converts customer locations to polar coordinates, initializes routes, and uses local search techniques to improve solutions. The MAS-SA-PR demonstrated competitive results on A, B, and P instances from the CVRPLIB compared to the Large Neighborhood Search-Ant Colony Optimization (LNS-ACO), Ranking-Based Monte Carlo-Artificial Bee Colony (RBMC-ABC), and the CVRP-FA algorithms. However, the lack of a formal statistical analysis limits the validity of its conclusions [<xref ref-type="bibr" rid="B4">4</xref>].</p>
				<p>In 2021, the authors continued their previous work [<xref ref-type="bibr" rid="B1">1</xref>] by proposing a collaborative approach to solve the CVRP called the Cooperative Firefly Algorithm (CHFA). First, they designed a new hybrid FA (HFA) by modifying the local search and mutation from the CVRP-FA. Then, they created a parallel cooperative model with multiple HFAs, where firefly populations periodically share solutions to improve convergence and avoid stagnation. Evaluated on 108 instances from the CVRPLIB, the algorithm demonstrated improvements in solution quality and computational speed compared to the CVRP-FA, LNS-ACO, HA, and Discrete Invasive Weed Optimization (DIWO) algorithms, and performed comparably to the Improved Symbiotic Organism Search (ISOS) and Gravitational Emulation Local Search (CVRP_GELS) algorithms [<xref ref-type="bibr" rid="B2">2</xref>].</p>
				<p>In 2022, BACO (Bilevel ACO), an algorithm based on ACO, was introduced to solve the Capacitated Electric Vehicle Routing Problem (CEVRP). The algorithm operates on two levels: the upper level generates solutions for the CVRP subproblem, and the lower level, with routes that meet capacity constraints, solves the Fixed Route Vehicle Charging Problem (FRVCP). Evaluated in the IEEE WCCI2020 competition, BACO updated the best solutions in seven out of ten large instances and showed fast convergence compared to algorithms like Iterative Local Search (ILS) and a single-level Max-Min Ant System (SLMMAS) [<xref ref-type="bibr" rid="B12">12</xref>].</p>
				<p>In the same year (2022), ADACO was introduced, which integrates Adaptive Stochastic Gradient Descent (ASGD) into ACO to improve the CVRP solution quality. ADACO dynamically updates the pheromone values, improving convergence and avoiding premature convergence. Evaluated on instances from the TSPLIB and CVRPLIB, the algorithm outperformed the Ant Colony System (ACS), the Max-Min Ant System (MMAS), and the ILS regarding solution precision and convergence. The Wilcoxon test confirmed that the results were statistically significant in most cases [<xref ref-type="bibr" rid="B5">5</xref>].</p>
				<p>In 2022, a Neural Large Neighborhood Search (NLNS) was introduced, combining a Large Neighborhood Search (LNS) with deep neural networks and Attention Mechanisms (AM) to repair incomplete CVRP solutions. Evaluated on various instances, it performed similarly to LKH3, with lower-cost solutions in several cases but did not surpass UHGS. For the SDVRP, SplitILS outperformed NLNS on smaller instances but outperformed it on larger ones. The results of the Capacitated Team Orienteering Problem (CTOP) were comparable to those achieved by the most advanced approaches. The study was initially presented in [<xref ref-type="bibr" rid="B10">10</xref>] and expanded in [<xref ref-type="bibr" rid="B7">7</xref>].</p>
				<p>Additionally, in 2022, an improved variant of ACO called DSRACO was introduced to solve the CVRP, which incorporates methods like Dynamic Space Reduction (DSR), an elitist mechanism, and large-scale neighborhood search operators. This approach improves the solution quality and prevents the algorithm from becoming stuck in the local optima. The evaluation of 73 CVRPLIB instances showed that DSRACO improves the solution efficiency and quality compared to ACO and competes with LNS-ACO, CVRP-FA, and the Order-aware Hybrid Genetic Algorithm (OHGA) [<xref ref-type="bibr" rid="B3">3</xref>].</p>
				<p>In 2023, the General Variable Neighborhood Search (GVNS) metaheuristic was enhanced by integrating Multi-Armed Bandit (MAB) algorithms and variations of the Upper Confidence Bound (UCB) algorithm to optimize neighborhood selection, creating a scheme called Bandit VNS. Evaluated on CVRPLIB sets, Bandit VNS improved both solution quality and execution speed, demonstrating that combining reinforcement learning and metaheuristics is a promising strategy for solving routing problems [<xref ref-type="bibr" rid="B6">6</xref>].</p>
				<p>In 2023, a hybrid algorithm combining clustering techniques with ACO was introduced to solve the Energy Minimizing Vehicle Routing Problem (EMVRP). K-means and M-medoids were applied to improve ACO efficiency by reducing the complexity of the problem. Four hybrid algorithms were designed: Free Ant + K-Means and Free Ant + K-Medoids, which explored multiple clusters, and Restricted Ant + K-Means and Restricted Ant + K-Medoids, limited to routing within preformed clusters. Evaluated on CVRPLIB instances, Free Ant + K-Means excelled in solution quality. Nevertheless, they required more computation time, while the Restricted Ant algorithms were more efficient and suited for scenarios with strict time limits [<xref ref-type="bibr" rid="B13">13</xref>].</p>
				<p>Among the results related to metaheuristics for solving the CVRP, it was found that some have been improved by integrating machine learning techniques. For example, the LNS metaheuristic was improved by guiding the repair of incomplete solutions by combining a Deep Neural Network (DNN) and AM techniques [7]. The ASGD was integrated into the ACO metaheuristic to improve solution quality [<xref ref-type="bibr" rid="B5">5</xref>]. This metaheuristic was also enhanced by integrating clustering techniques like M-Means and K-Medoids to improve efficiency [<xref ref-type="bibr" rid="B13">13</xref>]. Finally, the GVNS metaheuristic was strengthened using RL, applying the MAB approach and UCB implementation to optimize neighborhood selection [<xref ref-type="bibr" rid="B6">6</xref>].</p>
			</sec>
			<sec>
				<title><italic>B. Results Related to Reinforcement Learning</italic></title>
				<p>In 2021, a method based on Deep Reinforcement Learning (DRL) was introduced to solve TSP, CVRP, and Multiple Routing with Fixed Fleet Problems (MRPFF). The ERRL technique (Entropy Regularised Reinforcement Learning) includes an entropy term in the policy loss function to encourage exploration and avoid convergence to suboptimal policies. Evaluated using Nazari's instances for the TSP and the CVRP and generated instances for MRPFF, ERRL showed improvements in solution quality and accelerated optimization compared to other approaches [<xref ref-type="bibr" rid="B14">14</xref>].</p>
				<p>In 2022, a deep reinforcement learning architecture was combined with an attention mechanism to solve the Heterogeneous CVRP (HCVRP). The network policy includes a vehicle selection decoder, a node selection decoder, and an encoder to process the problem's features. The authors presented the MM-HCVRP and MS-HCVRP formulations for the min-max and min-sum objectives. For the experimentation, two fleets were generated: the first with three vehicles with capacities of 20, 25, and 30, and the second with five vehicles with capacities of 20, 25, 30, 35, and 40. The software is available at <ext-link ext-link-type="uri" xlink:href="https://github.com/jingwenli0312/HCVRP_DRL">https://github.com/jingwenli0312/HCVRP_DRL</ext-link>. The results were competitive compared to traditional methods [<xref ref-type="bibr" rid="B15">15</xref>].</p>
				<p>In 2022, a proposal focused on learning the solution improvement process instead of the solution construction. It uses an RL-based approach to learn policies that guide local search by employing a neural architecture with Self-Attention Mechanisms (Self AM) to parameterize the learning policy. Randomly generated instances of the CVRP and TSP were used, and the evaluation showed competitive results compared to state-of-the-art methods. The advantage of this approach is that it automatically designs a policy, eliminating the need for manual design, which requires substantial domain knowledge [<xref ref-type="bibr" rid="B16">16</xref>].</p>
				<p>In the same year (2022), DRL with a Multiple Relational Attention Mechanism (MRAM) was applied to improve CVRP solutions. This framework uses relational attention mechanisms to capture the state transition dynamics and enhance node selection in route construction. The model manages state representations by adding relational structures used in decision-making to optimize routes. Evaluations on randomly generated instances for the TSP, CVRP, and SDVRP showed improved solution quality and computational efficiency compared with algorithms such as Gurobi, LKH, and the attention model AM [<xref ref-type="bibr" rid="B17">17</xref>].</p>
				<p>Continuing in 2022, a Residual Edge-Graph Attention Neural Network (Residual E-GAT) was applied within a DRL framework to improve CVRP solutions. This approach builds on a Graph Attention Network (GAT), considering edge information and residual connections between layers, and optimizing route construction through a decoder that selects nodes based on model-generated probabilities. Experiments were conducted using up to 100 randomly generated nodes from the TSPLIB and CVRPLIB libraries. E-GAT's results surpassed algorithms like PtrNet, GAT, and the attention model AM in terms of optimality and computational efficiency, excelling in generalization and large-scale problem solving [<xref ref-type="bibr" rid="B18">18</xref>].</p>
				<p>In 2022, a Q-learning-based approach was introduced to solve the School Bus Routing Problem (SBRP), a variant of the CVRP. This approach uses a reinforcement learning model to create the HHQL algorithm, which dynamically selects the best low-level heuristics, such as one-point move, two-point swap, 2-opt, cross, or-opt, two-edge swap, and ruin-and-recreate, based on the accumulated rewards. The algorithm was compared to other heuristic-based approaches, showing that the combination of reinforcement learning and hyper-heuristic selection offers better results in terms of the total distance traveled and the number of routes [<xref ref-type="bibr" rid="B19">19</xref>].</p>
				<p>In 2023, a General Search Framework (GSF) was introduced, automating the design of search algorithms using reinforcement learning techniques such as Q-learning, Deep Q-Network (DQN), and Proximal Policy Optimization (PPO) to solve the CVRP with Time Windows (CVRPTW). The Actor-Critic with Entropy (ACE) method was introduced to balance exploration and exploitation in learning. The research focused on learning evolutionary operators, heuristic evolution, and the simultaneous learning of these approaches. Instances from the Solomon dataset were evaluated using three ACE models: ACE_FS, ACE_NLAS, and ACE_LAS, each with a different entropy coefficient adjustment scheme. The results showed that GSF with maximum entropy is effective in algorithm design and generalizes well to CVRPTW [<xref ref-type="bibr" rid="B20">20</xref>].</p>
				<p>In 2023, combining reinforcement learning with genetic algorithms (GA) showed promise for combinatorial optimization problems. The NeuroCrossover algorithm, based on RL, PPO, and the Cross Information Synergistic Attention (CISA) model, intelligently selects the genetic locus for configuring the crossover operator parameters in GA. The research was used to order two-point crossover operators on instances of the CVRP, the TSP, and the Bin Packing Problem (BPP) from the TSPLIB and CVRPLIB libraries. Experimental results showed that NeuroCrossover improves solution quality, convergence rate, and generalization [<xref ref-type="bibr" rid="B21">21</xref>].</p>
				<p>In addition, in 2023, a hybrid model based on Deep Reinforcement Learning (DRL) was introduced, integrating an attention-based encoder and an LSTM decoder to optimize route coordination in the Traveling Salesman Problem with Drone (TSP-D). It was assumed that the drone is twice as fast as the truck, and the hybrid model's performance was compared with that of the AM model and the TSP-ep-all. The evaluation used datasets with randomly generated and real-world locations from Amsterdam. The results showed improvements in the solution quality and convergence speed, confirming the effectiveness of the hybrid model compared to the reference methods [<xref ref-type="bibr" rid="B22">22</xref>].</p>
				<p>In the same year (2023), RL based on Proximal Policy Optimization (PPO) was used to automate the design of algorithms for solving the CVRPTW. This approach introduces two groups of features: search-dependent and instance-dependent, optimizing relevant search space information for algorithm design. To solve problem instances, the method uses RL to automatically generate search algorithms in five stages, three related to states, actions, and reward schemes. The evaluation with instances from the Solomon library, using features such as vehicle capacity, time window density, and customer distribution, showed improvements in the number of vehicles and total distance traveled compared to traditional methods [<xref ref-type="bibr" rid="B23">23</xref>].</p>
				<p>Another 2023 study focused on the automatic design of metaheuristics using RL to solve the CVRPTW. This paper proposes the first General Search Framework (GSF) with a general search structure that allows the formulation and analysis of metaheuristics, facilitating the automatic composition of algorithms using DQN networks and the PPO policy optimization method. These techniques select effective combinations of evolutionary operators at distinct stages of the optimization process to improve metaheuristic performance. The evaluation conducted on CVRPTW instances from the Solomon library shows that the proposed model outperforms manually designed algorithms, highlighting the effectiveness and generalization of the RL-based approach [<xref ref-type="bibr" rid="B24">24</xref>].</p>
				<p>In 2024, the Deep Dynamic Transformer Model (DDTM) was introduced to solve the Multi-start Team Orienteering Problem (MSTOP) in mission replanning for Unmanned Aerial Systems (UAS). It uses deep reinforcement learning with a modified version of the REINFORCE algorithm to improve learning efficiency. It employs a self-attention policy network for each part of the route, with attention paid between the partial route and the remaining nodes. The experiments demonstrate that DDTM outperforms methods such as the Policy Optimization with Multiple Optima (POMO), offering better results on classic problems like the CVRP and the TSP, and demonstrates its potential in complex mission replanning scenarios [<xref ref-type="bibr" rid="B25">25</xref>].</p>
			</sec>
			<sec>
				<title><italic>C. Results Related to Machine Learning</italic></title>
				<p>In 2021, a branch-and-price framework based on ML was introduced to solve VRP variants, such as SVRPWT (Sampled VRPTW) and SCVRP (Sampled CVRP), where customers are a random subset. Random forest and regression forests were used to predict variables and branching scores, integrating node and variable selection policies into the branch-and-price algorithm. Salomon and Gehring &amp; Homberger data, with customers randomly distributed, clustered, or mixed, were evaluated. The results showed reduced search tree sizes and shorter execution times compared to traditional algorithms [<xref ref-type="bibr" rid="B27">27</xref>].</p>
				<p>In 2022, transfer learning (TL) was applied to neural combinatorial optimization to solve the CVRP. Using a model pre-trained for TSP, TL accelerated learning and improved the results for the CVRP. The model was trained on TSP and then adapted to the CVRP, enhancing the efficiency and quality of the solutions. Experiments showed that models using TL converged faster and achieved shorter routes than those trained from scratch [<xref ref-type="bibr" rid="B28">28</xref>].</p>
				<p>In 2023, ML was combined with optimization algorithms to solve the CVRP problem, using K-means to group customers, and then applying the nearest neighbor heuristic (NN) and branch-and-cut technique (B&amp;C). K-means clusters customers based on location and vehicle capacity, reducing complexity and improving routing efficiency. The NN and B&amp;C then determine the best route within each cluster. The evaluation of artificially generated instances showed that K-means combined with B&amp;C is more efficient than NN regarding total distance and computation time [<xref ref-type="bibr" rid="B29">29</xref>].</p>
				<p>In 2023, an approach was proposed to solve the CVRPTW using a Graph Convolutional Network (GCN) and beam search, formulated as Quadratic Unconstrained Binary Optimization (QUBO) assisted by quantum computing. Randomly generated instances based on Solomon's data were used. The evaluation showed that this approach achieved results similar to LKH3 and Google OR-Tools for small and medium-sized instances, but was outperformed on larger instances [<xref ref-type="bibr" rid="B30">30</xref>].</p>
				<p>Another 2023 proposal introduced a Real-time Algorithm Configuration (RAC) approach using a gray-box model that combines cost-sensitive machine learning with a dynamic process to eliminate low-performing configurations. This improves efficiency by freeing up resources and adjusting the configuration during execution. Experiments with the Hybrid Genetic Search (HGS) algorithm for solving CVRP using XE CVRP data generated from Vidal and Uchoa instances showed that this strategy allows HGS to reach high-quality solutions in less time [<xref ref-type="bibr" rid="B31">31</xref>].</p>
				<p>In 2024, the ASP method was introduced, integrating two main components: Distributional Exploration (DE) and Persistent Scale Adaptation (PSA). DE uses a Policy Space Response Oracle (PSRO) and PSA employs a stepwise adaptive procedure. The ASP was evaluated by comparing solvers with and without ASP. The results for the TSPLIB and CVRPLIB instances showed significant improvements in optimality gaps and computational efficiency. Although focused on TSP and CVRP, ASP showed promising results for other combinatorial optimization problems [<xref ref-type="bibr" rid="B32">32</xref>].</p>
				<p>In 2024, a heuristic approach based on machine learning was introduced to solve instances of the capacitated vehicle routing problem in scenarios with slight changes in customer demands. The algorithm uses a Supervised Learning Model (SLM), specifically, a binary classification model, to predict which edges from a previous solution are likely to remain after demand changes. Compared to the FILO algorithm (Fast Iterated Localized Search Optimization), the machine learning and heuristic approach allowed for high-quality solutions in less time. Evaluations conducted with instances from the X set of the CVRPLIB and modified configurations showed significant reductions in re-optimization times, with optimality gaps ranging from 0% to 1.7% [<xref ref-type="bibr" rid="B33">33</xref>].</p>
				<p>Among the 30 articles reviewed, 40% (12 studies) focused on reinforcement learning, 37% (11 studies) on metaheuristics, and 23% (7 studies) on machine learning. <xref ref-type="fig" rid="f1">Figure 1</xref>a shows the distribution of analyzed publications by year organized into these three categories, where an increase in the use of reinforcement learning can be observed. Additionally, as shown in <xref ref-type="fig" rid="f1">Figure 1</xref>b, metaheuristics achieved, on average, a higher score than reinforcement learning, followed by machine learning, according to the evaluation criteria employed.</p>
				<p>
					<fig id="f1">
						<label>Figure 1</label>
						<caption>
							<title><italic>
 <italic>(a) Distribution of Publications by Year. (b) Mean Score of Publications by Year.</italic>
</italic></title>
						</caption>
						<graphic xlink:href="2357-5328-rfing-33-68-e18379-gf1.png"/>
					</fig>
				</p>
			</sec>
			<sec>
				<title><italic>D. Response to Research Questions</italic></title>
				<p>The research questions are answered below based on the previously presented description of the selected and analyzed primary studies.</p>
				<p>1) What machine learning techniques are being used, how do they operate, and which stage of a metaheuristic algorithm do they support in solving the CVRP problems? The main ML techniques used to enhance metaheuristics in solving CVRP problems include DNNs, MA, ASGD, clustering, and RL with MAB and UCB approaches [<xref ref-type="bibr" rid="B4">4</xref>]-[<xref ref-type="bibr" rid="B7">7</xref>], [<xref ref-type="bibr" rid="B13">13</xref>].</p>
				<p>2) How does the metaheuristic algorithm uses the knowledge obtained from the machine learning technique to solve the CVRP problems? The knowledge obtained from ML techniques is used to guide the search, improve neighborhood selection, repair incomplete solutions, enhance pheromone values to prevent premature convergence and optimize adaptive neighborhood selection to enable more efficient searches focused on promising areas [<xref ref-type="bibr" rid="B5">5</xref>]-[<xref ref-type="bibr" rid="B7">7</xref>].</p>
				<p>3) What CVRP datasets are used to evaluate and compare the performance of the algorithms? The most used datasets to evaluate and compare algorithm performance are those provided in CVRPLIB and Solomon [<xref ref-type="bibr" rid="B30">30</xref>], [<xref ref-type="bibr" rid="B33">33</xref>]. Randomly generated instance sets from each study are also employed [<xref ref-type="bibr" rid="B16">16</xref>], [<xref ref-type="bibr" rid="B18">18</xref>].</p>
				<p>4) What metrics are used for the evaluation and comparison of the algorithm performance? The percentage deviation (Gap) is used, including the Best Gap (for the best solution obtained) and the Average Gap, to measure the average quality of the solutions found. To evaluate computational efficiency, execution time, number of iterations, and number of objective function evaluations are used [<xref ref-type="bibr" rid="B5">5</xref>], [<xref ref-type="bibr" rid="B7">7</xref>].</p>
			</sec>
		</sec>
		<sec sec-type="conclusions">
			<title>4. CONCLUSIONS AND FUTURE WORK</title>
			<p>The techniques, methods, and models used to solve the CVRP problem through metaheuristic algorithms, reinforcement learning, and machine learning were identified and synthesized in this systematic review. A notable trend in the development of hybrid computational models has emerged, specifically those that integrate machine-learning techniques with metaheuristics to improve solution quality and computational efficiency. In the metaheuristic-related studies analyzed, four out of nine employed this hybrid approach, and their results indicate that this integration is promising. These models combine the exploration and adaptation capabilities of metaheuristics with the learning and prediction abilities of machine-learning techniques, allowing the search process to be more efficient and precise in complex problems, such as CVRP.</p>
			<p>Reinforcement learning-based models for solving the CVRP have gained attention in recent years. By integrating deep learning with attention mechanisms and neural networks, processes such as node selection and route construction have been improved, outperforming traditional algorithms. Reinforcement learning has been pivotal in automating the design of metaheuristics, as seen in GSF, which generates optimized algorithms for the CVRP with better generalization and effectiveness. This has enabled more efficient solution space exploration, avoiding premature convergence to suboptimal policies and improving both the quality of solutions and computational efficiency.</p>
			<p>In other machine learning studies, techniques such as Random Forests, TL, GCN, K-Means, and Branch &amp; Cut, combined with optimization algorithms, have effectively reduced search tree size and accelerated learning, highlighting the ML's potential in solving routing problems. Most experiments used standard datasets like TSPLIB, CVRPLIB, and Solomon, with solution quality evaluated using percentage deviation and computational efficiency measured by execution time, iteration count, and objective function evaluations.</p>
			<p>Based on the findings of this systematic review, future research should prioritize the development of an integrated framework that enhances existing metaheuristics through the integration of machine learning techniques. This framework would systematically identify areas within the metaheuristic that could benefit from ML optimization, such as neighborhood selection, solution repair, or algorithm convergence.</p>
		</sec>
	</body>
	<back>
		<ref-list>
			<title>REFERENCES</title>
			<ref id="B1">
				<label>[1]</label>
				<mixed-citation>A. M. Altabeeb, A. M. Mohsen, A. Ghallab, &quot;An improved hybrid firefly algorithm for capacitated vehicle routing problem,&quot; <italic>Applied Software Computing</italic>, vol. 84, e105728, Nov. 2019. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.10167j.asoc.2019.105728">https://doi.org/10.10167j.asoc.2019.105728</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Altabeeb</surname>
							<given-names>A. M.</given-names>
						</name>
						<name>
							<surname>Mohsen</surname>
							<given-names>A. M</given-names>
						</name>
						<name>
							<surname>Ghallab</surname>
							<given-names>A</given-names>
						</name>
					</person-group>
					<article-title>An improved hybrid firefly algorithm for capacitated vehicle routing problem</article-title>
					<source>Applied Software Computing</source>
					<volume>84</volume>
					<elocation-id>e105728</elocation-id>
					<month>11</month>
					<year>2019</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.10167j.asoc.2019.105728">https://doi.org/10.10167j.asoc.2019.105728</ext-link>
				</element-citation>
			</ref>
			<ref id="B2">
				<label>[2]</label>
				<mixed-citation>A. M. Altabeeb, A. M. Mohsen, L. Abualigah, A. Ghallab, &quot;Solving capacitated vehicle routing problem using cooperative firefly algorithm,&quot; <italic>Applied Software Computing</italic>, vol. 108, e107403, Sep. 2021. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.asoc.2021.107403">https://doi.org/10.1016/j.asoc.2021.107403</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Altabeeb</surname>
							<given-names>A. M.</given-names>
						</name>
						<name>
							<surname>Mohsen</surname>
							<given-names>A. M.</given-names>
						</name>
						<name>
							<surname>Abualigah</surname>
							<given-names>L</given-names>
						</name>
						<name>
							<surname>Ghallab</surname>
							<given-names>A</given-names>
						</name>
					</person-group>
					<article-title>Solving capacitated vehicle routing problem using cooperative firefly algorithm</article-title>
					<source>Applied Software Computing</source>
					<volume>108</volume>
					<elocation-id>e107403</elocation-id>
					<month>09</month>
					<year>2021</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.asoc.2021.107403">https://doi.org/10.1016/j.asoc.2021.107403</ext-link>
				</element-citation>
			</ref>
			<ref id="B3">
				<label>[3]</label>
				<mixed-citation>J. Cai, P. Wang, S. Sun, H. Dong, &quot;A dynamic space reduction ant colony optimization for capacitated vehicle routing problem,&quot; <italic>Software computing</italic>, vol. 26, no. 17, pp. 8745-8756, Sep. 2022. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00500-022-07198-2">https://doi.org/10.1007/s00500-022-07198-2</ext-link> .</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Cai</surname>
							<given-names>J</given-names>
						</name>
						<name>
							<surname>Wang</surname>
							<given-names>P</given-names>
						</name>
						<name>
							<surname>Sun</surname>
							<given-names>S</given-names>
						</name>
						<name>
							<surname>Dong</surname>
							<given-names>H</given-names>
						</name>
					</person-group>
					<article-title>A dynamic space reduction ant colony optimization for capacitated vehicle routing problem</article-title>
					<source>Software computing</source>
					<volume>26</volume>
					<issue>17</issue>
					<fpage>8745</fpage>
					<lpage>8756</lpage>
					<month>09</month>
					<year>2022</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00500-022-07198-2">https://doi.org/10.1007/s00500-022-07198-2</ext-link>
				</element-citation>
			</ref>
			<ref id="B4">
				<label>[4]</label>
				<mixed-citation>A. Thammano, P. Rungwachira, &quot;Hybrid modified ant system with sweep algorithm and path relinking for the capacitated vehicle routing problem,&quot; <italic>Heliyon</italic>, vol. 7, no. 9, e8029, Sep. 2021. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.heliyon.2021.e08029">https://doi.org/10.1016/j.heliyon.2021.e08029</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Thammano</surname>
							<given-names>A</given-names>
						</name>
						<name>
							<surname>Rungwachira</surname>
							<given-names>P</given-names>
						</name>
					</person-group>
					<article-title>Hybrid modified ant system with sweep algorithm and path relinking for the capacitated vehicle routing problem</article-title>
					<source>Heliyon</source>
					<volume>7</volume>
					<issue>9</issue>
					<elocation-id>e8029</elocation-id>
					<month>09</month>
					<year>2021</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.heliyon.2021.e08029">https://doi.org/10.1016/j.heliyon.2021.e08029</ext-link>
				</element-citation>
			</ref>
			<ref id="B5">
				<label>[5]</label>
				<mixed-citation>Y. Zhou, W. Li, X. Wang, Y. Qiu, W. Shen, &quot;Adaptive gradient descent enabled ant colony optimization for routing problems,&quot; <italic>Swarm and Evolutionary Computation</italic>, vol. 70, e101046, Apr. 2022. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.swevo.2022.101046">https://doi.org/10.1016/j.swevo.2022.101046</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Zhou</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Li</surname>
							<given-names>W</given-names>
						</name>
						<name>
							<surname>Wang</surname>
							<given-names>X</given-names>
						</name>
						<name>
							<surname>Qiu</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Shen</surname>
							<given-names>W</given-names>
						</name>
					</person-group>
					<article-title>Adaptive gradient descent enabled ant colony optimization for routing problems</article-title>
					<source>Swarm and Evolutionary Computation</source>
					<volume>70</volume>
					<elocation-id>e101046</elocation-id>
					<month>04</month>
					<year>2022</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.swevo.2022.101046">https://doi.org/10.1016/j.swevo.2022.101046</ext-link>
				</element-citation>
			</ref>
			<ref id="B6">
				<label>[6]</label>
				<mixed-citation>P. Kalatzantonakis, A. Sifaleras, N. Samaras, &quot;A reinforcement learning-Variable neighborhood search method for the capacitated Vehicle Routing Problem,&quot; <italic>Expert Systems with Applications</italic>, vol. 213, e118812, Mar. 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.10167j.eswa.2022.118812">https://doi.org/10.10167j.eswa.2022.118812</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Kalatzantonakis</surname>
							<given-names>P</given-names>
						</name>
						<name>
							<surname>Sifaleras</surname>
							<given-names>A</given-names>
						</name>
						<name>
							<surname>Samaras</surname>
							<given-names>N</given-names>
						</name>
					</person-group>
					<article-title>A reinforcement learning-Variable neighborhood search method for the capacitated Vehicle Routing Problem</article-title>
					<source>Expert Systems with Applications</source>
					<volume>213</volume>
					<elocation-id>e118812</elocation-id>
					<month>03</month>
					<year>2023</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.10167j.eswa.2022.118812">https://doi.org/10.10167j.eswa.2022.118812</ext-link>
				</element-citation>
			</ref>
			<ref id="B7">
				<label>[7]</label>
				<mixed-citation>A. Hottung, K. Tierney, &quot;Neural large neighborhood search for routing problems,&quot; <italic>Artificial Intelligence</italic>, vol. 313, e103786, Dec. 2022. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.artint.2022.103786">https://doi.org/10.1016/j.artint.2022.103786</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Hottung</surname>
							<given-names>A</given-names>
						</name>
						<name>
							<surname>Tierney</surname>
							<given-names>K</given-names>
						</name>
					</person-group>
					<article-title>Neural large neighborhood search for routing problems</article-title>
					<source>Artificial Intelligence</source>
					<volume>313</volume>
					<elocation-id>e103786</elocation-id>
					<month>12</month>
					<year>2022</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.artint.2022.103786">https://doi.org/10.1016/j.artint.2022.103786</ext-link>
				</element-citation>
			</ref>
			<ref id="B8">
				<label>[8]</label>
				<mixed-citation>B. Kitchenham, P. Brereton, &quot;A systematic review of systematic review process research in software engineering,&quot; <italic>Information and Software Technology</italic>, vol. 55, no. 12, pp. 2049-2075, 2013. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.infsof.2013.07.010">https://doi.org/10.1016/j.infsof.2013.07.010</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Kitchenham</surname>
							<given-names>B</given-names>
						</name>
						<name>
							<surname>Brereton</surname>
							<given-names>P</given-names>
						</name>
					</person-group>
					<article-title>A systematic review of systematic review process research in software engineering</article-title>
					<source>Information and Software Technology</source>
					<volume>55</volume>
					<issue>12</issue>
					<fpage>2049</fpage>
					<lpage>2075</lpage>
					<year>2013</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.infsof.2013.07.010">https://doi.org/10.1016/j.infsof.2013.07.010</ext-link>
				</element-citation>
			</ref>
			<ref id="B9">
				<label>[9]</label>
				<mixed-citation>A. García-Holgado, S. Marcos-Pablos, F. J. García-Peñalvo, &quot;Guidelines for performing systematic research projects reviews,&quot; <italic>International Journal of Interactive Multimedia and Artificial Intelligence</italic>, vol.05, e05, 2020. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.9781/ijimai.2020.05.005">https://doi.org/10.9781/ijimai.2020.05.005</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>García-Holgado</surname>
							<given-names>A.</given-names>
						</name>
						<name>
							<surname>Marcos-Pablos</surname>
							<given-names>S.</given-names>
						</name>
						<name>
							<surname>García-Peñalvo</surname>
							<given-names>F. J.</given-names>
						</name>
					</person-group>
					<article-title>Guidelines for performing systematic research projects reviews</article-title>
					<source>International Journal of Interactive Multimedia and Artificial Intelligence</source>
					<volume>05</volume>
					<elocation-id>e05</elocation-id>
					<year>2020</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.9781/ijimai.2020.05.005">https://doi.org/10.9781/ijimai.2020.05.005</ext-link>
				</element-citation>
			</ref>
			<ref id="B10">
				<label>[10]</label>
				<mixed-citation>W. Yang, L. Ke, &quot;An improved fireworks algorithm for the capacitated vehicle routing problem,&quot; <italic>Frontiers in Computer Science</italic>, vol. 13, no. 3, pp. 552-564, Jun. 2019. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s11704-017-6418-9">https://doi.org/10.1007/s11704-017-6418-9</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Yang</surname>
							<given-names>W</given-names>
						</name>
						<name>
							<surname>Ke</surname>
							<given-names>L</given-names>
						</name>
					</person-group>
					<article-title>An improved fireworks algorithm for the capacitated vehicle routing problem</article-title>
					<source>Frontiers in Computer Science</source>
					<volume>13</volume>
					<issue>3</issue>
					<fpage>552</fpage>
					<lpage>564</lpage>
					<month>06</month>
					<year>2019</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s11704-017-6418-9">https://doi.org/10.1007/s11704-017-6418-9</ext-link>
				</element-citation>
			</ref>
			<ref id="B11">
				<label>[11]</label>
				<mixed-citation>Y. Wu, F. Pan, S. Li, Z. Chen, M. Dong, &quot;Peer-induced fairness capacitated vehicle routing scheduling using a hybrid optimization ACO-VNS algorithm,&quot; <italic>Software computing</italic>, vol. 24, no. 3, pp. 2201-2213, Feb. 2020. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00500-019-04053-9">https://doi.org/10.1007/s00500-019-04053-9</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Wu</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Pan</surname>
							<given-names>F</given-names>
						</name>
						<name>
							<surname>Li</surname>
							<given-names>S</given-names>
						</name>
						<name>
							<surname>Chen</surname>
							<given-names>Z</given-names>
						</name>
						<name>
							<surname>Dong</surname>
							<given-names>M</given-names>
						</name>
					</person-group>
					<article-title>Peer-induced fairness capacitated vehicle routing scheduling using a hybrid optimization ACO-VNS algorithm</article-title>
					<source>Software computing</source>
					<volume>24</volume>
					<issue>3</issue>
					<fpage>2201</fpage>
					<lpage>2213</lpage>
					<month>02</month>
					<year>2020</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00500-019-04053-9">https://doi.org/10.1007/s00500-019-04053-9</ext-link>
				</element-citation>
			</ref>
			<ref id="B12">
				<label>[12]</label>
				<mixed-citation>Y. H. Jia, Y. Mei, M. Zhang, &quot;A Bilevel Ant Colony Optimization Algorithm for Capacitated Electric Vehicle Routing Problem,&quot; <italic>IEEE Transactions on Cybernetics</italic>, vol. 52, no. 10, pp. 10855-10868, Oct. 2022. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TCYB.2021.3069942">https://doi.org/10.1109/TCYB.2021.3069942</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Jia</surname>
							<given-names>Y. H.</given-names>
						</name>
						<name>
							<surname>Mei</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Zhang</surname>
							<given-names>M</given-names>
						</name>
					</person-group>
					<article-title>A Bilevel Ant Colony Optimization Algorithm for Capacitated Electric Vehicle Routing Problem</article-title>
					<source>IEEE Transactions on Cybernetics</source>
					<volume>52</volume>
					<issue>10</issue>
					<fpage>10855</fpage>
					<lpage>10868</lpage>
					<month>10</month>
					<year>2022</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TCYB.2021.3069942">https://doi.org/10.1109/TCYB.2021.3069942</ext-link>
				</element-citation>
			</ref>
			<ref id="B13">
				<label>[13]</label>
				<mixed-citation>N. Frias, F. Johnson, C. Valle, &quot;Hybrid Algorithms for Energy Minimizing Vehicle Routing Problem: Integrating Clusterization and Ant Colony Optimization,&quot; <italic>IEEE Access</italic>, vol. 11, pp. 125800-125821, 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/ACCESS.2023.3325787">https://doi.org/10.1109/ACCESS.2023.3325787</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Frias</surname>
							<given-names>N</given-names>
						</name>
						<name>
							<surname>Johnson</surname>
							<given-names>F</given-names>
						</name>
						<name>
							<surname>Valle</surname>
							<given-names>C</given-names>
						</name>
					</person-group>
					<article-title>Hybrid Algorithms for Energy Minimizing Vehicle Routing Problem: Integrating Clusterization and Ant Colony Optimization</article-title>
					<source>IEEE Access</source>
					<volume>11</volume>
					<fpage>125800</fpage>
					<lpage>125821</lpage>
					<year>2023</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/ACCESS.2023.3325787">https://doi.org/10.1109/ACCESS.2023.3325787</ext-link>
				</element-citation>
			</ref>
			<ref id="B14">
				<label>[14]</label>
				<mixed-citation>N. Sultana, J. Chan, T. Sarwar, A. K. Qin, &quot;Learning to Optimise Routing Problems using Policy Optimisation,&quot; in <italic>Proceedings of the International Joint Conference on Neural Networks</italic>, 2021. <ext-link ext-link-type="uri" xlink:href="https://doi,org/10.1109/IJCNN52387.2021.9534010">https://doi,org/10.1109/IJCNN52387.2021.9534010</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Sultana</surname>
							<given-names>N</given-names>
						</name>
						<name>
							<surname>Chan</surname>
							<given-names>J</given-names>
						</name>
						<name>
							<surname>Sarwar</surname>
							<given-names>T</given-names>
						</name>
						<name>
							<surname>Qin</surname>
							<given-names>A. K.</given-names>
						</name>
					</person-group>
					<article-title>Learning to Optimise Routing Problems using Policy Optimisation</article-title>
					<source>Proceedings of the International Joint Conference on Neural Networks</source>
					<year>2021</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi,org/10.1109/IJCNN52387.2021.9534010">https://doi,org/10.1109/IJCNN52387.2021.9534010</ext-link>
				</element-citation>
			</ref>
			<ref id="B15">
				<label>[15]</label>
				<mixed-citation>J. Li <italic>et al</italic>
 <italic>.,</italic> &quot;Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem,&quot; <italic>IEEE Transactions on Cybernetics</italic>, vol. 52, no. 12, pp. 13572-13585, Dec. 2022. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TCYB.2021.3111082">https://doi.org/10.1109/TCYB.2021.3111082</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Li</surname>
							<given-names>J</given-names>
						</name>
						<etal/>
					</person-group>
					<article-title>Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem</article-title>
					<source>IEEE Transactions on Cybernetics</source>
					<volume>52</volume>
					<issue>12</issue>
					<fpage>13572</fpage>
					<lpage>13585</lpage>
					<month>12</month>
					<year>2022</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TCYB.2021.3111082">https://doi.org/10.1109/TCYB.2021.3111082</ext-link>
				</element-citation>
			</ref>
			<ref id="B16">
				<label>[16]</label>
				<mixed-citation>Y. Wu, W. Song, Z. Cao, J. Zhang, A. Lim, &quot;Learning Improvement Heuristics for Solving Routing Problems,&quot; <italic>IEEE Transactions on Neural Networks and Learning Systems</italic>, vol. 33, no. 9, pp. 5057-5069, Sep. 2022. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TNNLS.2021.3068828">https://doi.org/10.1109/TNNLS.2021.3068828</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Wu</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Song</surname>
							<given-names>W</given-names>
						</name>
						<name>
							<surname>Cao</surname>
							<given-names>Z</given-names>
						</name>
						<name>
							<surname>Zhang</surname>
							<given-names>J</given-names>
						</name>
						<name>
							<surname>Lim</surname>
							<given-names>A</given-names>
						</name>
					</person-group>
					<article-title>Learning Improvement Heuristics for Solving Routing Problems</article-title>
					<source>IEEE Transactions on Neural Networks and Learning Systems</source>
					<volume>33</volume>
					<issue>9</issue>
					<fpage>5057</fpage>
					<lpage>5069</lpage>
					<month>09</month>
					<year>2022</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TNNLS.2021.3068828">https://doi.org/10.1109/TNNLS.2021.3068828</ext-link>
				</element-citation>
			</ref>
			<ref id="B17">
				<label>[17]</label>
				<mixed-citation>Y. Xu, M. Fang, L. Chen, G. Xu, Y. Du, C. Zhang, &quot;Reinforcement Learning with Multiple Relational Attention for Solving Vehicle Routing Problems,&quot; <italic>IEEE Transactions on Cybernetics</italic>, vol. 52, no. 10, pp. 11107-11120, Oct. 2022. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TCYB.2021.3089179">https://doi.org/10.1109/TCYB.2021.3089179</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Xu</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Fang</surname>
							<given-names>M</given-names>
						</name>
						<name>
							<surname>Chen</surname>
							<given-names>L</given-names>
						</name>
						<name>
							<surname>Xu</surname>
							<given-names>G</given-names>
						</name>
						<name>
							<surname>Du</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Zhang</surname>
							<given-names>C</given-names>
						</name>
					</person-group>
					<article-title>Reinforcement Learning with Multiple Relational Attention for Solving Vehicle Routing Problems</article-title>
					<source>IEEE Transactions on Cybernetics</source>
					<volume>52</volume>
					<issue>10</issue>
					<fpage>11107</fpage>
					<lpage>11120</lpage>
					<month>10</month>
					<year>2022</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TCYB.2021.3089179">https://doi.org/10.1109/TCYB.2021.3089179</ext-link>
				</element-citation>
			</ref>
			<ref id="B18">
				<label>[18]</label>
				<mixed-citation>K. Lei, P. Guo, Y. Wang, X. Wu, W. Zhao, &quot;Solve routing problems with a residual edge-graph attention neural network,&quot; <italic>Neurocomputing</italic>, vol. 508, pp. 79-98, Oct. 2022. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.neucom.2022.08.005">https://doi.org/10.1016/j.neucom.2022.08.005</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Lei</surname>
							<given-names>K</given-names>
						</name>
						<name>
							<surname>Guo</surname>
							<given-names>P</given-names>
						</name>
						<name>
							<surname>Wang</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Wu</surname>
							<given-names>X</given-names>
						</name>
						<name>
							<surname>Zhao</surname>
							<given-names>W</given-names>
						</name>
					</person-group>
					<article-title>Solve routing problems with a residual edge-graph attention neural network</article-title>
					<source>Neurocomputing</source>
					<volume>508</volume>
					<fpage>79</fpage>
					<lpage>98</lpage>
					<month>10</month>
					<year>2022</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.neucom.2022.08.005">https://doi.org/10.1016/j.neucom.2022.08.005</ext-link>
				</element-citation>
			</ref>
			<ref id="B19">
				<label>[19]</label>
				<mixed-citation>Y. Hou, W. Gu, C. Wang, K. Yang, Y. Wang, &quot;A Selection Hyper-heuristic based on Q-learning for School Bus Routing Problem,&quot; <italic>International Journal of Applied Mathematics</italic>, vol. 52, no. 4, pp. 817-825, Dec. 2022.</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Hou</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Gu</surname>
							<given-names>W</given-names>
						</name>
						<name>
							<surname>Wang</surname>
							<given-names>C</given-names>
						</name>
						<name>
							<surname>Yang</surname>
							<given-names>K</given-names>
						</name>
						<name>
							<surname>Wang</surname>
							<given-names>Y</given-names>
						</name>
					</person-group>
					<article-title>A Selection Hyper-heuristic based on Q-learning for School Bus Routing Problem</article-title>
					<source>International Journal of Applied Mathematics</source>
					<volume>52</volume>
					<issue>4</issue>
					<fpage>817</fpage>
					<lpage>825</lpage>
					<month>12</month>
					<year>2022</year>
				</element-citation>
			</ref>
			<ref id="B20">
				<label>[20]</label>
				<mixed-citation>W. Yi, R. Qu, &quot;Automated design of search algorithms based on reinforcement learning,&quot; <italic>Information Sciences (NY)</italic>, vol. 649, e119639, Nov. 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.ins.2023.119639">https://doi.org/10.1016/j.ins.2023.119639</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Yi</surname>
							<given-names>W</given-names>
						</name>
						<name>
							<surname>Qu</surname>
							<given-names>R</given-names>
						</name>
					</person-group>
					<article-title>Automated design of search algorithms based on reinforcement learning</article-title>
					<source>Information Sciences (NY)</source>
					<volume>649</volume>
					<elocation-id>e119639</elocation-id>
					<month>11</month>
					<year>2023</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.ins.2023.119639">https://doi.org/10.1016/j.ins.2023.119639</ext-link>
				</element-citation>
			</ref>
			<ref id="B21">
				<label>[21]</label>
				<mixed-citation>H. Liu, Z. Zong, Y. Li, D. Jin, &quot;NeuroCrossover: An intelligent genetic locus selection scheme for genetic algorithm using reinforcement learning,&quot; <italic>Applied Software Computing</italic>, vol. 146, e110680, Oct. 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.asoc.2023.110680">https://doi.org/10.1016/j.asoc.2023.110680</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Liu</surname>
							<given-names>H</given-names>
						</name>
						<name>
							<surname>Zong</surname>
							<given-names>Z</given-names>
						</name>
						<name>
							<surname>Li</surname>
							<given-names>Y</given-names>
						</name>
						<name>
							<surname>Jin</surname>
							<given-names>D</given-names>
						</name>
					</person-group>
					<article-title>NeuroCrossover: An intelligent genetic locus selection scheme for genetic algorithm using reinforcement learning</article-title>
					<source>Applied Software Computing</source>
					<volume>146</volume>
					<fpage>Oct. 2023</fpage>
					<lpage>Oct. 2023</lpage>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.asoc.2023.110680">https://doi.org/10.1016/j.asoc.2023.110680</ext-link>
				</element-citation>
			</ref>
			<ref id="B22">
				<label>[22]</label>
				<mixed-citation>A. Bogyrbayeva, T. Yoon, H. Ko, S. Lim, H. Yun, C. Kwon, &quot;A deep reinforcement learning approach for solving the Traveling Salesman Problem with Drone,&quot; <italic>Transportation Research Part C: Emerging Technologies</italic>, vol. 148, e103981, Mar. 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.trc.2022.103981">https://doi.org/10.1016/j.trc.2022.103981</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Bogyrbayeva</surname>
							<given-names>A</given-names>
						</name>
						<name>
							<surname>Yoon</surname>
							<given-names>T</given-names>
						</name>
						<name>
							<surname>Ko</surname>
							<given-names>H</given-names>
						</name>
						<name>
							<surname>Lim</surname>
							<given-names>S</given-names>
						</name>
						<name>
							<surname>Yun</surname>
							<given-names>H</given-names>
						</name>
						<name>
							<surname>Kwon</surname>
							<given-names>C</given-names>
						</name>
					</person-group>
					<article-title>A deep reinforcement learning approach for solving the Traveling Salesman Problem with Drone</article-title>
					<source>Transportation Research Part C: Emerging Technologies</source>
					<volume>148</volume>
					<elocation-id>e103981</elocation-id>
					<month>03</month>
					<year>2023</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.trc.2022.103981">https://doi.org/10.1016/j.trc.2022.103981</ext-link>
				</element-citation>
			</ref>
			<ref id="B23">
				<label>[23]</label>
				<mixed-citation>W. Yi, R. Qu, L. Jiao, &quot;Automated algorithm design using proximal policy optimisation with identified features,&quot; <italic>Expert Systems with Applications</italic> , vol. 216, e119461, Apr. 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.eswa.2022.119461">https://doi.org/10.1016/j.eswa.2022.119461</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Yi</surname>
							<given-names>W</given-names>
						</name>
						<name>
							<surname>Qu</surname>
							<given-names>R</given-names>
						</name>
						<name>
							<surname>Jiao</surname>
							<given-names>L</given-names>
						</name>
					</person-group>
					<article-title>Automated algorithm design using proximal policy optimisation with identified features</article-title>
					<source>Expert Systems with Applications</source>
					<volume>216</volume>
					<elocation-id>e119461</elocation-id>
					<month>04</month>
					<year>2023</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.eswa.2022.119461">https://doi.org/10.1016/j.eswa.2022.119461</ext-link>
				</element-citation>
			</ref>
			<ref id="B24">
				<label>[24]</label>
				<mixed-citation>W. Yi, R. Qu, L. Jiao, B. Niu, &quot;Automated Design of Metaheuristics Using Reinforcement Learning Within a Novel General Search Framework,&quot; <italic>IEEE Transactions on Evolutionary Computation</italic>, vol. 27, no. 4, pp. 1072-1084, Aug. 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TEVC.2022.3197298">https://doi.org/10.1109/TEVC.2022.3197298</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Yi</surname>
							<given-names>W</given-names>
						</name>
						<name>
							<surname>Qu</surname>
							<given-names>R</given-names>
						</name>
						<name>
							<surname>Jiao</surname>
							<given-names>L</given-names>
						</name>
						<name>
							<surname>Niu</surname>
							<given-names>B</given-names>
						</name>
					</person-group>
					<article-title>Automated Design of Metaheuristics Using Reinforcement Learning Within a Novel General Search Framework</article-title>
					<source>IEEE Transactions on Evolutionary Computation</source>
					<volume>27</volume>
					<issue>4</issue>
					<fpage>1072</fpage>
					<lpage>1084</lpage>
					<month>08</month>
					<year>2023</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TEVC.2022.3197298">https://doi.org/10.1109/TEVC.2022.3197298</ext-link>
				</element-citation>
			</ref>
			<ref id="B25">
				<label>[25]</label>
				<mixed-citation>D. H. Lee, J. Ahn, &quot;Multi-start team orienteering problem for UAS mission re-planning with data-efficient deep reinforcement learning,&quot; <italic>Applied Intelligence</italic>, vol. 54, no. 6, pp. 4467-4489, Mar. 2024. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s10489-024-05367-4">https://doi.org/10.1007/s10489-024-05367-4</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Lee</surname>
							<given-names>D. H.</given-names>
						</name>
						<name>
							<surname>Ahn</surname>
							<given-names>J</given-names>
						</name>
					</person-group>
					<article-title>Multi-start team orienteering problem for UAS mission re-planning with data-efficient deep reinforcement learning</article-title>
					<source>Applied Intelligence</source>
					<volume>54</volume>
					<issue>6</issue>
					<fpage>4467</fpage>
					<lpage>4489</lpage>
					<month>03</month>
					<year>2024</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s10489-024-05367-4">https://doi.org/10.1007/s10489-024-05367-4</ext-link>
				</element-citation>
			</ref>
			<ref id="B26">
				<label>[26]</label>
				<mixed-citation>Y.-E. Hou, W. Gu, C. Wang, K. Yang, Y. Wang, &quot;A Selection Hyper-heuristic based on Q-learning for School Bus Routing Problem,&quot; <italic>International Journal of Applied Mathematics</italic>, no. 4, e35, Dec. 2022.</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Hou</surname>
							<given-names>Y.-E.</given-names>
						</name>
						<name>
							<surname>Gu</surname>
							<given-names>W</given-names>
						</name>
						<name>
							<surname>Wang</surname>
							<given-names>C</given-names>
						</name>
						<name>
							<surname>Yang</surname>
							<given-names>K</given-names>
						</name>
						<name>
							<surname>Wang</surname>
							<given-names>Y</given-names>
						</name>
					</person-group>
					<article-title>A Selection Hyper-heuristic based on Q-learning for School Bus Routing Problem</article-title>
					<source>International Journal of Applied Mathematics</source>
					<issue>4</issue>
					<elocation-id>e35</elocation-id>
					<month>12</month>
					<year>2022</year>
				</element-citation>
			</ref>
			<ref id="B27">
				<label>[27]</label>
				<mixed-citation>N. Furian, M. O'Sullivan, C. Walker, E. Çela, &quot;A machine learning-based branch and price algorithm for a sampled vehicle routing problem,&quot; <italic>OR Spectrum</italic>, vol. 43, no. 3, pp. 693-732, Sep. 2021. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00291-020-00615-8">https://doi.org/10.1007/s00291-020-00615-8</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Furian</surname>
							<given-names>N</given-names>
						</name>
						<name>
							<surname>O'Sullivan</surname>
							<given-names>M.</given-names>
						</name>
						<name>
							<surname>Walker</surname>
							<given-names>C</given-names>
						</name>
						<name>
							<surname>Çela</surname>
							<given-names>E</given-names>
						</name>
					</person-group>
					<article-title>A machine learning-based branch and price algorithm for a sampled vehicle routing problem</article-title>
					<source>OR Spectrum</source>
					<volume>43</volume>
					<issue>3</issue>
					<fpage>693</fpage>
					<lpage>732</lpage>
					<month>09</month>
					<year>2021</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00291-020-00615-8">https://doi.org/10.1007/s00291-020-00615-8</ext-link>
				</element-citation>
			</ref>
			<ref id="B28">
				<label>[28]</label>
				<mixed-citation>A. Yaddaden, S. Harispe, M. Vasquez, &quot;Transfer Learning Helpful for Neural Combinatorial Optimization Applied to Vehicle Routing Problems?,&quot; <italic>Computing and Informatics</italic>, vol. 41, pp. 172-190, 2022. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.31577/cai_2022_1_172">https://doi.org/10.31577/cai_2022_1_172</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Yaddaden</surname>
							<given-names>A</given-names>
						</name>
						<name>
							<surname>Harispe</surname>
							<given-names>S</given-names>
						</name>
						<name>
							<surname>Vasquez</surname>
							<given-names>M</given-names>
						</name>
					</person-group>
					<article-title>Transfer Learning Helpful for Neural Combinatorial Optimization Applied to Vehicle Routing Problems?</article-title>
					<source>Computing and Informatics</source>
					<volume>41</volume>
					<fpage>172</fpage>
					<lpage>190</lpage>
					<year>2022</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.31577/cai_2022_1_172">https://doi.org/10.31577/cai_2022_1_172</ext-link>
				</element-citation>
			</ref>
			<ref id="B29">
				<label>[29]</label>
				<mixed-citation>M. Ben Hassine, M. S. Amri Sakhri, M. Tlili, &quot;Machine learning approach to solving the capacitated vehicle routing problem,&quot; in <italic>1st IEEE Afro-Mediterranean Conference on Artificial Intelligence, AMCAI 2023 - Proceedings</italic>, 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/AMCAI59331.2023.10431502">https://doi.org/10.1109/AMCAI59331.2023.10431502</ext-link>
				</mixed-citation>
				<element-citation publication-type="confproc">
					<person-group person-group-type="author">
						<name>
							<surname>Hassine</surname>
							<given-names>M. Ben</given-names>
						</name>
						<name>
							<surname>Sakhri</surname>
							<given-names>M. S. Amri</given-names>
						</name>
						<name>
							<surname>Tlili</surname>
							<given-names>M</given-names>
						</name>
					</person-group>
					<source>Machine learning approach to solving the capacitated vehicle routing problem</source>
					<conf-name>1st IEEE Afro-Mediterranean Conference on Artificial Intelligence, AMCAI 2023 - Proceedings</conf-name>
					<year>2023</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/AMCAI59331.2023.10431502">https://doi.org/10.1109/AMCAI59331.2023.10431502</ext-link>
				</element-citation>
			</ref>
			<ref id="B30">
				<label>[30]</label>
				<mixed-citation>J. Dornemann, &quot;Solving the capacitated vehicle routing problem with time windows via graph convolutional network assisted tree search and quantum-inspired computing,&quot; <italic>Frontiers in Applied Mathematics and Statistics</italic>, vol. 9, e1155356, 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fams.2023.1155356">https://doi.org/10.3389/fams.2023.1155356</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Dornemann</surname>
							<given-names>J</given-names>
						</name>
					</person-group>
					<article-title>Solving the capacitated vehicle routing problem with time windows via graph convolutional network assisted tree search and quantum-inspired computing</article-title>
					<source>Frontiers in Applied Mathematics and Statistics</source>
					<volume>9</volume>
					<elocation-id>e1155356</elocation-id>
					<year>2023</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fams.2023.1155356">https://doi.org/10.3389/fams.2023.1155356</ext-link>
				</element-citation>
			</ref>
			<ref id="B31">
				<label>[31]</label>
				<mixed-citation>D. Weiss, K. Tierney, &quot;Realtime gray-box algorithm configuration using cost-sensitive classification,&quot; <italic>Annals of Mathematics and Artificial Intelligence</italic>, vol. 23, e9890, 2023. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s10472-023-09890-x">https://doi.org/10.1007/s10472-023-09890-x</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Weiss</surname>
							<given-names>D</given-names>
						</name>
						<name>
							<surname>Tierney</surname>
							<given-names>K</given-names>
						</name>
					</person-group>
					<article-title>Realtime gray-box algorithm configuration using cost-sensitive classification</article-title>
					<source>Annals of Mathematics and Artificial Intelligence</source>
					<volume>23</volume>
					<elocation-id>e9890</elocation-id>
					<year>2023</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s10472-023-09890-x">https://doi.org/10.1007/s10472-023-09890-x</ext-link>
				</element-citation>
			</ref>
			<ref id="B32">
				<label>[32]</label>
				<mixed-citation>C. Wang, Z. Yu, S. McAleer, T. Yu, Y. Yang, &quot;ASP: Learn a Universal Neural Solver!,&quot; <italic>IEEE Transactions on Pattern Analysis and Machine Intelligence</italic>, vol. 46, no. 6, pp. 4102-4114, Jun. 2024. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TPAMI.2024.3352096">https://doi.org/10.1109/TPAMI.2024.3352096</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Wang</surname>
							<given-names>C</given-names>
						</name>
						<name>
							<surname>Yu</surname>
							<given-names>Z</given-names>
						</name>
						<name>
							<surname>McAleer</surname>
							<given-names>S</given-names>
						</name>
						<name>
							<surname>Yu</surname>
							<given-names>T</given-names>
						</name>
						<name>
							<surname>Yang</surname>
							<given-names>Y</given-names>
						</name>
					</person-group>
					<article-title>ASP: Learn a Universal Neural Solver!</article-title>
					<source>IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
					<volume>46</volume>
					<issue>6</issue>
					<fpage>4102</fpage>
					<lpage>4114</lpage>
					<month>06</month>
					<year>2024</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TPAMI.2024.3352096">https://doi.org/10.1109/TPAMI.2024.3352096</ext-link>
				</element-citation>
			</ref>
			<ref id="B33">
				<label>[33]</label>
				<mixed-citation>M. Morabit, G. Desaulniers, A. Lodi, &quot;Learning to repeatedly solve routing problems,&quot; <italic>Networks</italic>, vol. 83, no. 3, e22200, 2024. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1002/net.22200">https://doi.org/10.1002/net.22200</ext-link>
				</mixed-citation>
				<element-citation publication-type="journal">
					<person-group person-group-type="author">
						<name>
							<surname>Morabit</surname>
							<given-names>M</given-names>
						</name>
						<name>
							<surname>Desaulniers</surname>
							<given-names>G</given-names>
						</name>
						<name>
							<surname>Lodi</surname>
							<given-names>A</given-names>
						</name>
					</person-group>
					<article-title>Learning to repeatedly solve routing problems</article-title>
					<source>Networks</source>
					<volume>83</volume>
					<issue>3</issue>
					<elocation-id>e22200</elocation-id>
					<year>2024</year>
					<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1002/net.22200">https://doi.org/10.1002/net.22200</ext-link>
				</element-citation>
			</ref>
		</ref-list>
		<fn-group>
			<fn fn-type="other" id="fn1">
				<label>How to cite:</label>
				<p> M. Maca-Chagüendo and C. A. Cobos-Lozada, A Systematic Review of Machine-Learning Enhanced Meta-Heuristics for Solving Capacitated Vehicle Routing Problems. <italic>Revista Facultad de Ingeniería,</italic> vol. 33, no. 68, e18379, 2024. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.19053/01211129.v33.n68.2024.18379">https://doi.org/10.19053/01211129.v33.n68.2024.18379</ext-link>
				</p>
			</fn>
			<fn fn-type="other" id="fn2">
				<label>Mauricio Maca-Chagüendo:</label>
				<p> Methodology; Investigation; Formal analysis; Writing - original draft; Writing - review &amp; editing.</p>
			</fn>
			<fn fn-type="other" id="fn3">
				<label>Carlos-Alberto Cobos-Lozada:</label>
				<p> Conceptualization; Formal analysis; Supervision; Writing - review &amp; editing.</p>
			</fn>
			<fn fn-type="other" id="fn4">
				<label>FUNDING</label>
				<p> This scientific research article is derived from the research project &quot;Machine Learning as an Enhancement to Metaheuristic Algorithms for Solving Capacitated Vehicle Routing Problems,&quot; partially funded by the University of Cauca&quot;. The project started in 2023 and will be completed in 2027.</p>
			</fn>
		</fn-group>
	</back>
</article>