A Two-Stage Clustering–Evolutionary Framework with Adaptive Search Control for the Vehicle Routing Problem with Time Windows

liang liu; jiarui wang; yujing liang

Outline

Open Access

Research article

A Two-Stage Clustering–Evolutionary Framework with Adaptive Search Control for the Vehicle Routing Problem with Time Windows

Liang Liu^*

,

Jiarui Wang

,

Yujing Liang

School of Economics and Management, Tianjin Polytechnic University, 300387 Tianjin, China

Journal of Operational and Strategic Analytics

|

Volume 3, Issue 4, 2025

|

Pages 251-263

https://doi.org/10.56578/josa030404

Received: 09-28-2025,

Revised: 12-10-2025,

Accepted: 12-26-2025,

Available online: 12-30-2025

View Full Article|

Download PDF

Abstract:

The vehicle routing problem with time windows (VRPTW) remains a central challenge in urban logistics due to the interplay between spatial dispersion, operational constraints, and service reliability requirements. To address the scalability limitations and premature convergence issues commonly observed in conventional metaheuristics, a two-stage optimization framework is developed that integrates clustering-based spatial decomposition with an adaptive evolutionary search mechanism. In the first stage, k-means clustering is employed to partition large-scale customer nodes into geographically coherent subregions, with the number of clusters determined using the silhouette coefficient to ensure structural consistency. This decomposition reduces problem dimensionality and enables localized route optimization. In the second stage, an adaptive genetic algorithm is designed in which crossover and mutation probabilities are dynamically adjusted according to population fitness distribution, thereby improving global exploration in early iterations and enhancing solution refinement in later stages. A mathematical model is formulated to minimize total operational cost, incorporating vehicle activation, transportation distance, and time window penalties under capacity and service constraints. The proposed framework is evaluated using a real-world logistics dataset involving 60 customer nodes. To assess operational robustness, the optimized routing schemes are further validated within an agent-based simulation environment. Comparative results show that the proposed method achieves consistent improvements over baseline strategies, with cost reductions of 28.12% and 20.62% across two service regions, while significantly increasing vehicle utilization and reducing fleet size. The findings indicate that the integration of spatial decomposition and adaptive evolutionary control provides a practical and scalable solution for complex VRPTW instances in dynamic urban logistics settings.

Keywords: Vehicle routing problem with time windows, Clustering-based decomposition, Adaptive genetic algorithm, Logistics optimization, Simulation-based validation

1. Introduction

As global e-commerce develops rapidly, logistics distribution, the critical link between supply and demand, has become a key factor of enterprise competitiveness. According to recent statistics, the total social logistics volume in China had exceeded 331.2 trillion Chinese Yuan (CNY) by November 2025, with a significant increase in express delivery demand. However, several challenges still hinder the quality development of the logistics industry, such as “last-mile” delivery, mismatched allocation of transportation resources, and the high terminal distribution cost.

The vehicle routing problem, particularly the multi-depot vehicle routing problem, is a classical combinatorial optimization problem in logistics [1]. Its fundamental objective is to identify an optimal set of delivery routes, including vehicle capacity, travel distance, and time window restrictions, minimizing total operational cost and satisfying multiple constraints. The vehicle routing problem with time windows has attracted a lot of attention because it is closely related to real-world logistics. Since 2020, research on the vehicle routing problem with time windows has changed toward multi-strategy integration in complex operation. In terms of algorithmic model innovation, Tamke and Buscher [2] proposed branch-and-cut-based routing optimization in drone-assisted delivery systems. Similarly, Weng et al. [3] investigated multi-depot vehicle routing incorporating drone collaboration for emergency logistics, extending the application scope in complex scenarios. Beyond traditional ground logistics, the integration of autonomous and unmanned technologies has introduced new dimensions to routing optimization. In the context of urban delivery, Morim et al. [4] investigated drone-assisted vehicle routing incorporated with robot stations, and Lupi et al. [5] proposed a novel optimization system based on platoons of automated vehicles for last-mile urban freight. To solve the dynamic vehicle routing problem with time windows in these environments, Li et al. [6] proposed an elastic strategy-based adaptive genetic algorithm. This method effectively enhances the global search capability of the algorithm and significantly improves its solution accuracy and adaptability in dynamic and complex logistics environments. From the perspective of heuristic algorithm enhancement, Niazy et al. [7] proposed hybrid optimization combining the chicken swarm optimization with genetic algorithms. This greatly improved the solution accuracy for large-scale capacitated vehicle routing. In terms of heuristic optimization, Rodrigues [8] designed a novel genetic algorithm to effectively solve capacitated vehicle routing problems (VRPs). Building upon such advanced optimization strategies, Thammasang and Arunyanart [9] addressed the unique constraints of ice distribution, while Moghaddasi et al. [10] and Jiang et al. [11] adapted these heuristic methods for cold chain logistics and fresh fruit harvest systems, respectively. Furthermore, regarding dynamic and multi-objective routing models in specialized fields, Shen et al. [12] developed a two-stage scheduling algorithm for medical waste collection. In the aviation sector, Bao et al. [13] proposed a proactive real-time scheduling method for apron service vehicles, and subsequently Bao et al. [14] optimized the scheduling of electric aircraft tow tractors under operator cooperation.

The concept of spatial dimensionality reduction has been widely adopted in preprocessing and collaborative optimization. Wang et al. [15] proposed a clustering-based extended genetic algorithm, addressing the multi-depot vehicle routing problem with three-dimensional loading constraints. In retail logistics distribution, the modeling and optimization of the multi-depot logistics network conducted by Mogale et al. [16] enhanced distribution efficiency, providing valuable insights into spatial decomposition strategies for large-scale logistics systems. In addition, Yan et al. [17] investigated a split-delivery vehicle routing problem with time window and three-dimensional loading constraints, with tailored optimization addressing the associated complexities.

To address the growing complexity of vehicle routing problems (VRP) in dynamic and diverse logistics environments. Mardešić et al. [18], researchers have heavily focused on advanced clustering techniques and heuristic algorithms. Silva et al. [19] proposed a cluster branching technique to enhance decision-making efficiency. Lau et al. [20] extensively investigated the vehicle routing problem with time windows and a limited number of vehicles. By optimizing the allocation of constrained vehicle resources, their study provides an effective routing solution for complex logistics scenarios with limited fleet sizes.

These contributions have further enriched the research on the vehicle routing problem with multiple constraints.

However, limitations remain in the application of conventional genetic algorithms to the vehicle routing problem with time windows. First, fixed crossover and mutation probabilities often cause premature convergence during the later evolution stages, leading to entrapment in local optima. Second, research on optimized facility location and routing with multi-depot and large-scale demand remains fragmented, and systematic dynamic validation is often lacking. Although adaptive strategies have been utilized, the effective integration of clustering analysis with dynamic simulation for end-to-end optimization (from facility location to route planning) has not been fully explored.

In summary, hybrid optimization integrating k-means clustering and an adaptive genetic algorithm is proposed for logistics distribution routing, with empirical validation using the AnyLogic platform. In the first-stage solution, k-means clustering is utilized for scientific partitioning of customer nodes and distribution center allocation, thereby reducing the computational complexity of route optimization. In the second-stage solution, an improved adaptive genetic algorithm is designed for the vehicle routing problem model with time windows, with adaptive operators dynamically adjusting crossover and mutation probabilities based on individual fitness values. This enhances the global search capability. In the real-world case study based on operational data from Company S, a dynamic simulation model is developed using AnyLogic. Comparative analyses show the changes of key indicators, including total operational cost, vehicle load factor, and customer satisfaction, before and after optimization. The results confirm that the proposed approach is feasible and superior in complex logistics.

2. Problem Analysis and Assumptions

2.1 Problem Description

Logistics distribution routing optimization is a two-stage optimization framework composed of distribution center allocation and vehicle routing optimization. In the first stage, improved k-means clustering addresses the spatial hierarchical structure between sorting centers and large-scale customer nodes. The aim is to minimize the within-cluster sum of squared errors through iterative computation using customer coordinates and demand distributions. Then the optimal locations of secondary distribution centers are determined within a complex geographic network. This stage not only optimizes the hierarchical configuration of the logistics network but also achieves spatial decomposition. This can reduce the computational dimensionality of subsequent routing optimization. In the second stage, the vehicle routing problem with time windows is optimized based on the distribution center locations. Delivery vehicles are dispatched from their respective distribution centers to satisfy operational constraints, including vehicle capacity limits, maximum travel distance per trip, and soft time window requirements concerning customer service deadlines. The optimal routing should balance service efficiency and operational cost. The overall objective of the system is to minimize total operational cost by optimizing location and routing decisions. The cost function incorporates fixed vehicle activation costs, distance-dependent transportation costs, and penalty costs concerning time window violations.

2.2 Model Assumptions

The following assumptions were proposed for easy computation and consistency with practical logistics:

Assumption 1: The locations of distribution centers are fixed, and all delivery vehicles should return to their respective origin distribution centers after completing service tasks.

Assumption 2: The demand at each customer node is known in advance and does not exceed the rated capacity of a single vehicle.

Assumption 3: Each customer node is served exactly once by a single vehicle.

Assumption 4: Vehicles travel at a constant speed, without considering stochastic factors such as traffic congestion and other unforeseen disruptions.

Assumption 5: All delivery vehicles have identical specifications, capacity limits, and cost parameters.

3. Mathematical Model Formulation

3.1 Notation Definition

Table 1 shows the model variables and corresponding definitions.

Table 1. Notation of the mathematical model and related definitions

Variables	Definitions
$N$	Set of customer nodes; $N=\{1,2,\dots,n\}$, node 0 represents the distribution center
$V$	Set of delivery vehicles; $V=\{1,2,\dots,m\}$
$d_{ij}$	Distance between node $i$ and node $j$
$c_1$	Unit transportation cost per distance traveled
$c_2$	Fixed vehicle activation cost per vehicle
$q_i$	Demand of customer node $i$
$Q$	Rated vehicle capacity
$t_{ij}$	Travel time from node $i$ to node $j$
$s_i$	Service time at customer node $i$
$T_{ik}$	Arrival time of vehicle $k$ at customer node $i$
$w_1, w_2$	Penalty costs per unit time for early and late arrival
Decision Variables
$x_{ijk}$	Equal to 1 if vehicle $k$ travels from node $i$ to node $j$, otherwise 0
$y_{ik}$	Equal to 1 if customer $i$ is served by vehicle $k$, otherwise 0

3.2 Objective Function

The optimization objective is to minimize the total distribution cost ($Z$), including fixed vehicle activation cost (Z$_1$), transportation cost concerning travel distance (Z$_2$), and time window penalty cost (Z$_3$) using a linear penalty function.

$ Z=\min \left(Z_1+Z_2+Z_3\right) $

The three kinds of costs are expressed as follows:

$ \begin{aligned} &Z_1=\sum_{k \in V} \sum_{j \in N} x_{0 j k} \cdot c_2\\ &Z_2=\sum_{\mathrm{k} \in V} \sum_{i \in N \cup\{0\}} \sum_{j \in N \cup\{0\}} x_{i j k} \cdot d_{i j} \cdot c_1\\ &Z_3=\sum_{k \in V} \sum_{i \in N}\left[w_1 \cdot \max \left(E T_i-T_{i k}, 0\right)+w_2 \cdot \max \left(T_{i k}-L T_i, 0\right)\right] \end{aligned} $

3.3 Constraints

In terms of the capacity constraint, the total load assigned to each vehicle must not exceed its rated capacity. In terms of the service uniqueness constraint, each customer node must be visited exactly once and served by a single vehicle. The constraints are expressed as follows:

$ \begin{aligned} & \sum_{i \in N} q_i y_{i k} \leq Q, \forall k \in V \\ & \sum_{k \in V} y_{i k}=1, \forall i \in N \end{aligned} $

As for the flow conservation constraint, if a node is entered for each vehicle, it must subsequently be exited. As for the distribution center constraint, each vehicle must depart from its assigned distribution center and return to the same distribution center. The constraints are expressed as follows:

$ \begin{aligned} &\sum_{i \in N \cup\{0\}} x_{i j k}=\sum_{i \in N \cup\{0\}} x_{i j k}=y_{j k}, \forall j \in N, \forall k \in V\\ &\sum_{j \in N} x_{0 j k}=\sum_{i \in N} x_{i 0 k} \leq 1, \forall k \in V \end{aligned} $

The time continuity constraint and variable constraint are expressed as follows:

$ \begin{aligned} &\mathrm{x}_{ijk}=1 \Rightarrow T_{i k}+s_i+t_{i j}=T_{j k}, \forall i, j \in N, \forall k \in V\\ &x_{i j k} \in\{1,0\}, y_{j k} \in\{0,1\} \end{aligned} $

4. Algorithm Design

4.1 Regional Partitioning Based on K-Means Clustering

A two-stage solution is developed to optimize large-scale logistics distribution under time window constraints. k-means clustering is employed in the first stage for scientific distribution region partitioning. An adaptive genetic algorithm is used in the second stage to optimize the routes within each subregion.

Figure 1. Flowchart of the k-means clustering algorithm

To reduce the computational complexity concerning the large-scale vehicle routing problem, customer nodes are initially clustered based on their geographical coordinates. The number of clusters ($K$) is determined using the silhouette coefficient as an evaluation metric. The silhouette coefficient $S$($i$) integrating intra-cluster cohesion and inter-cluster separation ranges from -1 to 1. A value closer to 1 indicates more appropriate clustering. The optimal value of K is identified by traversing candidate values and selecting the one that maximizes the average silhouette coefficient. During the clustering process, a set of K initial cluster centroids is randomly selected. The centroids are iteratively updated until the total Euclidean distance between each customer node and its assigned cluster centroid is minimized:

$ J=\sum_{j=1}^K \sum_{i \in C_j}\left\|x_i-\mu_j\right\|^2 $

where, $x_i$ denotes the customer coordinates, and $\mu_j$ represents the centroid of cluster $j$. Figure 1 shows the flowchart of the k-means clustering algorithm.

4.2 Improved Adaptive Genetic Algorithm

4.2.1 Adaptive genetic algorithm

The procedural framework of the adaptive genetic algorithm is described in Figure 2.

Figure 2. Flowchart of the adaptive genetic algorithm

Step 1: Encoding

To enable processing within the genetic algorithm framework, candidate solutions are encoded into chromosome representations. This transformation between problem solutions and chromosome structures is referred to as encoding and decoding.

Step 2: Population Initialization

The initial population generation starts from the random sampling of a predefined number of individuals from the solution space. A chromosome represents an individual. The chromosomes have various forms, such as binary strings, real-valued vectors, or other suitable structures.

Step 3: Fitness Evaluation

The fitness function constructed aims to evaluate the population quality at the individual level. The fitness value determines the probability of selecting an individual as a parent for reproduction and genetic propagation. Individuals with higher fitness values exhibit superior performance in solving the optimization problem.

Step 4: Selection

Appropriate selection strategies are used to select individuals based on their fitness values. These selections are based on rank or involve a roulette wheel or tournament. Certain individuals are chosen for subsequent crossover and mutation.

Step 5: Crossover and Mutation

After two parent individuals are selected, new offspring are generated via crossover. Then the application of mutation to the offspring introduces new genetic characteristics, enhancing population diversity.

Step 6: Parameter Adaptation

The crossover and mutation probabilities are dynamically adjusted according to the fitness distribution and diversity of the current population.

Step 7: Iteration

Steps 2 through 5 are iteratively repeated for a predefined number of generations. When reaching the maximum number of iterations, achieving a predefined fitness threshold, or obtaining stable fitness values over successive generations, the iterative process is terminated.

Step 8: Output

In the final generation, the individual with the highest fitness value is considered the optimal solution. The results are analyzed in the aspects of solution quality, convergence speed, and computational complexity.

In conventional genetic algorithms, the crossover probability ($P_c$ ) and mutation probability ($P_m$) are typically fixed, which may lead to premature convergence or reduced search efficiency. To address these limitations, an adaptive mechanism is introduced for dynamic genetic operator adjustments. The integer encoding scheme adopted is based on permutation. Specifically, the integers 1, 2, …, $n$ are used to denote customer nodes, with 0 representing the distribution center. With a chromosome as a sequence, multiple vehicle routes in the sequence are encoded by inserting the delimiter 0 to separate different routes. A typical chromosome is expressed as $0 \rightarrow i_1 \rightarrow i_2 \rightarrow \ldots \rightarrow 0 \rightarrow i_j \rightarrow \ldots \rightarrow 0$.

4.2.2 Fitness function

The optimization objective is to minimize the total distribution cost Z. Therefore, the fitness function f is the reciprocal of the total cost:

$ f=\frac{1}{Z} $

The total cost Z consists of transportation cost associated with travel distance, fixed vehicle activation cost, and time window penalty cost.

4.2.3 Algorithm design

The vehicle routing problem with time windows is non-deterministic polynomial-time hard, involving multiple coupled constraints (e.g., time window, vehicle capacity, and travel distance constraints). The adaptive genetic algorithm, through its population-based search mechanism, handles multiple constraints and identifies feasible solutions that approximate the global optimum. Furthermore, premature convergence is mitigated by maintaining population diversity. This is very critical for complex optimization problems because diversity enhances the solution space exploration. In addition, the adaptive genetic algorithm integrates with other optimization techniques, such as local search methods and simulated annealing, due to its flexibility and extensibility. This facilitates the hybrid optimization development, further improving solution quality and computational efficiency. Based on these considerations, the algorithmic design is formulated below.

(a) Encoding Scheme

The encoding scheme is critical in the genetic algorithm, as it directly influences algorithmic performance and solution efficiency. The choice of encoding determines the representation of individuals and the application of genetic operators (e.g., crossover and mutation) to individuals. In the vehicle routing problem, commonly adopted encoding schemes include binary encoding, floating-point encoding, and integer-based encoding. Although binary encoding is straightforward to implement in routing problems, it often leads to a lot of infeasible solutions. Floating-point encoding offers higher numerical precision, but it often has lower efficiency in local search operations. Consequently, permutation-based integer encoding is more suitable for the vehicle routing problem.

The permutation-based encoding scheme enables direct mapping between chromosomes and vehicle routes without an explicit decoding process. As a result, the simplified representation and computational procedures can improve the computational efficiency. Under this scheme, customer nodes are encoded as a sequence (1, 2, 3, …, $N$), and vehicle indices are encoded as a sequence (1, 2, 3, …, $k$) for subsequent crossover and mutation. As for the encoding scheme of a distribution system (one depot, eight customer nodes, and two delivery vehicles), integers 1 and 2 represent the two vehicles, integers 3 to 10 correspond to the eight customers, and the chromosome length is set to 10. The structure is illustrated in Figure 3. This encoding approach completely represents the distribution routes, enhances the efficiency of the genetic algorithm, and reduces computational complexity.

With this encoding scheme, the genetic algorithm operates more efficiently in solving the vehicle routing problem. This reduces computational complexity and improves solution quality. The chromosome corresponds to a routing solution in which vehicle 1 serves customer nodes in the sequence $4 \rightarrow 7 \rightarrow 10 \rightarrow 3 \rightarrow 9$, while vehicle 2 serves customer nodes in the sequence $6 \rightarrow 5 \rightarrow 8$.

Figure 3. Chromosome encoding

(b) Population Initialization

As for the random initialization strategy adopted, an initial population consisting of 100 chromosomes is constructed. The sequence of elements within each chromosome is randomly permuted to produce new individuals or solutions. The fitness value of each chromosome is then evaluated, and the population is structured accordingly for subsequent operations. This ensures that chromosomes corresponding to different distribution nodes are processed rationally. The primary advantage of this approach lies in its simplicity and effectiveness. Random initialization within an appropriate range increases the likelihood of identifying globally optimal or near-optimal solutions. Moreover, it reduces the risk of the algorithm becoming trapped in local optima. This initialization method is broadly applicable and widely covers the solution space, thereby enhancing population diversity.

(c) Fitness Function

In the genetic algorithm, the fitness function is a key criterion for evaluating the individual quality. A higher fitness value indicates superior performance of the corresponding individual. The fitness value is computed as the reciprocal of the objective function. Based on the formulated objective function, the fitness function is expressed as follows:

$${ fitness }(i)=\frac{1}{Z}\left(i=1,2, \ldots, N_i\right)$$

where, $ \mathit{fitness}(i) $ and $Z$ denote the fitness value and the objective function of chromosome $i$; and $N_i$ denotes the population size.

(d) Selection Operator

Tournament selection is adopted in the genetic algorithm. From the generated chromosome population, two individuals are randomly sampled to form a tournament group, and their fitness values are compared. The individual with the highest fitness is selected as the “winner” and carried forward to the next generation. This method is computationally efficient and straightforward to implement. Therefore, it is very suitable for large-scale populations. The stochastic selection preserves individuals with diverse characteristics, thereby reducing the risk of premature convergence. The probability $P$ of an individual being selected as the winner can be expressed as follows:

$ P_{\text {(iwnins })}=\frac{{ fitness }(i)}{\sum_{\mathrm{j}=1}^N { fitness }(i)} $

where, fitness (i) denotes the fitness value of each individual, and N denotes the tournament size, i.e., the number of individuals randomly selected in each tournament. A schematic representation of the minimum tournament selection tree is illustrated in Figure 4.

Figure 4. Minimum tournament selection tree

(e) Crossover Operator

Various crossover strategies have been proposed in genetic algorithms. Uniform crossover, although simple, does not preserve the relative order of genes. Single-point and two-point crossover methods can partially maintain gene sequence continuity; however, they often disrupt gene integrity. The core principle of the order crossover method is to preserve both the relative order and uniqueness of genes, thereby ensuring the feasibility of offspring solutions.

In the order crossover process, two parent chromosomes are first selected, and two crossover points are randomly determined. A subsequence of genes from the first parent is copied directly into the corresponding positions of the offspring chromosome. Subsequently, the remaining genes are filled by scanning the second parent in sequence, excluding genes that have already been inherited, and inserting the remaining genes into the unassigned positions in order. For example, a gene segment {5, 3, 4} may be selected from the first parent and placed into the offspring at the corresponding positions. The remaining genes are then taken sequentially from the second parent. Each gene is checked for duplication; if it is not already present in the offspring, it is inserted into the next available position. Upon reaching the end of the second parent, the process continues from the beginning of the chromosome. Genes already present in the offspring (e.g., gene 5) are skipped, and the procedure proceeds to the next gene (e.g., gene 1). This process is repeated until the results in Figure 5 can be obtained. This strategy effectively avoids the occurrence of duplicate or missing genes, which are common issues in conventional single-point or two-point crossover methods, thereby ensuring the validity of the generated routing solutions. The crossover procedure is illustrated in Figure 5.

Figure 5. Order crossover operation

(f) Mutation Operator

During the swap mutation operation process, each gene within a chromosome is sequentially examined, and mutation is performed based on a predefined mutation probability. When a gene is selected for mutation, another gene is randomly chosen from the same chromosome, and their positions are exchanged. The swap mutation process is illustrated in Figure 6.

Figure 6. Swap mutation operation

(g) Adaptive Crossover and Mutation Probabilities

In the adaptive genetic algorithm, prior to the execution of crossover and mutation operations, the average fitness of the population is computed. This average fitness value is then compared with the fitness of each individual, based on which the crossover probability and mutation probability are assigned to each individual.

The parameters $P_c$ and $P_m$ are dynamically adjusted according to the evolutionary state of the population. When the fitness values of individuals tend to converge or the population approaches a local optimum, the probabilities are increased to enhance population diversity. Conversely, when the population exhibits high diversity, the probabilities are reduced to preserve high-quality genetic information. It can be expressed as follows:

$P_{\mathrm{c}}=\left\{\begin{array}{l}k_1 \frac{f_{\max }-f^{\prime}}{f_{\max }-f_{a v g}}, f^{\prime} \geq f_{a v g} \\ k_2, f^{\prime} \lt f_{a v g}\end{array}\right.$

$P_{\mathrm{m}}=\left\{\begin{array}{l}k_3 \frac{f_{\max }-f}{f_{\max }-f_{a v g}}, f \geq f_{a v g} \\ k_4, f \lt f_{a v g}\end{array}\right.$

where, $f_{\max}$ denotes the maximum fitness value within the population, $f_{\text {avg}}$ represents the average fitness value of the population in each generation, $f^{\prime}$ denotes the larger fitness value between the two individuals selected for crossover, $f$ represents the fitness value of the individual subject to mutation, and $k_1, k_2, k_3$, and $k_4$ are constants within the interval (0, 1). Table 2 shows the main parameters of the genetic algorithm.

Table 2. Main parameters of the genetic algorithm

Parameter	Condition	Conventional Genetic Algorithm	Adaptive Genetic Algorithm
Population size	—	500	500
Number of iterations	—	30	30
Crossover probability	$f^{\prime} \lt f_{a v g}$		$0.7+\frac{(0.7-0.2) f^{\prime}}{f_{\text {avg }}}$
	$f^{\prime} \geq f_{\text {avg }}$	0.6	0.2
Mutation probability	$f^{\prime}>f_{a v g}$		$0.05+\frac{(0.2-0.05)\left(f_{\max }-f^{\prime}\right)}{f_{\max }-f_{a \operatorname{g}}}$
	$f^{\prime} \leq f_{a v g}$	0.05	0.02

5. Case Study Analysis

5.1 Experimental Background and Data Source

A real-world last-mile logistics distribution scenario from Company S in Shanghai was selected as the case study. The distribution system consisted of two candidate distribution centers (DC$_1$ and DC$_2$) and 60 customer nodes randomly distributed across the service area. The geographical coordinates (latitude and longitude), demand quantities ($q_i$), and service time window requirements [$E T_i, L T_j$] were obtained from the company’s operational database. Table 3 and Table 4 show the parameters of the vehicle and the improved genetic algorithm, respectively. Figure 7 shows the geographic distribution and demand of customer nodes.

Table 3. Vehicle parameters

Parameter	Value
Fixed vehicle activation cost	100 CNY
Unit transportation cost	1 CNY/km
Time window penalty cost	1 CNY/s
Maximum vehicle capacity	180 kg
Maximum travel speed	15 km/h
Maximum travel distance per trip	60 km

Note: CNY = Chinese Yuan.

Table 4. Parameters of the improved genetic algorithm

Parameter	Description	Value
MAXGEN	Maximum number of iterations	500
popsize	Population size	200
$P_c$	Adaptive crossover probability	0.3–1.0
$P_m$	Adaptive mutation probability	0.01–0.09

Note: MAXGEN =Maximum Number of Generations; popsize =Population Size; Pc =Adaptive Crossover Probability; Pm =Adaptive Mutation Probability.

Figure 7. Geographic distribution and demand of customer nodes

5.2 Optimization Results and Comparative Analysis

5.2.1 Distribution route optimization results

Based on the solution obtained using the adaptive genetic algorithm, four and eleven delivery vehicles were allocated to the distribution centers DC$_1$ and DC$_2$, respectively. The optimized routing configuration is highly compact, and route intersections are effectively avoided.

The best fitness value within the population is recorded at each generation. By comparing the optimal fitness value of the current generation with that of the previous generation, the algorithm optimization can be evaluated. An increase in the best fitness value indicates continued improvement in solution quality, whereas stagnation suggests the possibility of adjusting parameters (e.g., crossover and mutation probabilities) or terminating the criteria. This feedback mechanism monitors algorithmic performance in real time and ensures effectiveness and convergence stability. The evolving optimal solutions for the two distribution centers across iterations are illustrated in Figure 8.

Figure 8. Evolving fitness values for distribution centers

As illustrated in Figure 8, the fitness values of both distribution centers show a decreasing trend over successive iterations. This indicates that the adaptive genetic algorithm effectively improves solution quality.

5.2.2 Chromosome output results

Based on the constructed function framework, the output results of the model were determined. A dedicated procedure was implemented to extract the optimal chromosome, obtaining key performance indicators, including fitness value, time window penalty cost, transportation cost, fixed vehicle activation cost, and chromosome representation. Five simulation runs were performed because the algorithm is stochastic, and the best-performing solution was selected. The corresponding results are presented in Table 5.

Table 5. Chromosome results

Metric	DC$_1$	DC$_2$
Fitness value	441.79	1168.35
Time window penalty cost	0.055	3.852
Transportation cost	41.74	164.495
Fixed vehicle activation cost	400	1000
Chromosome	0, 1, 12, 2, 13, 8, 4, 11, 9, 7, 6, 10, 3, 5	20, 40, 23, 24, 42, 17, 38, 48, 39, 37, 34, 45, 41, 30, 26, 28, 14, 27, 47, 33, 31, 43, 25, 18, 36, 35, 29, 15, 21, 46, 16, 32, 19, 44, 22

Note: DC = distribution center.

Table 5 shows a significant reduction in total distribution cost. The distribution costs in the two service regions are reduced by 28.12% and 20.62%, respectively, because the adaptive genetic algorithm has a strong global optimization capability in handling time window constraints. In addition, the vehicle utilization efficiency is greatly improved. The load factor of most delivery vehicles exceeds 90% after optimization, with seven vehicles in distribution center DC$_2$ reaching full capacity utilization (100%). This indicates a marked enhancement in utilizing transportation resources.

5.3 AnyLogic Simulation Validation

The AnyLogic simulation model was executed. The vehicle distribution process is presented in Figure 9.

Figure 9. Model execution

Table 6 and Table 7 show the vehicle routing schemes and operational details for the two distribution centers.

Table 6. Vehicle routing information on distribution center DC$_1$

Vehicle No.	Tasks	Route	Distance (m)	Time (s)	Load Factor
0	4	DC$_1$→1→2→13→3→DC$_1$	10120	2429	94%
1	3	DC$_1$→14→9→5→DC$_1$	7295	1751	100%
2	3	DC$_1$→12→10→8→DC$_1$	10004	2401	97%
3	4	DC$_1$→7→11→4→6→DC$_1$	12466	2992	97%

Note: DC = distribution center.

Table 7. Vehicle routing information on distribution center DC$_2$

Vehicle No.	Tasks	Route	Distance (m)	Time (s)	Load Factor
4	4	DC$_2$→21→41→24→25→DC$_2$	12787	3069	100%
5	4	DC$_2$→43→18→39→49→DC$_2$	20033	4808	100%
6	3	DC$_2$→40→38→35→DC$_2$	23895	5735	100%
7	3	DC$_2$→46→42→31→DC$_2$	21054	5053	94%
8	4	DC$_2$→27→29→15→28→DC$_2$	25220	6053	100%
9	3	DC$_2$→48→34→32→DC$_2$	24391	5854	83%
10	3	DC$_2$→44→26→19→DC$_2$	8420	2021	88%
11	4	DC$_2$→37→36→30→16→DC$_2$	16583	3980	100%
12	3	DC$_2$→22→47→17→DC$_2$	12779	3067	100%
13	4	DC$_2$→33→20→45→23→DC$_2$	10216	2452	100%

Note: DC = distribution center.

The tables show that four delivery vehicles are allocated to distribution center DC$_1$, with load factors exceeding 90% for all vehicles. Among them, Vehicle 1 achieves full capacity utilization (100%). For distribution center DC$_2$, seven vehicles achieve full capacity utilization (100%) among all ten delivery vehicles, while only two vehicles (83% and 88%) exhibit load factors below 90%. These results indicate that vehicle load factors are significantly improved after optimization. Furthermore, an analysis of distribution costs shows substantial cost reductions in both centers (28.12% and 20.62%). A detailed comparative analysis is presented in Table 8.

Table 8. Comparison of distribution cost optimization for distribution centers DC$_1$ and DC$_2$

Distribution Center	Best Fitness	Vehicles	Minimum Cost (CNY)	Conventional Cost (CNY)	Cost Savings (CNY)
DC$_1$	441.79	4	441.795	614.68	172.885
DC$_2$	1168.35	11	1168.347	1471.86	303.513

Note: DC = distribution center.

The conventional genetic algorithm exhibits notable limitations in addressing large-scale, multi-customer distribution problems. Specifically, its solutions tend to produce longer routing distances and lower vehicle load factors without effectively considering customer time window constraints. This leads to higher operational costs and reduced customer satisfaction. In contrast, the adaptive genetic algorithm obtains significant improvements. The introduction of adaptive crossover and mutation functions substantially enhances the convergence performance and the quality of the optimal solutions. These findings indicate that the proposed approach effectively overcomes the limitations of the conventional genetic algorithm. The optimized routing solutions exhibit shorter travel distances, higher vehicle load factors, and lower time window penalty costs. Consequently, this improves customer service levels, significantly reduces overall distribution costs, and reduces overall logistics system costs.

6. Conclusion

An integrated optimization framework combining k-means clustering and an adaptive genetic algorithm was proposed to address the vehicle routing problem with time windows. The effectiveness of the proposed approach was further validated through simulation experiments conducted on the AnyLogic platform. Based on the case study of the logistics distribution system of Company S, the following conclusions were drawn:

Algorithmic performance: The proposed adaptive genetic algorithm, incorporating dynamic adjustment mechanisms for crossover and mutation probabilities, effectively mitigates premature convergence and entrapment in local optima commonly observed in the conventional genetic algorithm. Compared with the conventional genetic algorithm, the superior global search capability and faster convergence of the adaptive genetic algorithm can identify high-quality routing solutions more efficiently.

•Solution effectiveness: The two-stage optimization strategy—comprising k-means-based regional partitioning and adaptive genetic algorithm-based routing optimization—successfully reduces the dimensionality for large-scale distribution problems. Simulation results from the AnyLogic platform confirm that the optimized routing solutions are feasible under dynamic delivery conditions. Furthermore, significant improvements in vehicle load factor are observed while maintaining compliance with customer time window constraints.

•Economic benefits: The empirical analysis based on Company S demonstrates the practical applicability and economic value of the proposed model and algorithm. The operational costs of distribution centers DC$_1$ and DC$_2$ are reduced by 28.12% and 20.62%, respectively. These results indicate that the proposed approach not only effectively reduces logistics costs but also provides a scientifically grounded decision-support framework for addressing the “last-mile” delivery challenge in urban logistics systems.

7. Author Contribution

Conceptualization, L.L.; methodology, L.L.; software, J.R.W.; validation, Y.J.L.; formal analysis, J.R.W; investigation, J.R.W.; resources, L.L.; data curation, L.L.; writing—original draft preparation, Y.J.L.; writing—review and editing, J.R.W; visualization, L.L.; supervision, L.L.; project administration, L.L.; funding acquisition, L.L. All authors have read and agreed to the published version of the manuscript

Data Availability

The data used to support the research findings are available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflict of interest.

References

1.

R. R. Baghbadorani and M. Haeri, “An innovative method to solve classic and dynamic multi-depot vehicle routing problem,” Int. J. Logist. Syst. Manag., vol. 50, no. 1, pp. 28–50, 2025. [Google Scholar] [Crossref]

2.

F. Tamke and U. Buscher, “A branch-and-cut algorithm for the vehicle routing problem with drones,” Transp. Res. Part B Methodol., vol. 144, pp. 174–203, 2021. [Google Scholar] [Crossref]

3.

X. Weng, W. She, H. Fan, J. Zhang, and L. Yun, “Multi-depot vehicle routing problem with drones in emergency logistics,” Cluster Comput., vol. 28, no. 1, p. 64, 2025. [Google Scholar] [Crossref]

4.

A. Morim, G. Campuzano, P. Amorim, M. Mes, and E. Lalla-Ruiz, “The drone-assisted vehicle routing problem with robot stations,” Expert Syst. Appl., vol. 238, p. 121741, 2024. [Google Scholar] [Crossref]

5.

M. Lupi, D. Conte, and A. Farina, “Last mile urban freight delivery: A new system based on platoons of automated vehicles,” Transp. Policy, vol. 164, pp. 42–59, 2025. [Google Scholar] [Crossref]

6.

J. Li, R. Liu, and R. Wang, “Elastic strategy-based adaptive genetic algorithm for solving dynamic vehicle routing problem with time windows,” IEEE Trans. Intell. Transp. Syst., vol. 24, no. 12, pp. 13930–13947, 2023. [Google Scholar] [Crossref]

7.

N. Niazy, A. El-Sawy, and M. Gadallah, “Solving capacitated vehicle routing problem using chicken swarm optimization with genetic algorithm,” Int. J. Intell. Eng. Syst., vol. 13, no. 5, pp. 502–513, 2020. [Google Scholar] [Crossref]

8.

C. Rodrigues, “Solving the capacitated vehicle routing problem using a new genetic algorithm,” mastersthesis, San Jose State University, 2023. [Google Scholar] [Crossref]

9.

S. Thammasang and S. Arunyanart, “Efficient route optimization for ice distribution: Enhanced VRPTW with customer retention strategies,” Eng. Technol. Appl. Sci. Res., vol. 14, no. 5, pp. 16346–16356, 2024. [Google Scholar] [Crossref]

10.

B. Moghaddasi, A. S. G. Majid, Z. Mohammadnazari, A. Aghsami, and M. Rabbani, “A green routing-location problem in a cold chain logistics network design within the Balanced Score Card pillars in fuzzy environment,” J. Comb. Optim., vol. 45, no. 5, p. 129, 2023. [Google Scholar] [Crossref]

11.

Y. Jiang, B. Bian, B. Zheng, and J. Chu, “A time space network optimization model for integrated fresh fruit harvest and distribution considering maturity,” Comput. Ind. Eng., vol. 190, p. 110029, 2024. [Google Scholar] [Crossref]

12.

X. Shen, H. Lou, and Z. Ge, “A two-stage scheduling algorithm for dynamic interval multi-objective vehicle routing problem in medical waste collection,” Comput. Ind. Eng., vol. 205, p. 111136, 2025. [Google Scholar] [Crossref]

13.

D. Bao, Z. Chen, and D. Kang, “Proactive real-time scheduling method for apron service vehicles based on mixed strategies,” Comput. Ind. Eng., vol. 192, p. 110182, 2024. [Google Scholar] [Crossref]

14.

D. W. Bao, J. Y. Zhou, D. Kang, and Z. Chen, “Optimization model for electric aircraft tow tractors scheduling under operator cooperation,” Transp. Res. Part C Emerg. Technol., vol. 172, p. 105032, 2025. [Google Scholar] [Crossref]

15.

Y. Wang, Y. Wei, X. Wang, Z. Wang, and H. Wang, “A clustering-based extended genetic algorithm for the multidepot vehicle routing problem with time windows and three-dimensional loading constraints,” Appl. Soft Comput., vol. 133, p. 109922, 2023. [Google Scholar] [Crossref]

16.

D. G. Mogale, A. Ghadge, and S. K. Jena, “Modelling and optimising a multi-depot vehicle routing problem for freight distribution in a retail logistics network,” Comput. Ind. Eng., vol. 207, p. 111315, 2025. [Google Scholar] [Crossref]

17.

M. Yan, L. Y. Chu, and X. C. Wu, “The split delivery vehicle routing problem with time windows and three-dimensional loading constraints,” J. Ind. Manag. Optim., vol. 20, no. 2, pp. 786–807, 2024. [Google Scholar] [Crossref]

18.

N. Mardešić, T. Erdelić, T. Carić, and M. Durasević, “Review of stochastic dynamic vehicle routing in the evolving urban logistics environment,” Mathematics, vol. 12, no. 1, p. 28, 2023. [Google Scholar] [Crossref]

19.

J. M. P. Silva, E. Uchoa, and A. Subramanian, “Cluster branching for vehicle routing problems,” INFORMS J. Comput., 2025. [Google Scholar] [Crossref]

20.

H. C. Lau, M. Sim, and K. M. Teo, “Vehicle routing problem with time windows and a limited number of vehicles,” Eur. J. Oper. Res., vol. 148, no. 3, pp. 559–569, 2003. [Google Scholar] [Crossref]

Cite this:

APA Style

IEEE Style

BibTex Style

MLA Style

Chicago Style

GB-T-7714-2015

Liu, L., WANG J R, & LIANG Y J (2025). A Two-Stage Clustering–Evolutionary Framework with Adaptive Search Control for the Vehicle Routing Problem with Time Windows. J. Oper. Strateg Anal., 3(4), 251-263. https://doi.org/10.56578/josa030404

cc

©2025 by the author(s). Published by Acadlore Publishing Services Limited, Hong Kong. This article is available for free download and can be reused and cited, provided that the original published version is credited, under the CC BY 4.0 license.

pdf

Figure 1. Flowchart of the k-means clustering algorithm

Table 1. Notation of the mathematical model and related definitions

Citations

Crossref: 0