Optimization of Joint Blocklength and UAV Placement for UAV-Based Relay Ultra-Reliable Low Latency Communication System
Abstract:
Drones have a problem with command transmission under Ultra-Reliable Low Latency Communication (URLLC) requirements. This paper discusses minimizing Packet Error Rate (PER) in an Unmanned Aerial Vehicle (UAV) relay system that transmits commands under Ultra-Reliable Low Latency Communication requirements. The problem is solved through joint optimization of block-length allocation and UAV placement. To tackle these challenges, the optimization problem was split into two sub-problems to analyze the convexity and monotonicity of each. An iterative optimization algorithm for PER minimization was then formulated, combining the Alternating Direction Method of the Multipliers algorithm (ADMM) with the bisection search method through a perturbation-based iterative approach. Simulation results confirm that the proposed algorithm achieves up to 16.42% improvement in computation time and up to 57.14% in convergence speed compared to the algorithm using the bisection method alone for both problems, and it gives the same performance as that of the exhaustive search method.
1. Introduction
URLLC is a new type of communication that was introduced in Fifth Generation /Beyond Fifth Generation (5G/B5G) to meet the needs of the Internet of Things (IoT), Industry 4.0, and other applications that have strict latency and reliability requirements [1], [2]. Unlike traditional communication systems, URLLC operates under strict constraints, often requiring end-to-end latencies of less than 1 ms and PER as low as $10^{-5}$ [3], [4]. To guarantee the low-latency transmission of UAV communication systems in URLLC, it uses short packet transmission within a finite packet length [5], [6]. The Shannon formula based on infinite packet length is no longer applicable since PER at the receiver cannot be ignored. To tackle this, Polyanskiy et al. [7] derived the achievable capacity within a finite packet regime to achieve the complex function of the Signal-to-Noise Ratio (SNR), Blocklength, and PER. On the other hand, UAVs have significantly impacted wireless communications due to their advantages, including Line-of-Sight (LoS) aerial–ground connections, rapid deployment, and precise mobility [8], [9]. These features make UAVs suitable for applications requiring reliable and low-latency communication, such as disaster recovery, remote surveillance, and industrial automation. For instance, in scenarios where direct communication between two ground nodes is obstructed by terrain or infrastructure, UAVs can serve as mobile relays, ensuring seamless connectivity [10], [11].
Therefore, one of the effective solutions in the case of communication between two ground points due to obstacles is to use a UAV as a mobile relay [12]. Due to the importance of these benefits, many studies have appeared to benefit from the integration of URLLC and UAV. Minimization of the overall PER by jointly optimizing blocklength allocation and UAV placement in a relay system was studied by Pan et al. [13]. Moreover, the minimization of the overall PER by optimizing blocklength allocation and distance for the Multi-Hop UAV relay system was investigated in the study of Ranjha and Kaddoum [14]. Resource allocation in a two-way UAV relaying system with URLLC was studied by Cai et al. [15]. The optimization of resource allocation and UAV placement in a UAV-based NOMA-based relay system was studied [16]. Although significant progress has been made, several challenges persist, one of the most critical being the urgent need to design and implement fast and efficient algorithms capable of determining the optimal system parameters. Where it is considered that time is a crucial element in such systems, the lack of speed and precision in identifying these parameters can lead to delays, inefficiencies, or suboptimal performance, which can ultimately compromise the system's effectiveness.
This paper introduces a UAV-based communication system for delivering URLLC services. In the context of military operations, an autonomous robot is assigned the critical role of conducting reconnaissance missions within a designated military zone. It operates remotely, receiving command instructions and relevant information from a central control unit. The aim of the research is to minimize the PER subject to the requirement of reliability and latency by jointly optimizing blocklength allocation for the control unit and the UAV placement. To achieve this, an iterative optimization algorithm was proposed based on the ADMM and the bisection search method for the PER minimization problem.
2. System Model and Problem Formulation
A two-dimensional UAV-based military reconnaissance system was proposed, consisting of a control unit, a UAV, and a robot. The primary function of the UAV is to gather intelligence, assess the environment, and relay vital data back to the command center, as shown in Figure 1. The area between the control unit and the robot has a hill that blocks the line of sight between the sender and the receiver, so the channel gain between them is negligible and weak. That is why using a UAV as a relay as it hovers over the hill to ensure communication and complete the delivery of information.

The area between the control unit and the robot has a hill that blocks the line of sight between the sender and the receiver, so the channel gain between them is negligible and weak. That is why using a UAV as a relay as it hovers over the hill ensures communication and complete information delivery. The positional coordinates of the control unit, UAV, and autonomous robot are denoted as (0, 0), (x, H), and (D, 0), respectively. These coordinates represent their locations within a Cartesian plane, where the control unit is positioned at the origin with H as the fixed height of the UAV.
Each transmission has two stages: the first stage from the control unit to the UAV and the second stage from the UAV to the robot, or autonomous reaction unit. $m_1$ and $m_2$ represent the blocklength allocated for each stage. The system bandwidth is $B$. Then, the overall blocklength is $M=B T_{\text {max }}$, where $T_{\text {max }}$ is the time in seconds that transmission needs to be complete to transmit a command signal whose packet size is $L$ bits. The channel power gain for the two stages is denoted as $h_1$ and $h_2$, respectively. The transmission powers are fixed from the control unit, $P_1$, and the UAV is $P_2$, respectively.
The findings in the study of Hashemi et al. [6] indicate that a robust LoS connection is established between the terrestrial control unit and the UAV when it is operating at a specified height, specifically at 120 meters. And can employ the free space channel model in this system. Channel power gain of the link can be mathematically expressed as:
where $\beta_0$ is channel power gain at a reference distance, $d_0=1$ meter.
As stated by the researches [7], [13], [17], the PER for transition m symbols within a finite packet length of size $L$ is:
where, $Q(x)$ is a semantic equation for the above equation $\left(\varepsilon_i=Q\left(f\left(\gamma_i, m_i, L_i\right)\right)\right)$ and a detailed explanation of the components of the equation.
$Q(x)=\frac{1}{\sqrt{2 \pi}} \int_x^{\infty} e^{-\frac{u^2}{2}} d u$ represents the right-tailed function of the standard normal distribution.
$V_i=1-\frac{1}{\left(1+\gamma_i\right)^2}$, represents channel dispersion.
$\gamma_i=P_i h_i$, represent the Signal-to-Noise Ratio (SNR).
$f\left(\gamma_i, m_i, L_i\right)=(\ln 2) \sqrt{\frac{m_i}{V i}}\left(\log _2\left(1+\gamma_i\right)\right)$.
Based on this, the PER at the relay is $\varepsilon_1=Q\left(f\left(\gamma_1, m_1, L\right)\right)$, and the PER for the robot is$\varepsilon_2=Q\left(f\left(\gamma_2, m_2, L\right)\right)$.
This system assumes that the UAV is a relay that serves as a Decode-and-Forward (DF).
The cumulative PER for the two stages, while $\varepsilon$ is defined as follows:
The objective of this work is the overall PER minimization due to latency and blocklength restriction to enable URLLC in this system by jointly optimizing blocklength for two stages and the location of the UAV. Consequently, the formulation of the problem optimization is given by Eq. (4) [13], [17]:
Provided that $m_i$ must be positive integers $m_{d, m}, m_2 \in z$, with $m_1+m_2=M$, and the workable domain for $x$ that primarily relies on the shape of the obstacle $d_1 \leq x \leq d_2$.
The problem in optimization shown in Eq. (4) presents complexity in all its constraints due to the nonconvex nature of the objective function concerning both the location and the blocklength. This complexity persists even when their values are constant, and additionally, the objective function does not exhibit joint convexity concerning the optimization variables.
3. Optimization Algorithm
To solve the problem in Eq. (4), an iterative algorithm was developed to find the optimal UAV placement and blocklength allocation to minimize the effective PER for two stages of the UAV-based relay communication system with finite packet length subject to the latency constraints and reliability requirement. Achieve this by dividing it into two sub-problems. First, find the optimized blocklength with fixed location x. Then, find the optimal location by fixing the blocklength allocation. Finally, alternatively, solve each subproblem until convergence.
According to the study [1], the channel gains are fixed when the UAV location $x$ is fixed. Therefore, our optimization task focuses solely on the blocklength allocation. It's important to note that the objective function in Eq. (4) is quite complex. Also, to satisfy the reliability requirement, PER should be $\leq 10^{-5}$. The probability of PER at each stage ought to be adequately low. To address this challenge, the overall probability of error $\varepsilon$ can be approximately denoted as $\varepsilon\left(m_1, m_2\right) \approx \varepsilon_1\left(m_1\right)+\varepsilon_2\left(m_2\right) \triangleq \tilde{\varepsilon}(m)$. Consequently, to prove that $\tilde{\varepsilon}(m)$ is convex with respect to $m_1, m_2$, let $f_1\left(m_1\right)=f\left(\gamma_1, m_1, L\right)$. The $1^{\text {st }}$ and $2^{\text {nd }}$-order derivation of $\varepsilon_1\left(m_1\right)$ with respect to $m$ could be derived as:
Since $e^{-\frac{f_1^2\left(m_1\right)}{2}}>0$ and $f_1\left(m_1\right)\left(f_1^{\prime}\left(m_1\right)\right)^2>0$, it is only needed to prove $f_1^{\prime \prime}\left(m_1\right) \leq 0$.
The function $f_{\text {lcould }}$ be simplified by defining $C_1=\log _2(1+ \gamma_{1)}, A_1=\frac{\ln 2}{\sqrt{V_1}}$ then, $f_1\left(m_1\right)$ and could be written as
The $1^{\text {st }}$ and $2^{\text {nd }}$-order derivation of $f_1\left(m_1\right)$ with respect to $m_l$ is
Therefore, it explains the exact concavity of the function $f_1\left(m_1\right)$ of $m_1$. This is proven by the convexity of $\varepsilon_1\left(m_1\right)$ with respect to $m_1$. The convexity of $\varepsilon_2\left(m_2\right)$ with respect to $m_2$ can be proven by using similar derivations. Thus, the optimal solution to problem 4 can be expressed as:
The proposed ADMM algorithm in the studies[17], [18] was utilized to obtain optimal $m_1$ and $m_2$. Therefore, the Augmented Lagrange Function (ALF) is formulated as
where, $\rho$ represents the quadratic penalty factor, and $\lambda$ is the Lagrange multiplier. Eq. (11) combines problem 4 with the constraints by introducing the variable $\lambda$. Following a scale transformation, Eq. (11) can be expressed as:
Algorithm 1. Find the optimal blocklength allocation ( $m_l$, $m_2$ ) with fixed UAV location |
1. Initialize $x, \lambda, \rho, M, m_1{ }^{(k)}, m_2{ }^{(k)}$ define maximum iteration is $k_{\max }$ and algorithm accuracy is $\varpi$. |
The objective in this sub-section is to optimize $x$ with a fixed blocklength. Nevertheless, $\tilde{\varepsilon}(x)$ is not convex with respect to $x$. As in the study [5], we numerically demonstrate the existence of a singular solution that optimally reduces the objective function within the allowable range. Therefore, to be clear, a new function is required to be defined, $g(x) \triangleq \ln \tilde{\varepsilon}(x)$ which is consistent with $\tilde{\varepsilon}(x)$ in monotonicity. Define $f_1(x)= f\left(\gamma_1, m_1, L\right), f_2(x)=f\left(\gamma_2, m_2, L\right), \gamma_1(x)=\frac{P_1 \beta_0}{H^2+x^2}$, and $\gamma_2(x)=\frac{P_1 \beta_0}{H^2+(D-x)^2}$.
The $1^{\text {st }}$ and $2^{\text {nd }}$ order derivative of $g(x)$ with respect to $x$ can be derived as:
where, the $1^{\text {st }}$ and $2^{\text {nd }}$ order derivative of $\varepsilon_1(x)$ is:
where, $Z_1=\frac{\sqrt{m_1}}{\left(\left(1+\gamma_1\right)^2-1\right)^{\frac{5}{2}}}$.
Thus $1^{\text {st }}$ and $2^{\text {nd }}$ order derivative of $\gamma_1(x)$ is:
Also, $1^{\text {st }}$ and $2^{\text {nd }}$ order derivative $\varepsilon_2(x)$ can obtained by using the same derivations. After that, it turns out that $g(x)$ is nonconvex with respect to $x$ on all possible intervals of UAV placement. $g(x), g^{\prime}(x), g^{\prime \prime}(x)$ plotted in Figure 2 it shows that $g(x)$ is a nonconvex function for the period from $x=0$ to $x=142$ because $g^{\prime \prime}(x)$ is negative. Then, $g(x)$ is a convex function for the period from $x=142$ to $x=186$ because $g^{\prime \prime}(x)$ is positive.
Finally, $g(x)$ is a nonconvex function for the period from $x =186$ to $x=200$ because $g^{\prime \prime}(x)$ is negative. Nonetheless, observations indicate that for the period from $x=0$ to $x=173$, the $g^{\prime}(x)$ is negative and for the period from $x=173$ to $x=$ 200 the $g^{\prime}(x)$ is positive. This means the value of it reduces until $x=173$, then rises. From this, it was noted that $g(x)$ is quasiconvex, and there is a distinct $x$ that can optimize the overall effective PER, and Algorithm 2 can be applied to this task.

Algorithm 2. Find the optimal UAV' location (x) with fixed blocklength $\left(m_1, m_2\right)$ |
1. Initialize error tolerance $\zeta=0.1, x^{l b}=d_l$ and $x^{u b}=d_2$ |
Algorithm 3 combines the previous two algorithms within an iterative framework, utilizing a random integer $R_{\max }$. It initializes the variables and then determines the optimal blocklengths $m_1$ and $m_2$ for the default location by using Algorithm 1. Subsequently, it adds a random number to the blocklengths obtained from Algorithm 1 and identifies three locations for the UAV using Algorithm 2. It then selects the location with the least PER and its corresponding blocklength, repeating the process. This ensures that the algorithm converges to a solution as long as the PER decreases with each iteration.
Algorithm 3. Overall iterative optimization algorithm for optimizing $\mathrm{m}_1, \mathrm{~m}_2$, and |
1. Initialize $k_{\text {max }}, k=1, R_{\text {max }}, x^{(k)}, m_1{ }^{(k)}, m_2{ }^{(k)}$ |
The complexity of the ADMM algorithm depends on the specific problem structure and the criteria for convergence that are employed. For each iteration, the complexity is linear concerning the number of variables when the algorithm is applied to separable problems. Nonetheless, the aggregate complexity is variable based on specific problem structures and the convergence criteria implemented.
In specific scenarios, particularly in instances where the resolution of the problem necessitates matrix inversions or factorizations within the optimization procedure, the complexity can escalate to a cubic relationship, denoted as $N^3$ [7], [19], [20]. Therefore, the complexity of Algorithm 1 is given by $Q_1=O\left(N^3\right)$, and for Algorithm 2 is given by $Q_2= O\left(\log _2\left(\left(d_2-d_1\right) / \zeta\right)\right)$, Since Algorithm 3 calls Algorithm 2 three times every time it calls Algorithm 1 in every iteration, the complexity of Algorithm 3 is $Q_3=O\left(n_{\max }\left(Q_1+Q_2\right)\right.$.
Although the complexity of this algorithm is theoretically higher than the algorithm based on a bisection search method, it has shown fast convergence. On the other hand, the complexity of this problem is low due to its separable nature, and there are no matrix operations. And the appropriate selection of $\rho$ has been instrumental in accelerating the convergence.
The parameter values for the system model used in this paper are listed in Table 1. It is worth noting that most parameters were used from [13], [17], and some variables were adjusted based on the results obtained during the simulation, as they were found to be the most suitable. It's important to highlight that all our simulations were carried out using MATLAB® R2023a on a PC with an Intel i7-1165G7 processor and 8G RAM.
| Parameter | Value |
| Robot distance ($D$) | 200 meter |
| Number of bits ($L$) | 100 bits |
| The start point of the obstacle ($d_1$) | 30 meter |
| The endpoint of the obstacle ($d_2$) | 130 meter |
| Initial UAV location ($x$) | 30 meter |
| Transmit power of CU ($P_1$) | 3 Watt |
| Transmit power of UAV ($P_2$) | 1 Watt |
| Time for transmit command signal ($T_{\text {max }}$) | $10^{-4} \mathrm{sec}$ |
| Bandwidth ($B$) | 1 MHz |
| Number of available blocklength ($M=B T_{\text {max }}$) | 100 |
| Initial blocklength for each stage $m_1=m_2=\frac{M}{2}$ | 50 |
| Channel power gain at a reference distance ( $\beta$ ) | 50 db |
| Location accuracy ($\xi$) | 0.1 meter |
| Random integer range ($R_{\text {max }}$) | 3 |
| Regularization parameter for ADMM ($\lambda$) | 0 |
| Penalty factor ($\rho$) | 0.007 |
| ADMM algorithm accuracy ($\boldsymbol{\varpi}$) | $10^{-6}$ |
| Max Number of iterations ($k_{\text {max }}$) | 10 |
4. Results and Discussion
The convergence behavior of Algorithm 3 is plotted in Figure 3 for different values of the penalty factor for ADMM and fixed UAV altitude. It shows the penalty factor has a high effect on the convergence in terms of overall PER and the number of iterations needed to reach the stable value. So, whenever the value is greater or less than 0.007, it makes the behavior of the algorithm unstable, and the best value is 0.007 to ensure the stability of the algorithm's convergence. Approximated decoding error probability

The convergence of Algorithm 3 is plotted in Figure 4 and compared to the exhaustive search method. The results show that the proposed algorithm achieved the same performance as the exhaustive search method despite the nonconvex nature of the problem. This demonstrates the effectiveness of the proposed algorithm in avoiding costly computations while maintaining accuracy. However, optimal performance depends on the choice of initial parameters, such as $\rho$, as inappropriate values may lead to unstable convergence. Therefore, further studies are recommended to explore mechanisms for dynamically adapting these parameters to improve performance under variable channel conditions.

Figure 5 compares Algorithm 3 to the algorithm in the study [13] that used the bisection search method to optimize the block length. The comparison demonstrates an improvement of up to 57.14% in convergence speed to the optimum value compared to the bisection method-based algorithm. This improvement is attributed to the ADMM algorithm's efficiency and ability to achieve rapid convergence by optimizing partial factors such as the penalty factor, $\rho$. Furthermore, using the bisection method to determine the UAV's position effectively reduced PER, reflecting the algorithm's ability to balance reliability and efficiency.

Furthermore, the computation time for the algorithms was calculated and presented in Table 2, showing a notable optimization in execution time with an acceleration of approximately 16.42% across ten iterations. This optimization becomes even more significant when considering that the proposed algorithm converges to the optimal value faster than the bisection-based method.
| Algorithm | Computation Time (second) |
| Proposed Algorithm | 2.773 |
| Bisection method-based | 3.318 |
| Exhaustive search method | 59,337.78 |
5. Conclusions
This paper introduced the blocklength allocation and UAV placement optimization in a UAV-based relay system that achieves the requirement of URLLC in terms of reliability and latency. Firstly, the problem of reducing the overall PER was split into two subproblems. Then, the ADMM algorithm is applied to solve blocklength allocation, and the bisection search method is used to solve UAV placement optimization. Finally, these two problems were combined by an iterative algorithm to reach the values of the optimal UAV placement and blocklength allocation that minimize PER.
The proposed algorithm requires careful tuning of the penalty factor to achieve optimal results. The choice of it depends on the selected scenario, as different channel conditions and system configurations may require adjustments for stability and efficiency. When selecting the appropriate value of the penalty factor, the algorithm demonstrated significant improvements in convergence speed, reducing the PER by up to 57.14% compared to the bisection search method. Additionally, the computation time was reduced by 16.42%, enhancing its efficiency for real-time communication applications and latency-sensitive systems.
Based on these findings, we recommend implementing adaptive blocklength allocation mechanisms that can respond to changing channel conditions in real-time and deploy multiple UAVs in areas with high user density to distribute the communication load better. The promising computational efficiency demonstrated by the proposed algorithm offers a foundation for real-time implementations, paving the way for its integration into advanced adaptive communication protocols.
The data used to support the findings of this study are available from the corresponding author upon request.
The authors declare that they have no conflicts of interest.
B | System bandwidth, Hz |
$x$ | UAV's horizontal coordinate, meter |
$H$ | UAV's fixed altitude, meter |
D | Distance between the control unit and the robot, meters |
$L$ | Packet size, bits |
M | Overall blocklength, Symbol |
$T_{\text {max }}$ | Maximum latency constraint, meter |
$P_1, P_2$ | Transmission power of the control unit and UAV, watts |
Greek symbols | |
$\beta$ | Channel power gain at a reference distance |
$\gamma$ | Signal-to-Noise Ratio (SNR) |
$\varepsilon$ | Packet Error Rate (PER) |
$\lambda$ | Lagrange multiplier |
$\rho$ | Quadratic penalty factor |
Subscripts | |
$i$ | Index for transmission stages ($i=1,2, \ldots$) |
max | Maximum value |
mid | Midpoint value |
$l b, u b$ | Lower bound and upper bound |
