Deep Learning-Based Optimization for Maritime Relay Networks

Guo, Nianci; Wang, Xiaowei

doi:10.3390/app15031160

Open AccessArticle

Deep Learning-Based Optimization for Maritime Relay Networks

by

Nianci Guo

and

Xiaowei Wang

^*

College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(3), 1160; https://doi.org/10.3390/app15031160

Submission received: 1 January 2025 / Revised: 19 January 2025 / Accepted: 22 January 2025 / Published: 24 January 2025

(This article belongs to the Section Marine Science and Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

The complexity and uncertainty of the marine environment pose significant challenges to the stability and coverage of communication links. Due to the limited coverage range of traditional onshore base stations (BSs) in marine environments, relay technology has become an essential approach to extending communication coverage. However, the rapid variations in marine wireless channels and the complexity of hydrological conditions make it extremely difficult to obtain accurate channel state information (CSI). In particular, dynamic environmental factors such as waves, tides and wind speed cause channel parameters to fluctuate significantly over time. To address these challenges, this paper proposes a cooperative communication strategy based on ships and designs a novel channel modeling method to accurately capture the characteristics of marine wireless channels. Furthermore, a deep learning-based optimization scheme is proposed, which formulates the relay selection problem as a spatiotemporal classification task. By integrating the spatial positions of candidate relays and their communication conditions, the proposed scheme enables real-time selection of the optimal relay while evaluating link connectivity probabilities under hydrological influences. Simulation results confirm that the proposed method achieves high accuracy even in rapidly changing marine environments.

Keywords:

maritime communication; relay selection; deep learning; connection probability

1. Introduction

The last two decades have witnessed great progress in terrestrial wireless communications to support various high-rate low-latency applications. However, the development of maritime communications is still far behind. In recent years, providing broadband communications for marine users has attracted attention due to rapidly increasing oceanic activities, including oil exploitation, sea farming, scientific expedition, tourism, etc.

Currently, obtaining access to onshore base stations (BSs) and terrestrial cellular networks is the most appropriate method to provide high-rate and low-cost network service. Hence, techniques to extend the coverage of onshore BSs have been investigated, among which relaying is a mature and effective one. Based on this technology, researchers have continuously explored new applications to enhance long-distance communication capabilities. For example, a multi-hop solution utilizing balloons to achieve broadband communication in remote maritime areas was proposed in [1], providing a novel perspective on addressing maritime communication challenges. Additionally, the work in [2] explored the design of relay networks for multi-domain maritime operations (land, sea, air and space), highlighting the critical need for connectivity and efficient network performance across these diverse environments. Moreover, the flexibility and mobility of unmanned aerial vehicles (UAVs) have garnered significant attention as a means to improve the coverage of maritime communication networks [3]. To address the challenges posed by the lack of stable connections between maritime communication nodes, researchers have proposed integrating UAVs into the topology of sixth-generation maritime communication networks [4]. In scenarios where direct connections to terrestrial base stations are limited, UAV swarms can significantly improve wireless communication performance. However, due to the mobility of UAVs, the state of communication links can easily be affected during operations. To address this issue, Ref. [5] proposed a channel relay selection method based on UAV position prediction to improve the stability of communication links. Additionally, Ref. [6] introduced a Q-learning-based relay selection scheme using the signal-to-noise ratio (SNR) to identify the optimal relay for multi-hop transmissions. Ref. [7] proposed an optimal relay selection scheme based on deep neural networks (DNNs), where the DNN is used to predict the outage probability and perform optimal relay selection, thereby improving network performance. Ref. [8] introduced the optimization of relay selection to improve the performance of Device-to-Device (D2D) communication systems in complex wireless propagation environments.

Broadband reliable wireless communications depend on accurate and instantaneous knowledge of channel information. However, in highly dynamic oceanic environments, channel information changes with hydrological and meteorological parameters, including wind, wave, moisture and temperature, to name a few. The fluctuations of these parameters present significant challenges for estimating channel information in maritime communications, which makes link analysis a challenging task. Ref. [9] proposed a novel opportunistic routing scheme aimed at improving the efficiency of nautical wireless ad hoc networks, particularly in situations where nodes frequently switch between network types. The scheme enhances packet delivery performance by optimizing relay node selection and utilizing prediction mechanisms. Ref. [10] made an initial attempt to leverage the advantages of machine learning to solve the relay selection problem in two-hop networks. In [11], a solution based on Q-learning was proposed to optimize relay selection in scenarios where channel state information (CSI) is unavailable. Furthermore, in reference [12], the relay selection problem was formulated as a multi-class classification problem, and a decision tree-based approach was used to address the challenges of estimating and collecting accurate eavesdropper CSI. Additionally, Ref. [13] proposed a deep learning-based channel estimation algorithm that models the rapid variations in communication channels as two-dimensional images, using known pilot positions to predict unknown channel responses. Deep learning can extract implicit features from large amounts of history data and therefore is regarded as a promising solution for connectivity analysis without full channel information.

Maritime communications are significantly influenced by unique environmental factors, such as seawater evaporation and tidal movements, which render terrestrial communication channel models inadequate for oceanic applications. Current maritime channel models often rely on well-established terrestrial frameworks but fail to capture the complex and dynamic propagation characteristics of marine environments [14]. This highlights the critical need for wireless channel models specifically tailored to maritime conditions. For instance, Ref. [15] analyzed wireless channel models in oceanic environments and identified key challenges, including highly dynamic network topologies and sparse user distributions, which lead to communication instability and high operational costs [16]. In addition, traditional modeling methods demonstrate clear limitations in rapidly changing oceanic conditions. Their lack of real-time adaptability and comprehensiveness prevents them from effectively characterizing the dynamic behavior of wireless channels in maritime environments.

In recent years, deep learning has shown significant potential in various fields. For example, Ref. [17] attempted to combine machine learning techniques with wireless communications. A relay selection scheme in millimeter-wave D2D communications within fifth-generation cellular networks is presented in [18]. The authors in [19] introduced a deep learning-assisted cooperative framework called Predictive Relay Selection (PRS), which enhances the quality of CSI by accurately predicting fading channels. Building upon this progress, the researchers in [20] explored and attempted to apply machine learning in maritime communications. In [21], the authors proposed a deep learning-based approach that analyzes channel and navigation information to predict link availability, providing a foundation for intelligent maritime communication networks.

However, existing strategies often rely on accurate CSI. Measuring and integrating such critical data pose extreme challenges in the dynamic and unpredictable oceanic environment. To overcome these challenges, this paper proposes a deep learning-based optimization strategy for maritime relay networks that eliminates the need for real-time CSI estimation. By enabling multi-hop transmission and prioritizing link connectivity, the proposed approach ensures efficient data transmission and enhances communication reliability. Moreover, link connectivity serves as a critical metric, offering intelligent node selection to optimize overall network performance in maritime scenarios.

In this paper, a channel modeling method is proposed to approximate the oceanic environment, and a deep learning-based scheme is developed to analyze relay networks. The main contributions of this work are as follows:

To capture the highly dynamic characteristics of wireless environments over the ocean, a channel modeling method is established, allowing channel parameters to vary randomly.

A deep learning-based optimization scheme is proposed, transforming the relay selection problem into a spatiotemporal classification and prediction task, effectively addressing the challenge of obtaining accurate channel state information (CSI) in dynamic oceanic conditions.

By incorporating hydrological factors into the evaluation of connectivity probability, this approach systematically analyzes communication systems in complex marine environments. The results demonstrate that optimizing relay selection strategies significantly enhances the system’s adaptability to dynamic conditions and improves data transmission reliability.

2. System Model

In the maritime communication environment, a dual-hop network model is considered, which comprises a source node ship (S), a destination node ship (D) and K relay node ships. Each ship travels at varying speeds (v) and in diverse directions (

θ

), thereby generating a dynamic network environment (depicted in Figure 1). At two distinct time points,

t_{1}

and

t_{2}

, the positions of the ships and their communication links vary, reflecting the instability engendered by the movement of ships within maritime communication networks. In this scenario, data transmission is facilitated via relay nodes that employ the Decode-and-Forward (DF) protocol.

The source node ship must transmit data to the destination node ship; however, the long distance between them makes direct communication impractical. To overcome this limitation, relay node ships serve as intermediaries to facilitate data transmission.

To further expand, a multi-hop network model is considered, where data are required to pass through multiple relay clusters to reach the destination node (D). The network is composed of a source node (S), a destination node (D) and M relay clusters. These relay clusters are labeled

C_{1}

,

C_{2}

, …,

C_{M}

, with each cluster containing multiple relay nodes denoted as

R_{m 1}

,

R_{m 2}

, …,

R_{m K_{m}}

, where

m = 1, . . ., M

. S and D cannot communicate directly, and except for the nodes in

C_{M}

, other relay nodes also cannot establish direct connections with D.

The transmission process begins with S, broadcasting data to the first relay cluster

C_{1}

. Subsequently, a relay node from cluster

C_{1}

is selected to receive the data and forward them to the next cluster,

C_{2}

. This process continues sequentially across the relay clusters until D successfully receives the data. The complete transmission path is formed by the relay nodes selected from each cluster, with the data being forwarded hop-by-hop along the path to ensure their successful delivery to the destination node.

2.1. Marine Communication Environment

In maritime environments, the highly dynamic and variable propagation characteristics of oceanic wireless channels render traditional modeling methods inadequate. To address this issue, a novel channel modeling approach is proposed, which integrates both large-scale and small-scale fading effects through the combination of statistical distribution models with time-series analysis. This approach is designed to provide a more accurate representation of wireless channel characteristics within oceanic environments. For simplicity, it is assumed that the statistical distribution of the channel follows a Rayleigh distribution, and the SNR is modeled as a random variable following an exponential distribution. Additionally, the temporal correlation of channel parameters, including the impacts of wave height and wind speed variations on channel characteristics, is taken into account. This facilitates a more accurate representation of the dynamic propagation characteristics of wireless channels in maritime environments. The channel model is formulated as follows:

H (t) = g (t) \cdot α (t),

(1)

where

g (t) = \frac{1}{L^{n}}

represents large-scale fading, L denotes the distance, n is the path loss exponent and

α (t)

represents small-scale fading.

Furthermore, accurately obtaining instantaneous CSI in maritime environments is challenging. In traditional terrestrial communication systems, where complete CSI is available for each transmission link, the relay selected must satisfy the following condition:

K^{*} = arg max_{1 \leq k \leq K} R_{k} = arg max_{1 \leq k \leq K} \frac{1}{2} log (1 + \frac{P g_{k}}{σ^{2}}),

(2)

where

R_{k}

represents the end-to-end rate when the k-th relay is cooperating.

K^{*}

denotes the selected relay, while K is the total number of relay nodes.

2.2. Data Acquisition

In maritime communication networks, the communication quality among ships is influenced by multiple factors, including ship speed (v), direction (

θ

), distance (d) and wave height (h). At a specific time point t, ships may be elevated to different sea surface heights due to the influence of ocean waves. Let

h_{1}

,

h_{2}

and

h_{3}

denote the wave heights of the ships in their respective sea area. Assuming that wave height is a random variable with a Gaussian distribution, its fluctuations can significantly affect communication quality. Furthermore, particular attention is given to the wave height differences, denoted as

Δ h_{1} = | h_{1 t} - h_{2 t} |

and

Δ h_{2} = | h_{2 t} - h_{3 t} |

, as these differences directly affects the stability of signal transmission. To determine the optimal relay node ship, two critical distance parameters are introduced, which can be expressed as:

L_{1 t} = \sqrt{{(d_{1 t})}^{2} + {| h_{1 t} - h_{2 t} |}^{2}},

(3)

L_{2 t} = \sqrt{{(d_{2 t})}^{2} + {| h_{2 t} - h_{3 t} |}^{2}},

(4)

where, at time t,

L_{1 t}

represents the distance between the source node ship and the relay ship, while

L_{2 t}

represents the distance between the relay ship and the destination node ship. Additionally,

d_{1 t}

and

d_{2 t}

capture the horizontal spatial distances between adjacent ships at this time, t.

v_{1 t}

,

v_{2 t}

and

v_{3 t}

denote the scalar speed of ships at the time point t and are given by:

v_{1 t} = V_{1 t} + Δ_{1 t},

(5)

v_{2 t} = V_{2 t} + Δ_{2 t},

(6)

v_{3 t} = V_{3 t} + Δ_{3 t},

(7)

where

V_{1 t}

,

V_{2 t}

and

V_{3 t}

are the speeds measured by ship logs and

Δ_{1 t}

,

Δ_{2 t}

and

Δ_{3 t}

are the random deviations.

θ_{1 t}

,

θ_{2 t}

and

θ_{3 t}

denote the angles of cruising directions deviating from the nominal direction at the time point t.

In a Rayleigh fading channel, the fading coefficient

α

is a random variable, and the instantaneous SNR (

γ

) of the channel is also a random variable.

γ_{S, R_{k}}

and

γ_{R_{k}, D}

are independent exponential random variables representing the instantaneous SNR of the small-scale fading from S to R and from R to D, respectively. In the considered system, both small-scale fading and large-scale fading effects are taken into account. Consequently, the overall average end-to-end SNR reflects the combined influence of both fading types. The expression for the overall average SNR is presented as follows:

\bar{γ} = \frac{{P | α |}^{2}}{W L^{n}},

(8)

where P represents the transmission power, and W represents the noise power.

In the DF protocol, the parameters

γ_{R_{k}, 1}

and

γ_{R_{k}, 2}

denote the end-to-end SNRs of the first and second hops on the k-th relay’s communication link, respectively. The equivalent SNR for this link is determined as the minimum value between

γ_{R_{k}, 1}

and

γ_{R_{k}, 2}

. According to the maximum function criterion, the relay with the highest equivalent SNR is selected as the optimal relay. The expressions for

γ_{R_{k}}

and

γ^{*}

are given as follows:

\begin{matrix} γ_{R_{k}} & = min {{\bar{γ}}_{R_{k}, 1}, {\bar{γ}}_{R_{k}, 2}} \\ = min \{\frac{P_{t} {| α_{R_{k}, 1} |}^{2}}{W L_{1}^{n}}, \frac{P_{t} {| α_{R_{k}, 2} |}^{2}}{W L_{2}^{n}}\}, \end{matrix}

(9)

γ^{*} = max_{1 \leq k \leq K} γ_{R_{k}} .

(10)

In maritime communications systems, link quality is a critical determinant of communication performance and reliability. Thus, assessing the connectivity probability of the optimal link is essential to ensuring stable and robust communication. In a dual-hop network, the communication process involves the source node transmitting data to the destination node through an intermediate relay node. For a single relay node, the connectivity probability is defined as the probability that the SNR of the relay node exceeds the outage threshold.

The mathematical expression for the connectivity probability of a single relay node is presented in Equation (11) as follows:

\begin{matrix} P_{r} (γ_{R_{k}} > γ_{0}) & = P_{r} (min \{γ_{R_{k}, 1}, γ_{R_{k}, 2}\} > γ_{0}) \\ = P_{r} (γ_{R_{k}, 1} > γ_{0}, γ_{R_{k}, 2} > γ_{0}) \\ = P_{r} (γ_{R_{k}, 1} > γ_{0}) \cdot P_{r} (γ_{R_{k}, 2} > γ_{0}) \\ = \int_{a}^{b} \int_{γ_{0}}^{\infty} f_{1} (x) f_{λ} (y) d x d y + \int_{a}^{b} \int_{γ_{0}}^{\infty} f_{2} (x) f_{λ} (y) d x d y \\ = \int_{a}^{b} {[1 - F_{γ} (γ_{0})]}^{2} f_{λ} (y) d y \\ = \int_{a}^{b} e^{- 2 γ_{0} λ} \cdot \frac{1}{b - a} d y \\ = \frac{1}{2 γ_{0} (b - a)} (e^{- 2 a γ_{0}} - e^{- 2 b γ_{0}}) . \end{matrix}

(11)

For a system with multiple relay nodes, the connectivity probability must consider the scenario where the maximum SNR among all relay nodes exceeds the threshold (

γ_{0}

). Accordingly, the connectivity probability of the entire system is expressed as:

P_{r} (γ^{*} > γ_{0}) = 1 - P_{r} (γ^{*} \leq γ_{0}) .

(12)

Since

γ^{*}

represents the maximum SNR among all relay nodes, the connectivity probability can be further expressed as:

\begin{matrix} P_{r} (γ^{*} > γ_{0}) & = 1 - \prod_{k = 1}^{K} P_{r} (γ_{R_{k}} \leq γ_{0}) \\ = 1 - \prod_{k = 1}^{K} (1 - P_{r} (γ_{R_{k}} > γ_{0})) \\ = 1 - \prod_{k = 1}^{K} (1 - \frac{1}{2 γ_{0} (b - a)} (e^{- 2 a γ_{0}} - e^{- 2 b γ_{0}})), \end{matrix}

(13)

where K denotes the total number of relay nodes.

In a multi-hop network, data transmission is performed through hop-by-hop forwarding across multiple relay clusters until the destination node is reached. The success rate of each hop is determined by the selection of the relay node in the preceding hop and its corresponding communication performance.

The connection probability of the first hop is defined as the probability that the source node (S), through broadcasting, communicates with the K relay nodes in the first cluster

C_{1}

and selects the relay node with the highest SNR. The probability that the SNR of a single relay node exceeds

γ_{0}

is calculated as:

\begin{matrix} P_{r} (γ_{R_{k}, 1} > γ_{0}) & = \int_{a}^{b} [\int_{γ_{0}}^{\infty} λ e^{- λ x} d x] \frac{1}{b - a} d λ \\ = \int_{a}^{b} e^{- λ γ_{0}} \frac{1}{b - a} d λ \\ = \frac{1}{γ_{0} (b - a)} (e^{- a γ_{0}} - e^{- b γ_{0}}) . \end{matrix}

(14)

For the K independent relay nodes in cluster

C_{1}

, the probability that the maximum SNR among these nodes exceeds the threshold is expressed as:

\begin{matrix} P_{r} (γ_{C_{1}}^{*} > γ_{0}) & = 1 - \prod_{k = 1}^{K} P_{r} (γ_{R_{k}, 1} \leq γ_{0}) \\ = 1 - \prod_{k = 1}^{K} (1 - P_{r} (γ_{R_{k}, 1} > γ_{0})) \\ = 1 - \prod_{k = 1}^{K} (1 - \frac{1}{γ_{0} (b - a)} (e^{- a γ_{0}} - e^{- b γ_{0}})) . \end{matrix}

(15)

For the connection probability of the intermediate

M - 1

hops, each hop is influenced by the SNR distribution of the relay node selected in the preceding hop. Let the connection probability of the m-th cluster be denoted as

P_{r} (γ_{C_{m}}^{*} > γ_{0} ∣ γ_{C_{m - 1}}^{*})

. The SNR of the current relay node follows an exponential distribution, which is expressed as:

f_{γ_{R_{k}, m}} (x ∣ γ_{C_{m - 1}}^{*}) = λ_{m, k} (γ_{C_{m - 1}}^{*}) e^{- λ_{m, k} (γ_{C_{m - 1}}^{*}) x},

(16)

where

λ_{m, k}

represents the channel parameter of the k-th relay node in the m-th cluster, which is related to the SNR of the previous hop.

The cumulative distribution function (CDF) of the maximum SNR in the current cluster is given as:

\begin{matrix} P_{r} (γ_{C_{m}}^{*} \leq x ∣ γ_{C_{m - 1}}^{*}) & = \prod_{k = 1}^{K} P_{r} (γ_{R_{k}, m} \leq x ∣ γ_{C_{m - 1}}^{*}) \\ = \prod_{k = 1}^{K} F_{γ_{R_{k}, m}} (x ∣ γ_{C_{m - 1}}^{*}) . \end{matrix}

(17)

Therefore, the connection probability in the current cluster can be expressed as:

\begin{matrix} P_{r} (γ_{C_{m}}^{*} > γ_{0} ∣ γ_{C_{m - 1}}^{*}) & = 1 - P_{r} (γ_{C_{m}}^{*} \leq γ_{0} ∣ γ_{C_{m - 1}}^{*}) \\ = 1 - \prod_{k = 1}^{K} (1 - e^{- λ_{m, k} (γ_{C_{m - 1}}^{*}) γ_{0}}), m = 2, . . ., M . \end{matrix}

(18)

For the M-th cluster, the connection probability of the selected relay node to D is given by:

\begin{matrix} P_{r} (γ_{D} > γ_{0} ∣ λ_{M}) & = \int_{γ_{0}}^{\infty} λ_{M} e^{- λ_{M} x} d x \\ = e^{- λ_{M} γ_{0}} . \end{matrix}

(19)

Consequently, the overall connection probability is expressed as:

\begin{matrix} P_{r} (γ_{D} > γ_{0}) & = \int_{a}^{b} e^{- λ γ_{0}} \frac{1}{b - a} d λ \\ = \frac{1}{b - a} \cdot \frac{e^{- a γ_{0}} - e^{- b γ_{0}}}{γ_{0}} . \end{matrix}

(20)

The connectivity probability of the entire system must account for the probabilities of all individual hops. It can be expressed as:

P_{total} = P_{r} (γ_{C_{1}}^{*} > γ_{0}) \cdot \prod_{m = 2}^{M} P_{r} (γ_{C_{m}}^{*} > γ_{0} ∣ γ_{C_{m - 1}}^{*}) \cdot P_{r} (γ_{D} > γ_{0} ∣ γ_{C_{M}}^{*}) .

(21)

To comprehensively understand the changing trends in the sustainable propagation link states, extensive data were collected while considering multiple factors that influence ship-to-ship communication. By integrating historical data from maritime networks, a thorough analysis of link state variations was performed, resulting in the development of representative training samples.

3. Relay Network Based on Deep Learning

The proposed algorithm utilizes a CNN-GRU architecture to optimize relay link prediction and evaluate link connectivity, thereby enhancing communication quality and link reliability in maritime environments while enabling longer successful communication cycles. The model efficiently extracts features from input data through Convolutional Neural Network (CNN) layers and passes the processed data to Gated Recurrent Unit (GRU) layers for prediction and adaptive adjustments. Similar to Long Short-Term Memory (LSTM) networks, GRUs effectively handle time-series data, extracting useful temporal information and identifying long-term dependencies. The core structure of the algorithm, as shown in Figure 2, consists of two main components: a feature extractor and an estimator.

The dynamic nature of the maritime environment and the motion of ships pose significant challenges to predicting the optimal relay link. The proposed solution addresses these challenges by utilizing features extracted from time-series data. It incorporates several factors that influence communication quality between ships, including their speed and direction of movement, the distance between ships, signal quality (SNR) and the presence of K relays. By collectively analyzing these factors, the optimal relay is selected, and the connectivity probability of the optimal relay link is determined.

The input features are structured as parameters over a series of time points, represented as samples X with multiple attributes. These samples are fed into the convolutional layer (CL) of the CNN model and are defined as:

X = {x_{1}, x_{2}, \dots, x_{t}},

(22)

x_{t} = [v_{1 t}, v_{2 t}, v_{3 t}, θ_{1 t}, θ_{2 t}, θ_{3 t}, L_{1 t}, L_{2 t}, h_{1 t}, h_{2 t}, h_{3 t}, γ_{1 t}, γ_{2 t}] .

(23)

The output data correspond to the prediction label and connectivity probability of training samples, which can be expressed as:

Y = {({\hat{K}}_{1}^{*}, P_{1}), ({\hat{K}}_{2}^{*}, P_{2}), \dots, ({\hat{K}}_{n}^{*}, P_{n})},

(24)

where

{\hat{K}}_{i}^{*}

represents the predicted label of the i-th time slot, and

P_{i}

denotes the corresponding connectivity probability.

In maritime networks, huge amounts of historical data are produced during navigation and can be utilized as training samples. In this work, it is neither economical nor practical to collect these data within a short time. To validate the proposed algorithm and to reduce the cost of experiment, we designed a software-aided data generating method. This method implements the data acquisition presented in Section 2.2 and also generates both navigation and hydrological parameters. Input data and output labels are produced out of these data and packed into training and prediction samples. The Algorithm 1 of method is summarized as follows.

Algorithm 1: Data Generating Method.

Step 1: Generate input data

Set up parameters.

1.1 Generate distances

Generate

d_{1}

and

d_{2}

with uniform distribution

d_{i} \sim U (d_{a}, d_{b})

, which reflects the spatial randomness of node deployment within a defined operational area.

1.2 Generate wave heights

Generate

h_{1}

,

h_{2}

and

h_{3}

with Gaussian distribution

h_{i} \sim N (μ, σ^{2})

, which is widely used in oceanography to approximate the variability of wave heights under diverse sea conditions.

1.3 Generate speed and direction

Generate

v_{1}, v_{2}

, and

v_{3}

,

θ_{1}, θ_{2}

, and

θ_{3}

follow the uniform distribution

v_{i} \sim U (v_{a}, v_{b})

,

θ_{i} \sim U (θ_{a}, θ_{b})

. These distributions capture the random nature of node movement while remaining within environmental and operational constraints.

1.4 Generate SNR

Assuming a Rayleigh channel, the generated channel parameter

λ

follows a uniform distribution

λ \sim U (λ_{a}, λ_{b})

, which models the random yet bounded variability in channel conditions.

The SNR vectors

γ_{1}

and

γ_{2}

follow the exponential distribution

γ_{i} \sim E x p (λ)

, which is consistent with the theoretical Rayleigh fading model.

1.5 Pack data

Repeat steps 1.1–1.4 for

2 K

times.

Re-organize all data and pack them into

X = [X_{t}, X_{p}]

.

Step 2: Generate output data

2.1

{\hat{K}}^{*}

is the selection result of the optimal strategy.

2.2 Monte Carlo method is used to calculate the connectivity probability of the optimal result.

Pack the results into

Y = [Y_{t}, Y_{p}]

.

Step 3: Predict relay selection

3.1 Pack the training samples

[X_{t}, Y_{t}]

and train the network.

3.2 Feed

X_{p}

into the network and obtain the results

Y_{p}^{'}

.

3.3 Compute the mean squared error (MSE) by comparing

Y_{p}

and

Y_{p}^{'}

.

3.1. Data Features Extractor

The feature extractor is specifically designed for highly mobile nodes in dynamic networks, with a focus on efficiently capturing essential patterns from fluctuating input data. Initially, CNN layers are utilized to extract latent features from raw data. These layers transform low-level input variations into high-level representations, which are subsequently passed to fully connected layers for further processing. This hierarchical structure significantly enhances the model’s capacity to recognize complex relationships, such as those between communication links. CNNs, as multilayer neural networks within a deep supervised learning framework, are designed to process diverse data types, including time-series and spatially structured data. Although traditionally applied to two-dimensional image processing tasks, the same principles can be effectively extended to handle one-dimensional sequential data [22]. In the proposed scheme, the CNN architecture incorporates convolutional and pooling layers, where each convolutional layer applies a set of kernels for feature extraction. The resulting features are passed through a Rectified Linear Unit (ReLU) activation function, which introduces non-linearity into the model. Pooling layers, particularly max-pooling, play a critical role in reducing the feature dimensions while retaining essential information, thereby improving computational efficiency and mitigating overfitting risks. The integration of convolutional and pooling layers not only reduces the complexity of the feature space but also ensures robustness to variations in the input data. By preserving the invariance of feature mappings, the model adapts effectively to dynamic input conditions. The max-pooling operation can be represented by (25) as:

y_{t}^{l} = max p o o l i n g (x_{t}^{l - 1}, s_{scale}, s_{stride}),

(25)

the pooling window size

s_{scale}

is set to 2 × 2, and the stride

s_{stride}

is set to 2. These settings are chosen to balance computational efficiency and feature preservation, ensuring effective downsampling of the input feature maps while maintaining critical data characteristics.

The CNN extracts data features through layer-by-layer convolution and pooling operations. The filter’s window size and stride can be flexibly set based on the input data dimensions and the complexity of the features to be captured. The convolutional layers of the CNN are capable of identifying spatial features relevant to relay node selection, such as wave height, wind speed and ocean current direction. The extraction of these features helps enhance the predictive performance of the model. The calculation formula is shown in Equation (26).

l_{t} = tanh (x_{t} * k_{t} + b_{t}),

(26)

where

l_{t}

represents the output value after convolution, tanh is the activation function,

x_{t}

is the input vector,

k_{t}

is the weight of the convolution kernel and

b_{t}

is the bias of the convolution kernel.

The tanh activation function introduces non-linearity, which is essential for enabling the CNN to learn complex feature mappings. Additionally, the tanh function outputs values in the range of [−1, 1], ensuring smooth transitions between negative and positive inputs, which is beneficial for capturing variations in features such as wave height and navigation direction, thereby enhancing the robustness and stability of the network.

3.2. Sequence Learning Estimator

A training model consisting of a GRU layer and a Dropout layer is defined, where the GRU model contains only two gates, the reset gate and the update gate, and the specific structure is illustrated in Figure 3. Compared to LSTM, the GRU has a simpler architecture, fewer parameters and a faster training speed. The output data from the feature extractor are fed into the GRU, which performs temporal modeling and captures temporal dependencies within the sequential data.

The GRU layer processes sequential data using its reset and update gates. At each time step t, the update gate (

z_{t}

) determines how much of the previous hidden state (

h_{t - 1}

) should be retained in the current hidden state, while the reset gate (

r_{t}

) decides how much of the past information to forget. The output of the GRU layer is a sequence of hidden states, which encode the temporal dynamics of the input data. The calculations are as follows:

z_{t} = σ (W_{z} \cdot [h_{t - 1}, x_{t}] + b_{z}),

(27)

r_{t} = σ (W_{r} \cdot [h_{t - 1}, x_{r}] + b_{r}),

(28)

{\tilde{h}}_{t} = \tan h (W \cdot [r_{t} * h_{t - 1}, x_{t}] + b),

(29)

h_{t} = (1 - z_{t}) * h_{t - 1} + z_{t} * {\tilde{h}}_{t},

(30)

where W and b are weights and bias,

σ

is the sigmoid activation function and

\tilde{h}

is the candidate hidden state, while

h_{t}

is the final hidden state, respectively.

The hidden states produced by the GRU, specifically the hidden state of the last time step (

h_{T}

), are passed through a Dropout layer to prevent overfitting and improve generalization. Dropout randomly sets a fraction of the units in

h_{T}

to zero during training, reducing the model’s reliance on specific neurons. This prevents the model from overfitting to particular features and encourages better generalization. The resulting feature vector, which encapsulates temporal dependencies from the entire input sequence, is then fed into a fully connected (FC) layer. Finally, accurate classification predictions are achieved through the Softmax classifier.

The integration of GRU layers into the model architecture not only accelerates the training process but also enhances the ability to process effective link features extracted by CNNs. By leveraging the spatial feature extraction capabilities of CNNs and the sequential modeling strengths of GRUs, the proposed algorithm captures both the spatial correlations present in the input data and the temporal dependencies across time steps. Specifically, spatial features are extracted through CNN layers and then sequentially passed to the GRU layers, where the temporal dynamics of the data are modeled. This combined architecture allows the model to effectively adapt to the dynamic environment of maritime communication networks. As a result, the algorithm achieves more accurate data modeling and prediction, addressing the challenges posed by real-time communication in complex oceanic environments.

4. Simulation Experiment and Analysis

In deep learning-based relay networks, the acquisition of training data plays a critical role in model training, directly impacting the effectiveness and accuracy of the experiments. MATLAB R2022a simulations are utilized to generate raw data by modeling key channel characteristics in maritime communication environments, including Rayleigh fading and large-scale path loss. These simulations are subsequently used to construct channel dataset samples for training. The key communication parameter settings for generating the dataset are summarized in Table 1.

Before training the model, the sample data must be preprocessed. As the deep learning-based maritime communication relay network optimization model follows a supervised learning paradigm, each set of relay link data must be labeled with the optimal relay and its corresponding connectivity probability. During the training process, the network architecture is iteratively adjusted, and parameter values are fine-tuned to maximize predictive performance. Table 2 provides the parameter settings for the CNN-GRU method used in this experiment.

To validate the consistency between the theoretical analysis and the simulation results for connectivity probability, Figure 4 presents the connectivity probability under different system parameters. Figure 4a,b demonstrate that an increase in the number of relay nodes K significantly enhances the connectivity probability, while higher SNR thresholds

γ_{0}

lead to a reduction. Figure 4c illustrates a gradual decline in connectivity probability as the number of clusters M increases, emphasizing the trade-off inherent in multi-cluster systems. The high levels of consistency between the theoretical and simulation results validate the accuracy of the formulas and the reliability of the methods.

The accuracy of the model is evaluated by comparing the predicted labels of the test set with the corresponding optimal choices. To assess the link prediction performance of the CNN-GRU model, the mean squared error (MSE) is employed as the performance metric. The MSE is defined as

M S E = \frac{1}{N} \sum_{i = 1}^{N} {(p_{i} - {\hat{p}}_{i})}^{2},

(31)

where N represents the number of predictions,

p_{i}

denotes the true value and

{\hat{p}}_{i}

represents the predicted value. The smaller the MSE value, the better the prediction performance.

Figure 5 illustrates the variation in the model accuracy with the increase in the number of relays. It can be observed that the accuracy of the CNN-GRU-based relay selection scheme decreases slightly as K grows but remains consistently above 92%, which is within an acceptable range. This trend can be attributed to the fact that when the number of relays is small (e.g., two relays), the model can more effectively learn and memorize the dataset features, resulting in higher accuracy on the training set. However, as the number of relays increases, the model tends to overfit the training data, leading to a decrease in performance on the test data.

Figure 6 illustrates the prediction accuracy of the model’s labels over the entire time-series. It is evident that when comparing the predicted relay choices of the test set with corresponding optimal relay labels, the accuracy of the CNN-GRU-based model stabilizes at approximately 0.90 after 100 iterations, effectively meeting the expected performance. This result can be attributed to the model’s training process, where the CNN-GRU-based relay selection model maps the feedback data from each link to the corresponding optimal label. Through iterative training, the model establishes connections and retains the learned mappings. During the prediction phase, the model accurately classifies and predicts the data based on these established mappings.

To evaluate the effectiveness of the proposed scheme, a series of experiments were conducted to assess its performance. The comprehensive analysis and experimental results are as follows.

The MSE performance of the proposed scheme under different Rayleigh fading channel parameters is depicted in Figure 7. In such channels, the received SNR follows an exponential distribution, where the distribution parameter

λ

was set to follow two different uniform distributions:

U (1, 2)

and

U (1, 10)

. When

λ

follows the distribution

U (1, 2)

, the smaller fluctuation range of

λ

results in a limited set of samples during training. This restricted range introduces a bias into the generated samples, as they fail to represent the diversity of real-world conditions. Consequently, the model trained on these biased samples demonstrates reduced performance, as reflected in the higher MSE values. Conversely, when

λ

follows

U (1, 10)

, the broader fluctuation range provides a more diverse and representative sample set. This increased diversity improves the model’s ability to generalize to different channel conditions, thereby reducing the MSE and enhancing performance.

Figure 8 analyzes the impact of different input sequence lengths on system performance. Longer input sequences provide more historical information, enabling the model to better capture the temporal correlations of signals and thereby enhancing its performance. In contrast, shorter input sequences lead to information loss and insufficient model complexity, which negatively affect system performance. When the input length is set to 56, the system demonstrates the poorest performance due to an imbalance between model complexity and information richness.

Maritime communication systems are significantly more affected by environmental conditions compared to terrestrial systems, with wave height being one of the critical influencing factors. Higher wave heights introduce greater signal fluctuation and instability due to harsher transmission conditions, leading to increased MSE. Conversely, lower wave heights provide a more stable environment, which improves signal quality and reduces MSE.

To analyze this impact, we evaluated the MSE performance of the proposed scheme under two different average wave heights (

μ = 1.5

and

μ = 4.5

), as shown in Figure 9. The results indicate that as the wave height increases (i.e., the average wave height

μ

grows), the MSE also increases accordingly.

Compared to the impact of

λ

parameters on the MSE, wave height primarily affects the physical stability of signals, whereas Rayleigh fading introduces statistical variations in channel conditions. Both factors have a significant impact on system performance, highlighting the necessity of considering environmental variability in the design of robust maritime communication systems.

5. Conclusions

In this paper, we explore the problem of relay-assisted maritime communication networks and propose a channel modeling method that closely represents real maritime environments. To effectively address the uncertainties associated with maritime conditions, we introduce a deep learning-based optimization scheme for maritime relay networks. This scheme predicts relay selection and calculates connectivity probabilities over multiple time points without relying on instantaneous CSI. Through simulation experiments and analysis, the proposed scheme demonstrates high accuracy and low MSE when compared to the optimal strategy. The model effectively captures the impact of system and environmental variations on performance, especially under different conditions of wave height and Rayleigh fading. Although the MSE fluctuates with changes in wave height and fading fluctuation range, the overall accuracy remains stable, validating the effectiveness and robustness of the method and demonstrating its reliability in complex maritime communication environments. However, the current model does not explore in depth the potential impact of extreme weather conditions (such as storms, fog, etc.) on performance. Future research can further investigate the performance under these extreme environmental conditions and enhance the model’s robustness in such scenarios. Furthermore, the real-world variability of the dataset may affect the model’s generalization ability, so future work will focus on optimizing the scheme in scenarios with higher data variability.

Author Contributions

Conceptualization and formal analysis, N.G. and X.W.; methodology, X.W.; software, N.G.; validation, N.G.; investigation, N.G.; writing—original draft preparation, N.G.; writing—review and editing, N.G. and X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Innovation Program of the Shanghai Municipal Education Commission of China under grant 2021-01-07-373 00-10-E00121 and the Shanghai Sailing Program under grant 20YF1416700.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Campos, R.; Oliveira, T.; Cruz, N.; Matos, A.; Almeida, J.M. BLUECOM+: Cost-effective broadband communications at remote ocean areas. In Proceedings of the OCEANS 2016—Shanghai, Shanghai, China, 10–13 April 2016; pp. 1–6. [Google Scholar]
Wakayama, C.Y.; Plate, R.S.; Morey, D.F.; Zabinsky, Z.B. Designing a Communication Relay Network for Multi-Domain Maritime Operations. In Proceedings of the MILCOM 2024—2024 IEEE Military Communications Conference (MILCOM), Washington, DC, USA, 28 October–1 November 2024; pp. 481–486. [Google Scholar]
Li, X.; Feng, W.; Chen, Y.; Wang, C.-X.; Ge, N. Maritime Coverage Enhancement Using UAVs Coordinated With Hybrid Satellite-Terrestrial Networks. IEEE Trans. Commun. 2020, 68, 2355–2369. [Google Scholar] [CrossRef]
Nomikos, N.; Giannopoulos, A.; Kalafatelis, A.; Özduran, V.; Trakadas, P.; Karagiannidis, G.K. Improving Connectivity in 6G Maritime Communication Networks With UAV Swarms. IEEE Access 2024, 12, 18739–18751. [Google Scholar] [CrossRef]
Yuan, J.; Wang, H.; Chen, Y.; Qin, Z.; Zhang, L.; Sun, W. Relay Selection Based on Location Prediction in Collaborative Communication. In Proceedings of the 2020 IEEE 20th International Conference on Communication Technology (ICCT), Nanning, China, 28–31 October 2020; pp. 785–789. [Google Scholar]
Paek, M.-J.; Na, Y.-J.; Lee, W.-S.; Ro, J.-H.; Song, H.-K. A Novel Relay Selection Scheme Based on Q-Learning in Multi-Hop Wireless Networks. Appl. Sci. 2020, 10, 5252. [Google Scholar] [CrossRef]
Tolebi, G.; Amirgaliyev, Y.; Nauryzbayev, G. Novel Scalable DNN-based Relay Selection Scheme for Wireless Powered Communication Networks. In Proceedings of the 2023 International Balkan Conference on Communications and Networking (BalkanCom), İstanbul, Turkiye, 5–8 June 2023; pp. 1–6. [Google Scholar]
Woo, W.H.; Annur, R.; Ponnusamy, V. Performance Evaluation for Relay Selection on Device-to-Device (D2D) Communications in Rayleigh Fading. In Proceedings of the 2021 3rd International Conference on Advancements in Computing (ICAC), Colombo, Sri Lanka, 9–11 December 2021; pp. 140–145. [Google Scholar]
Ge, L.; Jiang, S. An Efficient Opportunistic Routing Based on Prediction for Nautical Wireless Ad Hoc Networks. J. Mar. Sci. Eng. 2022, 10, 789. [Google Scholar] [CrossRef]
Wang, X. Decision-Tree-Based Relay Selection in Dualhop Wireless Communications. IEEE Trans. Veh. Technol. 2019, 68, 6212–6216. [Google Scholar] [CrossRef]
Yang, K.; Zhu, S.; Dan, Z.; Tang, X.; Wu, X.; Ouyang, J. Relay Selection for Wireless Cooperative Networks using Adaptive Q-learning Approach. In Proceedings of the 2019 Cross Strait Quad-Regional Radio Science and Wireless Technology Conference (CSQRWC), Taiyuan, China, 8–21 July 2019; pp. 1–5. [Google Scholar]
Wang, X.; Liu, F. Data-Driven Relay Selection for Physical-Layer Security: A Decision Tree Approach. IEEE Access 2020, 8, 12105–12116. [Google Scholar] [CrossRef]
Soltani, M.; Pourahmadi, V.; Mirzaei, A.; Sheikhzadeh, H. Deep Learning-Based Channel Estimation. IEEE Commun. Lett. 2019, 23, 652–655. [Google Scholar] [CrossRef]
Alvin Yau, K.-L.; Syed, A.R.; Hashim, W.; Qadir, J.; Wu, C.; Hassan, N. Maritime Networking: Bringing Internet to the Sea. IEEE Access 2019, 7, 48236–48255. [Google Scholar]
Wang, J.; Zhou, H.; Li, Y.; Sun, Q.; Wu, Y.; Jin, S.; Quek, T.Q.S.; Xu, C. Wireless Channel Models for Maritime Communications. IEEE Access 2018, 6, 68070–68088. [Google Scholar] [CrossRef]
Jiang, S. Marine Internet for Internetworking in Oceans: A Tutorial. Future Internet 2019, 11, 146. [Google Scholar] [CrossRef]
Joung, J. Machine Learning-Based Antenna Selection in Wireless Communications. IEEE Commun. Lett. 2016, 20, 2241–2244. [Google Scholar] [CrossRef]
Abdelreheem, A.; Omer, O.A.; Esmaiel, H.; Mohamed, U.S. Deep Learning-Based Relay Selection in D2D Millimeter Wave Communications. In Proceedings of the 2019 International Conference on Computer and Information Sciences (ICCIS), Sakaka, Saudi Arabia, 3–4 April 2019; pp. 1–5. [Google Scholar]
Jiang, W.; Schotten, H.D. A Simple Cooperative Diversity Method Based on Deep-Learning-Aided Relay Selection. IEEE Trans. Veh. Technol. 2021, 70, 4485–4500. [Google Scholar] [CrossRef]
Sinha, S.; Kwon, S.Y.; Park, S.H. A Survey of Deep/Machine Learning in Maritime Communications. In Proceedings of the 2023 Fourteenth International Conference on Ubiquitous and Future Networks (ICUFN), Paris, France, 4–7 July 2023; pp. 856–860. [Google Scholar]
Ge, L.; Jiang, S.; Wang, X.; Xu, Y.; Feng, R.; Zheng, Z. Link Availability Prediction Based on Machine Learning for Opportunistic Networks in Oceans. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 2021, 105, 598–602. [Google Scholar] [CrossRef]
Kuang, D.; Xu, B. Predicting kinetic triplets using a 1d convolutional neural network. Thermochimca Acta 2018, 669, 8–15. [Google Scholar] [CrossRef]

Figure 1. Maritime network foundation model.

Figure 2. CNN-GRU algorithm network structure.

Figure 3. Architecture of the GRU.

Figure 4. Connectivity probability under different system parameters.

Figure 5. Curve of accuracy changes with increasing K.

Figure 6. Relay prediction accuracy curve across the time series.

Figure 7. MSE for different Rayleigh fading channel parameters.

Figure 8. MSE relative to input length in Rayleigh fading.

Figure 9. MSE relative to wave height in Rayleigh fading.

Table 1. Communication parameter.

Parameter	Value
Carrier frequency	5.8 GHz
Transmit antenna height	8 m
Receive antenna height	8 m
Number of relays	4
Transmission power	20∼25 dBm
Noise power	10∼11 dBm
Path loss	90∼200 dB
SNR	10∼30 dB
Speed	$0 \leq v (t) \leq 30 km / h$
Direction	$- π / 6 \leq θ (t) \leq π / 6$
Distance	20 km

Table 2. Model simulation parameters.

Parameter	Value
CNN convolutional filters	64
Convolution kernel size	$2 \times 1$
Activation function	Relu
GRU hidden units	16
Dropout rate	0.2
Batch size	128
Epochs	100
Learning rate	0.01

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Guo, N.; Wang, X. Deep Learning-Based Optimization for Maritime Relay Networks. Appl. Sci. 2025, 15, 1160. https://doi.org/10.3390/app15031160

AMA Style

Guo N, Wang X. Deep Learning-Based Optimization for Maritime Relay Networks. Applied Sciences. 2025; 15(3):1160. https://doi.org/10.3390/app15031160

Chicago/Turabian Style

Guo, Nianci, and Xiaowei Wang. 2025. "Deep Learning-Based Optimization for Maritime Relay Networks" Applied Sciences 15, no. 3: 1160. https://doi.org/10.3390/app15031160

APA Style

Guo, N., & Wang, X. (2025). Deep Learning-Based Optimization for Maritime Relay Networks. Applied Sciences, 15(3), 1160. https://doi.org/10.3390/app15031160

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Deep Learning-Based Optimization for Maritime Relay Networks

Abstract

1. Introduction

2. System Model

2.1. Marine Communication Environment

2.2. Data Acquisition

3. Relay Network Based on Deep Learning

3.1. Data Features Extractor

3.2. Sequence Learning Estimator

4. Simulation Experiment and Analysis

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI