A Secure and Efficient Distributed Semantic Communication System for Heterogeneous Internet of Things Devices

Weihao Zeng, Xinyu Xu, Qianyun Zhang, Jiting Shi Zhijin Qin, Zhenyu Guan W. Zeng, X. Xu, Q. Zhang, J. Shi and Z. Guan are with the School of Cyber Science and Technology, Beihang University, Beijing 100191, China (email: {zengweihao, 20231010, zhangqianyun, shijiting, guanzhenyu}@buaa.edu.cn).Z. Qin is with the Department of Electronic Engineering, Tsinghua University, Beijing 100084, China (e-mail: qinzhijin@tsinghua.edu.cn)

Abstract

Semantic communications have emerged as a promising solution to address the challenge of efficient communication in rapidly evolving and increasingly complex Internet of Things (IoT) networks. However, protecting the security of semantic communication systems within the distributed and heterogeneous IoT networks is critical issues that need to be addressed. We develop a secure and efficient distributed semantic communication system in IoT scenarios, focusing on three aspects: secure system maintenance, efficient system update, and privacy-preserving system usage. Firstly, we propose a blockchain-based interaction framework that ensures the integrity, authentication, and availability of interactions among IoT devices to securely maintain system. This framework includes a novel digital signature verification mechanism designed for semantic communications, enabling secure and efficient interactions with semantic communications. Secondly, to improve the efficiency of interactions, we develop a flexible semantic communication scheme that leverages compressed semantic knowledge bases. This scheme reduces the data exchange required for system update and is adapt to dynamic task requirements and the diversity of device capabilities. Thirdly, we exploit the integration of differential privacy into semantic communications. We analyze the implementation of differential privacy taking into account the lossy nature of semantic communications and wireless channel distortions. An joint model-channel noise mechanism is introduced to achieve differential privacy preservation in semantic communications without compromising the system’s functionality. Experiments show that the system is able to achieve integrity, availability, efficiency and the preservation of privacy.

Index Terms:

Semantic communications, Internet of Things, blockchain, differential privacy.

I Introduction

The proliferation of the Internet of Things (IoT) has led to a significant increase in data volumes and network connectivity. This rapid expansion highlights the necessity for efficient communication systems within IoT networks. Semantic communications[1, 2] are novel communication paradigms that focus on directly conveying intended meanings and sharing only the essential information relevant to the receiver’s needs, i.e. semantics. Semantic communication systems are built on neural network models and shared knowledge bases, which combine to effectively extract semantic features from diverse sources and accurately interpret them to facilitate execution of specific tasks. It has emerged as a promising approach to achieve efficient communication in IoT scenarios, and pave the way for more intelligent IoT tasks[3, 4].

However, the distributed and heterogeneous natures of IoT networks and the presence of malicious attackers pose significant challenges to the security and practical deployment of semantic communication systems. Unlike end-to-end semantic communications[5], semantic communication systems within IoT networks require more complex multi-party interactions. To be specific, a critical concern is to synchronize semantic communication models and shared knowledge bases among multiple participants to prevent inaccurate extraction and interpretation of semantic information. In addition, ever-emerging communication tasks in IoT scenarios necessitate ongoing updates of semantic communication systems. This requires IoT devices to collect evolving data about communication tasks to update neural network models and tune knowledge bases. The data is inevitably distributed across different devices. These devices require collaborative model training, such as federated learning[6], to exploit this distributed data. It is worth noting that the above interactions are inherently communication tasks, which can also be accomplished through semantic communications, thereby enhancing the efficiency of the entire semantic communication system.

In order to establish a secure distributed semantic communication system, several issues need to be addressed. The first challenge is to achieve interaction integrity, authentication and availability to securely maintain semantic communication systems among IoT devices. The integrity and authentication of interactions are threatened by various attacks, such as data tampering, data falsification and man-in-the-middle attacks[7]. Adversaries can maliciously modify or falsify the information exchanged, causing conflicts among models and knowledge bases of each devices. They can also introduce perturbations into the information related to the collaborative system update, impeding the convergence of models and the representation of knowledge bases[8]. Furthermore, the lossy transmission nature of semantic communication raises significant issues for verifying the integrity and authentication of the exchanged information. Traditional verification mechanisms cannot be directly applied to semantic communications, as small distortions from the semantic communication process can make the verification fail. It hinders semantic communications to facilitate efficient interactions.

The availability of interactions is also threatened. The inherent dynamics of IoT network topology, along with the potential for device malfunctions, disconnections, and communication delays, pose difficulties in maintaining availability of interactions among IoT devices. External attacks, such as distributed denial-of-service attack, also present threats that compromise the availability of interactions. The aforementioned problems with the integrity, authentication and availability emphasize the importance of developing a interaction framework that is trustworthy and fault-tolerant while being able to leverage semantic communications for efficient interactions.

Second, the diverse transmission and computation capabilities of IoT devices are obstacles to the practical deployment of semantic communication systems. During interactions of update and synchronization, the direct exchange of entire models and knowledge bases among IoT devices imposes severe burdens on these transmission-limited IoT devices. This is due to the substantial size of the current implementation of models[3] and knowledge base, such as knowledge graph[9, 10], training datasets[11] and feature vector sets[12], which result in overwhelming data transmission requirements. In addition, the immense data size of models and knowledge bases significantly increases the computational overhead of model inference. This challenge is particularly acute for IoT devices with limited computing power, leading to higher latency and a severe degradation of overall system efficiency. Therefore, it is imperative to develop semantic communication system that facilitates efficient updates and synchronizations with minimal data exchange. This system also must possess scalability and elasticity to accommodate a diverse range of devices and tasks.

Third, preserving the privacy of IoT devices throughout the maintenance and utilization of semantic communication systems is also a critical issue that needs to be addressed. In the context of collaborative training for system maintenance, although integrating semantic communications with federated learning[13] limits the exposure of individual training data to other parties by keeping training data localized and only transmitting training result, privacy concerns remain a pressing issue. The gradient leakage attack[14] is one of the most serious privacy attacks in collaborative model training, where adversaries maliciously extract privacy information contained in gradients exchanged among IoT devices. Similar considerations apply to the usage of semantic communication systems as to system maintenance. For tasks that focus on data analysis and do not require precise data recovery, semantic communications deliver only the semantics, while leaving the original data local. The sensitive information in the raw data remains implicit in the semantics and can be inferred by methods such as model inversion attacks[15, 16]. Differential privacy (DP)[17, 18, 19] has emerged as a prominent framework for ensuring privacy in data analysis. It provides a rigorous mathematical defend against model inversion attacks and gradient leakage attack. Therefore, there is a necessity for a differential privacy mechanism in semantic communication systems.

To tackle above challenges presented in semantic communications within IoT networks, we propose a secure and efficient distributed semantic communication system. Our contributions are presented in detail as follows.

1.

We propose a blockchain-based interaction framework for secure updates and synchronization of the distributed semantic communication system, ensuring the integrity, authentication and availability of interactions. Furthermore, an integrity and authentication verification mechanism for semantic communications is designed. It enables the application of semantic communications in secure interactions.
2.

We develop a flexible semantic communication scheme for IoT scenarios based on high-level representational and compressed semantic knowledge bases. Mainly by updating and synchronizing semantic knowledge vectors, semantic communication systems are flexibly adapted to dynamically changing task requirements, and reduce the amount of data exchange required during system maintenance. The scheme offers flexibility for IoT devices to strike a balance between transmission and computation consumption by adjusting the size of knowledge bases utilized in semantic communications.
3.

We explore the differential privacy model in semantic communication, which takes into account both the lossy nature of semantic communication and the distortion caused by wireless channels. Building upon our model, we introduce an joint model-channel noise mechanism that optimally adds noise into signal symbols to achieve differential privacy in semantic communications. The mechanism is able to uniformly and transparently provide differential privacy protection for any data analysis task in semantic communications.

The rest of this article is organized in the following way. In Section II, we present the related work. In Section III, we present system model including scenario description, semantic communication system model with semantic knowledge base and problem definition. Section IV introduces an overview of the proposed system, followed by a detailed description of three important schemes, blockchain based interaction framework, flexible semantic communication scheme and an joint model-channel noise mechanism. The performance of the system are evaluated in Section V. Finally, we conclude our work in Section VI.

II Related Work

There are many studies that discuss the security of semantic communication systems from a holistic perspective. In [4], authors evaluated classical security techniques in the context of wireless semantic communication security, and the paper also included an analysis of attack and defense methods specific to semantic communications. The multi-domain security vulnerabilities of using deep neural networks for semantic communications are discussed in [20]. The paper also explored targeted and non-targeted adversarial attacks on computer vision and wireless channel with small perturbations. The outcomes of these attacks demonstrated the potential to manipulate the semantics of transmitted information. Authors in [21] clarified the requirements for secure semantic communication and presented the multiple potential security threats that exist at each step of semantic communications, along with the possible defenses against these threats.

In addition to the overall perspective, the following section describes works on semantic communication security from two specific perspectives: data integrity and privacy protection. In semantic communication systems, risks of data integrity arising from data tampering and forgery exist at all stages of data collection, model training, model inference and wireless transmission. To ensure the data integrity in semantic communication system, a semantic signature generation method is proposed in [22] based on generative adversarial networks to protect the integrity of semantics against adversarial perturbations over the end-to-end semantic communication system. Moreover, in distributed semantic communication systems, with a focus on efficient and secure information interaction in Web 3.0 and Metaverse, authors in [23, 24] integrate blockchain with semantic communications. Tamper-resistant mechanisms inherent in blockchain and smart contracts is utilized to verify the integrity and authenticity of semantics, and validate the quality of semantics. However, the current studies lack authentication of data sources for lossy semantics, and no proper integrity verification mechanism has been proposed for lossy transmission of semantic communications.

Attacks against privacy generally occur in the model inference phase. A combined attack involving model inversion attack and eavesdropping attack for semantic communication is proposed in [15]. The attacker first intercepts the semantic information transmitted in the wireless channel and then tries to reconstruct the original information by inverting the model, which leads to the leakage of the user’s private information. To resist the model inversion attack, a defense method based on random semantics permutation and substitution[15] is proposed to prevent the attacker from efficiently reconstructing the original information. Authors in [25] proposed an information bottleneck and adversarial learning approach to protect users’ privacy against model inversion attacks, where adversarial learning is used to train encoders to fool adversaries by maximizing reconstruction distortion. To address the privacy risk caused by knowledge discrepancies among communicating nodes, a knowledge discrepancy oriented privacy preserving method for semantic communication is proposed in [26]. Knowledge mapping and disambiguation reduce the knowledge discrepancy between the sender and receiver, and the use of path-cutting module prevent sensitive data from being leaked. A framework is proposed to address the utility-informativeness-security trade-off in the discrete task-oriented semantic communications[27]. It leverage adversarial learning to achieve privacy-preserving. Current privacy-preserving schemes in semantic communications are limited to specific scenarios and tasks, and lack mathematically rigorous proof of privacy-preserving effectiveness.

III System Model

III-A Scenario Description

We investigate the application of semantic communications in distributed IoT networks, as illustrated in Fig. 1. Within IoT networks, IoT devices exhibit a wide range of transmission and computation capabilities. These devices leverage semantic communication system to exchange semantics associated with specific tasks. These tasks, ranging from simple data collection to complex data analysis, are evolving in response to ever-changing environmental conditions. These devices not only simply utilize static semantic communication models and knowledge bases, but also perform interactions to continuously update and synchronize the semantic communication system. The objective of the system update is to keep pace with the evolving demands of IoT tasks. The aim of synchronizing models and knowledge among participants is to ensure accurate extraction and interpretation of semantic information.

There are attackers in IoT scenarios, categorized into internal and external attackers. Internal attackers within IoT networks are “honest and curious”. They comply with network protocols, but out of curiosity or malicious intent, they may conduct passive attacks, carrying out unauthorized information eavesdropping and analysis. For example, such an adversary might attempt to exploit gradient leakage to gain access to sensitive data without disrupting interaction processes within the network. External attackers are from outside the IoT networks, and can launch active attacks in addition to passive attacks. They initiate active attacks, including data tampering, data falsification, and denial-of-service attacks, with the aim of directly corrupting the update and synchronization processes.

III-B Semantic Communication System with Semantic Knowledge Base

Without loss of generality, we concentrate on semantic communications for the task of text transmission following the [5]. The input sentence to the semantic communication system is denoted as $\boldsymbol{s}=[w_{1},w_{2},\dots,w_{L}]$ , where $w_{l}$ is the $l$ -th word in the sentence. The transmitter comprises three essential components: semantic encoder, channel encoder, and semantic knowledge base. The semantic encoder is responsible for transforming the input data into meaningful semantic features. By leveraging the semantic knowledge base, the semantic encoder gains access to fundamental understanding and representations that significantly enhance its effectiveness. The channel encoder, which follows the semantic encoder, converts and compresses the semantic representations into fewer signal symbols suitable for transmission over the communication channel, ensuring reliable and efficient data delivery among IoT devices. The signal sent by the transmitter is denoted as

\boldsymbol{x}=C_{\boldsymbol{\beta}}\left(S_{\boldsymbol{\alpha}}\left(% \boldsymbol{s},\boldsymbol{\kappa}\right)\right)

(1)

where $\boldsymbol{x}\in\mathbb{C}^{K\times 1}$ represents the power-normalized signal that is to be transmitted, $\boldsymbol{\kappa}\in\mathbb{R}^{P\times Q}$ is represented as a semantic knowledge base with $P$ vectors, each of size $Q$ , $S_{\boldsymbol{\alpha}}\left(\cdot\right)$ is the semantic encoder with the parameters $\boldsymbol{\alpha}$ and $C_{\boldsymbol{\beta}}\left(\cdot\right)$ is the channel encoder with the parameters $\boldsymbol{\beta}$ . The signal received at the receiver is

\boldsymbol{y}=\boldsymbol{h}\boldsymbol{x}+\boldsymbol{n}_{channel}

(2)

where $\boldsymbol{y}\in\mathbb{C}^{K\times 1}$ , $\mathbf{n}_{channel}$ is the additive white Gaussian noise (AWGN), following $\mathbf{n}_{channel}\sim\mathcal{CN}\left(0,\sigma_{n}^{2}\mathbf{I}_{L}\right)$ . For the Rayleigh fading channel, the channel coefficient follows $\mathbf{h}\sim\mathcal{CN}\left(0,\mathbf{I}_{L}\right)$ ; and for Rician fading channel, it follows $\mathbf{h}\sim\mathcal{CN}\left(\mu_{h}\mathbf{I}_{L},\sigma_{h}^{2}\mathbf{I}% _{L}\right)$ with $\mu_{h}=\sqrt{r/(r+1)}$ and $\sigma_{h}=\sqrt{1/(r+1)}$ , where $r$ is the Rician coefficient.

The receiver includes semantic decoder, channel decoder and semantic knowledge base. The semantic knowledge base is synchronized to the transmitter’s. The channel decoder processes the received signals to recover semantic features, mitigating errors or distortions caused during the wireless communication process. Subsequently, the semantic decoder leverages the semantic knowledge base to decode these features, recovering the sentence $\mathbf{s}$ . The operation on the received signal $\boldsymbol{y}$ is

\hat{\boldsymbol{s}}=S_{\boldsymbol{\chi}}^{-1}\left(C^{-1}_{\boldsymbol{\psi}% }\left(\boldsymbol{y}\right),\boldsymbol{\kappa}\right)

(3)

where $\hat{\boldsymbol{s}}$ is the recovered sentence, $C^{-1}_{\boldsymbol{\psi}}\left(\cdot\right)$ is the channel decoder with parameters $\boldsymbol{\psi}$ , and $S_{\boldsymbol{\chi}}^{-1}\left(\cdot\right)$ is the semantic decoder with parameters $\boldsymbol{\chi}$ .

III-C Problem Definition

III-C1 Securing Interactions in Synchronization and Update

The timely synchronization and accurate update of $\boldsymbol{\alpha}$ , $\boldsymbol{\chi}$ , $\boldsymbol{\beta}$ , $\boldsymbol{\psi}$ , and $\boldsymbol{\kappa}$ are critical steps for the overall effectiveness of the semantic communication system. The integrity, authentication and availability of interactions need to be achieved. These models and knowledge bases can not be tampered or falsified during interactions. And interactions must be fault-tolerant and available in complex and changing IoT networks. To utilize semantic communications in interactions, it is necessary to verify the integrity and authenticity of $\hat{\boldsymbol{s}}$ with lossy transmissions.

III-C2 Building Efficient and Flexible Semantic Communication System with Semantic Knowledge Base

The challenge of efficiency arises from the substantial volume of data exchange that occurs during the process of updating and synchronizing $\boldsymbol{\alpha}$ , $\boldsymbol{\chi}$ , $\boldsymbol{\beta}$ , $\boldsymbol{\psi}$ and $\boldsymbol{\kappa}$ . To address this challenge, semantic knowledge bases need to be refined to achieve a small number of vectors, $P$ , while maintaining their semantic richness. This refinement is crucial to substantially reducing transmission overheads on IoT devices and efficiently empowering the semantic encoder with the fundamental information with less computational loads.

Furthermore, the wide range of transmission and computational capabilities requires the system to be adaptable and flexible. The transmission capability restricts the maximum value of the transmitted signal length $M$ , and the computation capability limits the number of semantic knowledge vectors $P$ involved in model inference. The objective of system can be represented as

\max\quad\sum_{M\in\boldsymbol{M}}\sum_{P\in\boldsymbol{P}}\zeta_{M,P}\left(% \boldsymbol{s},\hat{\boldsymbol{s}}\right)\\

(4)

where $\boldsymbol{M}$ represents the set of numbers of symbols that devices can transmit, and $\boldsymbol{P}$ represents the set of numbers of semantic knowledge vectors that devices can use, $\zeta_{M,P}\left(\cdot,\cdot\right)$ measure the similarity between $\boldsymbol{s}$ and $\hat{\boldsymbol{s}}$ when device transmits $M$ symbols and utilize $P$ semantic communication vectors.

III-C3 Achieving Differential Privacy

Considering potential data inference attacks[19] during maintenance and utilization of semantic communications, we need to achieve differential privacy in semantic communications. By adding noise to the transmitted message, called differential privacy noise, the differential privacy mechanism can be effective against such attacks. However, in semantic communication, the transmitted information is also affected by model noise and wireless channel noise. It requires a joint analysis of the impact of differential privacy noise, model noise and wireless channel noise on achieving the differential privacy objective. Based on this, it is necessary to propose an optimal noise addition mechanism to achieve target differential privacy with the least amount of added differential privacy noise.

IV Proposed Solution

IV-A Overview

Refer to caption — Figure 1: Overview of proposed system.

The overview of the proposed secure distributed semantic communication system is shown in the Fig. 1. The system consists of three entities, which are elaborated as follows:

1.

IoT devices: Entities are equipped with a range of bandwidth resources and computing capabilities. They can perform conventional reliable communication protocols such as Bluetooth or WiFi, which have been widely integrated within IoT ecosystems. In addition, they are also capable of semantic communications. These entities do not simply run the static semantic communication system. They interact with each other to continuously update and synchronize the semantic communication system.
2.

Key Generation Center: A trusted third party plays a crucial role within the network, facilitating network initiation and public/private key pairs generation and distribution[28]. It is worth noting that it cannot directly organize interactions and perform complicated data processing, due to availability issues caused by complex IoT environments and the limitations of the center’s own capabilities.
3.

Blockchain: A consortium blockchain is a intangible, conceptual entity maintained by IoT devices. This blockchain is crucial for achieving transparent and trustworthy interactions between network participants. It serves as a secure platform, ensuring that all process of synchronization and update are recorded in an immutable and tamper-proof manner. A secure environment that ensures the integrity, authentication and availability of the semantic communication system is supported by this blockchain.

The system deployment process is comprised of three main phases, which are as follows:

1.

Update: IoT devices collect local training data about tasks and train their local models and semantic knowledge bases. Then, they share their local models and semantic knowledge bases to collectively update the semantic communication system, thereby enabling it to adapt to emerging tasks. This approach ensures that the entire IoT network is able to cope with arising requirements.
2.

Synchronization: Since not all devices may participate in the update process because of limited resources, the synchronization phase is important to ensure that all devices are aligned with the most updated and optimized system. Furthermore, due to the inherent dynamic topology of IoT networks, where devices frequently join and leave the network, it is imperative for newly joining IoT devices to promptly retrieve the latest model to maintain consistency and coherence within the network.
3.

Communication: Once synchronization is complete, IoT devices proceed to the communication phase, where they leverage the semantic communication system to exchange information efficiently.

In the proposed system, the signaling used for controlling interactions is carried by conventional reliable communication protocols. Semantic communications are performed for IoT tasks in the communication phase. During the update and synchronization phases, these devices can choose to use either conventional methods or semantic communications to transmit models and knowledge bases, depending on their conditions. Traditional communication protocols do not require model inference, thereby conserving computational resources. However, they require the transmission of a larger number of signal symbols. In contrast, semantic communications reduce the number of symbols transmitted, but require computational processes for model inference.

The proposed system consists of a blockchain-based interaction framework, an efficient and flexible semantic communication scheme, and an joint model-channel differential privacy noise mechanism. The blockchain-based interaction framework provides integrity, availability protection for system maintenance. Based on the secure interactions provided by the framework, the efficient and flexible semantic communication scheme is explored to achieve a more efficient system update solution with less data exchange. In response to privacy breaches arising from the system maintenance process and system usage, the joint model-channel differential privacy noise mechanism is proposed to implement differential privacy in semantic communications.

IV-B Blockchain-based Secure Interaction Framework

IoT devices collectively build a blockchain network for trustworthy interactions with integrity, authentication and availability in the semantic communication system. A blockchain[29, 30] is a distributed immutable ledger, constructed as a list of blocks. Each block records a set of transactions, where a transaction represents an operation to read or write data to the ledger. The set of rules and conditions for querying or modifying the ledger is defined in codes, known as smart contracts. Each peer maintains a copy of the ledger by a collaborative process called consensus, ensuring the proper execution of smart contracts, the validation of blocks, and the consistency of the ledger among peers. Once a new block is generated and validated, it is cryptographically linked to the last block of the current ledger and synchronized among the networks. The blockchain is fault-tolerant and can withstand a single point of failure.

In the blockchain network maintained by IoT devices, model update and synchronization can be seen as transaction in blockchain, because it is actually a modification or reading of the ledger data. The blockchain network consists of multiple channels, each of which is a sub-network responsible for a specific semantic communication task. One device can participate in different channels at the same time.

There are three main transactions in the system, model upload, model aggregation and model retrieval. We select FedAvg[6] to aggregate local models from each devices. For achieve the integrity and authentication of transaction, the interaction workflow is as follows. The device generate a transaction proposal. For data upload task, it contains the models, knowledge bases and other data. This proposal is the signed and broadcasted to the network. Other device receive and validate the transaction proposal. To validate the receive proposal, devices first verify the digital signature to confirms that the proposal originated from a legitimate device within the channel. After signature is verified, for the model upload task, devices check the integrity of the model; for the model aggregation task, devices check that the FedAvg algorithm is executed correctly. Validated transaction are bundle into block. The network employs a consensus mechanism to agree on which block to append to the blockchain. For the model retrieval task, devices can access models and knowledge bases directly from its own copy of the ledger.

Performing the above workflows in conventional reliable communication protocols has been widely studied and discussed. It is notice that the whole process requires digital signatures to ensure the integrity and authenticity of the transaction. The use of semantic communications for transmitting a transaction proposal would inevitably result in the failure of signature verification due to the inherently lossy nature of semantic communications. In order to facilitate the integration of semantic communications into the aforementioned workflows and thereby enhance system performance, with the idea of provable data possession[31, 32], we propose a probabilistic signature verification mechanism. The mechanism ensures the integrity and authentication of transmitted semantics in semantic communications.

We consider that Alice want to transmit semantics to Bob with the integrity and authentication of semantics. The output of semantic encoder can be reconstructed into a one-dimensional data, $\boldsymbol{W}\in\mathbb{R}^{N}$ . This data goes through the channel codec and wireless channel and is received by the bob, denoted as $\widehat{\boldsymbol{W}}$ . Alice randomly samples $\boldsymbol{W}$ based on a random index set, $\boldsymbol{I}$ . The sampling result is denote as $\boldsymbol{W}_{\boldsymbol{I}}\triangleq\left\{\boldsymbol{W}_{i}|i\in% \boldsymbol{I}\right\}$ . Alice signs $\boldsymbol{W}_{\boldsymbol{I}}$ and $\boldsymbol{I}$ with its privacy key $sk$ , denoted as $sign\triangleq\left\{\boldsymbol{W}_{\boldsymbol{I}}||\boldsymbol{I}\right\}_{sk}$ . $\left\{{\boldsymbol{W}}_{\boldsymbol{I}}||\boldsymbol{I}||sign\right\}$ is transmitted to Bob in conventional communication protocols. It has much smaller data than $\boldsymbol{W}$ . Bob validates $sign$ with the public key of Alice, ensuring the integrity, authentication and non-repudiation of $\left\{{\boldsymbol{W}}_{\boldsymbol{I}}||\boldsymbol{I}\right\}$ . After sign is validated, Bob samples $\widehat{\boldsymbol{W}}$ with $\boldsymbol{I}$ , denoted as ${\widehat{\boldsymbol{W}}}_{\boldsymbol{I}}\triangleq\left\{\widehat{% \boldsymbol{W}}_{i}|i\in\boldsymbol{I}\right\}$ . Finally, Bob validates the difference between ${{\boldsymbol{W}}}_{\boldsymbol{I}}$ and ${\widehat{\boldsymbol{W}}}_{\boldsymbol{I}}$ . If the difference less than a specified threshold, the validation will be successful and vice versa.

To comprehensively quantity the discrepancy between ${{\boldsymbol{W}}}_{\boldsymbol{I}}$ and ${\widehat{\boldsymbol{W}}}_{\boldsymbol{I}}$ , we introduce a metric defined as

Diff=||{{\boldsymbol{W}}}_{\boldsymbol{I}}-{\widehat{\boldsymbol{W}}}_{% \boldsymbol{I}}||_{1}+||{{\boldsymbol{W}}}_{\boldsymbol{I}}-{\widehat{% \boldsymbol{W}}}_{\boldsymbol{I}}||_{\infty}.

(5)

This metric captures the two critical aspects of the difference between ${{\boldsymbol{W}}}_{\boldsymbol{I}}$ and ${\widehat{\boldsymbol{W}}}_{\boldsymbol{I}}$ . The $L_{1}$ norm, $||{{\boldsymbol{W}}}_{\boldsymbol{I}}-{\widehat{\boldsymbol{W}}}_{\boldsymbol{% I}}||_{1}$ , measures the average deviation, providing insights into the overall magnitude of the discrepancy across all elements. And the $L_{\infty}$ norm, $||{{\boldsymbol{W}}}_{\boldsymbol{I}}-{\widehat{\boldsymbol{W}}}_{\boldsymbol{% I}}||_{\infty}$ quantifies the maximum deviation, highlighting the most significant discrepancy among individual elements.

The key to the mechanism is that adversaries can not know $\boldsymbol{I}$ until the transmission of $\boldsymbol{I}$ is complete. Once adversaries are aware of $\boldsymbol{I}$ before Bob receives $\boldsymbol{W}$ , they are able to launch attacks without being detected by modifying the data whose index is not in $\boldsymbol{I}$ and maintaining the data whose index is in $\boldsymbol{I}$ . Therefore, it is crucial to maintain the randomness of the index set $\boldsymbol{I}$ . It must be transmitted delayed or encrypted.

We classify attacks on this mechanism into two categories, based on whether the modification of the information is greater than the threshold value. For attacks where modifications to data exceed thresholds, with the size of index set $|\boldsymbol{I}|$ increase, the integrity and authentication of $\boldsymbol{W}$ improves. If $x$ items are modified in $\boldsymbol{W}$ , the probability of detection with $|\boldsymbol{I}|=I$ is

P_{d}=1-\frac{C_{N-x}^{I}}{C_{N}^{I}}

(6)

For attacks where the modification of data is less than a threshold value, such as poisoning attacks achieved by introducing subtly delicate noises. It can also be submerged in channel and model noises, thereby remaining the security.

IV-C Efficient and Flexible Semantic Communication Scheme

The proposed system addresses the challenges posed by varying computational and communicative resources of IoT devices. By leveraging a shared semantic knowledge base, we develop an flexible semantic communication system that enables each IoT device to adapt their communication strategies in response to resource availability. The mechanism enables efficient model updating, by mainly updating only compact knowledge bases.

The proposed system is shown in the Fig. 2. In the proposed scheme, the semantic knowledge base is consists of semantic knowledge vectors. Considering the diverse requirements of different semantic communication tasks, there are semantic knowledge vectors tailored specifically to address these varying needs. We define a list of semantic knowledge vectors for the semantic communication task $t$ as $\boldsymbol{\kappa}^{t}=\left[\boldsymbol{v}_{1}^{t},\boldsymbol{v}_{2}^{t},% \cdots,\boldsymbol{v}_{P^{t}}^{t}\right]$ , where $P^{t}$ is the total number of vectors, and $\boldsymbol{v}_{n}^{t}\in\mathbb{R}^{Q}$ represents the $n$ -th $Q$ -dimensional vector in $\boldsymbol{\kappa}^{t}$ . The detailed process on semantic knowledge vectors is demonstrated in Fig. 3. During the initialization phase, both the transmitter and receiver retrieve the same $S_{\boldsymbol{\alpha}}$ , $S_{\boldsymbol{\chi}}^{-1}$ , $C_{\boldsymbol{\beta}}$ , $C_{\boldsymbol{\psi}}^{-1}$ and $\boldsymbol{\kappa}^{t}$ from the blockchain for the specific task $t$ . Transmitter utilizes the encoder $S_{\boldsymbol{\alpha}}$ to extract features from the input sentences $\boldsymbol{s}^{t}$ , with the help of $\boldsymbol{\kappa}^{t}$ . The input sentences $\boldsymbol{s}^{t}$ is embedded as $\boldsymbol{s}^{t}_{e}\in\mathbb{R}^{L\times Q}$ . These extracted features are $\boldsymbol{f}\in\mathbb{R}^{(L+P^{t})\times Q}\triangleq S_{\boldsymbol{% \alpha}}\left(\boldsymbol{s}^{t}||\boldsymbol{\kappa}^{t}\right)$ , Afterward, $\boldsymbol{f}$ is transmitted to the receiver through wireless channel with the process of channel codec, which is described in (1), (2) and (3). The features recovered by the channel decoder is denoted as $\hat{\boldsymbol{f}}$ . Finally, $\hat{\boldsymbol{f}}$ and $\boldsymbol{\kappa}^{t}$ are fed into $S_{\boldsymbol{\chi}}^{-1}$ as inputs in order to reconstruct the sentence, denoted as $\hat{\boldsymbol{s}^{t}}\triangleq S_{\boldsymbol{\chi}}^{-1}\left(\hat{% \boldsymbol{f}}||\boldsymbol{\kappa}^{t}\right)$ . The semantic knowledge vectors $\boldsymbol{\kappa}^{t}$ is generated by a neural model, called as semantic knowledge network, with fixed inputs. The model is only used during the training process. During semantic communications, the device can directly use its output without model inference.

This scheme leverages the shared semantic knowledge base to reduce data needed to be transmitted. Furthermore, the transmitter and receiver have ability to balance communication performance with computational and communicative demands by pruning $\boldsymbol{\kappa}^{t}$ and $\boldsymbol{f}$ . For devices with limited computing power, the transmitter and receiver can negotiate to truncate the basis $\mathcal{B}^{t}$ for mitigating the computational cost of the semantic encoding and decoding process. Besides, the receiver also can trim $\boldsymbol{f}$ to introduce fewer signal symbols to be transmitted. With the designed training scheme, vectors in $\boldsymbol{\kappa}^{t}$ and $\boldsymbol{f}$ are both ordered according to their importance for performing the semantic communication task $t$ . So that, the IoT device can efficiently truncate them with minimal sacrifice to communication performance. The method for constructing and updating $\boldsymbol{\kappa}^{t}$ and $\boldsymbol{f}$ with order of importance will be thoroughly introduced in the following.

IV-C1 Training with random pruning mechanism

The forward propagation with random pruning mechanism is shown in Algorithm 1. Let $\boldsymbol{\kappa}^{t}_{i}$ represent a subsequence of $\boldsymbol{\kappa}^{t}$ comprising the first $i$ elements, and $\boldsymbol{f}_{j}$ denote a subsequence of $\boldsymbol{f}$ containing the first $j$ elements. For each batch during training, $\boldsymbol{\kappa}^{t}_{i}$ and $\boldsymbol{f}_{j}$ are randomly selected, ranging from the empty set to containing all of elements in $\boldsymbol{\kappa}^{t}$ and $\boldsymbol{f}$ . The mechanism ensures devices to flexibly adjust the size of $\boldsymbol{\kappa}^{t}$ and $\boldsymbol{f}$ according to their own computational and communication capabilities, supporting a elastic semantic communication system.

Input: batch data

\boldsymbol{S}

from

D

;

RandomInteger\left(2,N^{t}\right)\to i

;

RandomInteger\left(0,G\right)\to j

;

4 Transmitter:

S_{\boldsymbol{\alpha}}\left(\boldsymbol{S}||\boldsymbol{\kappa}^{t}_{i}\right% )\to\boldsymbol{f}

;

6 Transmit

\boldsymbol{f}_{j}

over the channel;

8Receiver:

9 Receive

\hat{\boldsymbol{f}_{j}}

;

S_{\boldsymbol{\chi}}^{-1}\left(\hat{\boldsymbol{f}_{j}}||\boldsymbol{\kappa}^% {t}_{i}\right)\to\hat{\boldsymbol{S}}

;

Output:

\boldsymbol{f}

\hat{\boldsymbol{f}}

\hat{\boldsymbol{S}}

Algorithm 1 Forward propagation with random pruning mechanism

IV-C2 Efficient local network update

3 Function Train the Semantic Codec():

Input: batch data

\boldsymbol{S}

from dataset;

5 Freeze

C_{\boldsymbol{\beta}}

C_{\boldsymbol{\psi}}^{-1}

\boldsymbol{\kappa}^{t}

;

6 Forward propagation based on Algorithm 1;

7 Compute loss function

\mathcal{L}_{CE}

by (7);

8 Train

S_{\boldsymbol{\alpha}}

S_{\boldsymbol{\chi}}^{-1}

\to

Gradient descent with

\mathcal{L}_{CE}

;

Output:

S_{\boldsymbol{\alpha}}

S_{\boldsymbol{\chi}}^{-1}

;

11 Function Train the Channel Codec():

Input: batch data

\boldsymbol{S}

from dataset;

13 Freeze

S_{\boldsymbol{\alpha}}

S_{\boldsymbol{\chi}}^{-1}

\boldsymbol{\kappa}^{t}

;

14 Forward propagation based on Algorithm 1;

15 Compute loss function

\mathcal{L}_{MSE}

by (8);

16 Train

C_{\boldsymbol{\beta}}

C_{\boldsymbol{\psi}}^{-1}

\to

Gradient descent with

\mathcal{L}_{MSE}

;

Output:

C_{\boldsymbol{\beta}}

C_{\boldsymbol{\psi}}^{-1}

;

19 Function Train the Semantic Knowledge Base():

Input: batch data

\boldsymbol{S}

from dataset;

21 Freeze

C_{\boldsymbol{\beta}}

C_{\boldsymbol{\psi}}^{-1}

S_{\boldsymbol{\alpha}}

S_{\boldsymbol{\chi}}^{-1}

;

22 Forward propagation based on Algorithm 1;

23 Compute loss function

\mathcal{L}_{MSE}

by (8);

24 Train

\boldsymbol{\kappa}^{t}

\to

Gradient descent with

\mathcal{L}_{CE}

;

Output:

\boldsymbol{\kappa}^{t}

;

26 Function Train the Whole System():

Input: batch data

\boldsymbol{S}

from dataset;

27 Forward propagation based on Algorithm 1;

28 Compute loss function

\mathcal{L}_{CE}

by (7);

29 Train

S_{\boldsymbol{\alpha}}

S_{\boldsymbol{\chi}}^{-1}

C_{\boldsymbol{\beta}}

C_{\boldsymbol{\psi}}^{-1}

\boldsymbol{\kappa}^{t}

\to

Gradient descent with

\mathcal{L}_{CE}

;

Output:

S_{\boldsymbol{\alpha}}

S_{\boldsymbol{\chi}}^{-1}

C_{\boldsymbol{\beta}}

C_{\boldsymbol{\psi}}^{-1}

\boldsymbol{\kappa}^{t}

;

Algorithm 2 Local update

As exhibited in Algorithm 2, the training of the semantic communication system is divided into four steps, for the individual training of the semantic codec, the channel codec, the semantic knowledge base and the overall training of the whole system. In the first steps, $S_{\boldsymbol{\alpha}}$ and $S_{\boldsymbol{\chi}}^{-1}$ are updated with the goal of minimizing the divergence between $\boldsymbol{s}$ and $\hat{\boldsymbol{s}}$ . To quantify this divergence, we employ the cross-entropy (CE) to quantify the divergence, which is given by

		$\displaystyle\mathcal{L}_{CE}\left(\boldsymbol{s},\hat{\boldsymbol{s}}\right)=$		(7)
		$\displaystyle-\sum_{l=1}q\left(w_{l}\right)log\left(p\left(w_{l}\right)\right)% +\left(1-q\left(w_{l}\right)\right)log\left(1-p\left(w_{l}\right)\right),$		(7)

where $q\left(w_{l}\right)$ denotes the real probability of the occurrence of $w_{l}$ in original sentence $\mathbf{s}$ , and $p\left(w_{l}\right)$ is the predicted probability of the same $w_{i}$ appearing in the reconstructed sentence $\hat{\mathbf{s}}$ .

In the second steps, $C_{\boldsymbol{\beta}}$ and $C_{\boldsymbol{\psi}}^{-1}$ are updated with the $\mathcal{L}_{MSE}$ , which is given by

\displaystyle\mathcal{L}_{MSE}\left(\boldsymbol{f},\hat{\boldsymbol{f}}\right)% =\left|\left|\boldsymbol{f}-\hat{\boldsymbol{f}}\right|\right|_{2},

(8)

In the third steps, $\boldsymbol{\kappa}^{t}$ are update with the $\mathcal{L}_{CE}$ . To ensure the broad representational capability of the semantic knowledge base, we introduce the cosine distance into the loss function, aiming to enrich each vector with a more diverse set of information. By incorporating these approach, we strive to achieve a semantic knowledge base that is not only scalable and adaptable but also possesses a wider range of representation, thereby improving the comprehensiveness and accuracy of semantic communication capabilities of the system. the $\mathcal{L}_{\boldsymbol{\kappa}}$ is given by

\mathcal{L}_{\kappa}\left(\boldsymbol{s},\hat{\boldsymbol{s}}\right)=\mathcal{% L}_{CE}\left(\boldsymbol{s},\hat{\boldsymbol{s}}\right)+\left|\left|\left(% \boldsymbol{\kappa}^{t}\right)^{T}\left(\boldsymbol{\kappa}^{t}\right)\ \right% |\right|_{2},

(9)

Finally, the whole network is trained with $\mathcal{L}_{total}$ , which is given by

\mathcal{L}_{total}=\mathcal{L}_{CE}+\mathcal{L}_{MSE}+\mathcal{L}_{% \boldsymbol{\kappa}}

(10)

Oriented towards the need for continuous efficient update of the semantic communication system, IoT devices can only update the semantic knowledge base based on (9) with less data exchange.

IV-D The Joint Model-Channel Differential Privacy Noise Mechanism

In this section, we propose a differential privacy semantic communication scheme for any task that focuses on data analysis and do not require precise data recovery. In our proposed system, the proposed differential privacy scheme prevents attackers from inferring sensitive information contained in the local training data, according to the analysis results, i.e., the transmitted signal symbols. With the help of the mechanism, IoT devices can efficiently share their models and semantic knowledge bases and ensure privacy protection during system maintain and usage. The whole process for transmitter is

\boldsymbol{x}=\Omega(\mathcal{D}),

(11)

where $\mathcal{D}$ is the raw data collected by IoT device, $\Omega(\cdot)$ represent the whole process including data analyzing, semantic encoding and channel encoding. $\boldsymbol{x}$ is also represented as

\boldsymbol{x}=\boldsymbol{si}+\boldsymbol{n}_{model},

(12)

where $\boldsymbol{si}$ is the semantic information extracted from $\mathcal{D}$ , $\boldsymbol{n}_{model}\sim\mathcal{CN}\left(0,\sigma_{m}^{2}\mathbf{I}\right)$ represent the model noise with Gaussian distribution, which is the result of unstable gradients descending, the training data noise and other factors[33]. After being transmitted over wireless channel, based on (2) and (12), the received signal can be represented as

\boldsymbol{y}=\boldsymbol{h}\left(\boldsymbol{si}+\boldsymbol{n}_{model}% \right)+\boldsymbol{n}_{channel}.

(13)

Adversaries can only perform malicious analysis based on $\boldsymbol{y}$ . We define the process from $D$ to $y$ as

\boldsymbol{y}=\mathcal{M}(\mathcal{D}).

(14)

Based on (14), we know that semantic communications achieve differential privacy if $\mathcal{M}(\cdot)$ satisfy differential privacy. Formally, $\mathcal{M}:\boldsymbol{D}\to\boldsymbol{Y}$ satisfies $\left(\epsilon,\delta\right)$ -differential privacy[34, 35] if and only if for any two adjacent datasets $\mathcal{D},\mathcal{D}^{\prime}\subseteq\boldsymbol{D}$ and output $\boldsymbol{\gamma}\subset\boldsymbol{Y}$ , we have

Pr[\mathcal{M}(\mathcal{D})\in\boldsymbol{\gamma}]\leq e^{\epsilon}Pr[\mathcal% {M}(\mathcal{D}^{\prime})\in\boldsymbol{\gamma}]+\delta

(15)

where $\mathcal{D}$ and $\mathcal{D}^{\prime}$ differ in only one sample, $\boldsymbol{D}$ and $\boldsymbol{Y}$ are sets of all $\mathcal{D}$ and $\boldsymbol{y}$ respectively, $\epsilon$ controls the privacy loss, with smaller values indicating stronger privacy protection, $\delta$ allows for a small probability of deviation from the strict privacy guarantee, providing a more flexible approach in scenarios where absolute privacy may be impractical. Hence, a mechanism satisfies $\left(\epsilon,\delta\right)$ -differential privacy if, for any pair of adjacent datasets, and for any outputs, the ratio of the probabilities of observing these outputs under the mechanism is bounded by $\exp(\epsilon)$ with probability at least $1-\delta$ .

To make $\mathcal{M}(\cdot)$ satisfy differential privacy, we utilize analytic Gaussian mechanism[36]. Note that $\triangle$ is sensitivity of $M(\cdot)$ , defined as the maximum of $||M(\mathcal{D})-M(\mathcal{D}^{\prime})||_{2}$ . The mechanism is that for any $\epsilon>0$ , $\delta\in(0,1)$ and $\triangle$ , there is a $\sigma$ . Adding Gaussian noise with mean 0 and standard deviation $\sigma$ into the result of mechanism $M$ provides $\left(\epsilon,\delta\right)$ -differential privacy.

We add Gaussian noise $\mathbf{n}_{dp}\sim\mathcal{CN}\left(0,\sigma_{dp}^{2}\mathbf{I}_{L}\right)$ to achieve differential privacy, therefore the signal received at the adversary is

\boldsymbol{y}=\boldsymbol{h}\left(\boldsymbol{si}+\boldsymbol{n}_{model}+% \boldsymbol{n}_{dp}\right)+\boldsymbol{n}_{channel}.

(16)

Following (16), considering that $\boldsymbol{n}_{dp}$ , $\boldsymbol{n}_{model}$ and $\boldsymbol{n}_{channel}$ are all Gaussian noise, there are multiple differential privacy mechanisms accumulated in $\mathcal{M}(\cdot)$ . $\boldsymbol{n}_{dp}$ , $\boldsymbol{n}_{model}$ and $\boldsymbol{n}_{channel}$ provide $(\epsilon_{dp},\delta_{dp})$ , $(\epsilon_{model},\delta_{model})$ , $(\epsilon_{channel},\delta_{channel})$ -differential privacy, respectively. Because the model noise and channel noise are immutable, we need to adjust the differential privacy noise appropriately to achieve the target differential privacy with minimum noise. In composition theorem for heterogeneous differential privacy mechanisms[37], for any $\epsilon_{i}>0$ , $\delta_{i}\in[0,1]$ for $i\in\{1,...,k\}$ , the class of $(\epsilon_{i},\delta_{i})$ -DP mechanisms satisfy $\left(\hat{\epsilon},1-(1-\hat{\delta})\prod_{i=1}^{k}\left(1-\delta_{i}\right% )\right)$ , where

$\displaystyle\hat{\epsilon}=$	$\displaystyle\min\left\{\sum_{i=1}^{k}\epsilon_{i},\right.$	(17)
	$\displaystyle\sum_{i=1}^{k}\frac{(e^{\epsilon_{i}}-1)\epsilon_{i}}{e^{\epsilon% _{i}}+1}+\sqrt{\sum_{i=1}^{k}2\epsilon_{i}^{2}\log\left(e+\frac{\sqrt{\sum_{i=% 1}^{k}\epsilon_{i}^{2}}}{\hat{\delta}}\right)},$
	$\displaystyle\left.\sum_{i=1}^{k}\frac{(e^{\epsilon_{i}}-1)\epsilon_{i}}{e^{% \epsilon_{i}}+1}+\sqrt{\sum_{i=1}^{k}2\epsilon_{i}^{2}\log\left(\frac{1}{\hat{% \delta}}\right)}\right\}.$

Based on (16) and (17), the proposed scheme first needs to confirm whether the channel noise and model noise are sufficient to achieve the differential privacy objective, and if not, then introduce $\boldsymbol{n}_{dp}$ as appropriate. $\boldsymbol{n}_{dp}$ is adjusted to achieve $\hat{\epsilon}<\epsilon^{t}$ and $\hat{\delta}<\delta^{t}$ , where $\epsilon^{t}$ and $\delta^{t}$ describe the differential privacy of target. The proposed scenarios are generic and can be effectively applied in a variety of situations. If analysis results are transmitted using traditional reliable communication protocols, it can be considered that $\boldsymbol{n}_{model}$ and $\boldsymbol{n}_{channel}$ are zeros ans $\boldsymbol{h}$ is identity matrix in (16). If model noise or wireless channel noise is difficult to estimate, the scheme is able to ignore the poorly estimated noise and permute the available Gaussian mechanisms to achieve differential privacy by adjusting $(\epsilon_{i},\delta_{i})$ in (17). Since the scheme adds $\boldsymbol{n}_{dp}$ to the symbol after power normalization which has natural upper and lower bounds, its sensitivity can be easily estimated. This simplifies the implementation of differential privacy and makes the scheme broadly adaptable to different data analysis tasks without the need to analyze the sensitivity task by task.

V Performance Evaluation

In this section, we evaluation the performance of the proposed system. We first evaluate the effectiveness of the proposed compressed semantic knowledge base. Then the flexibility of the semantic communication scheme is evaluated. Finally, we evaluate the impact of the proposed differential privacy protection mechanism on the performance of semantic communications.

Following the DeepSC[5], we employ four Transformer encoder layers in the semantic encoder, and four Transformer decoder layers in the semantic decoder. The entire network parameter settings are summarized in Table I. The knowledge base is generated by the semantic knowledge network and consists of eight vectors of size $128$ . The dataset used in experiments is the English and French corpora in the proceeding of the European Parliament[38].

TABLE I: The settings of the proposed system

	Layer Name	Unit
Semantic Encoder	4 $\times$ Transformer Encoder	128 (8 heads)
Channel Encoder	Dense	256
Channel Encoder	Dense	16
Channel Decoder	Dense	128
Channel Decoder	Dense	256
Semantic Decoder	4 $\times$ Transformer Decoder	128 (8 heads)
Predictable Layer	Dense	Dictonary size
Semantic Knowledge Net	Dense	128
Semantic Knowledge Net	Dense	128 $\times$ 8

In order to demonstrate that the proposed knowledge base enables efficient semantic knowledge system updating, we show the loss evolution of the proposed system in Fig. 4. The loss is $\mathcal{L}_{CE}$ in (7). “SKB in ‘en’ ” and “SKB in ‘fr’ ” denote the use of English corpus, French corpus and English-French corpus to train the semantic knowledge network to generate the semantic knowledge bases respectively.

The system is trained to perform text transmission in both English and French. In the first $1200$ epochs, the semantic knowledge network is frozen and only DeepSC-related modules are being trained. After $1200$ epochs, the DeepSC-related modules is trained only $5$ rounds per $100$ rounds on average, while the semantic knowledge network starts to be trained for English and French respectively to generate compressed semantic knowledge bases. The output of the semantic knowledge network is reshaped to $\mathbb{R}^{8\times 128}$ , as a semantic knowledge base. At the $1200$ -th epoch, the system begins to converge. The incorporation of the semantic knowledge network allows the system’s loss to converge to a lower loss. Moreover, the decline in Loss is accomplished with most of the network being frozen. This will significantly reduce the amount of data that needs to be shared during the collaborative learning process in IoT networks.

We analyze the performance of the proposed system using the bilingual evaluation understudy (BLEU) score[38]. Fig. 5 and Fig. 6 show the comparison of BLEU versus signal to noise ratio (SNR) in English and French transmission tasks with different knowledge bases over different wireless channels, AWGN Rayleigh and Rician. The DeepSC serve as the baseline for this comparison. From the figures, it can be seen that the BLEU of the proposed scheme is higher compared to DeepSC which does not use semantic knowledge base. Moreover, the closer the training dataset is to the communication task requirements, the more the trained semantic knowledge base improves the BLEU. Based on the above experimental results, we learn that the proposed semantic communication scheme based on compressed semantic knowledge bases is able to achieve efficient system updating and support adjustment for different tasks.

We conduct a thorough evaluation of the flexibility of the proposed system. Fig. 7 presents a comparatively analysis of the performance of the proposed system under different pruning levels. The result indicates that the performance of the system is enhanced as the pruning level decrease. Specifically, the proposed system, when transmitting $90\%$ of semantic features, achieves approximately same the BLEU score of DeepSC. Transmission in only $80\%$ semantic features can achieve a BLEU higher than $0.85$ even at a SNR of $-3db$ . The proposed scheme is able to obtain better performance compared to DeepSC at low SNR due to the fact that semantic knowledge bases has been shared in advance and is not interfered by the current noise.

Fig. 8 shows the communication performance of the proposed differential privacy semantic communication for different $\delta$ and $\epsilon$ settings. The results show that the mechanism is able to guarantee mathematically rigorous proofs of privacy preservation with BLEU of more than $0.8$ .

VI Conclusion

We propose a secure, efficient, and privacy-preserving semantic communication system in IoT networks. The proposed solutions have been validated through extensive experiments, showing that they can achieve the desired goals of efficiency, and privacy preservation.

References

[1] Z. Qin, X. Tao, J. Lu, W. Tong, and G. Y. Li, “Semantic communications: Principles and challenges,” arXiv preprint arXiv:2201.01389, 2021.
[2] D. Gündüz, Z. Qin, I. E. Aguerri, H. S. Dhillon, Z. Yang, A. Yener, K. K. Wong, and C.-B. Chae, “Beyond transmitting bits: Context, semantics, and task-oriented communications,” IEEE Journal on Selected Areas in Communications, vol. 41, no. 1, pp. 5–41, 2022.
[3] H. Xie and Z. Qin, “A lite distributed semantic communication system for internet of things,” IEEE Journal on Selected Areas in Communications, vol. 39, no. 1, pp. 142–153, 2020.
[4] H. Du, J. Wang, D. Niyato, J. Kang, Z. Xiong, M. Guizani, and D. I. Kim, “Rethinking wireless communication security in semantic internet of things,” IEEE Wireless Communications, vol. 30, no. 3, pp. 36–43, 2023.
[5] H. Xie, Z. Qin, G. Y. Li, and B.-H. Juang, “Deep learning enabled semantic communication systems,” IEEE Transactions on Signal Processing, vol. 69, pp. 2663–2675, 2021.
[6] B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. y Arcas, “Communication-efficient learning of deep networks from decentralized data,” in Artificial intelligence and statistics. PMLR, 2017, pp. 1273–1282.
[7] J. Deogirikar and A. Vidhate, “Security attacks in iot: A survey,” in 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC). IEEE, 2017, pp. 32–37.
[8] L. Feng, Y. Zhao, S. Guo, X. Qiu, W. Li, and P. Yu, “Bafl: A blockchain-based asynchronous federated learning framework,” IEEE Transactions on Computers, vol. 71, no. 5, pp. 1092–1103, 2021.
[9] F. Zhou, Y. Li, M. Xu, L. Yuan, Q. Wu, R. Q. Hu, and N. Al-Dhahir, “Cognitive semantic communication systems driven by knowledge graph: principle, implementation, and performance evaluation,” IEEE Transactions on Communications, 2023.
[10] S. Jiang, Y. Liu, Y. Zhang, P. Luo, K. Cao, J. Xiong, H. Zhao, and J. Wei, “Reliable semantic communication system enabled by knowledge graph,” Entropy, vol. 24, no. 6, p. 846, 2022.
[11] H. Zhang, S. Shao, M. Tao, X. Bi, and K. B. Letaief, “Deep learning-enabled semantic communication systems with task-unaware transmitter and dynamic data,” IEEE Journal on Selected Areas in Communications, vol. 41, no. 1, pp. 170–185, 2022.
[12] Y. Sun, H. Chen, X. Xu, P. Zhang, and S. Cui, “Semantic knowledge base-enabled zero-shot multi-level feature transmission optimization,” IEEE Transactions on Wireless Communications, 2023.
[13] L. X. Nguyen, H. Q. Le, Y. L. Tun, P. S. Aung, Y. K. Tun, Z. Han, and C. S. Hong, “An efficient federated learning framework for training semantic communication systems,” IEEE Transactions on Vehicular Technology, 2024.
[14] W. Wei and L. Liu, “Gradient leakage attack resilient deep learning,” IEEE Transactions on Information Forensics and Security, vol. 17, pp. 303–316, 2021.
[15] Y. Chen, Q. Yang, Z. Shi, and J. Chen, “The model inversion eavesdropping attack in semantic communication systems,” in GLOBECOM 2023-2023 IEEE Global Communications Conference. IEEE, 2023, pp. 5171–5177.
[16] Y. Zhang, R. Jia, H. Pei, W. Wang, B. Li, and D. Song, “The secret revealer: Generative model-inversion attacks against deep neural networks,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 253–261.
[17] C. Dwork, “Differential privacy,” in International colloquium on automata, languages, and programming. Springer, 2006, pp. 1–12.
[18] M. Abadi, A. Chu, I. Goodfellow, H. B. McMahan, I. Mironov, K. Talwar, and L. Zhang, “Deep learning with differential privacy,” in Proceedings of the 2016 ACM SIGSAC conference on computer and communications security, 2016, pp. 308–318.
[19] D. Ye, S. Shen, T. Zhu, B. Liu, and W. Zhou, “One parameter defense—defending against data inference attacks via differential privacy,” IEEE Transactions on Information Forensics and Security, vol. 17, pp. 1466–1480, 2022.
[20] Y. E. Sagduyu, T. Erpek, S. Ulukus, and A. Yener, “Is semantic communication secure? a tale of multi-domain adversarial attacks,” IEEE Communications Magazine, vol. 61, no. 11, pp. 50–55, 2023.
[21] M. Shen, J. Wang, H. Du, D. Niyato, X. Tang, J. Kang, Y. Ding, and L. Zhu, “Secure semantic communications: Challenges, approaches, and opportunities,” IEEE Network, 2023.
[22] X. Liu, G. Nan, Q. Cui, Z. Li, P. Liu, Z. Xing, H. Mu, X. Tao, and T. Q. Quek, “Semprotector: A unified framework for semantic protection in deep learning-based semantic communication systems,” IEEE Communications Magazine, vol. 61, no. 11, pp. 56–62, 2023.
[23] Y. Lin, Z. Gao, H. Du, D. Niyato, J. Kang, Z. Xiong, and Z. Zheng, “Blockchain-based efficient and trustworthy aigc services in metaverse,” IEEE Transactions on Services Computing, 2024.
[24] Y. Lin, Z. Gao, H. Du, D. Niyato, J. Kang, Y. Gao, J. Wang, and A. Jamalipour, “Blockchain-based semantic information sharing and pricing for web 3.0,” IEEE Transactions on Network Science and Engineering, 2023.
[25] Y. Wang, S. Guo, Y. Deng, H. Zhang, and Y. Fang, “Privacy-preserving task-oriented semantic communications against model inversion attacks,” IEEE Transactions on Wireless Communications, 2024.
[26] S. Cheng, X. Zhang, Y. Sun, Q. Cui, and X. Tao, “Knowledge discrepancy oriented privacy preserving for semantic communication,” IEEE Transactions on Vehicular Technology, 2024.
[27] A. Zhang, Y. Wang, and S. Guo, “On the utility-informativeness-security trade-off in discrete task-oriented semantic communication,” IEEE Communications Letters, 2024.
[28] Y. Miao, Z. Liu, H. Li, K.-K. R. Choo, and R. H. Deng, “Privacy-preserving byzantine-robust federated learning via blockchain systems,” IEEE Transactions on Information Forensics and Security, vol. 17, pp. 2848–2861, 2022.
[29] M. Belotti, N. Božić, G. Pujolle, and S. Secci, “A vademecum on blockchain technologies: When, which, and how,” IEEE Communications Surveys & Tutorials, vol. 21, no. 4, pp. 3796–3838, 2019.
[30] E. Androulaki, A. Barger, V. Bortnikov, C. Cachin, K. Christidis, A. De Caro, D. Enyeart, C. Ferris, G. Laventman, Y. Manevich et al., “Hyperledger fabric: a distributed operating system for permissioned blockchains,” in Proceedings of the thirteenth EuroSys conference, 2018, pp. 1–15.
[31] G. Ateniese, R. Burns, R. Curtmola, J. Herring, L. Kissner, Z. Peterson, and D. Song, “Provable data possession at untrusted stores,” in Proceedings of the 14th ACM conference on Computer and communications security, 2007, pp. 598–609.
[32] J. Li, J. Wu, G. Jiang, and T. Srikanthan, “Blockchain-based public auditing for big data in cloud storage,” Information Processing & Management, vol. 57, no. 6, p. 102382, 2020.
[33] H. Xie, Z. Qin, and G. Y. Li, “Semantic communication with memory,” IEEE Journal on Selected Areas in Communications, 2023.
[34] C. Dwork, F. McSherry, K. Nissim, and A. Smith, “Calibrating noise to sensitivity in private data analysis,” in Theory of Cryptography: Third Theory of Cryptography Conference, TCC 2006, New York, NY, USA, March 4-7, 2006. Proceedings 3. Springer, 2006, pp. 265–284.
[35] R. Xue, K. Xue, B. Zhu, X. Luo, T. Zhang, Q. Sun, and J. Lu, “Differentially private federated learning with an adaptive noise mechanism,” IEEE Transactions on Information Forensics and Security, 2023.
[36] B. Balle and Y.-X. Wang, “Improving the gaussian mechanism for differential privacy: Analytical calibration and optimal denoising,” in International Conference on Machine Learning. PMLR, 2018, pp. 394–403.
[37] P. Kairouz, S. Oh, and P. Viswanath, “The composition theorem for differential privacy,” in International conference on machine learning. PMLR, 2015, pp. 1376–1385.
[38] K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu, “Bleu: a method for automatic evaluation of machine translation,” in Proceedings of the 40th annual meeting of the Association for Computational Linguistics, 2002, pp. 311–318.