1. Introduction
With rapid advancement in electronics industry, small inexpensive battery-powered wireless sensors have already started to make an impact on the communication with the physical world. WSN consists of large number of low cost devices to gather information from the diverse kinds of physical phenomenon. The sensors can monitor various entities such as: temperature, pressure, humidity, salinity, metallic objects, and mobility; this monitoring capability can be effectively used in commercial, military, and environmental applications [
1,
2,
3,
4,
5,
6,
7,
8,
9,
10,
11]. For these sensor network applications, most research has discussed problems by the deployment of large number of low-cost homogeneous devices. But in practical applications, in order to meet the demands of various applications for the technologies of sensor networks, increasing attentions have been attracted to the researches on heterogeneous WSNs [
12]. Heterogeneous WSN is composed of different types of sensor nodes, which are in a wide range of applications [
13,
14]. In fact, the heterogeneity is common in the WSNs [
15]. The priority should be given to reduce energy dissipation in network operation, improve network load and stability, and prolong network lifetime.
Compressive sensing (CS) [
16,
17,
18] is a collection of recently proposed sampling methods in information theory. The combination of CS theory with WSNs holds promising improvements to the lifetime of wireless sensors, whereas it reduces global scale communication cost without introducing intensive computation or complicated transmission control. CS states that sparse signal of information in WSNs can be exactly reconstructed from a small number of random linear measurements of information in WSNs [
19]. CS provides a new approach to mathematical complexities especially when sparse information is applied. CS tends to recover data vector
with
N number of information form data vector
with
M number of information such that
[
20]. This will result in extending the lifetime of the sensor network.
Energy consumption in networks can be effectively reduced by organizing clustering sensor nodes, so many energy-efficient routing protocols are designed on the basis of the clustering structure. Currently, a number of distributed clustering protocols are proposed. In accordance with the networks, homogeneous or heterogeneous, to which the protocols are adaptive, clustering protocols can be categorized into homogeneous clustering protocols which assume that all the sensor nodes are identical and heterogeneous clustering protocols which assume that the sensor nodes have different capabilities. Due to the dynamic and complex nature of energy configuration and network evolution, it is very difficult to design a clustering protocol which can save energy and provide reliable data transmission in heterogeneous networks.
To overcome the above problems, we propose a new protocol EDACP-CS for heterogeneous WSNs. EDACP-CS includes two phases: the cluster head arrangement phase and the routing phase. In cluster head arrangement phase, all the sensor nodes organize themselves into clusters with one node elected as a cluster head (CH) according to their weighted election probabilities and their distance to the base station (BS) to determine near nodes and far nodes in order to give more chance to nearest nodes to be CHs by modifying the election probability value for each type of nodes. In the routing phase, after receiving the data from the member nodes, each CH compresses the collected data using CS and transfers it to the BS. Simulation results show that our protocol can aggregate data efficiently and reduce energy consumption greatly, prolonging the stability period and the lifetime of the whole network strongly depend on higher values of extra energy during its heterogeneous settings and different transmission ranges.
The remainder of the paper is organized as follows:
Section 2 presents the related work of the proposed protocol. In
Section 3, we introduce the proposed system model. Simulation results and its discussion are presented in
Section 4. Finally, the conclusion of our work is given in
Section 5.
2. Related Work
Recently there has been a growing interest in WSNs. One of the major issues in WSN is developing an energy-efficient routing protocol. Since the sensor nodes have limited available power, energy conservation is a critical issue in WSN for nodes and network life. The issue of heterogeneity (in terms of energy) of nodes is addressed in [
21]. In [
22], the proposed protocol is based on random selection of CHs weighted according to the remaining energy of the node. This approach addresses the problem of varying energy levels and consumption rates but still assumes that the BS can be reached directly by all the nodes.
In [
23], the authors provided the optimal heterogeneous sensor deployment that minimizes the deployment cost in different communication modes. In their model, the cost of the cluster head device is determined by the amount of initial battery energy, which depends on the number of cluster members and communication mode. They do not consider the sensing coverage and aging process over time.
Low-Energy Adaptive Clustering Hierarchy (LEACH) [
9] is one of the most popular distributed cluster-based routing protocols in WSNs. The operation of LEACH is generally separated into two phases, the set-up phase and the steady-state phase. In the set-up phase, CHs are selected and clusters are organized, each node decides whether to become a CH for the current round. This decision is based on a predetermined fraction of nodes and the threshold
, which is given by:
where
is the predetermined percentage of CHs,
r is the count of current round and
G is the set of sensor nodes that have not been CHs in the last
rounds. Using this threshold, each node will be a CH at a round within
rounds. After
rounds, all nodes are once again eligible to become CHs. In this way, the energy concentration on CHs is distributed. LEACH does not consider the residual energy of each node so the nodes that have relatively smaller energy remaining can be selected as CHs. This makes the network lifetime shortened. In the steady-state phase, the actual data transmissions to the BS take place. After the steady-state phase, the next round begins.
In [
24], the authors proposed a new routing protocol and data aggregation method in Leach-heterogeneous system where the sensor nodes form the cluster and the CH elected based on the residual energy of the individual node calculation with re-clustering scheme is adopted in each cluster of the WSNs. The proposed protocol works with 3 types of nodes (normal, advanced and super), CHs are selected on the basis of their residual energy and their distances to the BS.
CS is a novel sensing/sampling paradigm that goes against the common wisdom in data acquisition, the theory of CS extends traditional sensing and sampling systems to a much broader class of signals. The promise of CS is that a sparse or compressible signal can be recovered from a small salient set of projections. To make this possible, there are two principles [
16]: sparsity, which pertains to the signals of interest, and incoherence, which pertains to the sensing modality. Under CS framework, any compressible signal
can be represented in the form of:
where
is the transform matrix and α is the sparse representation of
X. There are no more than
K nonzero entries in vector α, where
K is much smaller than
N, and the signal
X is called
K sparse signal. In contradiction to sparsity, incoherence means that the measurement matrix Φ has dense representation in basis Ψ, and Φ is independent to Ψ. In practice, most of natural signals are sparse or near sparse, and they can be recovered from their compressible samples. For a signal
, we can obtain its
M linear observations, as in
Figure 1.
Figure 1.
The scheme of compressive sampling.
Figure 1.
The scheme of compressive sampling.
If
satisfies the restricted isometry property (RIP) [
16], condition
such that
c is a small constant with
, the vector α can be accurately recovered from
Y as the unique solution of:
The original networked data
X may be sparse itself or can be sparsified with a suitable transform such as Discrete Cosine Transform or Discrete Wavelet transform [
25,
26,
27]. One example of the self-sparse
X is a linear combination of just
K basis vectors, with
, that is; only
K are nonzero and
are zero [
28]. Usually, the networked data vector
X is sparse with a proper Ψ in Equation (
2). Most of the previous works [
25,
26,
27] consider a regular sensor network. However, sensors are usually deployed on an irregular grid. So it is expected to find some sparse representations to sparsify irregular grid sensor networked data. In WSNs, sampling matrix Φ is usually pre-designed,
i.e., each sensor locally draws
M elements of the random projection vectors by using its network address as the seed of a pseudorandom number generator. Based on CS theory, Jia
et al. [
28] considers a sparse event detection scenario where the channel impulse response (CIR) matrix is used as a natural sampling matrix. In [
29], a basic global superposition model to obtain the measurements of sensor data is proposed, where a sampling matrix is modeled as the channel impulse response (CIR) matrix while the sparsifying matrix is expressed as the distributed wavelet transform. In [
30], compressive distributed sensing using random walk CDS(RW) algorithm is proposed that uses rateless coding. In our proposed protocol we use CS to fuse data efficiently in EDACP-CS and consider three types of nodes and different transmission ranges to achieve a robust self configured WSN that maximizes lifetime.
In [
31], energy efficient clustering and data aggregation protocol for heterogeneous WSNs (EECDA) has been proposed. EECDA combines energy efficient cluster based routing and data aggregation for improving the performance in terms of lifetime and stability. However, in the proposed protocol we discuss effectively the aggregation using CS and consider the heterogeneity of nodes due to energy as well as transmission ranges. We assume that CHs are randomly selected based on their residual energy and the distance form the BS. In [
32], the authors have proposed a novel data collection algorithm using compressive sensing (CS data collection). In CS data collection, each sensor node only communicates with its neighbor node in one hop. However, in the proposed protocol all sensor nodes organize themselves into clusters with one node elected as a CH based on their weighted election probabilities and their distance to the BS.
All the above protocols do not consider efficient data compression and link heterogeneity. In the proposed protocol EDACP-CS, each sensor node independently elects itself as a CH based on its residual energy and its distance to the BS. Therefore, nodes that are closer to the BS and contain more energy than the other nodes have more chance to be selected as a CH for current round. EDACP-CS introduces a new scheme to combine clustering strategy with CS theory for increasing both energy and stable period constrains under heterogeneous environment in terms of node energy and transmission range. Simulation shows that the network lifetime and the stability period are much better in the proposed protocol than CS data collection, EECDA and LEACH-Hetero protocols.
3. System Model
In EDACP-CS protocol, we consider heterogeneous WSN with nodes of different energy levels and transmission ranges. Sensor nodes are divided into three categories (normal, advanced and super) nodes as shown in
Figure 2. The energy consumption of advanced is less than that of normal and the energy consumption of super is less than that of advanced.
Figure 2.
A heterogeneous wireless sensor network.
Figure 2.
A heterogeneous wireless sensor network.
3.1. Network and Energy Model Assumptions
In EDACP-CS, we use the simplified energy model proposed in [
33]. According to the radio energy dissipation model illustrated in
Figure 3, in order to achieve an acceptable Signal-to-Noise Ratio (SNR) in transmitting an
bit message over a distance
d, the energy expended by the radio is given by:
where
is the energy dissipated per bit to run the transmitter or the receiver circuit,
and
depend on the transmitter amplifier model we use, and
d is the distance between the sender and the receiver. By equating the two expressions at
, we have
. To receive an
bit message the radio expends
.
We make some assumptions about the sensor nodes and underlying network model as follows:
All sensor nodes are stationary and uniformly distributed within a square field,
Communication among sensor nodes is based on single-hop approach,
Networked data vector is sparse or highly compressible in Distributed Wavelet Transform (DWT) domain, i.e., it contains K largest coefficients. Setting the rest coefficients zero will not cause much information loss,
A WSN consists of heterogeneous nodes in terms of node energy and transmission range.
Figure 3.
Radio Energy Dissipation Model.
Figure 3.
Radio Energy Dissipation Model.
3.2. Optimal Number of Clusters
We assume that an area
square meters over which
N nodes are uniformly distributed. For simplicity, assume the sink is located in the center of the field, and that the distance between any node to the BS or its CH is
, where
is the maximum transmission range. We consider that after deployment, the BS broadcasts a “hello” message to all the nodes at a certain power level. Each node can compute its approximated distance
to the BS based on the received signal strength. The average distance
between nodes and the BS is given by:
The value of
can be approximated as:
where
is the average distance between a cluster member and its CH and
is the average distance between the CH and the BS. The energy dissipated in the CH node during a round is given by the following formula:
where
C is the number of clusters,
Y is the compressed data. The energy dissipated by a non-CH node is given by:
Using Euclidian metric, the area occupied by each cluster will be
with node distribution
:
Assuming the area is a circle with radius
,
is constant, and the density ρ is uniform where
,
can be simplified as follows:
The energy dissipated in a cluster per round is given by:
The total energy dissipation in the network per round will be the sum of the energy dissipation by all clusters,
i.e.,
By differentiating
with respect to
C and equating to zero, the optimal number of constructed clusters can be found:
where, the average distance from a CH to the BS
is given by [
34] such that
:
If the distance of a significant percentage of nodes to the BS is greater than
then, following the same analysis as in [
33] we will obtain:
By substituting in Equation (
13) and differentiating
with respect to
C and equating to zero, we will find:
The optimal probability of a node to become a CH,
, can be computed as follows:
The optimal construction of clusters (which is equivalent to the setting of the optimal probability for a node to become a CH) is very important. In [
9], the authors showed that if the clusters are not constructed in an optimal way, the total consumed energy of the WSN per round is increased exponentially either when the number of the constructed clusters is greater than the optimal number of clusters or especially when the number of the constructed clusters is less than the optimal number of clusters. If the number of the constructed clusters is less than the optimal number of clusters, some nodes in the network have to transmit their data very far to reach the CH, this will increase the energy of the system. If the number of the constructed clusters is greater than the optimal number of clusters, the total routing traffics within each cluster will be reduced because of fewer members, however, more clusters will result in more than one-hop transmissions from the CHs to the BS also the CHs will receive data from fewer members this will reduce the local data aggregation being performed and increase the communications among the CHs.
3.3. Cluster Head Election Phase
The optimal probability of a node to become a CH is equivalent to the optimal construction of clusters. This clustering is optimal in the sense that energy consumption is well distributed over all sensors and the total energy consumption is minimal. Such optimal clustering highly depends on the energy model that we use.
In EDACP-CS, the nodes with less energy than the others and the nodes with more distance from the BS than the others have the smallest chance to be selected as a CH for current round. Let us assume
is the initial energy of each normal sensor node,
m is the fraction of advanced nodes among normal nodes which are equipped with α times more energy than the normal nodes, and
is the fraction of super nodes among advanced nodes which are equipped with β times more energy than the normal nodes. Note a new heterogeneous setting has no affect on the spatial density of the network so the setting of
does not change. On the other hand, due to heterogeneous nodes the net energy of the network is changed as the initial energy of each super node becomes
and each advanced node becomes
. Therefore, the total (initial) energy of the new heterogeneous setting is given by Equation (
19):
Hence, the total energy of the system is increased by a factor of In order to optimize the stable region of the system, the new epoch must become equal to because the system has times more energy.
Virtually there are nodes with energy equal to the initial energy of a normal node. In order to maintain the minimum energy consumption in each round within an epoch, the average number of CHs per round per epoch must be constant and equal to . In the heterogeneous scenario the average number of CHs per round per epoch is equal to (because each virtual node has the initial energy of a normal node). Therefore, the weighed probabilities for normal, advanced and super nodes according to residual energy and the distance from the BS are, respectively:
If distance
we take:
In Equation (
1), we replace
by the weighted probabilities to obtain the threshold that is used to elect the CH in each round. We define
as the threshold for normal nodes,
the threshold for advanced nodes and
the threshold for super nodes. Thus, for normal nodes, we have:
where
r is the current round,
is the set of normal nodes that have not become CHs within the last
rounds of the epoch, and
is the threshold applied to a population of
normal nodes. This guarantees that each normal node will become a CH exactly once every
rounds per epoch, and that the average number of CHs that are normal nodes per round per epoch is equal to
Similarly, for advanced and super nodes, we have:
3.4. Cluster Head Arrangement Phase
In this phase cluster construction occurs. For every transmission round each node calculates the probability threshold and chooses a random number between 0 and 1. If the number is less than threshold , the node becomes a CH during the current round. The CHs then broadcast the message to the network and declare themselves as CHs. After this message, each regular node chooses its closest CH with the largest received signal strength and then informs the CH by sending a join cluster message. The CH sets up a TDMA schedule and transmits it to the nodes in the cluster then the set up phase is completed and the next phase begins.
3.5. Routing Phase
Once the clusters are formed and the TDMA schedule is fixed, the data transmission phase can begin. The sensor nodes periodically collect the data samples
(networked data) [
35], and transmit it during their allocated transmission time to the CH. We assume that the sensed data is highly correlated in space domain. We use distributed wavelet transform (DWT) to sparsify the networked data
X and DWT is applied to the sampled data. DWT [
36] is successfully applied to sparsify the network data [
35,
36] acquired by the sensors deployed in an irregular grid. Once the BS knows the locations of all sensor nodes, DWT basis can be computed. DWT replaces the 2-D set of measurements with a set of transform coefficients that, for piecewise smooth fields, are sparser than the original data:
where
is the transform coefficient vector which contains
nonzeros, and
is the DWT basis. After receiving the data from the cluster members, the CHs compress the collected data using CS. The received signal vector at CH can be written as:
where Φ is the sampling matrix whose entries are
Gaussian with zero mean and unit variance. Subsequently CHs transmit measurements
Y to the BS independently. Finally, the BS decodes the networked data
X from
Y using the basis pursuit solver in Sparselab toolbox of Matlab.
It is well known that transmission on wireless channels is much more error prone than on wired channels. Physical phenomena like reflection, diffraction, and scattering of waveforms, partially in conjunction with moving nodes or movements in the environment, lead to fast fading and inter-symbol interference. Path loss, attenuation, and the presence of obstacles lead to slow fading. In addition, there are noise and interference from other nodes/systems working in overlapping or neighboring frequency bands. Thus, transmission errors are inherent in wireless communications because of these instability of wireless channels, which is due to many reasons, for example, channel fading, time-frequency coherence, and inter-band interference [
37]. To overcome the problem of recovery of transmitted data packet in case of error, we propose to resend the corrupted portion of the packet as illustrated in
Figure 4.
Figure 4.
Data communication in case of error.
Figure 4.
Data communication in case of error.