Introduction
Tremendous growths of technologies and applications for wireless sensor networks (WSNs) have brought significantly increased demand for radio spectrum, but the traditional static spectrum allocation policies have led to the spectrum scarcity [1]–[3]. The cognitive radio sensor network (CRSN) is an emergent paradigm that efficiently alleviates spectrum scarcity and better utilizes available spectrum [4]. Equipped with the cognitive radio module, the cognitive sensor nodes (CSNs) can opportunistically access the spectrum bands licensed to the primary users (PUs) without interfering with them.
It has known that the CRSN inherits some intrinsic properties of WSNs (e.g., the limitation of hardware and energy resource). Therefore, it is difficult to maintain the connectivity and stability of CRSN when the energy consumption for transmitting a large amount of mutual information and application data leads to the invalidation of some nodes [5]. To reduce the resource overheads of spectrum sensing, routing and data transmission, clustering is widely applied to scale down network and enhance the network performance. We observe that clustering can bring about three main advantages to CRSNs, namely scalability, stability and supporting cooperative tasks [6], [7]. However, the traditional clustering algorithms (e.g., lowest ID [8], maximum node degree [9], and
A. Problem Statement
Recently, many spectrum-aware clustering algorithms have been proposed to tackle the clustering problem in CRSNs. In [16], the nodes were grouped into clusters based on a highly connected subgraph where the value of each edge was defined as the number of the common idle channels between two nodes. In [17], the nodes exchanged information, including IDs and available channel lists, with their one-hop and two-hop neighbours to form local clusters, and then a spectrum-aware routing was constructed by comparing the available channel lists of adjacent nodes. In [18], a novel non-parametric Bayesian-based channel clustering scheme was proposed to identify the supported QoS levels over multiple available licensed channels, and thus SUs could identify an appropriate cluster or set of licensed channels that fulfilled their stringent QoS requirements. In [19], SMART triggered re-clustering by merging or splitting some of clusters due to the change of available channels. In [20], all second users (SUs) sensed the available channels and gathered the information of neighbour nodes using HELLO packets. SUs checked their own maintained information set
To solve this problem, a spatial similarity model was provided to approximate the number of idle channels available to each node. The nodes with high spatial similarity thus formed a cluster [13]. In [22], some physically close nodes could sense more same available channels, so they were grouped together by measuring the clustering metric, i.e., their Euclidean distances. Considering the available channels, geographical positions and database statistics, an improved spectrum-aware clustering algorithm for centralized cognitive radio networks (CRNs) was proposed to improve the network throughput and maintain the stability of clusters [23]. On the other hand, a temporal similarity model was illustrated by a bipartite graph presented the essential property of spectrum-aware clustering [24]. When the idle channels in two sets of vertices were highly temporal correlation, the edges of these vertices could connect to form a cluster. As discussed, utilizing the spatial or temporal similarity model to form clusters is benefit to improve the network performance. Separately utilizing the spatial correlation and temporal correlation, however, these clustering methods do not always provide the most reasonable spectrum-aware clustering. Authors of [25] illuminated that sensor networks utilized clustering to aggregate gathered information at the cluster heads by exploiting correlation. Hence, clustering can exploit spatial and temporal correlation of sensor nodes’ observations on spectrum sensing. In [26], a distributed spectrum-aware clustering (DSAC) protocol formed groupwise constrained clusters, where the optimal number of clusters was derived to minimize the intra-cluster distances under the spectrum-aware constraint. In [27], the temporal-spatial correlation of sensing results was transformed into the intra-cluster and inter-cluster sensing similarity such that an optimization problem with respect to the similarity metric was presented to determine the optimal number of clusters. We see that the temporal-spatial correlation is a very noteworthy characteristic of spectrum-aware clustering, but few existing works perform spectrum-aware clustering based on temporal-spatial correlation. It is required to establish a clustering metric by modelling the temporal-spatial correlation of spectrum sensing.
The above frequently-used clustering algorithms can provide spectrum-aware clustering for CRSNs without unreliable nodes, but the established clustering structure will be broken down using the sensing results of unreliable nodes. Specifically, some malicious nodes send the tampered sensing results to attack the spectrum decisions of normal nodes, and some nodes affected by the multi-path or shadow fading problem sending the biased sensing results will disturb the spectrum decisions of their neighbours. To exclude these unreliable nodes, some researchers have defined a clustering metric with respect to the reliability of nodes as the clustering criterion. In [28], a trust aware model involving sensing reliability and energy reliability was used to clustering trustworthy nodes and elect the most credible nodes as CHs. Thus, the model could reduce the handoff rate and improve the network throughput. In [29], the reliability ratio between the number of correct decisions and the total number of taken decisions was defined as a clustering metric to elect CHs. An update formula for the reliability parameter was introduced in [27]. When the value of reliability parameter exceeded a given threshold, re-clustering was triggered to delete the unreliable nodes. However, the frequent re-clustering resultes in the instability of topological structure and much resource consumption. It is observed that reinforcing the effect of reliable nodes on spectrum-aware clustering can avoid frequent re-clustering. To quickly recognize and exclude unreliable nodes, establishing a clustering metric by modelling the sensing reliability is an important problem.
In general, most of CSNs are battery powered which is difficult to recharge or replace for some environmental reasons, so they should minimize the energy consumption of spectrum sensing operations and normal activities in order to maximize their lifetime. An algorithm “Quadrant Based Energy Efficient Clustering Hierarchy” (QBEECH) was proposed to explain that energy efficient clustering could increase both the stability and the lifetime of the network much further [30]. In [31], an energy-efficient spectrum-aware clustering algorithm was proposed to achieve clustering by allowing each member node to learn the cooperative sensing costs of its neighboring nodes. It then gave an optimal clustering that minimized the network energy consumption. Based on CogLEACH algorithm, a centralized probabilistic clustering algorithm (CogLEACH-C) was introduced, where clusterheads election depended not only on the number of sensed idle channels but also on the level of each node’ residual energy [32]. In [33], a distributed spectrum-aware clustering algorithm determined the optimal number of clusters by establishing a network-wide energy consumption model with respect to the residual energy. In [34], an energy-aware cluster-based routing protocol for CRSNs selected the gateway nodes with more energy and picked CHs due to the residual energy and available channels. Thus, the event data are routed through an energy-efficient and stable path from a source node to sink. In accord with some traditional energy-efficient clustering algorithms, the above spectrum-aware clustering algorithms have offered the same idea that the clustering metric need to involve the residual energy of nodes. In established clusters, the member nodes sensing spectrum quickly consumes their residual energy. According to the characteristic of cooperative spectrum sensing, how to reduce the energy consumption as far as possible is worth more deep exploration.
As aforementioned, to provide the optimal clustering, the clustering criterion needs to consider many influence factors rather than only the channel availability. A weight-based localized clustering algorithm partitioned the network into single-hop clusters on the basis of the channel availability, the SUs’ speed and the PUs’ interference levels [35]. In [36], a weighted sum of capacity rank, stability rank and quality rank values termed as reserved value (RV) was the final value of each link. A cluster-based MAC protocol used RV to form clusters. A network stability-aware clustering (NSAC) defined the weight by considering both spectrum dynamic and residual energy. One node with the largest weight was selected as cluster head, then this CH and its neighbors formed one cluster marked “clustered”. The remaining nodes updated their weights and repeated the above clustering process until all nodes had been marked “clustered” [37]. This sequential clustering, however, increases the delay of clustering. In this paper, we propose a new spectrum-aware clustering algorithm based on weighted clustering metric involving temporal-spatial correlation, confidence level as well as residual energy. In a distributed mode, the node with the larger clustering metric than its neighbors is elected as a qualified CH, and thus the optimal clustering is established by maximizing the intra-cluster similarity and the inter-cluster dissimilarity. After clustering, the CHs sense spectrum instead of their member nodes and share the sensing information with them, while the member nodes do not need to detect spectrum all the time that greatly reduces the energy consumption of spectrum sensing and increases the time of data transmission.
Our main contributions of this paper are summarized as follows:
Different from the traditional spectrum-aware clustering criterion mainly involving the channel availability, we introduce a novel weighted clustering metric taking into account three important influence factors, i.e., temporal-spatial correlation, sensing confidence and residual energy. Due to the weighted clustering metric, we can provide the optimal clustering according to the different application scenarios.
An optimization model measuring the intra-cluster similarity and the inter-cluster dissimilarity is established to solve the optimal number of clusters. Our clustering hierarchy based on the optimal cluster size can provide better network performance than some traditional spectrum-aware clustering methods relying on the channel availability.
After clustering, the CHs periodically sense spectrum instead of their member nodes. Thus, the member nodes can choose provisional dormancy to avoid rapidly running out of the residual energy and becoming invalidation. This cooperative spectrum sensing mode guarantees the stability of the network topology. On the other hand, the member nodes can also choose to transmit data in order to enhance the network throughput.
B. Organization
The rest of this paper is organized as follows. Section 2 introduces four evaluation criteria to propose a new weighted clustering metric for spectrum-aware clustering. Section 3 presents a clusterhead election method to elect high-performance CHs and establishes an optimization model to solve the optimal number of clusters. In Section 4, the experiments are provided to evaluate the performance of our proposed algorithm, showing the available throughout gain.
System Model
A. Sensing Structure for Spectrum-Aware Clustering
In general, a CRSN consists of a Base Station (BS) and some randomly distributed sensor nodes equipped with CR module. The spectrum resource in this area is divided into
B. New Evaluation Criteria for Clustering
To create clusters, the similarity measurement between the observations of all nodes is required. In most clustering methods, this is achieved by use of the clustering metric. Hence, the choice of an appropriate clustering metric has a great impact on the quality of clustering. Focusing on channel availability, geographical location and residual energy, we define a new clustering metric to perform more reasonable clustering. Our clustering metric is evaluated by the following criteria.
1) Spatial Correlation
The channel availabilities of all nodes are subject to their geographical locations, i.e., some geographically close nodes have higher probability in a cluster [39]. To illustrate the relationship of channel availability and geographical location, we define the spatial correlation between node \begin{equation*} S{C_{i,j,t}}= \begin{cases} {1-\dfrac {{{d_{i,j,t}}}}{{\min \{{r_{i}},{r_{j}}\}}},}&{d_{i,j,t}}< \min \{{r_{i}},{r_{j}}\}\\ {0,}&{d_{i,j,t}}\ge \min \{{r_{i}},{r_{j}}\} \end{cases}\tag{1}\end{equation*}
\begin{equation*} E(S{C_{i,t}})=\frac {1}{{n_{i,t}}}\sum \limits _{j=1}^{{n_{i,t}}}{S{C_{i,j,t}}}\tag{2}\end{equation*}
2) Temporal Correlation
Suppose that the behaviour of a PU is a semi-Markov ON-OFF process and all nodes in this PU’s coverage range simultaneously detect the idle channels. It is clear to see that the channel availabilities of these nodes are high correlation at a timestamp, so we define the temporal correlation between node \begin{equation*} T{C_{i,j,t}}=\frac {{|{c_{i,t}}\cap {c_{j,t}}|}}{n_{c}}\tag{3}\end{equation*}
\begin{equation*} E(T{C_{i,t}})=\frac {1}{{{n_{i,t}}}}\sum \limits _{j=1}^{{n_{i,t}}}{T{C_{i,j,t}}}\tag{4}\end{equation*}
3) Confidence Level
As discussed, without shadowing problem or unsuspecting interference, the channel availabilities of the geographically close nodes have high spatial and temporal correlation so that the sensing confidences of these nodes also have high correlation. To evaluate all nodes’ initial confidences before the first round of clustering, the BS broadcasts a Training start message. After receiving the message, each node starts to detect spectrum and sends sensing result to BS in multi-hop manner. Then, the BS makes a global decision by the majority criterion [40] and broadcasts its decision. In the Training stage, a confidence level is defined to measure the similarity of sensing results between node \begin{equation*} C{L_{i,0}}=\frac {{|{c_{i,0}}\cap {c_{g,0}}|}}{{|{c_{g,0}}|}}\tag{5}\end{equation*}
\begin{equation*} C{L_{i,t}}=\frac {{|{c_{i,t}}\cap {c_{k,t}}|}}{{|{c_{k,t}}|}}\tag{6}\end{equation*}
\begin{equation*} E(C{L_{i,t}})=\frac {1}{T}\sum \limits _{t=1}^{T}{C{L_{i,t}}}\tag{7}\end{equation*}
4) Residual Energy Level
One of the main challenges of spectrum-aware clustering is high energy consumption especially when exchanging sensing results and making local spectrum decision. Such issue becomes more challenging for the energy limited CRSNs, so the energy-efficient clustering is necessary to prolong network lifetime [41], [42]. Because the CHs need more energy to make local decision and allocate the sensed idle channels for member nodes, a node with higher residual energy is more likely to become a CH. To evaluate the energy of all nodes, the residual energy level for node \begin{equation*} RE{L_{i,t}}={e_{i,t}}/{e_{\max }}\tag{8}\end{equation*}
Spectrum-Aware Clustering Algorithm Based on Weighted Clustering Metric
A. Clusterhead Election
The proposed evaluation criteria allow to clustering nodes involving various influence factors, which contributes to obtain the optimal spectrum-aware clustering. Now, we apply these evaluation criteria to define the following weighted clustering metric for node \begin{align*} WCM_{i,t}\!=\!{w_{1}}(E(S{C_{i,t}})\cdot E(T{C_{i,t}}))+{w_{2}}E(C{L_{i,t}})\!+\!{w_{3}}RE{L_{i,t}}\!\!\!\!\! \\ {}\tag{9}\end{align*}
Using WCM to elect CHs and form clusters, the clustering process is described as follows. First, each node senses spectrum and exchanges the sensing information with its one-hop neighbours, followed by computing WCM and sending a WCM message to its neighbours. Then, after receiving multiple WCM messages, each node compares its WCM with all received messages. If the values of all received WCM s are less than its own, the node declares itself as a CH and broadcasts a CH Announcement message; otherwise, the node that cannot declare itself as a CH joins a one-hop cluster, whose clusterhead has the largest value of WCM. The node that does not receive any CH Announcement message elects itself as a CH. Finally, the
B. Optimal Number of Clusters
For cluster-based CRSN, the cluster size (i.e., the number of nodes in a cluster) has great impact on cooperative tasks such as channel allocation, channel switch and data routing. The small cluster size (i.e., the large number of clusters) inevitably incurs high routing overhead and transmission delay. On the other hand, the large cluster size (i.e., the small number of clusters) leads to less common idle channels in a cluster. Therefore, it is very important to investigate whether grouping
To solve the optimal number of clusters, we need to define the intra-cluster similarity and the inter-cluster dissimilarity. Suppose that the \begin{equation*} IC{R_{k,t}}=\begin{cases} \displaystyle {\dfrac {1}{m_{k}}\sum \limits _{i=1}^{m_{k}} {S{C_{i,k,t}}\cdot T{C_{i,k,t}},}}&{m_{k}>0}\\ {0,}&{m_{k}=0} \end{cases}\tag{10}\end{equation*}
\begin{equation*} UN{R_{k,t}}=\frac {1}{K-1}\sum \limits _{l=1,l \ne k}^{K}{(1-SC_{l,k,t}\cdot TC_{l,k,t})}\tag{11}\end{equation*}
\begin{equation*} \max \limits _{K} L(K)=\frac {1}{K}\sum \limits _{k=1}^{K} {IR{A_{k,t}}}+\frac {1}{K}\sum \limits _{k=1}^{K} {UN{R_{k,t}}}\tag{12}\end{equation*}
Note that our proposed algorithm has grouped
From the above discussion, our proposed algorithm solving the optimal number of clusters is summarized as follows.
After clustering, the BS constructs a shortest path tree routing for all CHs according to their reported locations, and then the CRSN steps into the Data Transmission stage. In intra-cluster range, each CH broadcasts a Transmission start packet containing the channel assignment message and the transmission time. Each member node transmits data to its CH after it receives the Data Transmission start packet. In inter-cluster range, all CHs transmit the collected data to BS by the established routing. In the interval of data transmission, all CHs periodically sense spectrum instead of all member nodes. Once the channle availability changes in the monitoring area, the CRSN will trigger the local or global re-clustering to establish a new optimal clustering in terms of the CHs’ sensing results. As such, the high-efficiency spectrum monitoring based on the optimal clustering is a key to implement high-performance dynamic spectrum sharing and high-speed data transmission.
Experimental Results and Analyses
In this section, the feasibility and effectiveness of our proposed algorithm are studied and analyzed by the following extended simulation experiments in MATLAB platform.
A. Performance Analysis of Spectrum-Aware Clustering Algorithm
We consider a regular cognitive radio sensor network where the PUs and the CSNs share radio spectrum by CSNs sensing and accessing the PUs’ idle channels. The experimental scenario is set as follows: 3 PUs and 50 sensor nodes are randomly distributed in a square area with size
Comparison of clustering results for different spectrum-aware clustering algorithms. (a) Clustering using typical spectrum-aware clustering algorithm (b) Clustering using our proposed algorithm.
B. Verification of the Optimal Clustering
We next try to verify our spectrum-aware clustering is optimal. Applying Algorithm 1 to the clustering result in Figure 3(b), Figure 4 shows that the objective function
Algorithm 1 The Optimal Number of Clusters for Spectrum-Aware Clustering Algorithm Based on Weighted Clustering Metric
At the
while
for each cluster
calculate
end
calculate
find one pair clusters
merge two clusters and elect a new clusterhead
for each cluster
calculate
end
calculate
if
else if
end
end
the optimal number of clusters
C. Evaluation for Network Performance
After forming the optimal clustering, the member nodes receive the Data Transmission start packet from their CHs and start to transmit data. To test the cluster-based network performance, the following simulation experiments set the system parameters based on IEEE 802.15.4 as follows. We perform a round of the PU
Comparison of the network performance for the periodical spectrum sensing algorithm and our proposed algorithm after clustering. (a) Amount of data transmission for different number of nodes (b) Energy consumption of spectrum sensing and clustering for different number of nodes.
D. Influence of Three Weight Coefficients
From the implement processes of our spectrum-aware clustering algorithm, we observe that the WCM plays an important role in clusterhead election and cluster formation. Furthermore, three weight coefficients
The following experiments test the performance of our proposed algorithm with respect to three weight coefficients according to the different application scenarios. To achieve the most compact hierarchical structure for a CRSN, we set
Comparison of spectrum-aware clustering for different linear combination modes of three weight coefficients. (a)
To evaluate the network performance for the different linear combination modes of three evaluation factors, we define the average amount of data transmission and the average energy consumption by performing Monte Carlo experiment. Following the above experiments, Figure 7(a) shows that the average number of clusters for four linear combination modes greatly increases with the increasing of the number of nodes, which leads to the notably increasing of the average energy consumption in Figure 7(c). The average energy consumption with respect to
Comparison of the network performance for the different values of three weight coefficients. (a) Number of clusters for the different number of nodes (b) Amount of data transmission for the different number of nodes (c) Average energy consumption of data transmission for the different number of nodes.
E. Performance Comparison of Different Algorithms
The next experiment compares the performance of our proposed algorithm with two existing spectrum-aware clustering algorithms in terms of the number of clusters. Figure 8 illustrations the relation between the number of nodes and the number of clusters for three algorithms. It can be seen that the number of clusters for three algorithms increases with the increasing of the number of nodes, where the proposed algorithm using
Performance comparison of the proposed algorithm with RARE and cluster-based protocol. (a) Number of clusters for the different number of nodes (b) Average energy consumption of data transmission for the different number of nodes.
Conclusion
The spectrum-aware clustering algorithm can better group nodes than some typical clustering algorithms by well exploiting the cognitive characteristic of CRSNs. Different from the traditional spectrum-aware clustering algorithms, a novel spectrum-aware clustering algorithm based on weighted clustering metric is proposed to take account into various clustering criteria including temporal-spatial correlation, sensing confidence and residual energy. Comparing the WCM of all nodes, the optimal spectrum-aware clustering is formed to efficiently enhance the network connectivity of a CRSN. After clustering, our proposed algorithm involving CHs sensing spectrum instead of all member nodes provides an energy-efficient spectrum sensing method. The experiments show that the data transmission and energy consumption are greatly improved under the optimal spectrum-aware clustering. In the future work, we will test the proposed algorithm in some application scenarios, e.g., smart community, internet of vehicles and ecological monitoring.