9
Artificial Intelligence Application to Cognitive Radio Networks

Badr BENMAMMAR and Asma AMRAOUI

Abou Bekr Belkaid University, Tlemcen, Algeria

9.1. Introduction

In wireless networks, field resources in terms of frequency and bandwidth availability are increasingly scarce. Consequently, new solutions are required to minimize the energy consumption and optimize the allocation of radio resources.

For flexible access to the spectrum, Mitola and Maguire (1999) introduced cognitive radio (CR) relying on software-defined radio. Software-defined radio is a radio that can realize in software form the typical functions of the radio interface generally realized in a hardware form, such as the carrier frequency, signal bandwidth and modulation. Indeed, Mitola and Maguire (1999) combined their software-defined radio experiences, as well as their passions for machine learning (ML) and artificial intelligence (AI) to set up CR technology. According to Mitola (2000), a CR is able to know, perceive and learn from its environment, and then act in order to simplify the user’s life. In 2005, Haykin (2005) reviewed the concept of CR and dealt with it as brain-empowered wireless communication.

CR is a technology that detects the environment, analyzes its transmission parameters, and then makes the decisions related to resource allocation and management. However, the formulations of optimization for resource allocation offer optimal solutions sometimes at the expense of the computing time and the processing complexity. To reduce this complexity and obtain resource allocation within reasonable, the cognitive radio network (CRN) must be equipped with learning and reasoning capacities. The cognitive engine must coordinate the CR actions using ML techniques.

Consequently, a CR must be intelligent and able to learn from its experience by interacting with its radiofrequency environment. Therefore, learning is an essential element of CR, which can be provided using AI and ML techniques. Indeed, applying AI, and in particular ML to CRN, has recently attracted considerable interest in the literature.

Learning aims to enable machines to carry out similar tasks to those of an expert. The intelligent machine senses its environment and takes measures to maximize its own utility. AI focuses on deduction, reasoning, problem solving, knowledge and learning representation (Woods 1986).

The learning process in the CRN is illustrated in Figure 9.1 and can be presented as follows (Abbas et al. 2015):

  1. – detection of radiofrequency parameters such as the channel quality;
  2. – observation of the environment and analysis of its reactions;
  3. – learning;
  4. – conservation of decisions and observations for model update;
  5. – decision on resource management problems and consequent transmission error adjustment (Bkassiny et al. 2013; Russell 2016).

There are several works in the literature that focus on the application of AI to CRN. In Zhao and Morales-Tirado (2012), the authors introduced the use of AI and ML techniques in CR. They also presented the possible applications and the fundamental ideas of CR. In Bkassiny et al. (2013), the authors presented a survey on the various AI techniques, such as reinforcement learning (RL), game theory (GT), neural networks (NNs), support vector machines (SVM) and Markov models (MM). The survey discusses the strengths and weaknesses of these techniques, as well as the difficulties encountered in their applications in the CR field. In Gavrilovska et al. (2013), the authors studied the GT, RL and reasoning approaches such as Bayesian networks (BNs), fuzzy logic (FL) and case-based reasoning (CBR) in CRN. The survey presented by Abbas et al. (2015) was dedicated to FL, genetic algorithms (GA), NNs, GT, RL, SVM, CBR, decision trees (DT), BNs, MMs, multiagent systems (MASs) and artificial bee colonies algorithms.

Schematic illustration of the learning process in cognitive radio networks.

Figure 9.1. Learning process in cognitive radio networks (Abbas et al. 2015)

It can however be noted that the article by Abbas et al. (2015) did not discuss the application of particle swarm optimization (PSO), a commonly used metaheuristic in CR. Other more recent metaheuristics that are also used in CR were ignored by Abbas et al. (2015), such as the firefly algorithm (FA), cuckoo search (CS) and the gravitational search algorithm (GSA).

This chapter focuses on the AI techniques that were most commonly used during the last three years in CR (the state-of-the-art of work before 2015 was already done in Abbas et al. (2015)).

Our focus is on work that was not discussed in previous surveys, such as the FA, CS, GSA and PSO. We also present recent work related to the application of other AI techniques in CR, namely GA, bee colony algorithms, FL, GT, NNs, MMs, SVM, CBR, DT, BNs, MASs and RL. Altogether, 17 AI techniques will be studied in this chapter.

The essential points dealt with in this chapter relate to:

  • – the presentation of a complete study on AI techniques, their definitions and their applications to CR. It is worth noting that some techniques have never been discussed before in this field;
  • – the presentation of the main tasks of CR and their corresponding challenges;
  • – the categorization of the presented techniques depending on the type of learning (supervised or unsupervised) and their applications depending on the CR tasks.

The rest of this chapter is organized as follows. Section 9.2 presents the cognition cycle, the main CR tasks and their corresponding challenges. Section 9.3 offers a state-of-the-art on the application of AI methods to CR. Section 9.4 proposes a categorization of the techniques presented depending on the type of learning (supervised or unsupervised) and presents their applications depending on the CR tasks. Finally, section 9.5 is dedicated to the conclusion.

9.2. Cognitive radio

9.2.1. Cognition cycle

A CRN follows the cognitive cycle in order to optimize its performances (see Figure 9.1). It starts with sensing the environment, then goes on with analyzing external parameters and ends by making decisions related to the allocation and dynamic management of resources, in order to improve spectral efficiency (Biglieri et al. 2013).

Sensing the environment: in the CR, the secondary network can use the available spectrum, but without causing interference in the primary network. Consequently, the secondary network must first detect the parameters of its environment, such as the availability of spectrum holes in the frequency.

Analysis of parameters of the environment: the detected environment parameters are used as inputs for resource management. The latter may include energy consumption minimization, interference minimization, throughput optimization, improvement of quality of service (QoS) and maximization of spectral efficiency (Wyglinski et al. 2009).

Decision making: in CR, decision making can rely on optimization algorithms. However, in order to reduce complexity and get a resource allocation within reasonable time, CRN use ML and AI (Qiu et al. 2012).

9.2.2. CR tasks and corresponding challenges

CR generally relies on two main tasks:

  • the “cognitive” task, which can be obtained by spectrum detection techniques. The main challenges for these techniques are the accuracy of the decision concerning spectrum availability, the detection duration, the detection frequency and the uncertainty on the power of ambient noise, particularly at low signal-to-noise ratio (SNR) due to multipath attenuation and shadowing. To improve spectrum detection performances, cooperative detection and geolocalization technologies were proposed in the literature (Ghasemi and Sousa 2008; Wang and Liu 2011; Umar and Sheikh 2012);
  • the “reconfigurable” task, which is used to dynamically adjust the transmission parameters in order to improve network performances. It relies on decision making, which is based on optimization algorithms. The main challenge of this task concerns the complexity and the convergence of these techniques within a limited time. This can be solved using AI and ML techniques in order to build learning models for decision making. Consequently, the choice of a learning technique for conducting a specific CR task is in itself considered a challenge.

9.3. Application of AI in CR

9.3.1. Metaheuristics

In the literature, metaheuristics are classified into two subcategories: those with a single solution and those that are population based. Single solution metaheuristics are iterative approaches that start with a single initial solution and improve it with each iteration taking advantage of its neighborhood. Population-based metaheuristics explore the research space using a set of solutions known as “population”. This latter category is also divided into two subcategories: evolutionary algorithms and algorithms based on swarm intelligence.

Evolutionary algorithms draw their inspiration from the natural evolution of species, and more precisely from the natural selection principle stated in the theory of evolution, developed by Darwin (2009) in his work entitled On the Origin of Species: By Means of Natural Selection or the Preservation of Favored Races in the Struggle for Life. The algorithms based on swarm intelligence were introduced for the first time by Gerardo and Wang (1993) in their work entitled “Swarm intelligence in cellular robotics systems”, which describes the behavior of a group of robots cooperating to accomplish a task or solve a problem.

The following section presents the application of six metaheuristics most commonly used in CRN, namely FA, CS, bee colony algorithms, GA, GSA and PSO.

9.3.1.1. Firefly algorithm

The FA is an optimization approach based on swarm intelligence that was proposed by Xin-She Yang in 2008 (Surafel and Ngnotchouye 2017). Its principle is inspired by the lighting behavior of fireflies. Indeed, each potential solution is assimilated with a firefly, whose luminosity is proportionally related to its quality (quality of the solution). The following works dealt with the application of FA to Cognitive Radio Network (CRN).

The FA was adapted by the authors in a CRN based on orthogonal frequency-division multiplexing (OFDM) (Saoucha and Benmammar 2017). Multiobjective optimization was also used to optimize the quality of communication between secondary users (SU). The performances of the FA were validated through a comparison with the PSO algorithm and with the cross entropy in terms of convergence speed, quality of the solution and stability.

The FA was used in Tounsi and Babes (2017) to solve the problem of power control and channel allocation in CRN. A modified version of the FA, using the new attractiveness factor, was proposed to solve this problem. A theoretical analysis was presented in this article to prove the efficiency and the existence of Nash equilibrium concerning the proposed strategy. The results presented in this article show that the proposed method outperforms the approaches in the literature in terms of convergence speed.

To estimate the amplitudes of cancellation subcarriers, the authors in Elahi et al. (2017) proposed two search algorithms: the GA and the FA. According to the simulation results, the proposed algorithms enable a better reduction of the secondary lobes compared to the techniques currently mentioned by the literature.

In Ghanem et al. (2016), the problem of attack based on primary user emulation (PUE) is solved by means of a localization defense model based on the use of the FA. The CR users cooperate to detect and localize the attacker comparing its location with the position of the primary users (PU). The simulation results are compared with existing methods and prove that the FA reduces the localization error and requires fewer SUs to cooperate.

9.3.1.2. Cuckoo search

CS is an optimization approach based on swarm intelligence, which was proposed by Yang and Deb (2017). Its development draws inspiration from the parasitic behavior of certain bird species. In the real world, cuckoos lay their eggs in the nests of other bird species. In most cases, the host bird believes that the laid eggs are hers and, consequently, takes care of them. However, in certain cases the eggs laid are discovered and discarded by the host bird or the host bird leaves the nest.

The following works focus on the application of CS in the CRN.

A new method for the OFDM-based estimation of the channel state in a CRN was proposed in Manjith (2016). The method is hybridization between bacterial foraging optimization (BFO) and a modified CS algorithm.

In Kaur et al. (2018a), the authors proposed a new multiuser CR system, as well as its optimization by means of CS algorithm. The transmission parameters of several SUs are considered according to the IEEE 802.22 WRAN standard. The optimization results were compared to another efficient optimization technique based on biogeography and simulated annealing.

9.3.1.3. Bee colony algorithm

Various groups of researchers independently participated in the development of bee algorithms during the last 10 years. Tovey, at Georgia Tech, in collaboration with Nakrani, at the University of Oxford, proposed the honey bee algorithm for the first time in 2004 (Nakrani and Tovey 2004). The virtual bee algorithm was created in 2005 by Xin-She Yang at the University of Cambridge to solve numerical optimization problems (Yang 2005). Haddad et al. (2006) presented the honey bees mating optimization algorithm.

The artificial bee colony (ABC) algorithm was developed in 2005 by Karaboga (2005) for the optimization of numerical functions.

The following works focused on the application of bee colony algorithms in CRN.

A new spectral handover algorithm based on the ABC algorithm in a CRN was proposed in Bayrakdar and Calhan (2018). In the algorithm proposed by the authors, the spectrum availability characteristic is observed on the basis of bee missions in order to minimize the spectral handover delay and maximize the probability of finding an inactive channel. The main advantage of this algorithm is that the delay of the spectral handover of SUs is considerably reduced for a different number of users, without reducing the probability of finding an available channel.

A hybrid algorithm between artificial bee colonies and GA was proposed in Elghamrawy (2018) to optimize the use of spectrum by detecting the PUE attacks and increasing the detection probability. The proposed algorithm integrates the genetic operators with the ABC algorithm in order to reach equilibrium between use and exploration to find the optimal solution. The simulation results indicate promising performances of the proposed algorithm for spectrum detection optimization, compared to recent detection algorithms.

The main objective of Zaheer et al. (2016) is to minimize the transmission powers and thus reduce the interferences in a CRN by using the ABC algorithm.

9.3.1.4. Genetic algorithms

The principle of GA was introduced by John Holland at the University of Michigan in the United States, in the 1960s (Holland 1992), and highlighted by the reference work by Goldberg (1989). In a genetic algorithm, a population is constituted by a set of individuals, each of which is identified by a set of genes known as “chromosomes”. Reproduction involves the recombination of chromosomes of two primary individuals, thus giving birth to child individuals whose genetic fingerprint is inherited from parents. However, the genetic code of the children may contain genes that their parents do not have, thus modeling the mutation genetic phenomenon. The latter enables changes in the morphology of species, always leading to a better adaptation to the natural environment.

The following works focus on the application of GA to CRN.

The problem formulation, the development and use of a genetic algorithm for channel assignment in a CRN was presented by Elhachmi and Guennoun (2016). This approach offers the PU and SU an efficient means to access the available spectrum. Compared to existing methods, simulation results prove that the proposed algorithm yields satisfactory results in terms of interferences and throughput.

In Jiao and Joe (2016), the authors considered a new model of CRN in which the networks of PU are constituted of heterogeneous PU. The authors consider the problem of energy-efficient resource allocation for the CR user having a coverage area in which heterogeneous PU operate simultaneously via multiradio access technology. The authors proposed a research diagram based on two level crossover GA to obtain an optimal solution in terms of power and throughput. Simulation results show that the algorithm proposed by the authors is stable and its convergence is more rapid.

9.3.1.5. Gravitational search algorithm

In 2009, Rashedi et al. (2009) developed a GSA, which is an optimization metaheuristics inspired by nature. GSA relies on Newton’s law of gravitation, according to which gravitation is an attraction between massive bodies. The masses of the bodies (solutions) are proportional to their values of objective functions (costs). With each iteration, masses are mutually attracted by gravitational forces. The heaviest mass exerts the strongest force of attraction. Consequently, the heaviest masses, which are probably close to the global optimum, attract other masses depending on their distances. Each object is determined by four specifications: position, inertia, active gravitational mass and passive gravitational mass. The position corresponds to a solution to the problem; inertia and gravitational masses are determined using the objective function.

The following works focused on the application of the GSA in CRN.

Guo et al. (2018) present a new method for solving the problem of spectrum waste in CRN. This method relies on graph coloring and GSA. The authors compared the performances of their algorithm with the PSO and GA.

In Kaur et al. (2018b), a hybrid algorithm of PSO and gravitational research is presented for CRN optimization. A new CR environment is proposed, enabling several SUs to access the spectrum while their channels undergo Nakagami-m fading. The transmission factors belonging to several SUs and relying on IEEE 802.22 WRAN standard are optimized to reach several objectives related to the expected QoS using PSO, GSA and the hybrid PSOGSA algorithm. Objective functions that are modified and influenced by fading are established for the optimization task. The optimization results indicate an improved performance of the hybrid algorithm compared to other basic techniques.

9.3.1.6. Particle swarm optimization

PSO is an optimization metaheuristics in the family of evolutionary algorithms and proposed by Russel Eberhart (electrical engineer) and James Kennedy (sociopsychologist) (Kennedy and Eberhart 1995). PSO has its source in the observations made during computer simulations of flocks of birds and fish schools (Craig 1987; Heppner and Grenander 1990). Indeed, the PSO draws heavily on the observation of gregarious relations of migratory birds, which, in order to travel “long distances” (migration, search for food, aerial displays, etc.), must optimize their motions in terms of consumed energy and time (etc.), as for example in the V-shaped flight formation presented in Figure 9.2.

Photo depicts a V-shaped formation of Anser flight.

Figure 9.2. V-shaped formation of Anser flight (Bestaoui 2015)

The following work focuses on the application of PSO in CRN.

A technique based on PSO and on the received signal strength indicator for the detection of the position of PU and the PUE–based attacker was proposed in Fihri et al. (2018). The authors aim to improve the detection accuracy and reduce the risk of false alarms.

The effect of eigenvalues of the covariance matrix on samples received through the SNR estimation method was analyzed in Manesh et al. (2017). The authors proposed the use of PSO in the SNR estimation technique based on eigenvalues in order to optimize these parameters. The results of the proposed method are compared to those of the original SNR estimation method and the results validate the improvement obtained by the proposed technique compared to the original technique.

The simultaneous wireless information and the multiple user power transfer for the CRN relying on PSO and on semidefinite relaxation were studied in Tuan and Koo (2017). A secondary emitter with an antenna array provides information and energy to several secondary receivers with a single antenna. The authors proved through simulations that their algorithm features rapid convergence and better performances compared to other existing systems.

In Zhai and Wang (2017), the authors used PSO to solve the crowdsourcing paradigm, according to which mobile users are assigned the task of spectrum detection. Simulation results show that the proposed algorithm reaches higher performances compared to those of other algorithms.

In Tang and Xin (2016), the authors used PSO to study the compromise between utility and energy consumption in a CRN based on OFDM. Considering the low convergence of the original PSO around local optima, an enhanced version combining chaos theory is proposed in this study in order to help PSO identify solutions around the best global results. Using simulations, the authors proved that their algorithm requires a smaller number of iterations and can reach a higher energy efficiency than the other algorithms.

Tuan and Koo (2016) proposed a hybrid method based on PSO and Brute-Force Search (BFS). This method is used for the maximization of the SU throughput in a full-duplex CRN, when it has two distinct antennas and a capacity to autoeliminate its interferences. The simulations show that for certain values of the parameters, the considered system provides a much higher throughput than the previously proposed systems.

Alhammadi et al. (2016) discussed the three spectral handover mechanisms (proactive, reactive and hybrid) used to reduce the handover delay. The article contains an implementation of the PSO algorithm to minimize the total service time of the spectral handover to optimal value. The numerical results show that PSO significantly reduces the total service time compared to otter spectral handover systems.

9.3.2. Fuzzy logic

In 1965, relying on mathematical theory on fuzzy sets, Lotfi Zadeh developed FL, which is an extension of Boolean logic (Zadeh 1965). FL makes it possible to consider imprecisions and uncertainties, due to which the reasoning involving it is rendered significantly more flexible. The following work focuses on the application of FL in CRN.

Banerjee et al. (2017) proposed a new decision making method based on FL for the relay selection, unlike many existing works in which the signal to interference plus noise ratio (SINR) is considered the only parameter for relay selection. To find the best relays, the authors conducted a broad simulation study. The simulation results reveal the influence of various parameters on the selection of the best relay.

9.3.3. Game theory

The first known discussion on GT was mentioned in a letter written by James Waldegrave in 1713. The GT is used as a decision making technique, where several players must make choices and consequently influence the interests of other players. Each player decides its actions depending on the history of the actions selected by other players during the previous rounds of the game.

In CRN, the nodes are the game players and the actions are the parameters of the radio environment, such as the emission power and the channel selection. These actions are made by the nodes on the basis of observations represented by the parameters of the environment, such as channel availability, channel quality and interferences. Consequently, each node draws lessons from its past actions, observes the actions of other nodes and consequently modifies its actions (Bellhouse 2017). The following works focus on the application of GT in CRN.

An approach based on the GT using the Stackelberg game to secure a network of CR sensors against the attack involving spectrum detection data falsification was proposed in Abdalzaher et al. (2017); the purpose of this attack was to corrupt the spectrum decisions communicated by the sensor nodes to the merging center by imposing an interference power. Simulations indicate the improvement of the performances of the proposed protection model compared to the two basic defense mechanisms, namely random defense mechanisms and those whose protection is equal to static signal to noise ratio.

The problems related to the security of the physical layer and to energy efficiency because of power control and relay cooperation, where decode-and-forward and amplify-and-forward protocols are considered, were studied in Fang et al. (2017). The authors proposed a Stackelberg game model with one leader and one follower in the presence of multiple listeners, in which an optimal strategy of power allocation and pricing can be determined in order to maximize player utility. The simulations conducted by the authors prove that the proposed game model improves the network energy efficiency and offers better performances against eavesdropping attacks, compared to Nash equilibrium systems, rand and direct transmission.

In Roy et al. (2017), the authors used the GT to study the conflict and the cooperation between two levels of SUs (real time and not real time). A model of the auction game is proposed in order to analyzer the decision-making process and efficiently allocate an inactive channel to a pair of SUs (real time and not real time) belonging to a group of users.

A two-level Stackelberg game model, in which the PU and SU act, respectively, as leaders and followers in order to improve the energy efficiency of nodes in a multiple jumps CRN was proposed in Shu et al. (2016). The simulations conducted proved the relevance of the authors’ proposals.

9.3.4. Neural networks

NNs were introduced by Warren McCulloch and Walter Pitts in 1943 and draw their inspiration from the central nervous system. Similarly to biological NNs, an artificial NN is formed of nodes, equally known as “neurons” or “processing elements”, which are connected together to form a network.

The artificial NN receives information from all neighboring neurons and provides an output depending on its weight and on the activation functions. The adaptive weights can represent connection forces between neurons. To accomplish the learning process, the weights must be adjusted until the network output is approximately equal to the desired output.

Artificial NNs were used to enable CR to learn from the environment and make decisions in order to improve the QoS of the communication system (Haykin 2008; Rojas 2013). The following works focus on the application of NNs to CRN.

Supraja and Pitchai (2019) presented a hybrid system constituted of a genetic algorithm, PSO and a back-propagation NN as supervised learning algorithm enabling the prediction of spectrum profiles in CRN.

Zhang et al. (2017) used the convolutional NN in an automatic system for the recognition of CR wave shapes. The proposed system can identify up to eight types of signals. The classification results were proven by the authors through simulations.

Liu et al. (2019) studied the compromise between energy efficiency and spectral efficiency for the PU and SU in a CRN. A feed-forward NN is designed and a back-propagation analog algorithm is developed to learn the optimal parameters of the algorithm proposed by the authors. Simulations are provided to confirm the efficiency of the proposed algorithm.

9.3.5. Markov models

MM is used for modeling random processes passing from one state to another in time. The random process is without memory where the future states depend only on the present state (Norris 1998; Ching and Ng 2006). In MMs, the states are visible to the observer; nevertheless, in the hidden Markov model (HMM), certain states are masked or are not explicitly visible (Fraser 2008). The following works focus on the application of MMs to CRN.

To solve the problems of power distributed control in a network of wireless cognitive sensors, a power control mechanism based on the HMM is proposed in Zhu et al. (2017) depending on the difference and on the independence of channel sensing results among the network users. The simulations indicate that, besides improving the energy efficiency, the power control mechanism based on the HMM model better reaches the SINR target compared to other methods.

An adaptive method for double threshold energy detection based on the MM was proposed in Liu et al. (2017). With the use of this method, the modified Markov method takes into account the time-variable characteristic of the channel occupation to solve the state of the channel “under confusion”. The simulations show that the proposed method yields better performances in terms of detection compared to other existing methods.

A method to build a radio environment map (REM) in an environment with several PU was proposed in Ichikawa and Fujii (2017). REM provides statistical information on the activity of the PU at each location. It also enables the SU to dynamically access the licensed band. Simulations show that the proposed method has better performances than the existing unsupervised classification method.

9.3.6. Support vector machines

SVMs are an ML approach that use a non-probabilistic linear classifier for data classification into two categories. The following works focus on the application of SVMs in CRN.

Four supervised ML techniques were used to study the prediction of PU activity in Agarwal et al. (2016). Two of these originate in NNs and the other two in SVMs. The results highlight the analysis of learning techniques depending on various traffic statistics and suggest the best learning model enabling the accurate prediction of the PU.

A small dimension probability vector for the cooperative spectrum detection based on ML techniques in a CRN was proposed in Lu et al. (2016). The K-means classification algorithm, the SVM and the probability vector were studied by the authors. Considering a CRN with 1 PU and N SUs, the proposed probability may reduce the dimension of the existing energy vector from N dimensions to two dimensions, driving similar or even better detection accuracy, a shorter learning duration and a shorter classification time.

9.3.7. Case-based reasoning

CBR is a problem solving paradigm based on the reuse of past experiences to solve new problems. CBR builds a database with information on past situations, problems, solutions and their advantages. The new problems are then solved by finding the most similar case in the memory and deducing the solution to the current situation (Kolodner 2014). The following work focuses on the application of CBR to CRN.

A Q-learning method, based on the case of dynamic access to the spectrum improving and stabilizing the performances of cognitive cellular systems with dynamic topologies, was proposed in Morozs et al. (2016). The proposed approach is a combination of classical distributed Q-learning and a new implementation of CBR algorithm seeking to facilitate a certain number of learning processes executed in parallel. Simulations show that the proposed case-based Q-learning approach enables constant improvement of the QoS under conditions of dynamic and asymmetric network topology and with traffic load.

9.3.8. Decision trees

DTs are one of the major data structures of ML. Their operation relies on heuristics that, while satisfying intuition, yield outstanding results in practice. Due to their tree structure, DTs are readable by the human being, unlike other approaches in which the classifier builds a “black box”, as in the case of NNs. The following work focuses on the application of DTs to CRN.

A new algorithm combining the random forest (several DTs) to reduce interferences was proposed in Wang and Yang (2016a). This advancement also enables the significant improvement in network throughput.

A new spectrum detection diagram relying on DTs for the classification of the MAC layer protocol was proposed in Wang and Yang (2016b). The simulations confirm that the new proposed method could significantly improve the network throughput.

9.3.9. Bayesian networks

BNs are graphical probabilistic models relying on the interaction between various nodes to learn and starting from each node involved in the process. BNs play a role in the decision making process if they are associated with other tools in order to form influence diagrams (Bolstad 2004).

The following work focuses on the application of BNs in CRN.

A BN-based model for dealing with modularity and uncertainty was proposed in Elderini et al. (2017). The Bayesian model enables the qualitative and quantitative addition of parameters that influence the fault probability and the SINR in a CRN.

In Salahdine et al. (2017a), BNs are considered among the uncertainty management techniques in CRN. A use case involves CRN modeling by a graph in which each node represents a SU and each edge represents the communication link between corresponding nodes.

A hybrid method between a Bayesian model and a trilateration technique, which is used to obtain a good approximation of the position of the PUE attacker, was proposed in Fihri et al. (2017). The Bayesian decision theory based on the loss function and the conditional probability makes it possible to determine the existence of the PUE attacker in the uncertainty area.

An efficient and rapid sensing method, in which the Toeplitz sensing matrix and the Bayesian model are combined to deal with uncertainties and reduce the random character of measures, was proposed in Salahdine et al. (2017b). The proposed method was thoroughly implemented and tested, yielding satisfactory results.

A nonparametric Bayesian approach to the clustering of subchannels in CRN based on OFDMA was proposed in Ahmed et al. (2017). The approach uses the traffic functionalities of each subchannel to obtain statistics on its idle/busy period. Based on the harvested energy, the SU determines the energy detection threshold so that it can maximize its rate of spectrum use while minimizing the interferences with the PU.

9.3.10. MASs and RL

Jacques Ferber presented the MAS as an intelligent entity conscious of its environment, capable of skillfully acting and autonomously communicating. They contain the environment, objects, agents and various relations between these entities (Ferber and Weiss 1999). With MAS, users can interact, negotiate and cooperate to enable more efficient communication between the entities in the network.

Their use in CRN enables users to manage their own spectrum in a dynamic and decentralized manner. The agents sense their environment and consequently react. The association of MAS and CR enables a better use of the unused spectrum and a optimal management of radio resources, while reducing the risk of interferences.

On the other hand, RL is a domain of ML that makes it possible to solve sequential decision problems under uncertainty. It plays an essential role with SM, as they enable agents to discover the situation and take measures by trial and error, in order to maximize the cumulated reward. In RL, an agent must take into account the immediate advantages and consequences of its actions to optimize the long-term system performances (Wiering and Van Otterlo 2012).

Deep reinforcement learning (DRL) uses deep learning and RL principles to create efficient algorithms that can be applied to fields such as robotics, video games, finance and health care (François-Lavet et al. 2018). By implementing a deep learning architecture (deep NNs, etc.) with a RL algorithm (Q-learning, etc.), it is possible to create a powerful DRL model that can adapt previously insoluble problems. The following work focuses on the application of MASs, RL and DRL in CRN.

The optimization of opportunistic accessibility of channels in railway CR environments was studied in Yin et al. (2017). The model proposed by the authors involves a Bayesian inference enabling the calculation of the probability of successful transmission on a single station, as well as a team cooperation aiming to optimize network performances in a group of base stations.

An efficient transmission mode based on the Q-learning algorithm was proposed in cooperative CRN in Rahman et al. (2016). The state, the action and the reward are defined to obtain good performances in terms of delay and energy efficiency during data transmission, as well as for the interference caused to PU during transmissions achieved by the SU. The simulations show that the diagram proposed by the authors can efficiently assume the determination of the transmission mode and surpasses the classical diagrams mentioned in the literature.

The efficient allocation of transmission power between the SU, without generating interference for the PU, is the objective of the authors in Lall et al. (2016). Three mixed strategies (correlated equilibrium) were used to control the transmission power during learning. The experimental results indicated that the proposed algorithm by far surpasses its classical counterparts.

The routing of several flows generated by the SU toward a given destination, considering the PU presence, was approached by Pourpeighambar et al. (2017). Each SU is expected to egoistically minimize the end-to-end delay of its flow, while meeting the QoS demands of the PU. For a rapid adaptation of the SU routing decision to environment changes and to their non-cooperative interaction, the authors formulated the routing problem as a stochastic learning process represented by non-cooperative games. Then they proposed a system based on swarm RL to solve the routing problem and thus avoid the information exchanges between competing SU. The simulations show that the proposed diagram converges in a demonstrable manner and prove its efficiency for reducing the delay while meeting the QoS demands of the PU.

An algorithm based on RL for the management of power assignment for the transmission channel and the control channel in CRN was proposed in Lin et al. (2016). The simulation results show that this new algorithm brings a significant improvement in terms of compromise between the control channel reliability and the transmission channel efficiency.

An algorithm for selecting a channel for data transmission and predicting the duration it will remain idle for in order to minimize the time dedicated to its detection was proposed in Raj et al. (2018). This algorithm involves two stages: an RL approach for the selection of channels and a Bayesian approach for determining the duration for which detection can be ignored.

The spectrum sharing in a CRN composed of a PU and SU was studied in (Xingjian et al. 2018). The PU and the SU work in a non-cooperative manner. The authors aim to develop a power control method based on learning for the SU in order to share the common spectrum with the PU. The authors developed a DRLbased method, which the SU can use to intelligently adjust its emission power, so that after two cycles of interaction with the PU, the two users can send their own data with the required QoS.

9.4. Categorization and use of techniques in CR

Data availability influences the choice of the learning technique to be used. Supervised learning is used when learning data are labeled and the CR has previous information on the environment. Unsupervised algorithms do not require labeled learning data. Unsupervised learning is used when certain RF components are not known by the CRN, which enables it to operate autonomously with no previous knowledge.

There are differences between learning techniques in terms of strengths, limitations, challenges and applications in CRN.

DTs, NNs, SVMs and CBR are considered supervised learning techniques. The GT, RL, MMs, metaheuristics, FL and BNs are considered unsupervised learning techniques.

It is worth noting that all the techniques studied in this chapter are used both for sensing and for decision in CR, except for the metaheuristics and for the CBR, which are used exclusively for decision. DTs, GT and FL should be used for sensing taking into account the capacity of the spectrum detection technique employed.

MAS can be used in combination with all the techniques presented in this chapter.

9.5. Conclusion

This chapter presents a complete study of the AI techniques used in CRN. The definitions of various techniques and their applications in CR are also discussed in this work. Indeed, a complete state-of-the-art on the application of the FA, CS, GSA, PSO, GA, bee colony algorithms, FL, GT, NNs, MMs, SVMs, CBR, DTs, BNs, MASs and RL in CRNs was presented in this chapter. It also presented the main CR tasks and their corresponding challenges, classified the presented techniques depending on the type of learning (supervised or unsupervised) and presented their applications depending on two CR tasks (sensing and decision).

9.6. References

Abbas, N., Nasser, Y., and El Ahmad, K. (2015). Recent advances on artificial intelligence and learning techniques in cognitive radio networks. EURASIP Journal on Wireless Communications and Networking (JWCN), 174 (2015), 1–20.

Abdalzaher, M.S., Seddik, K., and Muta, O. (2017). Using Stackelberg game to enhance cognitive radio sensor networks security. IET Communications, 11(9), 1503–1511.

Agarwal, A., Dubey, S., Khan, M. A., Gangopadhyay, R., and Debnath, S. (2016). Learning based primary user activity prediction in cognitive radio networks for efficient dynamic spectrum access. International Conference on Signal Processing and Communications. 12–15 June 2016, Bangalore, India, 1–5.

Ahmed, M.E., Kim, D.I., Kim, J.Y., and Shin, Y. (2017). Energy-arrival-aware detection threshold in wireless-powered cognitive radio networks. IEEE Transactions on Vehicular Technology, 66(10), 9201– 9213.

Alhammadi, A., Roslee, M., and Yusoff Alias, M. (2016). Analysis of spectrum handoff schemes in cognitive radio network using particle swarm optimization. 3rd International Symposium on Telecommunication Technologies. 28–30 November 2016, Kuala Lumpur, Malaysia, 103–107.

Banerjee, J.S., Chakraborty, A., and Chattopadhyay, A. (2017). Fuzzy based relay selection for secondary transmission in cooperative cognitive radio networks. Advances in Optical Science and Engineering. 279–287.

Bayrakdar, M.E. and Çalhan, A. (2018). Artificial bee colony–based spectrum handoff algorithm in wireless cognitive radio networks. International Journal of Communication Systems (IJCS), 31(5), 1–16.

Bellhouse, D. (2007). The problem of Waldegrave. Electronic Journal for the History of Probability and Statistics, 3(2), 1–12.

Beni, G., Wang, J. (1993). Swarm Intelligence in Cellular Robotic Systems. In Robots and Biological Systems: Towards a New Bionics?, Dario P., Sandini G., Aebischer P. (eds). Springer, Berlin, Heidelberg.

Bestaoui, A.A. (2015). Gestion de spectre dans un réseau de radio cognitive en utilisant l’algorithme d’optimisation par essaim de particules. Master’s thesis, University of Tlemcen, Algeria.

Biglieri, E., Goldsmith, A.J., Greenstein, L.J., Mandayam, N.B., and Poor, H.V. (2013). Principles of Cognitive Radio. Cambridge University Press, Cambridge.

Bkassiny, M., Li, Y., and Jayaweera, S.K. (2013). A survey on machine-learning techniques in cognitive radios. IEEE Communications Surveys & Tutorials, 15(3), 1136–1159.

Bolstad, W.M. (2004). Introduction to Bayesian Statistics. John Wiley & Sons, Hoboken.

Ching, W.-K. and Ng, M.K. (2006). Markov Chains. Models, Algorithms and Applications. Springer, Berlin.

Craig, W.R. (1987). Flocks, herds, and schools: A distributed behavioral model. Computer Graphics, 21(4), 25–34.

Darwin, C. (2009). The Origin of Species: By Means of Natural Selection, or the Preservation of Favoured Races in the Struggle for Life. Cambridge University Press, Cambridge.

Elahi, A., Qureshi, I.M., Atif, M., and Gul, N. (2017). Interference reduction in Cognitive radio networks using genetic and firefly algorithms. International Conference on Communication, Computing and Digital Systems.

Elderini, T., Kaabouch, N., and Reyes, H. (2017). Outage probability estimation technique based on a Bayesian model for cognitive radio networks. IEEE 7th Annual Computing and Communication Workshop and Conference.

Elghamrawy, S.M. (2018). Security in cognitive radio network: Defense against primary user emulation attacks using genetic artificial bee colony (GABC) algorithm. Future Generation Computer Systems (FGCS). 85(8), 1–19.

Elhachmi, J. and Guennoun, Z. (2016). Cognitive radio spectrum allocation using genetic algorithm. EURASIP Journal on Wireless Communications and Networking, 133 (2016), 1–11.

Fang, H., Xu, L., and Raymond Choo, K.-K. (2017). Stackelberg game based relay selection for physical layer security and energy efficiency enhancement in cognitive radio networks. Applied Mathematics and Computation, 296, 153–167.

Ferber, J. and Weiss, G. (1999). Multiagent Systems: An Introduction to Distributed Artificial Intelligence. Addison-Wesley, Boston.

Fihri, W.F., Arjoune, Y., El Ghazi, H., Kaabouch, N., and El Majd, B.A. (2017). Bayesian decision model with trilateration for primary user emulation attack localization in cognitive radio networks. International Symposium on Networks, Computers and Communications. 16–17 May 2017. Marrakech, Morocco, 1–6.

Fihri, W.F., Arjoune, Y., El Ghazi, H., Kaabouch, N., and El Majd, B.A. (2018). A particle swarm optimization based algorithm for primary user emulation attack detection. IEEE 8th Annual Computing and Communication Workshop and Conference. 8–10 January 2018, Las Vegas, USA, 823–827.

François-Lavet, V., Henderson, P., Islam, R., Bellemare, M.G., and Pineau, J. An introduction to deep reinforcement learning. Foundations and Trends® in Machine Learning, 11(3/4), 219–354.

Fraser, A.M. (2008). Hidden Markov Models and Dynamical Systems. Siam, Philadelphia.

Gavrilovska, L., Atanasovski, V., Macaluso, I., and Da Silva, L. (2013). Learning and reasoning in cognitive radio networks. IEEE Communications Surveys Tutor, 15(4), 1761–1777.

Ghanem, W.R., Shokair, M., and Desouky, M.I. (2016). An improved primary user emulation attack detection in cognitive radio networks based on firefly optimization algorithm. 33rd National Radio Science Conference. 22–25 February 2016, Aswan, Egypt, 178–187.

Ghasemi, A. and Sousa, E.S. (2008). Spectrum sensing in cognitive radio networks: Requirements, challenges and design trade-offs. IEEE Communications Magazine, 46(4), 32–39.

Goldberg, D.E. (1989). Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley Longman Publishing, Boston.

Guo, L., Chen, Z., and Huang, L. (2018). A novel cognitive radio spectrum allocation scheme with chaotic gravitational search algorithm. International Journal of Embedded Systems, 10(2), 161–167.

Haddad, O.B., Afshar, A., and Mariño, M.A. (2006). Honey-bees mating optimization (HBMO) algorithm: A new heuristic approach for water resources optimization. Water Resources Management, 20(5), 661–680.

Haykin, S. (2005). Cognitive radio: Brain-empowered wireless communications. IEEE Journal on Selected Areas in Communications, 23(2), 201–220.

Haykin, S. (2008). Neural Networks and Learning Machines. Pearson, London.

Heppner, F. and Grenander, U. (1990). A Stochastic Nonlinear Model for Coordinated Bird Flocks. AAAS Publication, Washington.

Holland, J.H. (1992). Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence. MIT Press, Cambridge.

Ichikawa, K. and Fujii, T. (2017). Radio environment map construction using hidden Markov model in multiple primary user environment. International Conference on Computing, Networking and Communications. 26–29 January 2017. Santa Clara, CA, USA, 272–276.

Jiao, Y. and Joe, I. (2016). Energy-efficient resource allocation for heterogeneous cognitive radio network based on two-tier crossover genetic algorithm. Journal of Communications and Networks, 18(1), 112–122.

Karaboga, D. (2005). An idea based on honey bee swarm for numerical optimization, volume 200. Technical report, Erciyes University.

Kaur, K., Rattan, M., and Singh Patterh, M. (2018a). Cuckoo search based optimization of multiuser cognitive radio system under the effect of shadowing. Wireless Personal Communications, 99(3), 1217–1230.

Kaur, K., Rattan, M., and Singh Patterh, M. (2018b). Cognitive radio design optimization over fading channels using PSO, GSA and hybrid PSOGSA. Second International Conference on Intelligent Computing and Control Systems. 14–15 June 2018, Madurai, India, 1700–1706.

Kennedy, J. and Eberhart, R.C. (1995). Particle swarm optimization. Proceedings of the IEEE International Conference on Neural Networks IV. 27 November–1 December 1995, University of Western Australia, Perth, 1942–1948.

Kolodner, J. (2014). Case-Based Reasoning. Morgan Kaufmann, Burlington.

Lall, S., Sadhu, A.K., Konar, A., Mallik, K.K., and Ghosh, S. (2016). Multiagent reinfocement learning for stochastic power management in cognitive radio network. International Conference on Microelectronics, Computing and Communications. 23 January 2016, Durgapur, India, 1–6.

Lin, Y., Wang, C., Wang, J., and Dou, Z. (2016). A novel dynamic spectrum access framework based on reinforcement learning for cognitive radio sensor networks. Sensors, 16(10), 1675.

Liu, Y., Liang, J., Xiao, N., Yuan, X., Zhang, Z., Hu, M., and Hu, Y. (2017). Adaptive double threshold energy detection based on Markov model for cognitive radio. PLOS ONE, 12(5), 1–18.

Liu, M., Song, T., Hu, J., Yang, J., and Gui, G. (2019). Deep learning-inspired message passing algorithm for efficient resource allocation in cognitive radio networks. IEEE Transactions on Vehicular Technology, 68(1), 641–665.

Lu, Y., Zhu, P., Wang, D., and Fattouche, M. (2016). Machine learning techniques with probability vector for cooperative spectrum sensing in cognitive radio networks. IEEE Wireless Communications and Networking Conference. April 3-6 2016, Doha, Qatar.

Manesh, M.R., Quadri, A., Subramaniam, S., and Kaabouch, N. (2017). An optimized SNR estimation technique using particle swarm optimization algorithm. IEEE 7th Annual Computing and Communication Workshop and Conference. 9–11 January 2017, Las Vegas, USA, 1–6.

Manjith, R. (2016). A hybrid of BFO and MCS algorithms for channel estimation of cognitive radio system. Arabian Journal for Science and Engineering, 41(3), 841–852.

Mitola, J. (2000). Cognitive radio: an integrated agent architecture for software defined radio. PhD Thesis, Royal Institute of Technology, Stockholm.

Mitola, J. and Maguire, G.Q. (1999). Cognitive radio: Making software radios more personal. IEEE Personal Communications, 6(4), 13–18.

Morozs, N., Clarke, T., and Grace, D. (2016). Cognitive spectrum management in dynamic cellular environments: A case-based Q-learning approach. Engineering Applications of Artificial Intelligence, 55, 239–249.

Nakrani, S. and Tovey, C. (2004). On honey bees and dynamic server allocation in internet hosting centers. Adaptive Behavior, 12(3–4), 223–240.

Norris, J.R. (1998). Markov Chains. Cambridge University Press, Cambridge.

Pourpeighambar, B., Dehghan, M., and Sabaei, M. (2017). Non-cooperative reinforcement learning based routing in cognitive radio networks. Computer Communications, 106, 11–23.

Qiu, R. C., Hu, Z., Li, H., and Wicks, M. C. (2012) Cognitive Radio Communication and Networking: Principles and Practice. John Wiley & Sons, Hoboken.

Rahman, M.A., Lee, Y.-D., and Koo, I. (2016). An efficient transmission mode selection based on reinforcement learning for cooperative cognitive radio networks. Human-centric Computing and Information Sciences, 6(1), 2.

Raj, V., Dias, I., Tholeti, T., and Kalyani, S. (2018). Spectrum access in cognitive radio using a two-stage reinforcement learning approach. IEEE Journal of Selected Topics in Signal Processing, 12(1), 20–34.

Rashedi, E., Nezamabadi-Pour, H., and Saryazdi, S. (2009). GSA: A gravitational search algorithm. Information Sciences, 179(13), 2232–2248.

Rojas, R. (2013). Neural Networks: A Systematic Introduction. Springer Science & Business Media, Berlin.

Roy, A., Midya, S., Majumder, K., Phadikar, S., and Dasgupta, A. (2017). Optimized secondary user selection for quality of service enhancement of two-tier multi-user cognitive radio network: A game theoretic approach. Computer Networks, 123, 1–18.

Russell, S.J. and Norvig, P. (2016). Artificial Intelligence: A Modern Approach. Pearson Malaysia, Kuala Lumpur.

Salahdine, F., Kaabouch, N., and El Ghazi, H. (2017a). Techniques for dealing with uncertainty in cognitive radio networks. IEEE 7th Annual Computing and Communication Workshop and Conference. 9–11 January 2017, Las Vegas, USA, 1–6.

Salahdine, F., Kaabouch, N., and El Ghazi, H. (2017b). A Bayesian recovery technique with Toeplitz matrix for compressive spectrum sensing in cognitive radio networks. International Journal of Communication Systems (IJCS), 30(15), 1–9.

Saoucha, N.A. and Benmammar, B. (2017). Adapting radio resources in multicarrier cognitive radio using discrete firefly approach. International Journal of Wireless and Mobile Computing, 13(1), 39–44.

Shu, Z., Qian, Y., Yang, Y., and Sharif, H. (2016). A game theoretic approach for energy-efficient communications in multi-hop cognitive radio networks. Wireless Communications and Mobile Computing, 16(14), 2131–2143.

Supraja, P. and Pitchai, R. (2019). Spectrum prediction in cognitive radio with hybrid optimized neural network. Mobile Networks and Applications, 24(2), 357–364.

Surafel, L.T. and Ngnotchouye, J.M.T. (2017). Firefly algorithm for discrete optimization problems: A survey. KSCE Journal of Civil Engineering, 21(2), 535–545.

Tang, M. and Xin, Y. (2016). Energy efficient power allocation in cognitive radio network using coevolution chaotic particle swarm optimization. Computer Networks, 100, 1–11.

Tounsi, A. and Babes, M. (2017). An efficient joint spectrum and power allocation in cognitive radio networks using a modified firefly algorithm. International Journal of Communication Networks and Distributed Systems, 19(2), 214–236.

Tuan, P. and Koo, I. (2016). Throughput maximisation by optimising detection thresholds in full-duplex cognitive radio networks. IET Communications, 10(11), 1355–1364.

Tuan, P. and Koo, I. (2017). Robust weighted sum harvested energy maximization for SWIPT cognitive radio networks based on particle swarm optimization. Sensors, 17(10), 2275.

Umar, R. and Sheikh, A.U.H. (2012). Cognitive radio oriented wireless networks: Challenges and solutions. Proceedings of the International Conference on Multimedia Computing and Systems. 10–12 May 2012. Tangier, Morocco, 992–997.

Wang, B. and Liu, K.J.R. (2011). Advances in cognitive radio networks: A survey. IEEE J. Selected Topics Signal Process, 5(1), 5–23.

Wang, D. and Yang, Z. (2016a). A novel spectrum sensing scheme combined with machine learning. 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics. 15–17 October 2016, Datong, China, 1293–1297.

Wang, D. and Yang, Z. (2016b). An advanced scheme with decision tree for the improvement of spectrum sensing efficiency in dynamic network. 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics. 15–17 October 2016, Datong, China, 1288–1292.

Wiering, M. and Van Otterlo, M. (2012). Reinforcement learning. Adaptation, Learning, and Optimization, 12, 3.

Woods, W.A. (1986). Important issues in knowledge representation. Proceedings of the IEEE, 74(1), 1322–1334.

Wyglinski, A.M., Nekovee, M. and Hou, T. (eds) (2009). Cognitive Radio Communications and Networks: Principles and Practice. Academic Press, Cambridge.

Xingjian, L. et al. (2018). Intelligent power control for spectrum sharing in cognitive radios: A deep reinforcement learning approach. IEEE Access, 6, 25463–25473.

Yang, X.-S. (2005). Engineering optimizations via nature-inspired virtual bee algorithms. International Work-Conference on the Interplay Between Natural and Artificial Computation. Las Palmas, Canary Islands, Spain, 317–323.

Yang, X.-S. and Deb, S. (2017). Cuckoo search state-of-the-art and opportunities. IEEE 4th International Conference on Soft Computing & Machine Intelligence. 23–24 November 2017, Mauritius, 55–59.

Yin, Z., Wang, Y., and Wu, C. (2017). A multiagent collaborative model for bayesian opportunistic channel accessibility in railway cognitive radio. International Journal of Performability Engineering (IJPE), 13(4), 479–489.

Zadeh, L.A. (1965). Fuzzy sets. Information and Control, 8(3), 338–353.

Zaheer, M., Uzma, M., Asif, A., and Qureshi, I. M. (2016). Interference control in cognitive radio using joint beamforming and power optimization by applying artificial bee colony. 19th International Multi-Topic Conference. 5–6 December 2016. Islamabad, Pakistan, 1–6.

Zhai, L. and Wang. H. (2017). Crowdsensing task assignment based on particle swarm optimization in cognitive radio networks. Wireless Communications and Mobile Computing (WCMC). 4687974, 1–9.

Zhang, M., Diao, M., and Guo, L. (2017). Convolutional neural networks for automatic cognitive radio waveform recognition. IEEE Access, 5, 11074–11082.

Zhao, Y. and Morales-Tirado, L. (2012). Cognitive radio technology: Principles and practice. International Conference on Computing, Networking and Communications. 30 January–2 February 2012. Maui, USA, 650–654.

Zhu, J., Jiang, D., Ba, S., and Zhang, Y. (2017). A game-theoretic power control mechanism based on hidden Markov model in cognitive wireless sensor network with imperfect information. Neurocomputing, 220, 76–83.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.141.202.187