Automatic Modulation Classification Exploiting Hybrid Machine Learning Network

It is a research hot spot in cognitive electronic warfare systems to classify the electromagnetic signals of a radar or communication system according to their modulation characteristics. We construct a multilayer hybrid machine learning network for the classification of seven types of signals in different modulation. We extract the signal modulation features exploiting a set of algorithms such as time-frequency analysis, discrete Fourier transform, and instantaneous autocorrelation and accomplish automatic modulation classification using naive Bayesian and support vector machine in a hybrid manner. The parameters in the network for classification are determined automatically in the training process. The numerical simulation results indicate that the proposed network accomplishes the classification accurately.


Introduction
The automatic modulation classification (AMC) of the electromagnetic signals of radar and communication systems is an important function in modern electronic warfare systems [1][2][3][4].The AMC process in a cognitive electronic warfare system for radar and communication signal surveillance is shown in Figure 1.The AMC mainly consists of feature extraction and modulation classification [5][6][7].Modulation feature extraction is composed of a series of transform and analysis algorithms in time domain, frequency domain or time-frequency domain, such as time-frequency analysis [8], cyclic cumulant [9][10][11][12] and radar ambiguity function [13], and so on.The classification processing consists of various pattern recognition and machine learning algorithms, such as support vector machine (SVM) [14], deep learning [15], and clustering algorithms [16].
Pattern recognition has made a lot of progress and gained extensive applications in the field of computer vision, where SVM, artificial neural network and deep learning are widely used to realize image classification [17][18][19].The AMC of the communication and radar signals can be regarded as an important branch of pattern recognition.A series of results have been reported in a range of open literatures.
To recognize the radar emitter signal in [20], the feature is extracted by signal fuzzy function slice and singular value decomposition, and modulation classification is obtained by the utilization of a kind of fuzzy clustering classification approach.In [21], a combination of the "Rihaczek distribution and Hough transform" algorithms is introduced to extract the features of the signals in time-frequency domain, and the AMC of radar signals in quadrature amplitude modulation (QAM) and phased shift keying (PSK) modulation is achieved.In [22], wavelet transform and manifold learning are employed to realize feature extraction and highdimensional data dimensionality reduction respectively, and the nearest neighbor algorithm is adopted to realize classification of five types of signals (binary amplitude shift keying (2ASK), binary frequency shift keying (2FSK), binary phase shift keying (BPSK), linear frequency modulation (LFM) and Clock Pulse (CP).A type of feature extraction method using blind channel estimation and cumulant is proposed in [23], and classification of the modulated signals is realized by a type of multiclassification algorithm based on maximum likelihood.[24] accomplishes feature extraction and modulation classification using high-order cumulant and binary tree based SVM, and verifies its performance using various signals such as 2ASK, 4ASK, quadrature phase shift keying (QPSK), 2FSK and 4-frequency shift keying (4FSK).[25] gives the modulation feature by the exploitation of a type of time-frequency image method based on local binary mode, and identifies the radar signals by SVM algorithm.In [26], genetic programming is carried out for extracting useful features, and the nearest neighbor algorithm is selected for the classification of certain types of radar signals.
It can be seen from the above literatures and several other related literatures [27][28][29][30][31][32] that there are several common problems in study of this field: (1) The articles usually focus on signals emitted from either radar or communication systems, while electronic warfare systems in application may receive signals from both of these systems.That is to say, the current studied AMC algorithms lack a universal study on the classification of the radar and communication signals simultaneously.(2) The changes of the signal parameters are not considered in constructing the data sample libraries for training and test, though it has been demonstrated that these changes have a significant influence on the performance of the classification algorithms.(3) Without exploring the different multifeatures of the signal modulations, the AMC approaches proposed in many of the open literatures are based on only one feature, which makes the classification more difficult to implement.(4) Most studies overemphasize the use of advanced classification algorithms, while ignoring the study on signal feature extraction techniques [33].The characteristics of most types of digital modulation used in modern communication and radar systems have significant differences, which can be obtained according to their definitions using certain feature extraction algorithms.The lack of these features actually increases the difficulties of the subsequent classification algorithms.
Different from the above research works, this paper emphasizes feature extraction and classification of the radar and communication signal simultaneously.Multidimensional modulation features in time-frequency domain, frequency domain, and envelope domain and phase domain are extracted with the utilization of a set of feature extraction algorithms, so as to find the differences of signals in different modulation as much as possible.It aims to reduce the pressure of the classification algorithm effectively and improve the classification accuracy.At the same time, a multilayer hybrid classification network is constructed for the classification exploiting multiple features, and its effectiveness to improve the classification performance is tested by seven types of radar and communication signals commonly used in practical systems, including BPSK, QPSK, 16-quadrature amplitude modulation (16QAM), LFM, single frequency (SF), 2FSK, and 4FSK.
The rest of this paper is organized as follows.Section 2 gives the structure of the hybrid machine learning algorithm, and describes the principle of the signal modulation feature extraction and classification network.Section 3 analyzes the feature extraction algorithms for the signal set {BPSK, QPSK, 16QAM, LFM, SF, 2FSK, and 4FSK}, including timefrequency analysis, instantaneous auto-correlation (IA), and discrete Fourier transform (DFT).Section 4 analyzes the idea of the dimension reduction of the modulation features when using PCA and SVM for classification.Performance evaluation of the feature extraction and classification network is discussed in Section 5.The Conclusions are drawn in Section 6.

The Principle of the Hybrid Classification Network
Aiming at classifying the signal set {BPSK, QPSK, 16QAM, LFM, SF, 2FSK, and 4FSK}, this paper proposes a hybrid AMC network consisting of a variety of modulation feature extraction algorithms and machine learning classification methods.The overall framework of the hybrid AMC network is shown in Figure 2.
( ) Classification of {SF, LFM} and {BPSK, QPSK, QAM, FSK, and FSK}.According to Figure 2, short-time Fourier transform (STFT) algorithm is used in the first-layer of the network to extract the standard deviation of the first-order difference of the time-frequency spectrum peaks (recorded as  1 ), identifying {SF, LFM} from {BPSK, QPSK, 16QAM, 2FSK, and 4FSK}.
According to the signal definition of SF and LFM, the first-order difference in time-frequency spectrum of these two signals is a fixed constant.Hence, the values of  1 for these two modulation types are approximately zeroes, which is quite different from the large  1 values of the other  The number of the frequency peaks is extracted and recorded as  3 .According to the decision threshold  3 , which can be obtained by training, {2FSK4FSK} can be identified from the signal set {BPSK, QPSK, and 16QAM}.
( ) Classification of {BPSK}, {QPSK}, and { QAM}.In the third-layer of the network, the remaining signal set {BPSK, QPSK, 16QAM} of the right branch is classified.Three features, including standard deviation of envelope, zerocrossing ratio, and standard deviation of the real part of IA, are extracted for classification.By determining the principal components with the contribution rate, PCA algorithm is used to reduce the dimensionality of the features from 3dimension to two-dimension, making it suitable for the application of the one-to-one method of SVM.Regarding QPSK as a positive class, BPSK and 16QAM are sequentially substituted into the SVM classifier as a negative class to find the support vector.Two optimal boundaries are determined in light of the position of the support vector, and the classification of BPSK, QPSK, and 16QAM is accomplished finally.
The classification structure can be regarded as a machine learning network based on sample training and test.It is necessary to construct a large learning sample aggregate for extracting the modulation features of the above seven signals and determine the multiple thresholds for the multilayer classification during the training process.In the test phase, the predetermined thresholds are used for different class, and the correct recognition rate of each class is achieved in the end.

Feature Extraction Algorithms
The extracted modulation features are mainly based on the differences of radar and communication signals in timefrequency spectrum, frequency spectrum and phase modulation.Different algorithms are used to extract different features for signals of different class.The time-frequency feature is extracted by using STFT, which identifies the signal set {SF, LFM} from the other signals.SF and LFM are discriminated by the feature of the real part of IA.According to the number of frequency peaks which obtained based on DFT, 2FSK and 4FSK are identified from the signal set {BPSK, QPSK, and 16QAM}.The features based on the real part of IA are used to distinguish between BPSK, QPSK, and 16QAM.The following section will discuss the feature extraction algorithms including STFT, IA, and DFT.
. .Feature Extraction Based on STFT.Time-frequency feature of the signals can be extracted by STFT.For a discrete signal () at discrete time instant , its STFT is given by where  represents the discrete frequency and   is the total frequency number. refers to time delay and ℎ( − ) denotes the Rectangular window function.In (1), the nonstationary signal can be regarded as the superposition of a series of short-time stationary signals, which highlights the varying characteristics of the original signal frequency with time delay.The peak of the frequency along time delay dimension can be given by where  denotes the total number of windows and () represents the maximum frequency peak corresponding to the th window.The frequency peak of each time window can be extracted.The difference of frequency peak corresponding to two adjacent time windows can be given by The standard deviation of the difference in (3) can be given by where  refers to the mean of  and  = (1/( − 1)) ∑ −1 =1 ().The STFT of the seven waveform types with Signal-to-Noise Ratio (SNR) equal to 20 dB are illustrated in Figure 3.As can be seen from Figure 3(a), the frequency of SF is the same since it has only one frequency.The difference between the adjacent frequencies for SF is constant, so the standard deviation is zero.From Figure 3(b) we can see that the frequency of LFM changes linearly, which leads to a constant difference between the two adjacent frequencies.Hence, the standard deviation is also zero.Figures 3(c) and 3(d) show that the frequencies of 2FSK and 4FSK are variable, which means that the difference between the adjacent frequencies is not constant.Figures 3(e), 3(f), and 3(g) show that when there is a phase variation for BPSK, QPSK, or 16QAM signal, the instantaneous frequency has a large disturb, which leads to fluctuations between the adjacent frequencies.Hence, the standard deviations of the signals such as BPSK, QPSK, 16QAM, 2FSK, and 4FSK will be larger than those of the SF and LFM.In this case, SF and LFM can be identified from the other signals by setting the standard deviation threshold  1 .

. . Feature Extraction Using IA [ ]. Features based on
IA can be extracted to distinguish between {SF, LFM} and {BPSK, QPSK, 16QAM}, respectively.The IA of a discrete signal () is of the following form: where  refers to time delay.The difference between the definition of IA and auto-correlation function lies in that there is no time integration in the calculation of IA.The advantage of using IA is that it retains the instantaneous phase information of the signal.The IA expressions of some of the signals are analyzed below.
(1) The signal of SF is of the following form  where  is the amplitude of the signal,  0 refers to the carrier frequency, and  0 represents the initial phase of the signal.The real part of IA of SF can be given by where  is the number of samples of the signal.It can be seen from ( 7) that if  is certain, the IA of SF is related only to the carrier frequency, which is a constant.Hence, the output of the real part of IA is a direct-current (DC) signal as shown in Figure 4(a).

Mathematical Problems in Engineering
(2) The signal of LFM is of the following form: where  is the slope of frequency modulation.The real part of IA of LFM can be given by It can be seen from ( 9) that the output of the real part of IA is an alternating current (AC) signal of frequency , which is shown in Figure 4(b).
(3) The expression of PSK can be given by where   denotes the discrete phase of a code group representing BPSK or QPSK.For BPSK, the value of   is 0 or .
For QPSK, the value of   is 0, /2, , or 3/2.The real part of IA of the PSK signal is of the following form: where  is the number of samples within one code and  < .
The real part of IA is DC within the same code period.In different code period, it can be divided into two cases: the adjacent code is the same ( +1 −  = 0) or different ( +1 −  ̸ = 0).For BPSK shown in Figure 4(c), the real part of IA is a two-value transition, of which  +1 −   = 0 corresponds to a positive transitions and  +1 −   = ± corresponds to a negative transition.However, there is a status of  +1 −   = ±/2 for QPSK, for which the real part of IA is zero (the projection on the real axis).Therefore, the real part of IA for QPSK is a three-value output, which is illustrated in Figure 4(d).
(4) The signal of 16QAMcan be expressed as where   refers to the amplitude of the code group.The real part of IA of 16QAM is of the following form: From ( 13) we can see that when  is constant, the output of IA is DC in the same code period.However, it causes a phase jump  +1 −   and amplitude transition  +1 ⋅   between different code period.Hence, the output of the IA for 16QAM is a multivalue transition; see Figure 4(e).
Two features are extracted based on the real part of the IA.
Feature .Define the standard deviation of IA as where () is the value of the real part based on IA at time instant  and  = (1/  ) ∑   =1 () represents the mean of ().The standard deviation for SF signal will be small since the fluctuation of its IA is small.However, the IA of LFM fluctuates greatly; i.e., the standard deviation is larger.Under this circumstance, SF and LFM can be identified by setting the standard deviation threshold  2 of the real part of IA.
Feature .Define zero-crossing ratio as where Num{⋅} denotes a counter and  1 refers to a small range belonging to zero (such as −0.001 <  1 < 0.001).As shown in the Figure 4, a binary jump occurs for the IA of BPSK, meaning that there is no zero in the output.However, the IA of QPSK is of a three-value transition form with a large number of zeroes in the output.The IA of 16QAM is similar to QPSK.Therefore, the difference of zero-crossing ratio between BPSK and {QPSK, 16QAM} signals can be used as a classification feature.
. .Feature Extraction Based on DFT.For the remaining signal set {BPSK, QPSK, 16QAM, 2FSK, 4FSK}, the frequency spectrum features of the signals are extracted using DFT.According to the signal definitions, the peaks of 2FSK and 4FSK are 2 and 4 within the bandwidth, respectively.However, there are much more peaks for BPSK, QPSK and 16QAM.
Since frequency peaks of the signal set {BPSK, QPSK, 16QAM, 2FSK, and 4FSK} are different, the number of frequency peaks based on DFT can be extracted as a typical feature.For a discrete signal (), its DFT can be given by ()  −(2/  ) ,  = 0, 1, 2, . . . ,  − 1, (16)   where  represents the discrete frequency and   is the total frequency number.The number of peak can be defined as where  . .Feature Extraction Based on Signal Envelope.The multilevel amplitude of the 16QAM signal is quite different from the constant envelope BPSK and QPSK signal.Hence, envelope features in time domain can be used to classify BPSK, QPSK, and 16QAM.For a discrete signal (), the standard deviation of the envelope can be defined as where  = (1/  ) ∑   =1 |()| represents the mean of the instantaneous envelope.

Analysis of SVM Based on PCA Dimensionality Reduction
Three features are extracted for the classification of BPSK, QPSK and 16QAM so as to ensure the classification accuracy under various conditions.Due to the large number of features, the classification tends to be complicated.If the three features can be replaced by the two features, SVM can be used to classify the three modulated signals in the Twodimensional (2D) feature space.Therefore, PCA algorithm is used to perform principal component analysis on the Threedimensional (3D) features, extracting principal components in features and reducing the dimension of features.
. .PCA Algorithm.The PCA algorithm transforms the original data with possible correlation into a set of new data with linear independence of each dimension through linear transformation, and it can be used to extract the principal feature components of the data, thereby achieving the purpose of dimensionality reduction [36].The main idea is to map the  1 dimensional features to  2 dimension ( 2 <  1 ), which is a completely new orthogonal feature called the principal component.It can be easily understood that PCA can be used to find the most useful linear combination, i.e., those new features with relatively large discrimination to achieve the purpose of reducing the dimension.
There are two basic requirements for PCA dimensionality reduction.First of all, the projections of the samples in the principal component direction are required to be as dispersed as possible.The more dispersed projections the larger the variance of the samples; i.e., more useful information is carried in the reduced dimension projections.Secondly, the distances from the sample points to the principal component direction are required to be as small as possible, i.e., the errors can be reduced as much as possible.The steps of the PCA dimensionality reduction algorithm [37] for  1 -dimensional modulation feature samples are summarized as follows: (1) Arrange the modulation feature samples into matrix X of  (sample numbers) rows and  1 columns.
(2) Process the sample data, recorded as , including zero-meanization and normalization.
(3) For the processed sample data, its covariance matrix can be given by where [⋅]  refers to transposition operation.
(4) According to calculate the eigenvalue   and the eigenvector   of   .Arrange the eigenvalues from large to small, and the corresponding eigenvectors are also arranged from large to small.
(5) The contribution rate is defined as where  1 is the original sample data dimension and  2 is the sample data dimension after dimensionality reduction.The new matrix  ( = [ 1 , ⋅ ⋅ ⋅   2 ]) called projection matrix is composed of the feature vectors corresponding to the first  2 eigenvalues.
(6) Determine the projection data of the original feature data in the projection matrix, and then its principal component can be given by  = .(22) . .One-to-One Multiclassification Method Based on SVM.SVM is originally an effective binary-class classification method, and its basic model is defined as a linear classifier with largest interval in feature space.For multiclassification problems, SVM can also achieve classification in an one-tomany mode, one-to-one mode, etc.In this paper, the oneto-one mode of SVM is employed due to its simplicity.The flowchart for three-class classification using SVM based on one-to-one method is shown in Figure 5, and it will be used to classify BPSK, QPSK, and 16QAM.The basic classification principle of SVM is summarized below.The discriminant function of implementing SVM is given by [38] where  is the training sample input after dimensionality reduction using PCA,  refers to a weight vector,   (±1) denotes a category label, and  is an offset.Its interval is given by Its geometric interval is given by The purpose of SVM is to find the optimal  0 and  0 , which is to maximize the geometric interval , i.e., to minimize .The problem can be transformed into where   is a vector in  = [ 1 ,  2 , . . .,   ]  .Applying Lagrange multiplication into ( 26), then we get where   denotes a nonnegative Lagrange multiplier.Calculate partial derivative of  and  respectively and make them equal to zero; then we get Convert it to a dual problem and the target signal can be given by min  1 2 According to (29), the optimal Lagrange multiplier  0, can be obtained.Then optimal weight  0 can be given by Furthermore, according to () =    +  = ∓1, the optimal bias can be given by Finally, the objective function of the optimal classification can be given by where sign(⋅) is a symbolic function.It can be seen from the above analysis that the determination of the optimal weight vector is determined only by the optimal Lagrange multiplier, the training samples, and their categories.The position of the support vector and the offset are determined through training using the 2D feature data processed by PCA.Finally, the optimal classification boundary is found to achieve the correct classification for the test samples.The objective of classifying BPSK, QPSK, and 16QAM can be accomplished using the above classification process as depicted in Figure 5.By specifying a signal as a positive class, the rest of the other two signals are treated as negative classes, and finally the one-to-one method is used to classify the multiple signals.Through the above feature analysis, QPSK can be designated as a positive class, and BPSK and 16QAM are sequentially regarded as a negative class.The basic SVM is used for twice to make the two optimal classification boundaries, which can accurately identify the three signals to achieve the classification.

Simulation Analysis
. .Performance Analysis without Fading Channel Effect . . .Simulation Setup.In order to verify the performance of the hybrid classifying network, we did the following simulations, including training phase and testing phase.As known to all, bandwidth, code rate and SNR have a much more significant influence on the signal features in comparison with sampling frequency and carrier frequency.Hence, the signal classes for training and testing are simulated by changing BW, CR, and SNR instead of FS and FC for simplicity.In the training phase, the SNRs of the seven types of modulated signals are set to [10dB, 20dB, 30dB], respectively, and the total number of samples is set to 5000.The timing offset is 0.1s.The parameters for different kinds of signals are shown in Table 1, where TW, BW, CR, FC, and FS stand for time-width, bandwidth, code rate, carrier frequency, and sample frequency, respectively.There are 450 data segments serving as sample data for each type of signal modulation.
. . .Setting the resholds and Optimal Boundary Lines.In the first-layer of the network, the standard deviation features of the difference of the frequency peaks based on STFT are extracted and shown in Figure 6.Since SF has only one frequency, the difference between adjacent frequencies is approximately zero.Hence, the standard deviation are also nearly zeros.For LFM with linear frequency variation, the difference between the adjacent frequencies is constant, leading to a zero value standard deviation.For 2FSK and 4FSK, the difference between the adjacent frequencies leads to large standard deviations.For the remaining BPSK, QPSK and 16QAM with phase jumps, fluctuations in the difference between adjacent frequencies are the main reasons for large standard deviations.
Through training, the standard deviation threshold  1 is set as 0.4 according to the naive Bayesian algorithm [34].As  shown in Figure 6, {2FSK, 4FSK, BPSK, QPSK, and 16QAM} is above the boundary line and {SF, LFM} is below the boundary line.
In the left-branch of the second-layer training, standard deviation characteristics based on the real part of IA are extracted to classify SF and LFM.The real part of IA of SF is a DC level, whereas LFM corresponds to an AC signal.Hence, the standard deviation between the two types of modulation is quite different, as shown in Figure 7. Through training, the threshold  2 of standard deviation of the real part of IA can be set to 0.52 according to the naive Bayesian algorithm.As shown in Figure 7, LFM is above the boundary line, whereas SF is below the boundary line.
In the right-branch of the second-layer, the remaining signal set {BPSK, QPSK, 16QAM, 2FSK, and 4FSK} is classified by the features based on DFT.BPSK, QPSK, and 16QAM have multiple peaks within the bandwidth, and the number of peaks increases from 20 to 340 when the BW increases as shown in Figure 8.However, the peak numbers of 2FSK and 4FSK are distributed around 2 and 4, respectively, which means that the thresholds  3 can be set to 2 and 4. In this case, 2FSK and 4FSK can be identified from the signal set {BPSK, QPSK, and QAM}.In the third-layer of the network, the signal set {BPSK, QPSK, and 16QAM} is trained by multiclassification method of SVM based on PCA feature dimension reduction.
The main features employed include standard deviation of the envelope, zero-crossing ratio of the IA, and standard deviation of the IA.As can be seen from Figures 9-11, the distinguishing characteristics of the signals are much more obvious with the increase of the SNR.The 3D features are analyzed using PCA to make dimension degradation.From Figure 12 we can see that the contribution rate is still over 97% after the dimensions reduces to 2D.It indicates that the new 2D features can reflect more than 97% of the original 3D features.In other words, the new 2D features can replace the original 3D features with little loss.
The new 2D feature data is used as the training set, and the one-to-one method is substituted into the SVM for classification the support vectors (the positions of the circles in Figure 13) are found, thereby determining the optimal boundary 1.According to the optimal boundary 1, the recognition of BPSK and QPSK is attained.The second step is to classify QPSK and 16QAM.If QPSK is specified as a positive class, then 16QAM is used as a negative class.The 2D new features of the two signal classes are substituted into the basic SVM for training.The optimal boundary 2 is determined after the positions of the support vectors are found.According to the optimal boundary 2, the recognition of QPSK and 16QAM are obtained.The classification results are shown in Figure 13, from which we can see that BPSK, QPSK, and 16QAM can be accurately identified by the two optimal boundary lines.
. . .Performance Analysis.During the testing phase, the correct recognition rates of the signal set {BPSK, QPSK, 16QAM, LFM, SF, 2FSK, and 4FSK} at different SNRs are shown in Table 2.It can be seen from Table 2 that the correct recognition rate of the signals improves with the increase of . .Performance Analysis under Fading Channel Conditions.Multipath effect of a channel usually leads to serious distortion on the received signal, causing serious degradation on the AMC algorithm.A fading channel is taken into account to analyze the performance of the proposed classification network in this simulation.The received signal model in the fading channel circumstance can be written as where () is the transmitted signal, () is the additive white Gaussian noise, and ℎ(),  = 0, 1, . . .,  − 1 are the  fading channel coefficients.The channel ℎ() is considered nonrandom and assumed to be Rayleigh fading.The channel coefficients are randomly generated with variance 0.05 in the simulation except for ℎ(0) = 1.Other simulation conditions are the same as the above simulation.The correct recognition rates of the signal set {BPSK, QPSK, 16QAM, LFM, SF, 2FSK, and 4FSK} at different SNRs under fading channel are shown in Table 3.Compared with Table 2, the correct recognition rate of each signal decreases.SF and LFM go down a bit just about 1%, while 2FSK and 4FSK fall approximately 2%.Especially, the descending value of BPSK, QPSK, and 16QAM can reach about 6%.The result of the comparison indicates that the performance of the classification network in fading channel has a slighter decrease than the scenarios without a fading channel.

. . Performance Comparison with Algorithm in [ ].
The classification of QAM signal in the third layer is an important part in the proposed network, whereas diverse methodologies have been explored in how to classify the QAM signal class.The AMC algorithm based on high-order cyclostationarity proposed in [9] is a classic algorithm for QAM signal classification and has good classification effect and superior performance.This paper applies the second-order instantaneous autocorrelation algorithm to realize AMC, and its performance is compared with the one in [9].The adopted signals include BPSK, QPSK, and 16QAM.Figure 14 plots the total recognition performance of {BPSK, QPSK, and 16QAM} of the proposed algorithm and that of the algorithm in [9].A comparison of these curves shows that the two algorithms have similar performance in classification.The advantage of the instantaneous autocorrelation is less complexity in comparison with that of the high-order cyclostationarity approach.

Conclusion
This paper proposes an AMC network for the classification of radar and communication signals.In general, a three-layer classification network is employed, consisting of a series of feature extraction and classification methods such as STFT, DFT, IA, PCA, SVM, and naive Bayesian algorithm.Through the training of the large sample data, the setting of the classification thresholds of the machine learning algorithms is automatically realized.During the sample construction process, the comprehensive coverage of signal samples is attained by changing the key parameters such as code rate and bandwidth.The simulation results show that the correct recognition rate of the seven types of modulated signals can reach over 94% at SNR of 10dB and above if channel distortion is not considered.For fading channel scenarios, a degradation of the correct recognition rate of about 6% is observed as a performance comparison study.

Figure 1 :
Figure 1: The structure of an AMC system for radar and communication signal surveillance.

Figure 2 :
Figure 2: Diagram of the classification network.
| ⋅ | refers to the modulo operation and  2 the threshold of frequency peak, taking 0.7 times of the maximum value.Since the number of peaks of 2FSK and 4FSK is smaller than the other three signals, 2FSK and 4FSK can be identified from other signals by setting the frequency peak threshold  3 .

Figure 4 :
Figure 4: The real part of IA of the signals.

Figure 5 :
Figure 5: Three-class classification of SVM based on one-to-one method.

Figure 6 :
Figure 6: Standard deviation of the difference of the STFT peak.

Figure 7 :Figure 8 :
Figure 7: Standard deviation of real part of IA of SF and LFM.

Table 1 :
Parameters for different types of modulated signals.

Table 2 :
Correct recognition rate at different SNRs.

Table 3 :
Correct recognition rate at different SNRs with fading channel.