Feature Recognition of Crop Growth Information in Precision Farming

1Department of Electronic and Information Engineering, Henan University of Animal Husbandry and Economy, Zhengzhou 450044, China 2College of Electrical Engineering, Henan University of Technology, Zhengzhou 450001, China 3School of Information Engineer, Henan Institute of Science and Technology, Xinxiang 453003, China 4Department of Applied Physics, Xi’an University of Technology, Xi’an 710048, China


Introduction
The plant electrical signal is a kind of physiological signal relating to plants' physiological process and their internal information transmission which is a kind of stress response of plants to environmental changes.Electrical signals can not only transmit remote information from point to point on a plant rapidly, but also respond to external stimuli such as light, temperature, burn, and salt sensitively; even these changes take place much earlier than other kinds of physiological and morphological changes usually [1,2].Through the research of plant electrical signals, the information expressed by plants in the natural world is deciphered by human beings.Therefore, an automatic adoptive control system conducted by plant electrical signals is established, which realizes the automatic production controlling system based on the concept of energy-saving and emission-reducing when manufacturing crops and vegetables [3].
With hundreds of years of studying plants' electrical signals, researchers have developed a set of plant signal models and physiological effects relating to this, from the stage where they can only identify the existence of those signals [4].However, relevant research of this field has not been deeply engaged by domestic and foreign scholars yet.Basic features of plants' electrical signals and their spectral distribution still remain uncertain.Recently, wavelet transformation is widely applied in the research of faint electrical signals such as EEG, ECG, and EMG, but it was seldom introduced to study plants' electrical signals.
As the good time-frequency localization characteristic wavelet transformation is acquired, it is very suitable for nonstationary and time-varying signal analysis; all varieties of wavelet entropy based on this feature are the combination of wavelet transform and the information entropy principle [5].Wavelet transformation combines the unique advantages in processing irregular signal and the statistical characteristics of figuring the complexity of information entropy.So the wavelet entropy has unique advantages in nonstationary and time-varying signal feature extraction.Although feature information of signal will be more adequate as the feature index increases, the complexity and difficulty of analysis may also be enhanced.In fact, there is a certain information correlation between different characteristic indexes of a same signal.In order to solve the related problems owing to the increase of information indexes, less independent index will be applied to reflect the information of the original index.We can use the principal component analysis (PCA) as the treatment.The principal component analysis method is used to reduce the number of the original number of evaluation indicators into a small number of comprehensive indicators (principal component), using the idea of dimension reduction.The indexes that are not related to each other can retain the majority of information in the original index, simplifying the complexity of the problem and achieving good results in the process of solving many practical problems.Therefore, the identification of plant electrical signals is proposed, based on the advantages of these two methods.
In this article, the concept of signal wavelet is decomposed and wavelet coefficients are reconstructed first, while transforming the original signal to a number of subsignals that have equal length and contain different frequency; then we extract wavelet energy entropy, wavelet singular entropy, and wavelet variance entropy of each subsignal to construct as the plant electrical feature; then the dimension reduction idea of principal component analysis (PCA) is used to treat a plenty of original data with dimension reduction, eliminating redundant information among features and obtaining a small number of comprehensive principal component indexes independent of each other; finally, identification of plant signal is realized by adopting the method of BP neural network.

Material and Osmotic
Stress.Maize seeds (Zhengdan 985) were provided by the Agricultural Seed Market of Henan.The seeds were germinated on wet filter paper in an incubator where a constant temperature of 30 ∘ C can be maintained.Once the seeds have germinated, they would be transplanted into a pot filled with soil.The soil contained some essential nutrients to supply the growth of plants; it was composed of 50% peat, 35% clay, and 15% humus (pH 6.0) and grown under a light intensity of 60mol m −2 s −1 provided by mercury halide lamps in a 14/10 h light/dark period, at a watering cycle of 3 days with 1% KH 2 PO 4 solution.To apply osmotic stress, 15% PEG6000 was introduced to solve this problem when the maize plants were at the 4-leaf stage.The electrical signals of maize leaves were measured after various periods of sampling under osmotic stress.Corn which grew in a normal condition is set as a comparing group.The leaf relative water content will be measured, and the corresponding electrical signals will be collected when a mild stress, moderate stress, and severe stress are imposed, respectively.About 10 groups of data will be collected in each case.

Measuring Instrument.
A new biological function experiment system (designed by Henan University of Technology, the integrated circuit Laboratory [6]) is introduced to measure electrical signals.The input impedance is 400 MΩ, and every input channel is photo-electrically isolated to decrease outside interference.The equivalent input noise voltage of the instrument was less than 1V.The measuring system is shown in Figure 1.The constant temperature control system, shielding box, and measuring instrument were connected to each other.The sampling frequency of the system and the frequency of the low-pass filter were 5 kHz and 400 Hz, respectively, and the frequency of the high-pass filter was 0.053 Hz.The unit of electrical potential energy is 1V and the unit of time is seconds.During the sampling, the temperature of the environment was 20 ∘ C while the humidity was 40-50%.
The electrode was a high-sensitivity silver AMBU ECG electrode.Before the experiment, the results of a test about the electrodes showed that the system can detect signals between 0 and 50Hz, while the fluctuations were less than 0.5%, which demonstrated the high electrode stability that this system acquires.To measure the plant electrical signal, the A electrode was placed on the front surface of the leaf and the B electrode on the back (leaf length, 25 cm; leaf width, 2 cm).The distance between two electrodes was 15 cm.The C electrode was connected to the copper sheet, which was covered by the soil.The measuring electrode was connected to the signal measurement system, and the electrical signal was monitored and recorded by a computer.The environment temperature was regulated by the constant temperature controller [7].
Signal recognition is the differential of the related information of signals, such as the fact that test corn leaves electrical signal were collected at different drought stress.The identified signals are often the same kind of signals.
The same kind of signals acquires different information due to different environments or man-made factors; the information of signal only represents a small percentage of all information, but it is the difference between categories of main information signal characteristics.With this idea, the paper is determined to find a suitable method to extract the characteristic of the category information, and we mark the reflected characteristics of the main information of the signal as similar characteristics, while those subtle but being hard to extract features are marked as category.In this paper, the study of the plant electrical signal feature extraction originates from this concept, and the work is divided into two parts: The first step is to find characteristics of the signal information that are clearly presented.The second step is extracting new features based on the first step.The proportion of each attribute characterization of information is different, and the category information characteristics are obtained.

Feature Extraction of Plant Electrical Signals Based on Multiwavelet Entropy
In the information theory, entropy is a description that describes the degree of uncertainty in the system.If a signal source is treated as a material system, the randomness and uncertainty the signal source will be greater if the output information expands, and the entropy will also be as great as the disorder.This is the reason why the information entropy is regarded as the measurement of system disorder degree [8].If the coefficient matrix of wavelet transformation is processed as a probability distribution sequence, the sparsity degree of coefficient matrix will be reflected by entropy value obtained from probability distribution sequence, that is, the order degree of signal probability distribution.Nevertheless, the probability distribution to random distribution of signals will be closer since the entropy value is greater [9].

Definition of Three Kinds of Wavelet Entropy
3.1.1.Wavelet Energy Entropy.Wavelet energy entropy is a statistic analysis that analyzes the energy distribution of analyzed signal on each frequency band, using the scale coefficient of wavelet transformation as a benchmark to divide the signal energy.Finally, a quantitative entropy value reflects the distribution complexity of signal energy. =  1 ,  2 , . . .,   is supposed as wavelet energy spectrum of signal () on  scales; thus a division of signal energy is formed on  scale domain.According to the feature of orthogonal wavelet, the sum power  in a time window is equal to the sum of every component power   .Suppose  =   / ( = 1, 2, . . ., ); then ∑   = 1.Therefore, define wavelet energy entropy as ln(  ) = 0 when   = 0.

Wavelet Singular Entropy.
The theory of singularity value decomposition can easily extract the fundamental modal feature of analyzed matrix.Wavelet singular entropy adopts the advantages of SVD theory to extract analyzed signal's wavelet transformation coefficient matrix feature, reflecting the time domain distribution feature of analyzed signal.
Suppose that signal's wavelet decomposition in the scale  ( = 1, 2, . . ., ) is  () ; thus the decomposition in  scales can constitute an  ×  matrix .According to SVD theory, for an  ×  matrix , there must exist an  ×  dimensional matrix , an  ×  dimensional matrix , and an  ×  dimensional matrix Λ, decomposing matrix  as where diagonal matrix Λ's main diagonal element   ( = 1, 2, . . ., ) is not negative and in descending order, that is,  1 ≥  2 ≥ ⋅ ⋅ ⋅ ≥   ≥ 0. These diagonal elements are singular value of wavelet transformation result  × .When the signals have no noise or high signal-to-noise ratio, according to signal's singular decomposition theory, their main diagonal singular value will be only a small number but not zero.The signal wavelet decomposition result matrix's singular value also meets similar rules.The less the signal's frequency component, the less the number of wavelet decomposition result's singular value that is not zero.In order to quantify the feature of signal's frequency component and distribution feature, wavelet singular entropy is defined as where Δ  is  th incremental wavelet singular entropy.
Treating wavelet transformation result matrix with singular value decomposition is equivalent to mapping wavelet space associated with each other to linearly independent feature space.Wavelet singular entropy can discriminate signals with different time domain distribution in a quantifiable way.The greater the signal's complexity and uncertainty, the greater the value of quantify wavelet singular entropy.Complexity 3.1.3.Wavelet Variance Entropy.For a variance spectrum {  () = (1/)‖  ‖ 2 },  = 1, 2, ⋅ ⋅ ⋅ , obtained from a given signal's wavelet decomposition, among which   is wavelet coefficient's variance spectrum under  scale.Wavelet variance entropy is defined as follows: where   =   ()/ ∑    () indicates the variance probability under  scale.Step 1. Deal signal () with two-layer wavelet decomposition.Two high-frequency coefficient cd1, cd2 and second layer low-frequency coefficient ca2 are extracted, respectively.

The
Step 2. Reconstruct wavelet decomposition coefficient and extract signals among every frequency range.  ( = 1, 2, 3) is used, respectively, to indicate reconstruction signal of cd1, cd2, and cd3.Therefore, entire reconstruction signal  in two-layer wavelet decomposition can be described as Step 3. Calculate wavelet energy entropy, wavelet variance entropy, and wavelet singular entropy of each reconstruction signal.  (),   (), and  V () are used, respectively, to indicate wavelet energy entropy, wavelet singular entropy, and wavelet variance entropy, among which  = 1, 2, 3.
Step 4. Reconstruction feature vector: all three wavelet entropy values are expressed in scientific notation.Extract their effective numbers to construct multiple wavelet entropy's feature vector , which is shown as follows: In this article, db5 wavelet that has compact support and orthogonality is used to treat corn electrical signal with wavelet decomposition, adopting kinds of discussed wavelet entropy extraction algorithm to construct feature vector.This kind of feature vector combines with different entropy's features, thus describing the original signal's information content in a comprehensive way.

Principal Component Analysis of Multiple Wavelet Entropy Feature
Principal component analysis is not only a statistic analysis method, but also a data compression and feature information extraction method.It will concentrate information which disperses in a set of variables to a few comprehensive indexes (principal component) independent of each other.Every principal component is linear combination of original variables.Every principal component is in an orthogonal relationship with each other, in order to achieve data dimension reduction and redundant information elimination.Set the original sample data of 10×9 matrix of the multiwavelet entropy feature of the corn in good growing condition as an example.The theory of using principal component analysis to realize data compression and principal component extraction is shown as follows [10].
Step 1. Data standardization process: suppose that the original sample data matrix of the multiwavelet entropy feature of the corn in normal growing condition is  = ((, )) 10×9 , among which  = 1, 2, . . ., 10,  = 1, 2, ⋅ ⋅ ⋅ 9, and (, ) is the signal's jth feature index.The expression of data standardization process is where   is sample mean and   is sample standard deviation.Their calculation expressions are as follows: After standard treatment, the normalized matrix  * is obtained.
Step 2. Calculate correlation coefficient matrix.Calculate the correlation coefficient matrix  = ( (,) ) 9×9 of normalized matrix  * .The calculation expression of  (,) is Step 3. Calculate the feature value and corresponding feature vector of correlation coefficient matrix.Let | − | = 0 and 9 nonnegative feature value   can be calculated.They are the variance of principal component.The feature value is sorted by descending order.The corresponding feature vector is set as  1 ,  2 , . . .,  9 ; thus the  th principal component expression of original sample data is Step 4. Determine the number of principal components.The principle of selected subordinate component is as follows: original sample date  corresponds to feature value   and accumulated contribution rate  = ∑  =1 / ∑ 9 =1   of  principal component is over 85%, indicating this  is the principal component representing the similar feature of original signal.Select the rest 9- subordinate component as the category feature of identification signal.

Time Domain Characteristics of Electrical Signal in Maize
Leaf under Osmotic Stress.Plant electrical signals are very weak and are easily masked by background noise.Although the measuring instrument and the shield method used in this study can reduce external interference, there is still some background noise.To obtain the most accurate power spectrum of signals in maize leaves, we used a wavelet transform method to denoise the original signal.The time domain waveform of the electrical signal in maize leaf is shown in Figures 2(a), 2(b), 2(c), and 2(d).This waveform was decomposed through a db5 wavelet at seven scales, soft threshold denoised, and reconstructed.The electrical signals were sampled separately at the normal growing condition, the mild drought stress, the moderate drought stress, and the severe drought stress.As it can be seen from calculation of each table, the accumulated contribution rate of first three principals in normal growing condition group is 85.38% when  = 3, the accumulated contribution rate of first three principals in mild drought stress group is 98.02%, the accumulated contribution rate of first three principals in moderate drought stress group is 87.03%, and the accumulated contribution rate of first three principals in severe drought stress group is 96.38%.Therefore, select the last six components as category feature in this  Four groups of multiwavelet entropy of corn electrical signal are taken into expression (12) to calculate category feature.The specific calculation process is omitted here.Finally, the category feature can be seen in Tables 5, 6, 7, and 8.

Classification of Feature Value by BP Neural Network.
Feature extraction is that multiwavelet entropy of plant electric signal is constructed as feature vector, through principal component analysis to get a few independent principal component indexes and extract the important information which has classification significance.However, classifier design is aimed at making feature vector fall into different categories and use it realize the automatic identification of signals.In practical applications, the number of nodes input layers in the network should be determined according to the dimensions of the input data, and the number of nodes output layers in the network is according to the design of the classifier requirement in general.If the network is used as classifier, the number of node output layers should be to classification categories.The number of hidden layers can use one, two, or more hidden layers to construct the network structure.The more hidden layers of network, the stronger the mapping function, but the training time is very long.Number of hidden layer nodes selected is too little, the network will not have enough freedom to better fit the training data, the network is not strong or not training to come out, and then it cannot even identify samples that have not been met before, where fault tolerance is poor.If the number of hidden layer nodes is too high, the network training time is too long, and the error is not necessarily the best.In general, it can only rely on experience and test to determine the number of hidden layer nodes.
BP network is a kind of multilayer network of forward propagation.Apart from having input layer and output layer, it also has one or more hidden layers.A 3-layer BP neural network can be done for any n-dimensional to m-dimensional mapping [2,[11][12][13][14][15][16] .The network training method is called the error backpropagation method.It makes use of the minimum error between the actual output of the network and the expected output (the target vector) to the network's multilayer connection weights and then after the forward layer by layer correction.The method of successive correction of the weight vector is called successive correction method.Four sets of feature vectors for each class of signals are selected randomly in the experiment, which are taken into BP neural network to process 20000 times learning, and the error is close to 0.01, and the structure of selection is 3layer BP neural network 4-15-4.The transfer function of the computing unit in the network is trained by the traingdm type function.The results of the experiment are shown in Table 9.

Analysis of Result.
The proposed method is applied to classify the electrical signals of maize leaves in four cases.It could be seen that the recognition rate of the normal growing group and the moderate drought stress group was lower than that in the mild drought stress group and the severe drought stress group.Similarly, it can be seen by the contribution rate in the normal growth and moderate drought stress groups of similar information contained in the class features compared to the other two groups.So the category feature is not obvious, and recognition rate is not as that of the two groups.In general, the average recognition rate of the method is 93.75%, and the recognition rate of it is not only higher than that of just using multiwavelet entropy structure feature; recognition rate is 83%.And the running efficiency of the program is fast, more than half [17,18].
When this method is classified in BP neural network, it will be able to achieve the set target for each training 7000-8000 times.And when the feature of multiwavelet entropy is identified, training 20000 times to achieve the set target probability is only 90%.It can be seen that it is feasible to use the multiwavelet entropy and principal component analysis to identify the plant electrical signals.Multiwavelet entropy can characterize the information contained in plants' electrical signals.The principal component analysis can not only reduce the dimension of feature vectors and remove redundant information, but also get the classification features of the plant signal, which makes the recognition rate of the classifier greatly improved.
This method also has the advantages of practical application, and the common signal recognition method is the identification of a single signal, but this method is to identify a set of signal samples at the same time.In facility agriculture, it is obviously inappropriate for the detection of a signal to represent a regional crop.Therefore, a large number of repeated experiments are required to calculate the features of the regional crop.A regional crop signal is correlated; as a result, the duplicate information was also tested in repeated trials, and this does not conform to the principle of efficiency.The method proposed in this paper solves this problem.Not only can it identify the data samples of a region at the same time, but also the principal component analysis has removed the correlation between signals.In addition, it has high recognition rate, which can make the early warning and the diagnosis of the plant under the external environment stress more effective.

Conclusion
In view of the recognition problems of weak, complex, and strong random plant electrical signals, a signal feature extraction method based on multiwavelet entropy and principal component analysis is proposed; the conclusions can be drawn as follows.
(1) All kinds of wavelet entropy are organic combination consisting of wavelet transformation, entropy principle, and related theories; it is a blend of three unique advantages in signal processing; thus it can extract and analyze the nonstationary signals effectively; fusion feature of wavelet entropy can make a good characterization to all of information of plant electrical signal.
(2) Make full use of wavelet transform and principal component analysis.Wavelet transformation time-frequency resolution features are used for signal refinement and signal feature extraction in frequency.Using principal component analysis can not only convert multiwavelet entropy feature index to a handful of independent indexes while eliminating redundant information among features, but also pass through the contribution degree classifying main information and secondary information of the signal.Therefore, we can obtain the class feature, reduce the eigenvector dimensions, simplify the classifier design, and improve the classifier recognition rate.
(3) The proposed approach is used to recognize a variety of conditions of corn leaf electrical signals, it obtains satisfactory results of this classification, and the experimental results verify the feasibility and the validity of this method.
To sum up, more plants for more extensive research shall be used in further study, while the application scope of this method shall be broadened, and we should make more sufficient data mining of plant electric signals to extract the better feature parameters.Based on the above, we have built a practical recognition system of plant electric signals successfully.

Data Availability
The data used to support the findings of this study are available from the corresponding author upon request.

Figure 2 :
Figure 2: Time domain waveform of electrical signal of maize leaf under osmotic stress: (a) the normal growing condition; (b) the mild drought stress; (c) the moderate drought stress; (d) the severe drought stress.
Leaf of Maize.Fast Fourier transform was used to analyze the spectrum of the electrical signal in maize leaves.The power spectrum distributions at various intervals under osmotic stress are shown in Figure3.

5. 3 .
Feature Vector.The original sample date of multiwavelet entropy of corn electrical signal under 4 conditions is 10 × 9 matrix, whose feature values are shown in Tables 1, 2, 3, and 4.

Figure 3 :
Figure 3: Power spectrum distribution of electrical signals in maize leaf under osmotic stress.(a) The normal growing condition; (b) the mild drought stress; (c) the moderate drought stress; (d) the severe drought stress.
Feature Extraction Method of Multiple WaveletEntropy.Different wavelet entropy reflects signal's distribution statistic feature from different angles.Due to the existence of uncertainty and incompleteness of feature information, especially when the case of error or loss of information occurs during the procession of signal transmission, not every kind of wavelet entropy can completely reflect signal's time-domain feature.Aiming at different types of signals, different kinds of wavelet entropy algorithms have different advantages and features.If we extract, classify, and identify plant electrical signals with single wavelet entropy, it would be hard to deal with signal's uncertainty and changeability interference, which cannot guarantee rightness and reliability of classification.As is shown below, a given signal constructs multiple wavelet entropy feature vectors; detailed steps are as follows.

Table 1 :
The multiwavelet entropy of corn's multiwavelet entropy of the normal growing condition group.

Table 2 :
The feature vector of corn's multiwavelet entropy of the mild drought stress group.

Table 3 :
The feature vector of corn's multiwavelet entropy of the moderate drought stress group.

Table 4 :
The feature vector of corn's multiwavelet entropy of the severe drought stress group.

Table 5 :
The category feature of multiwavelet entropy of the normal growing condition group.

Table 6 :
The category feature of multiwavelet entropy of the mild drought stress group.

Table 7 :
The category feature of multiwavelet entropy of the moderate drought stress group.

Table 8 :
The category feature of multiwavelet entropy of the severe drought stress group.

Table 9 :
The experimental result of BP neural network.