Pulse Wave Cycle Features Analysis of Different Blood Pressure Grades in the Elderly

Background and Objective The same range of blood pressure values may reflect different vascular functions, especially in the elderly. Therefore, a single blood pressure value may not comprehensively reveal cardiovascular function. This study focused on identifying pulse wave features in the elderly that can be used to show functional differences when blood pressure values are in the same range. Methods First, pulse data were preprocessed and pulse cycles were segmented. Second, time domain, higher-order statistics, and energy features of wavelet packet decomposition coefficients were extracted. Finally, useful pulse wave features were evaluated using a feature selection and classifier design. Results A total of 6,075 pulse wave cycles were grouped into 3 types according to different blood pressure levels and each group was divided into 2 categories according to a history of hypertension. The classification accuracy of feature selection in the 3 groups was 97.91%, 95.24%, and 92.28%, respectively. Conclusion Selected features could be appropriately used to analyze cardiovascular function in the elderly and can serve as the basis for research on a cardiovascular risk assessment model based on Traditional Chinese Medicine pulse diagnosis.


Introduction
Pulse diagnosis is very important in Traditional Chinese Medicine (TCM), and research has focused on obtaining objective evidence for the technique [1,2]. Pulse waves include objective information used in TCM pulse diagnosis. The time domain features of pulse waves have physiological significance and reflect the duration and amplitude of percussion waves, tidal waves, and dicrotic waves. The frequency and time domain features can reflect the disease state [3], especially in cardiovascular disease [4]. Early research [5] found that time domain features reflect hypertension. Recent studies have sought to identify a scientific correlation between pulse patterns (wiry pulse, slippery pulse, and others) and blood pressure using a computational approach. For example, the association between pulse waves and hemodynamic parameters has been studied in hypertensive patients [6,7], and research has shown that blood pressure values can be predicted by pulse waves [8]. Pulse waves and blood pressure values are closely associated [9][10][11]. According to TCM, most hypertensive patients have a wiry pulse [8]. A wiry pulse is also commonly found in normotensive elderly, especially in those over 60 years old. Moreover, the elderly are at higher risk for hypertension. In the elderly, it is unclear how to distinguish between hypertensive patients taking blood pressure medication and normotensive, using pulse waves with the same blood pressure values. In our previous studies, a series of features, including time domain (TD), energy ( ), and higher-order statistics (HOS) features of wavelet packet decomposition coefficients (WPDC), were used in pulse classification of health versus subhealth and atherosclerosis versus nonarteriosclerosis [12]. The results have proved the feasibility of above features in pulse analysis. So we hypothesized that time domain (TD), energy ( ), and higher-order statistics (HOS) features of wavelet packet decomposition coefficients (WPDC) in the pulse wave cycle, which can identify signal characteristics [13], may reveal differences in pulse waves within the same range of blood pressure values in hypertensive and normotensive. This study focused on individuals over 60 years of age to identify useful features in the pulse wave cycle that can demonstrate differences between hypertensive and normotensive, within the same range of blood pressure values. In this paper, firstly, the methods were introduced including the pulse data acquisition, preprocessing and pulse wave cycle segmenting, and feature extraction. Secondly, the experiments design and result were described. Thirdly, some details on experiments result discussion were given. Finally, the summary was presented.

Methods
Pulse data acquisition, preprocessing, pulse wave cycle segmentation, and feature extraction and classifier evaluation were performed. The general flow diagram is shown in Figure 1.

Data Acquisition.
Data were collected from elderly volunteer subjects who presented for physical examinations at the community health service center in Pudong New District of Shanghai. The subjects were allowed to rest for 3-5 minutes before data collection and were instructed to sit, breathe quietly, relax the upper arm, extend the forearm, and flex the shoulder and elbow to about 120 ∘ , with the left wrist on a pulse pillow. Then, our specially developed TCM pulse bracelet [14] was placed over the Guan position in the left hand to capture the best pulse signals for 10 s.
Subjects were excluded from analysis if they lacked complete data for control or outcome variables or had significant heart disease.
A total of 770 subjects met the inclusion criteria and provided 10 s of pulse data for grouping of pulse wave cycles into NG, HerG, and HestG categories, according to blood pressure values. NG subjects had a baseline systolic blood pressure < 120 mmHg or diastolic blood pressure < 80 mmHg. HerG subjects had a baseline systolic blood pressure of 130 to 139 mmHg or diastolic blood pressure 80 to 89 mmHg. HestG subjects had a baseline systolic blood pressure > 140 mmHg or diastolic blood pressure > 90 mmHg. Each group was divided into 2 classes by history of hypertension (yes or no). Pulse wave cycle data were obtained for 10 s in all subjects during data preprocessing.

Data Preprocessing.
Baseline wandering of original pulse data was removed with a high-pass filter in the sampling device. A bandpass from 0.5 Hz to 30 Hz filter was used to smooth waves affected by tremor or breathing. A Shannon Energy Envelope, Hilbert Transform (SEEHT) extractor was used for the percussion wave and beginning of the pulse wave cycle, as it was thought to be better than other extractors for wider, small pulse waves, or sudden changes in wave amplitude [12,15]. A pulse wave cycle was defined as the interval between two initial sets of pulse data.
SEEHT extractor for the percussion was showed in more detail in [15] (Figure 2). Firstly, a bandpass filter with 1∼ 4 Hz is designed to exclude other peaks and emphasize the percussion wave. Secondly, the data after bandpass filter are transformed by the Shannon Energy Envelope formula. However, wrist pulse signals between 1 Hz and 4 Hz are restrained by differentiated signals. SEE signals based on differentiated signals bring abrupt changes because the other waves are amplified in differentiation. The major local maxima of smooth SEE indicate approximate locations of the percussion waves. Hence, for detecting the percussion waves, a low-pass filtering is used for smoothing SEE to reduce the complexity of searching the local maxima. Thirdly, Hilbert Transform is used for finding the peaks. And then, the moving average filter signals after Hilbert Transform, which removed the lowfrequency drift, locate the peaks by zero-cross point from positive shaft to negative shaft. Finally, the real peaks of the   percussion wave are the maximum within 0.25 s in the pulse data after bandpass filter with 0.5∼30 Hz. Although the SEEHT method had shown good results for extraction of the percussion wave and the beginning of a pulse wave cycle, an error was observed in segmentation. This is basically due to morphological diversity in the pulse wave cycle. To eliminate the influence of segmentation error on the experimental results, noise in pulse wave cycles was excluded by visual inspection. For example, in Figure 3, there is a pulse signal (blue line) with low quality. The red asterisks are the percussion wave detected by SEEHT method. The red cycle is the start point of a pulse cycle and the end point of prior pulse cycle. So it is pulse cycle segmentation from one red cycle to next one. There are three error segmentations (red box) because of noise, so for every pulse sample segmentations result, we find out the error parts by visual inspection and delete that to ensure the effectiveness of pulse cycles in subsequent research.

Feature Extraction.
To identify differences in pulse wave cycles between elderly hypertensive and normotensive, TD, , and HOS features of WPDC were extracted after preprocessing.

Time Domain Feature Extraction.
A standard pulse wave is made up of 3 components: the percussion wave, tidal wave, and dicrotic wave. TD features include the duration and amplitude of the inflection point of 3 waves, which were extracted using a previously described method named Shap Threshold Value (STV) method ( Figure 4). STV method, which was described in more detail in pages 32-35 of [12], mainly contains two steps. First is that the pulse wave cycles are classified into eight pulse cycles (in Figure 5) by the shape according to expert experience and domain knowledge. Second step is detecting the inflection point in every shape using different threshold values.
Most TD features have clear physiological significance. In this study, 20 TD features ( Figure 4) were chosen for analysis including 6 duration features ( 1, 2, 3, 4, 5, ), 5 amplitude features (ℎ1, ℎ2, ℎ3, ℎ4, ℎ5, ℎ1/ 1, ℎ3/ℎ1, ℎ4/ℎ1), 4 width features ( 31, 51, 31/ , 51/ ), and 2 area features (As, Ad). The meaning of above features is showed in Table 1. 1 The duration of the percussion wave 2 2 The duration of the beginning of the tidal wave 3 3 The duration of the crest of the tidal wave 4 4 The duration of the beginning of the dicrotic wave 5 5 The duration of the crest of the dicrotic wave 6 ℎ1 The amplitude of the percussion wave 7 ℎ2 The amplitude of the beginning of the tidal wave 8 ℎ3 The amplitude of the crest of the tidal wave 9 ℎ4 The amplitude of the beginning of the dicrotic wave 10 ℎ5 The amplitude of the crest of the dicrotic wave 11 31 The width in 1/3 amplitude of the percussion wave 12 51 The width in 1/5 amplitude of the percussion wave 13 31/ The ratio of 31 and t 14 51/ The ratio of 51 and t 15 ℎ1/ 1 The ratio of ℎ1 and 1 16 ℎ3/ℎ1 The ratio of ℎ3 and ℎ1 17 ℎ4/ℎ1 The ratio of ℎ4 and ℎ1 18 A pulse cycle duration 19 As The systolic pulse wave area in 4 20 Ad The diastolic pulse wave area between the end of 4 and the end of

Wavelet Packet Decomposition.
The discrete wavelet transform (DWT) only decomposes low-frequency components (approximations: A). The wavelet packet method, which is an expansion of the DWT method, can increase a wide variety of skills and power of the wavelet transform [16]. Wavelet packet decomposition (WPD) utilizes both lowfrequency and high-frequency components (details: D). In WPD, the approximation achieved from the first level is split into new detail and approximation components, and this process is then repeated. Mother wavelet functions are important for wavelet packet coefficients and classification accuracy of extracted features. It was reported that the best feature set was obtained with the db6 wavelet function [17]. Therefore, this study chose the db6 wavelet function as the mother wavelet function to estimate the wavelet packet coefficients. The number of decomposition levels was set at  4. Therefore, 30 subbands were obtained for the fourth level of WPD. Figure 6 shows the fourth level of the WPD tree of pulse wave cycles.

Higher-Order Statistics and Energy Entropy.
Higherorder statistics (HOS) have been applied successfully to extract features for classification [13]. In signal processing, many signals, especially nonlinearities, cannot be examined properly by second-order statistical methods. Therefore, higher-order statistical methods have been developed. While first-and second-order statistics contain mean and variance, nonlinear combinations of higher-order moments contain cumulants [18]. Let ( ) is real, discrete time random process. The moments of ( ) are defined as the coefficients in Taylor series expansion of the moment generating function [19].
For zero mean discrete time signals, moments and cumulants are defined as [13] where [⋅] is defined as the expectation operation and (⋅) is the random process.
The second characteristic function of ( ), defined as [13] is called the cumulant generating function, and the coefficients in its Taylor expansion are the th-order cumulants of ( ), represented as ( 1 , 2 , . . . , ). The cumulants are defined as [13] 2 ( ) = 2 ( ) , The second-, third-, and fourth-order cumulants are calculated for each pulse cycle taking lag 0, which means that the value of , , equals zero. The zero-lag cumulants have special names: 2 (0) is the variance and is denoted by 2 ; 3(0, 0) and 4(0, 0, 0) are denoted by 3 and 4 known as skewness and kurtosis, respectively. In this study, the HOS methods are used to extract new and fewer number of features from the wavelet packet decomposition coefficients There were 30 subbands for the 4 levels as noted. Three features were extracted for each subband using HOS. We calculate HOS methods, second-, third-, and fourth-order cumulants including using cumulants functions in MATLAB 2013a: In addition, Shannon entropy was used to calculate the energy of WPDC with the following entropy function in MATLAB 2013a: where represents the wavelet packet decomposition coefficients of every pulse cycle. Thus, 30 features ( 1, 2 ⋅ ⋅ ⋅ 30) and 90 HOS features (HOS1, HOS2 ⋅ ⋅ ⋅ HOS90) were obtained for analysis.

Feature Selection.
CfsSubsetEval and BestFirst were used for feature selection; these are built-in attribute evaluator and search methods in WEKA 3.8. CfsSubsetEval evaluates the worth of a subset of features by considering the individual predictive ability of each feature along with the degree of redundancy. Subsets of features that are highly correlated with the class while having low intercorrelation are preferred. The BestFirst method searches the space of attribute subsets by greedy hillclimbing augmented with a backtracking facility. Setting the number of consecutive nonimproving nodes allowed control of the level of backtracking. BestFirst may start with an empty set of attributes and search forward, with a full set of attributes and search backward, or at any point and search in both directions. The process is shown as Algorithm 1. [20], which is the most effective and common nonparametric method in pattern recognition classification, was used for evaluation of the effectiveness of all features. -NN is independent of statistical distribution of training examples and classifies objects by computing their distance to the training examples in the feature space. The object is assigned to the class most common among its -Nearest Neighbors. In this study, when = 1, the object is simply assigned to the class of its nearest neighbor.

Classification. -Nearest Neighbor ( -NN)
To compare the results of classification, the statistical definitions used were as follows:

Experimental Results
After data preprocessing and noise removal, 6,075 pulse wave cycles were analyzed for the NG, HerG, and HestG groups, and the hypertension history and nonhypertension history classes. The 6 data sets are detailed in Table 2.
In every experiment, 20 TD features, 90 HOS features, and 30 features were selected with 10-fold cross-validation. The occurrence frequency of these features was designated as OF. For example, a 5-fold selected feature was chosen 5 times in 10-fold selection (OF = 5/10 × 100 = 50%). All chosen features with different OFs were divided into different combinations by eliminating lower values. Finally, 1-NN was used to verify different feature combinations. The experimental software platform for 1-NN and feature selection was Weka 3.8. All features were extracted in MATLAB 2013a.

Discussion
In the NG experiment, 7 features were selected for best performance: age, BMI, 51/ , , HOS29, HOS81, and 15 ( Figure 8); these were found at least 9 times in 10-fold selection (OF ≥ 90    TD features between normal values in the hypertension and nonhypertension groups. For example, in the NG experiment, an increase in age was accompanied by an increase in 51/ in the group with normal blood pressure values and a history of hypertension ( Figure 11(a)), but there was no consistent change in those without a history of hypertension ( Figure 11(b)). Most of the selected HOS and features of WPDC had low-frequency components. One feature from the first level, 2 from the second level, 1 from the third level, and 4 from the fourth level ( Figure 12) were selected as best features. The selected features included 3 third cumulants, 2 fourth cumulants, and 3 features.
Each subband level after WPD contained second-, third-, and fourth-order cumulants. Red box denotes selected features in NG; blue box denotes selected features in HerG; green box denotes selected features in HestG.
In TCM theory, the pulse type changes from slippery to wiry with age. The consensus among TCM physicians is that hypertensive patients have a wiry pulse. Research has shown a correlation between the rank of a wiry pulse and different levels of hypertension. Two types of wiry pulse (healthy elderly wiry pulse and hypertensive wiry pulse) show a distinct difference. The classification accuracy showed a decreasing trend as blood pressure values increased (97.91% in HG, 95.24% in HerG, and 92.28% in HestG). Because of the normal blood pressure values in the NG group, there are essentially 2 classes of a wiry pulse: the healthy elderly wiry pulse and the hypertensive wiry pulse. However, in the HerG and HestG groups, the pulse wave in those without a hypertension history reflected the features of a hypertensive wiry pulse. Thus, the classification accuracy in the HerG group was lower than that in the NG group, and the accuracy in the HestG group was lower than that in the HerG group. The features selected in the classification all achieved accuracy of greater than 92.28% in the 3 groups. Although the features of a hypertensive wiry pulse were present in 2 classes (hypertension and nonhypertension history) in the HerG and HestG groups, the selected features can also reflect cardiovascular function under conditions of sustained hypertension.

Conclusion
In elderly individuals, pulse wave cycle features in the same blood pressure range show significant differences according