1. Introduction

CMMM

Computational and Mathematical Methods in Medicine

1748-6718 1748-670X

Hindawi Publishing Corporation

10.1155/2015/846942

846942

Research Article

NMFBFS: A NMF-Based Feature Selection Method in Identifying Pivotal Clinical Symptoms of Hepatocellular Carcinoma

http://orcid.org/0000-0001-5781-3465

Zhiwei

^{1, 2} Meng

Guanmin

³ Huang

Deshuang

¹ Yue

Xiaoqiang

⁴ Wang

Bing

^{1, 5,6} Huang

Tao

Machine Learning & Systems Biology Lab

School of Electronics and Information Engineering

Tongji University

4800 Caoan Road

Shanghai 201804

China

tongji.edu.cn

School of Information Engineering

Zhejiang A&F University

88 Huancheng North Road

Linan 311300

China

zjfc.edu.cn

Department of Clinical Laboratory

Tongde Hospital of Zhejiang Province

234th Gucui Road

Hangzhou 310012

China

zjtongde.com

⁴

Department of Traditional Chinese Medicine

Changzheng Hospital

Second Military Medical University

415 Fengyang Road

Shanghai 200003

China

smmu.edu.cn

⁵

The Advanced Research Institute of Intelligent Sensing Network

Tongji University

4800 Caoan Road

Shanghai 201804

China

tongji.edu.cn

⁶

The Key Laboratory of Embedded System and Service Computing

Tongji University

4800 Caoan Road

Shanghai 201804

China

tongji.edu.cn

2015

12102015

2015 22 04 2015 20 06 2015 02 07 2015 12102015

2015

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Background. Hepatocellular carcinoma (HCC) is a highly aggressive malignancy. Traditional Chinese Medicine (TCM), with the characteristics of syndrome differentiation, plays an important role in the comprehensive treatment of HCC. This study aims to develop a nonnegative matrix factorization- (NMF-) based feature selection approach (NMFBFS) to identify potential clinical symptoms for HCC patient stratification. Methods. The NMFBFS approach consisted of three major steps. Firstly, statistics-based preliminary feature screening was designed to detect and remove irrelevant symptoms. Secondly, NMF was employed to infer redundant symptoms. Based on NMF-derived basis matrix, we defined a novel similarity measurement of intersymptoms. Finally, we converted each group of redundant symptoms to a new single feature so that the dimension was further reduced. Results. Based on a clinical dataset consisting of 407 patient samples of HCC with 57 symptoms, NMFBFS approach detected 8 irrelevant symptoms and then identified 16 redundant symptoms within 6 groups. Finally, an optimal feature subset with 39 clinical features was generated after compressing the redundant symptoms by groups. The validation of classification performance shows that these 39 features obviously improve the prediction accuracy of HCC patients. Conclusions. Compared with other methods, NMFBFS has obvious advantages in identifying important clinical features of HCC.

1. Introduction

Hepatocellular carcinoma (HCC) is the third most common cause of cancer-related death worldwide and the leading cause of death in patients with the complication of cirrhosis [1, 2]. The occurrence of HCC is larvaceous and short of specific symptoms [3, 4]. Its diagnosis depends on biopsy, imaging examination such as Doppler ultrasound, computed tomography, magnetic resonance imaging, and blood test [5, 6]. Once the patients with HCC see doctors, the disease has often entered its late stage, losing the chance of resection. Hence, seeking simple methods to predict HCC and its clinical stage is very meaningful and helpful to improve the diagnosis of HCC.

As one of the most popular complementary and alternative medicine modalities, Traditional Chinese Medicine (TCM) plays an active role in treatment of malignant tumors including HCC in Chinese and some East Asian countries [7, 8]. Unlike modern medicine, the diagnosis and treatment of TCM depend on the analysis of symptoms and signs of HCC collected by inspection, auscultation and olfaction, inquiry, and pulse taking and palpation [8]. TCM regards specific combination of symptoms and signs as a TCM syndrome, which is the main basis for treatment; and it can be also used to guide clinical diagnosis of HCC. Our previous work proposed a hierarchical feature selection (PSOHFS) model to quickly identify the potential HCC syndromes from a TCM clinical dataset [9], by which all the original symptoms were classified into several groups according to the categories of clinical observations, and each symptom group was then converted into a syndrome signature to reduce the searching space of feature selection. But the limitation of this method is that the interactions among symptoms which belong to different categories (aspects) were ignored. Therefore, the current challenge is to design an efficient feature selection approach for high-dimensional TCM data with consideration of clinical significance.

In this study, a nonnegative matrix factorization- (NMF- [10]) based feature selection (NMFBFS) method was proposed to select pivotal clinical symptoms for HCC diagnoses. A TCM clinical dataset was used in this work, which consisted of 407 HCC patients with 57 clinical symptoms. Each patient sample is labeled with a clinical-staging symbol which indicates the severity of certain patient. Firstly, the preliminary screening with statistical method was designed to detect irrelevant symptoms from the full symptom set. Secondly, the process of NMF was implemented after eliminating the irrelevant symptoms. Based on the NMF-derived basis matrix, we defined a similarity measure to infer redundant symptoms by calculating the distance and correlation among the symptoms. Finally, the secondary dimension reduction was implemented based on the inferred groups of redundant symptoms. We converted each symptom group to a new feature (named “mixed feature”) if these symptoms represent similar distribution patterns on the sample space. The experiment results show that 39 novel features inferred by NMFBFS obviously improve the accuracy of diagnosis of HCC clinical samples. Moreover, NMFBFS-derived 39 optimal clinical features included some well-known common symptoms of HCC patients. Comparing to three representative feature selection methods (ReliefF [11], mRMR [12], and Elastic Net [13]), our proposed approach showed the best performance to identify optimal clinical features for HCC patients.

2. Materials and Methods 2.1. Experimental Data 2.1.1. Description

In this work, the questionnaire survey dataset of HCC includes 407 samples within two years, and each patient was observed on 57 clinical symptoms (Table 1). Each patient sample is labeled with a symbol of clinical stage, which is related to TCM pattern of syndrome and indicates the severity degree of HCC. According to the international staging system [14], there are three stages and two substages in each phase in this dataset. The aim of our work is to identify the symptom signatures, which are related to three clinical stages: phases I, II, and III, and the larger value indicates that stronger positive symptom occurred. Within our dataset, all the original symptoms are described by two types of data: binary (0 or 1) or integer (0, 1, 2, 3, …). For example, the type of symptom “tinnitus” is binary (0 or 1), which means two possible states: occurrence (positive) or nonoccurrence (nonpositive). Another example is “sleeplessness” whose value can be 0, 1, 2, or 3. The larger the value is, the stronger the positive state will be. A symptom does not appear positive if its value equals zero.

Table 1

The description of original clinical data of HCC patients.

Sex	Phase I (82)		Phase II (195)		Phase III (130)
Sex	PhaseIA	PhaseIB	PhaseIIA	PhaseIIB	PhaseIIIA	PhaseIIIB
Male	33	27	50	115	95	10
Female	12	10	10	20	16	9

2.1.2. Data Preprocessing

Refinement of Feature Set. Our original dataset consists of 407 HCC patient samples (Table 1). The first step of preprocessing is to remove the useless features because they provide no useful information for the following classification. If a feature is constant on all the observed samples, it can be considered as useless feature. For our dataset, some symptoms, such as “pale tongue” and “slow pulse,” were removed out because there is no any observed patient positive on these symptoms. After removing this kind of features, a refined clinical dataset with 407 samples and 57 symptoms ( V 1 , … , V 57 ) can be obtained.

Simplification of Clinical Staging. The clinical staging of HCC patients in our original dataset was marked with collections “IA,” “IB,” “IIA,” “IIB,” “IIIA,” and “IIIB.” For identifying the symptom signatures related to three clinical stages, all the samples would be relabeled as three classes. Here, we remarked class label “1” for the samples labeled “IA” and “IB.” In a similar way, class label “2” is used for “IIA” and “IIB” and “3” is for “IIIA” and “IIIB.” Finally, all the 407 clinical samples can be distributed in three categories: 82 samples in phase I, 195 in phase II, and 130 in phase III. The details of the refined dataset were described in Table 1.

2.2. Feature Selection

Feature selection can be organized into three categories, depending on how they interact with the construction of model. Filter methods employ a criterion to evaluate each feature individually and are independent of the model [15]. Among them, feature ranking is a common method which involves ranking all the features based on a certain measurement and selecting a feature subset which contains high ranked features [16]. However, one of the drawbacks of ranking methods is that the selected subset might not be optimal in that a redundant subset might be obtained. Wrapper methods involve combination searches through the feature space, guided by the predicting performance of a model [17]. Heuristic search is widely used in wrapper methods as searching strategy which can produce good results and is computationally feasible; however, they often yield local optimum results. For an embedded method, the feature search process is embedded into classification algorithm, so that the learning process and the feature selection process cannot be separated [18].

2.3. Nonnegative Matrix Factorization

Nonnegative matrix factorization (NMF) aims to obtain a linear representation of multivariate data under nonnegativity constraints. These constraints lead to a part-based representation because only additive, not subtractive, combinations of the original data are allowed [19]. In general, NMF can be used to describe hundreds to thousands of features in a dataset in terms of a small number of metafeatures, particularly in gene expression profiles analysis [20–22].

Let X be n × p nonnegative matrix; that is, each element x i j ≥ 0 in X . Nonnegative matrix factorization (NMF) consists in finding an approximation (1) X ≈ W H , where the basis matrix W and the mixture coefficient matrix H are n × r and r × p nonnegative matrices, respectively, where r > 0 and r ≪ m i n ⁡ ( n , p ) . The objective behind the small value of r is to summarize and split the information contained in X into r factors (also called “basis” or “metafeature”). The matrix H has the same number of samples but much smaller number of features rather than matrix X . Therefore, the metafeature expression patterns in H usually provide a robust clustering of samples [22].

The main approach to NMF is for solving estimate matrices W and H as a local minimum: (2) [ D ( X , W H ) + R ( W , H )] W , H ≥ 0 min ⁡ , where D is a loss function that measures the quality of the approximation which is usually based on either the Frobenius distance or the Kullback-Leibler divergence [19]. R is an optional regularization function, defined to enforce desirable properties on matrices W and H , such as smoothness or sparse [23, 24].

In our study, the loss function in NMF is based on Kullback-Leibler divergence [25]. The above function R was defined as follows: (3) R W , H = F 1 W + F 2 H , where F 1 W and F 2 H are regulation functions for W and H , respectively. Here, we applied Tikhonov smoothness regularization [26] for W in (4) F 1 W = 1 2 ∑ i , j W i j - c 2 , where c is a constant positive or zero. In addition, we applied sparsity-enforcing regularization [26] for H in (5) F 2 H = 1 2 ∑ j H . j 2 2 - α 2 H . j 1 2 2 . In formula (5), H . j is j th row of H . H . j 2 2 and H . j 1 2 define the l 2 -norm and l 1 -norm of H . j . The algorithm proposed by Lee is a well-established method to solve the optimization of NMF [27].

2.4. NMF-Based Feature Selection

In this study, our proposed NMF-based feature selection (NMFBFS) approach can be seen as a two-stage filter method. In the first stage, preliminary screening is implemented to detect irrelevant symptoms and remove them from the whole feature set. In the second stage, NMF clusters the redundant symptoms which potentially have similar patterns into different groups, and each group is then transformed into new single features to reduce the dimension. Obviously, the process of NMFBFS is independent of classifier and can quickly infer the optimal feature subset even in the high-dimensional dataset. The flowchart of NMFBFS is shown in Figure 1.

Figure 1

The flowchart of the proposed approach.

2.4.1. Removing the Irrelevant Symptoms

In our questionnaire, all the symptoms were defined by clinical doctors, which covered many aspects of patients. However, the relevance weight of each feature for distinguishing samples among the clinical stages was not quantitatively studied. In machine learning, the irrelevant features provide no useful information in any context and always scarcely contribute to patient stratification [28]. If the sample size is large, it is meaningful to quickly detect the irrelevant symptoms by calculating the frequencies of positive. Here, we calculated the ratio (frequency) of presence (positive) of each symptom on the samples in every clinical stage. If the frequencies of certain symptom in all the clinical stages are very low, which indicates that this symptom hardly appears positive in most of patients, therefore it is considered as an irrelevant symptom. After removing the irrelevant symptoms from the original dataset, the rest of symptoms are considered as relevant features, which are potentially related to at least one class of patients (or one clinical stage).

2.4.2. Identifying Redundant Symptoms Based on NMF

After the irrelevant symptoms had been removed, nonnegative matrix factorization was applied on the dataset X ( n × p ). For a given rank r , the matrix X can be decomposed to basis matrix W and coefficient matrix H . Usually, the value of rank r is much smaller than the number of features ( n ) and the sample number ( p ), so that there is at least one dimension in both W and H being very small. The widespread appliances of NMF in biclustering further indicate that basis matrix W can be used for feature clustering and coefficient matrix H is used for sample clustering, respectively [20, 21]. In our study, the number of samples is much larger than the dimensionality; hence, directly calculating distance or correlation to measure the similarity between original features (symptoms) on all the samples will lead to biases because some features might represent local similar patterns on a part of samples. Fortunately, the basis matrix W represents the compressed sample space of matrix X , which facilitates uncovering the difference between features. Here, we introduced two features ( v i and v j ) in original dataset X as an example to clarify the basic idea of this step. According to the definition of NMF, we can easily know (6) x i = w i × H , x j = w j × H , where x i and x j are i th and j th rows of matrix X ; w i and w j are i th and j th rows of matrix W . The following can be easily found. (1) If w i ≈ w j , then x i ≈ x j ; (2) if w i = k w j , then x i = k x j , where k is a constant. Furthermore, if i th row w i in matrix W is very close to w j , the feature v i might have a similar pattern as v j on all the samples. Therefore, we defined a novel similarity measurement in formula (7) to approximately evaluate redundancy between the two original symptoms via matrix W : (7) sim v i , v j ≈ sim w i , w j = sim_dist w i , w j + sim_corr w i , w j 2 , where (8) sim_dist w i , w j = 1 - w i - w j × w i - w j T Max ⁡ D , (9) sim_corr w i , w j = w i - w - × w j - w - T w i - w - × w i - w - T × w j - w - × w j - w - T . Formula (8) uses distance-based similarity, which indicates how two corresponding features are close to each other; and formula (9) adopts correlation-based similarity, which describes similar patterns of two original features. Hence, our developed similarity measurement considered distance and correlation between features at the same time. Max ⁡ D in formula (8) is the maximal distance value in all pairs of ( w i , w j ). Based on the above definition of similarity, we further calculated the similarity matrix S M X using all the basis rows in W ( S M X i , j = s i m v i , v j ), where element S M X i , j denotes the similarity between original features i and j . Given a threshold θ ( 0 < θ < 1 ), we can screen all the redundant features by groups with S M X i , j > θ .

2.4.3. Transformation of Redundant Symptoms by Group

In the above section, all the redundant symptoms were screened out and were organized into different groups. For each symptom group, a new mixed feature was extracted as the representation of the whole group and replaced all the original features within this group. Therefore, NMFBFS-inferred optimal feature subset includes two parts: nonredundant original features and new generated mixed features (see Figure 1). There are two strategies that can be used to transform the redundant symptom groups to mixed features.

(1) Calculate the mean vector from all the redundant symptoms as in (10) x N F = mean x r 1 , x r 2 , … , x r n , where x r 1 , x r 2 , … , and x r n are the feature vectors of original dataset X and are determined as redundant symptoms in a group. n denotes the number of inferred redundant symptoms in a group. The vector x N F of new single feature v N F was averaged on that group.

(2) Randomly select a vector from one of redundant symptoms as (11) x N F ∈ x r 1 , x r 2 , … , x r n . In our study, we transformed the groups of redundant symptoms to new mixed features by using formula (10). After this step, the feature space of the clinical dataset was further reduced so that the optimal feature subset rarely included redundant features.

3. Simulation Design

Firstly, we calculated the frequencies of each original symptom appearing positive at each clinical stage and then removed the irrelevant symptoms if their frequency values were very low.

Secondly, a representative sample set was screened out for NMF analysis. In our dataset, the number of samples in three phases of HCC varies a lot, that is, from 82, 130 to 195. If the whole dataset is used, a class imbalance problem will be caused [29–31]. In addition, the sex ratio of patients is also seriously unbalanced in the original dataset (Table 1). For avoiding the bias caused by imbalance of samples, we selected 40 samples from each clinical phase with equal proportion of male and female (20 : 20) to construct a representative clinical dataset D R (120 samples in total) for the following NMF analysis. Considering the fact that each original sample has a class label which corresponds to clinical stage of that patient, for all the original samples (407), we can actually get a preliminary participation of samples as three clusters, which can also be considered as a trained KNN clustering model [32]. We then defined the center of each cluster, which is the mean vector of all the samples in the same cluster. Given a large value of K , we input each center of cluster into the above KNN model and keep the output consistent with the corresponding class label of the center. Based on the K -nearest neighbors, we can finally screen out 40 representative samples (20 males and 20 females) of each clinical stage according to Euclidean distance.

Finally, several redundant symptom groups were identified. Then we transformed each redundant symptom group into a new mixed feature. Combining all the nonredundant original features with new generated mixed features, we obtained an optimal clinical symptom subset of HCC. At last, the classification performance of this feature subset was further validated by least squares support vector machines (LSSVM) [33, 34].

Experimental Parameters. At first, we set a frequency threshold to identify the irrelevant symptoms. The NMF R package [35] was then employed as a computational framework for nonnegative matrix factorization algorithms in R . For this method, the optimal rank r should be determined firstly. Currently there are several approaches that had been proposed to determine the optimal value of r [36, 37]. In our study, two methods, that is, cophenetic coefficient [36] and RSS curve [37], had been adopted to determine the optimal rank r range from 2 to 7. After obtaining the results of NMF with optimal r , we calculated the similarity matrix S M X with all the basis rows and inferred the redundant symptoms with a threshold θ = 0.95 , which meet the following conditions: s i m _ c o r r ( w i , w j ) ≥ 0.95 and s i m _ d i s t ( w i , w j ) ≥ 0.95 in formulas (7)–(9). Finally, a LSSVM classifier had been implemented to validate the classification performance of inferred optimal symptom subset. In the LSSVM multiclass model, Gaussian RBF kernel was employed, and the kernel parameters σ 2 and γ were determined by grid search [38]. In our grid search, we set σ 2 = 10 a and γ = 10 b . Variable a changes from −1 to 5 with step 0.25, and variable b changes from −1 to 4 with step 0.2. Therefore, we have the range of [ 0.1,100000 ] for σ 2 and the range of [ 0.1,10000 ] for γ . Totally, there are 24 levels for the value of σ 2 and 25 levels for γ . In other words, there are 600 pairs of σ 2 , γ tested when training a LSSVM classifier. To find an optimal value of σ 2 , γ , we used 5-fold cross-validation to evaluate the classification accuracy of LSSVM model.

4. Results and Discussion

Firstly, we calculated the frequencies of positive for all the original symptoms (57) at each clinical stage (see Supplementary Table S1 available online at http://dx.doi.org/10.1155/2015/846942). Eight irrelevant symptoms were judged as irrelevant features (threshold: 10%). From Table 2, we can clearly see that these symptoms appeared on few patients (less than 10% in each clinical stage) in the clinical observation and therefore they were considered as noisy features in the process of diagnosis. Because the total number of samples is large (407), we considered that the eight irrelevant symptoms identified with statistical analysis are very reliable. A part of symptoms shown in Table 2 was proved by previous studies. For example, Lai et al. concluded that no association is detected between “emotional depression” and the risk of hepatocellular carcinoma in older people in Taiwan [39, 40]. In addition, Peng et al. studied 169 Chinese patients with HCC; only three patients presented with hydrothorax, which also indicated that this symptom was not a key symptom in the process of liver cancer development [41, 42]. In addition, “edema in lower extremities” is undoubtedly a well-known symptom of HCC patients in clinic [43]; however, it was considered an irrelevant symptom in this study because it rarely appeared in all the three stages of our data. Increasing the observed samples or reducing the threshold will make it as a candidate symptom.

Table 2

Eight irrelevant symptoms were screened with threshold 10%. Each of them is rarely positive in each phase.

Symptoms	Phase I		Phase II		Phase III
Symptoms	Phase IA	Phase IB	Phase IIA	Phase IIB	Phase IIIA	Phase IIIB
Pale white lip [ V 1 ]	0	5.41%	6.67%	5.19%	4.5%	0
Edema in lower extremities [ V 16 ]	2.22%	8.1%	1.67%	5.19%	3.6%	0
Lack of urine output [ V 41 ]	0	2.7%	0	0	5.41%	0
Emotional depression [ V 43 ]	4.44%	0	5%	8.89%	6.31%	5.26%
Head body trapped heavy [ V 47 ]	0	2.7%	3.33%	2.22%	2.7%	0
Hydrothorax [ V 51 ]	6.67%	2.7%	1.67%	3.7%	2.7%	0
Rapid pulse [ V 55 ]	4.44%	2.7%	1.67%	0.74%	5.41%	5.26%
Uneven pulse [ V 56 ]	4.44%	5.41%	8.33%	3.7%	3.6%	0

Secondly, the calculation of NMF was implemented after removing all the detected irrelevant symptoms. According to the description in “Simulation Design”, NMF was applied on the representative matrix D R with 120 HCC samples, which uniformly covered three clinical phases. Figure 2(a) represents the fact that D R is a sparse matrix, in which large partition of elements is zero (no positive), such as symptom V 6 shown in Figure 2(b). However, there are also some symptoms that were positive on many patients, such as symptom V 25 shown in Figure 2(c). Matrix D R does not show obvious subtypes and patterns; hence, it is hard to compare the similarity directly between symptoms with the row vectors of D R since the number of samples is still very large. In this study, we used NMF to compress the representative matrix D R and to reveal the distribution patterns of features (symptoms) on fewer samples. Before the calculation of NMF, a critical parameter should be firstly determined: the value of factorization rank r . According to Brunet’s method, the first value of r for which the cophenetic coefficient starts decreasing is the optimal one [36]. Frigyesi and Höglund suggested choosing the first value where the RSS curve presents an inflection point [37]. Based on these two methods, we determined that “3” is a reasonable value of rank r for the clinical data matrix D R . The curves shown in Figure 3 also confirm this conclusion. Nonnegative matrix factorization was then implemented on the matrix D R ( 49 × 120 ) with rank 3. It also indicates that the number of metafeatures (basis) equals 3.

Figure 2

The heatmap of the representative clinical dataset D R . (a) The heatmap of D R with 49 symptoms and 120 samples. (b) The distribution patterns of symptoms V 6 , V 8 , V 28 , V 37 , and V 53 indicate that the frequencies of positive are low. (c) The distribution patterns of symptoms V 46 , V 42 , and V 25 indicate that the frequencies of positive are high.

(a) (b) (c)

Figure 3

Estimation of the optimal rank r .

(a) (b)

Figure 4 represents the final results of NMF which included the basis matrix W ( 49 × 3 ) and mixture coefficient H ( 3 × 120 ). Each row in matrix W uses a compressed pattern to approximatively represent the distribution of a symptom on all the original samples. Comparing with matrix D R shown in Figure 2, the obvious difference in matrix W is that there are several groups of features revealing similar patterns in the compressed sample space, such as V 40 and V 36 in Figure 4. According to Figure 2(a), we can find that the distance between the vectors of symptoms V 40 and V 36 in D R is also close; furthermore, the compressed patterns of V 40 and V 36 in matrix W ( w 40 and w 36 ) in Figure 4 facilitate easier identifying of redundant features which have very similar distribution patterns.

Figure 4

The result of NMF on the dataset D R . The left side indicates the visualization of matrix W ( 49 ∗ 3 ), and right side denotes matrix H ( 3 ∗ 120 ).

(a) (b)

The matrix H has the same number of samples but much smaller number of metafeatures (basis) rather than original matrix X [36]. Therefore, the metafeature expression patterns in H usually provide a robust clustering of samples. Given the j th column in H as H j = [ h j 1 , h j 2 , h j 3 ] T , we determined that j th clinical sample is placed into k th cluster if max ⁡ H j = H j ( k ) , where k ∈ { 1,2 , 3 } . Hence, we used matrix H to group all the samples into 3 clusters, which correspond to 3 bases (metafeature). Figure 5 shows that there are great overlaps between the clinical-staging markers (a priori knowledge of class labels) and indexes of basis components (metafeatures) on the 120 original clinical samples included in dataset D R .

Figure 5

The relationships between NMF-derived basis components and clinical stages of samples.

In matrix W , each column also corresponds to a metafeature or basis (see Figure 4). Entry w i j in matrix W is the coefficient of original feature i in metafeature (basis) j [36]. Therefore, an original feature i relates to certain basis j if w i j is the largest entry in row i of matrix W . From Figure 4, we can clearly see that the original symptom features participating in the same basis have similar expression patterns rather than that in other bases. Table 3 represents the symptoms which are related to all basis components. Combination of Figure 5 and Table 3 further indicates that the “basis 1” related symptoms are very related to the clinical samples of phase II, and “basis 2” and “basis 3” related symptoms are very related for phase I and phase III, respectively. This finding contributes to identifying clinical phase-specific important symptoms via NMF. Moreover, the partition of 49 clinical symptoms shown in Table 3 was well supported by some related studies. For example, nausea is observed as a common adverse effect in HCC patients in phase I [44]. The symptoms ascites, anorexia, fever, and jaundice often occurred in phase II [43, 45–48]. The symptoms “yellow complexion” and “yellow skin and eye” shown in Table 3 are obvious appearances of jaundice. For phase III, pain is the most obvious characteristic in HCC patients [49]. There are three pain-related symptoms presented in Table 3: “pain in shoulder and back,” “chest pain,” and “distending pain in hypochondrium.” Moreover, fatigue and weakness were also common in HCC patients [43]. Together, these findings suggest that NMF with an optimal rank can reveal the latent associations between the potential symptom features and clinical phases.

Table 3

The NMF-derived participation of the symptoms to each corresponding basis component.

Basis components	Number of symptoms	The names of symptoms
Basis 1	16	Varicose veins [ V 7 ]; yellow complexion [ V 11 ]; yellow skin and eye [ V 13 ]; stomach pain [ V 31 ]; dry stool [ V 38 ]; feeling thirsty [ V 27 ]; hot flash [ V 20 ]; doing belly full bilge [ V 33 ]; fullness in stomach [ V 32 ]; block under the rib [ V 49 ]; chills [ V 18 ]; fever [ V 19 ]; spider telangiectasia in liver palm [ V 15 ]; ascites [ V 50 ]; yellow greasiness [ V 9 ]; anorexia [ V 34 ]

Basis 2	17	Nausea [ V 35 ]; pulse slip [ V 54 ]; petechial and ecchymosis tongue [ V 6 ]; white slip [ V 8 ]; chest distress [ V 28 ]; semiliquid stool [ V 37 ]; weak pulse [ V 53 ];night sweat [ V 22 ]; dirty mouth [ V 17 ]; red tongue [ V 3 ]; thready pulse [ V 57 ];sticky greasy coating [ V 10 ]; purple tongue [ V 4 ]; stringy pulse [ V 52 ]; pale white lip [ V 2 ]; large and teeth-printed tongue [ V 5 ]; gloomy complexion [ V 14 ]

Basis 3	16	Tinnitus [ V 24 ]; dizziness [ V 23 ]; pain in shoulder and back [ V 48 ]; chest pain [ V 29 ]; distending pain in hypochondrium [ V 30 ]; bitter taste [ V 26 ]; insomnia [ V 42 ]; appearance with stained yellow [ V 12 ]; yellow urine [ V 40 ]; hiccup [ V 36 ]; soreness and weakness of waist and knees [ V 44 ]; dry throat [ V 25 ]; feverishness in palms and soles [ V 45 ]; spontaneous perspiration [ V 21 ]; night urination much [ V 39 ]; physically and mentally fatigued [ V 46 ]

Just as mentioned above in “Simulation Design,” several groups of redundant features were then screened out according to a given threshold θ = 0.95 (Table 4). We obtained two redundant symptom groups from each basis component, which indicates that the redundant symptoms included in the same group also might have similar patterns in the original sample space. Here, we take Figures 2(b)-2(c) as examples to collaborate the effectiveness of our method. Figure 2(b) represents the distribution of positive of five symptoms in the dataset D R . These five symptoms ( V 6 , V 8 , V 28 , V 37 , and V 53 ) were identified as basis 2 related features, and they are most possibly belonging to phase I (Table 4). Although each of the row vectors in Figure 2(b) is not completely equal, they all represent relative lower frequency of positive ( 15.17 ± 3.25 % ) and their local distribution patterns are similar in a way. Comparing the corresponding rows of these five symptoms in matrix W in Figure 4, we found that the compressed patterns of these symptoms are very similar. Similarly, the symptoms ( V 46 , V 42 , and V 25 ) are potentially related to basis 3, the frequency of positive for each is over 50%, and the mean value of positive for these three symptoms is 1.77, which further indicate that they might be related to some patients whose conditions are very serious. Although the symptoms V 46 , V 42 , and V 25 were not identified as redundant symptoms with given threshold (0.95), their compressed patterns in matrix W in Figure 4 also suggested that their patterns were very close. In summary, we considered a fact that the matrix W facilitates evaluating the difference among symptoms, and matrix H can validate the high degree of correlation between class labels of samples and basis indexes. After inferring the redundant symptoms with given threshold, we combined each symptoms’ group together and converted it into a new feature (named mixed feature). Finally, we obtained 39 clinical features ( F S 1 ) of HCC as the optimal feature subset, which consisted of two parts: 33 original symptom features ( F S 2 ) and 6 new mixed features ( F S 3 ) (Table 5). Based on the analysis of results of NMF, the feature space of original dataset was further reduced.

Table 4

The mean similarity values about the pairs of redundant symptoms within the same groups.

Basis components	The screened redundant symptoms	Distance-based similaritysim_dist ( w i , w j )	Correlation-based similaritysim_corr ( w i , w j )
Basis 1	V 38 , V 27 , V 20	0.9672	1.0
Basis 1	V 19 , V 15	0.9507	1.0

Basis 2	V 35 , V 54	0.9685	0.9960
Basis 2	V 6 , V 8 , V 53 , V 37 , V 28	0.9628	1.0

Basis 3	V 48 , V 29	0.9686	1.0
Basis 3	V 44 , V 45	0.9520	0.9926

Table 5

The NMF-driven potential clinical features of HCC (threshold: 0.95).

Basis components	Original features	Mixed features	Description about mixed features
Basis 1	V 7 ; V 11 ; V 13 ; V 31 ; V 33 ; V 32 ; V 49 ; V 18 ; V 50 ; V 9 ; V 34	M 11 M 12	Converted from V 38 , V 27 , V 20 and V 19 , V 15 , respectively.

Basis 2	V 22 ; V 17 ; V 3 ; V 57 ; V 2 ; V 10 ; V 4 ; V 52 ; V 5 ; V 14	M 21 M 22	Converted from V 35 , V 54 and V 6 , V 8 , V 53 , V 37 , V 28 , respectively.

Basis 3	V 24 ; V 23 ; V 30 ; V 26 ; V 42 ; V 12 ; V 40 ; V 36 ; V 25 ; V 21 ; V 39 ; V 46	M 31 M 32	Converted from V 48 , V 29 and V 44 , V 45 , respectively.

Number of features	33	6	Total: 39 features

For evaluating the potential of NMFBFS-inferred optimal feature subset, we firstly tested the classification accuracy of three candidate feature subsets F S 1 , F S 2 , and O F S on the training set (120 representative samples). F S 1 and F S 2 were generated by feature selection with the threshold θ (0.95). O F S denoted 49 original symptom features in the dataset D R . Table 6 indicates that the 39 optimal features, which covered 33 original symptom features and 6 new mixed features, result in the best classification accuracy on the training samples. The performance of F S 2 was much better than O F S ; however, it was still worse than F S 1 because the new mixed features also have important contributions to classification.

Table 6

Classification accuracy among three feature subsets on the training set (120 representative samples). FS₁ was obtained by the proposed approach with a given threshold ( θ = 0.95 ), in which 33 original symptom features and 6 new mixed features were included. FS₂ denotes the above-mentioned 33 original symptom features ( FS 2 ⊂ FS 1 ). OFS indicates all the 49 symptoms before NMF calculation.

Feature subsets	Dimension	Classification accuracy in LSSVM (%)
FS₁	39	80.002 ± 9.95
FS₂	33	77.50 ± 12.36
OFS	49	72.50 ± 11.64

We then compared the performance of our NMFBFS with three well-known feature selection methods (ReliefF [11], mRMR [12], and Elastic Net [13]). ReliefF was implemented using MATLAB function. “mRMRe” and “elasticnet” R packages were applied for mRMR and Elastic Net based feature selection, respectively. Supplementary Figure S1 represents the ReliefF-based feature ranking. Supplementary Figure S2 represents the Elastic Net ( λ = 0.5 ) solution paths for feature selection. We selected Top 20 features and Top 40 features as two candidate feature subsets for each method to evaluate their classification performances: F S R F 20 and F S R F 40 generated from ReliefF; F S M R 20 and F S M R 40 inferred from mRMR; F S E N 20 and F S E N 40 inferred from Elastic Net. Table 7 represents the classification performance of the above six candidate feature subsets and the NMFBFS-derived optimal feature subset F S 1 on the training set (120 representative samples). The results indicate that NMFBFS-inferred feature subset has the best classification accuracy in training samples.

Table 7

Classification accuracy of inferred optimal feature subset via NMFBFS, ReliefF, mRMR, and Elastic Net on the training set.

Methods	Feature subset	Dimension	Classification accuracy in LSSVM (%)
NMFBFS	F S 1	39	80.002 ± 9.95

ReliefF	FS_RF20	20	65.00 ± 10.03
ReliefF	FS_RF40	40	73.33 ± 15.76

mRMR	FS_MR20	20	70.83 ± 12.5
mRMR	FS_MR40	40	74.17 ± 9.03

Elastic Net	FS_EN20	20	70.00 ± 11.56
Elastic Net	FS_EN40	40	76.67 ± 10.46

Except 120 representative training samples which were screened out to implement the NMF analysis, the remaining samples can be used to test the classification accuracy of optimal feature subset. We randomly selected 40 samples (10 : 20 : 10 for each clinical stage) from the rest of the samples and then evaluated the classification accuracy of inferred feature subset by each method (NMFBFS, ReliefF, mRMR, and Elastic Net). Table 8 shows the differences among all these methods, and it can be found that the optimal feature subset inferred by our proposed method has the best generalization performance.

Table 8

Classification accuracy of inferred optimal feature subset via NMFBFS, ReliefF, mRMR, and Elastic Net on the testing set.

Methods	Feature subset	Dimension	Classification accuracy in LSSVM (%)
NMFBFS	F S 1	39	79.65 ± 6.48

ReliefF	FS_RF20	20	50.71 ± 1.22
ReliefF	FS_RF40	40	76.43 ± 8.27

mRMR	FS_MR20	20	63.79 ± 1.22
mRMR	FS_MR40	40	77.14 ± 9.18

Elastic Net	FS_EN20	20	67.57 ± 4.09
Elastic Net	FS_EN40	40	78.38 ± 9.62

Finally, the more important thing is that the selection of threshold θ determines how many groups of redundant symptoms will be screened out. Here, we further discussed the effects of threshold θ to the optimal feature subsets on the classification performance. Table 9 shows the differences among three optimal feature subsets inferred by the proposed approach with different values for threshold θ . From Table 9, we can obviously see that the bigger value of θ will screen redundant symptoms strictly, which leads to less similar symptoms that would be obtained. With a smaller value of θ , much more symptoms can be categorized into the same groups; hence, the original feature space will be sharply reduced by our approach. Table 9 denotes that, with the decrease of θ , the size of optimal feature subset was narrowed down but the classification accuracy was also decreased. These results suggest that a bigger value of θ will result in less redundant symptoms and therefore induce a larger size of optimal feature subset; oppositely, smaller θ can provide more redundant symptoms and sharply reduce the feature dimension. An extreme case is that θ equals “0,” which means that we can get one mixed feature for each basis and the size of optimal feature subset is equal to the number of bases. In a word, how to determine the value of θ depends on the size of optimal feature subset and its corresponding classification performance.

Table 9

The performance of classification for the inferred optimal feature subsets with different threshold θ .

The values of θ	Original symptom features	New mixed features	Total number of features	Classification accuracy (%)
θ = 0.95	33	6	39	80.002 ± 9.95
θ = 0.90	21	9	30	70.83 ± 6.59
θ = 0.85	10	8	18	70.00 ± 4.56

5. Conclusions

In this study, we developed the NMFBFS approach to efficiently extract the important clinical symptoms of HCC from clinical observation data. NMFBFS is a two-stage filter method for feature selection as follows. (1) In the first stage, preliminary screening is implemented to detect and remove the irrelevant features; (2) in the second stage, NMF was applied to identify the redundant features by groups which might represent similar distribution patterns. Each redundant symptom group was then transformed into a new mixed feature so that the dimension of dataset was further reduced.

The application of NMFBFS on a clinical dataset of HCC proved the effectiveness of this approach. The optimal clinical features derived from NMFBFS approach contained many well-recognized symptoms of HCC patients. Moreover, this study also provides a general computational framework of a novel feature selection approach to efficiently extract the optimal feature subset from a high-dimensional dataset.

Abbreviations

HCC:

Hepatocellular carcinoma

TCM:

Traditional Chinese Medicine

NMF:

Nonnegative matrix factorization

LSSVM:

Least squares support vector machines

KNN:

K -nearest neighbor.

Conflict of Interests

The authors declare that they have no competing interests.

Authors’ Contribution

Zhiwei Ji and Guanmin Meng contributed equally to this work.

Acknowledgments

This work was supported by the National Science Foundation of China (nos. 61472282 and 61133010). The data in this work was collected by the Changhai Hospital, Shanghai, China.

Bosch

F. X.

Ribes

Cléries

Díaz

Epidemiology of hepatocellular carcinoma

Clinics in Liver Disease 2005 9 2 191 211

10.1016/j.cld.2004.12.009

2-s2.0-17044371509

Center

M. M.

Jemal

Smith

R. A.

Ward

Worldwide variations in colorectal cancer

CA—Cancer Journal for Clinicians 2009 59 6 366 378

10.3322/caac.20038

2-s2.0-73049112438

El-Serag

H. B.

Hepatocellular carcinoma

The New England Journal of Medicine 2011 365 12 1118 1127

10.1056/nejmra1001683

2-s2.0-80053088189

A new prognostic system for hepatocellular carcinoma: a retrospective study of 435 patients: the Cancer of the Liver Italian Program (CLIP) investigators

Hepatology 1998 28 3 751 755

2-s2.0-0031782818

Miller

Schwartz

L. H.

D'Angelica

The use of Imaging in the diagnosis and staging of hepatobiliary malignancies

Surgical Oncology Clinics of North America 2007 16 2 343 368

10.1016/j.soc.2007.04.001

2-s2.0-34249874594

Forner

Bruix

Diagnosis of hepatic nodules 20 mm or smaller in cirrhosis: prospective validation of the noninvasive diagnostic criteria for hepatocellular carcinoma—reply

Hepatology 2008 47 6 2146 2147

Liao

Y.-H.

Lin

C.-C.

T.-C.

Lin

J.-G.

Utilization pattern of traditional Chinese medicine for liver cancer patients in Taiwan

BMC Complementary and Alternative Medicine 2012 12, article 146

10.1186/1472-6882-12-146

2-s2.0-84865640882

Mourad

Sinoquet

Leray

Probabilistic graphical models for genetic association studies

Briefings in Bioinformatics 2012 13 1 20 33

bbr015

10.1093/bib/bbr015

2-s2.0-84855682582

Wang

Identifying potential clinical syndromes of hepatocellular carcinoma using PSO-based hierarchical feature selection algorithm

BioMed Research International 2014 2014 12

127572

10.1155/2014/127572

2-s2.0-84897565065

J.-X.

Zhai

C.-M.

Y.-Q.

Face aging simulation and recognition based on NMF algorithm with sparseness constraints

Neurocomputing 2013 116 250 259

10.1016/j.neucom.2012.08.030

2-s2.0-84878479062

Liang

J. N.

Yang

Winstanley

Invariant optimal feature selection: a distance discriminant and feature ranking based solution

Pattern Recognition 2008 41 5 1429 1439

10.1016/j.patcog.2007.10.018

2-s2.0-38349127958

Peng

H. C.

Long

F. H.

Ding

Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy

IEEE Transactions on Pattern Analysis and Machine Intelligence 2005 27 8 1226 1238

10.1109/tpami.2005.159

2-s2.0-24344458137

Zou

Hastie

Regularization and variable selection via the elastic net

Journal of the Royal Statistical Society Series B: Statistical Methodology 2005 67 2 301 320

10.1111/j.1467-9868.2005.00503.x

MR2137327

2-s2.0-16244401458

Wildi

Pestalozzi

B. C.

McCormack

Clavien

P.-A.

Critical evaluation of the different staging systems for hepatocellular carcinoma

The British Journal of Surgery 2004 91 4 400 408

10.1002/bjs.4554

2-s2.0-1842845112

Sharma

Imoto

Miyano

A filter based feature selection algorithm using null space of covariance matrix for DNA microarray gene expression data

Current Bioinformatics 2012 7 3 289 294

10.2174/157489312802460802

2-s2.0-84866672652

Bellal

Elghazel

Aussem

A semi-supervised feature ranking method with ensemble learning

Pattern Recognition Letters 2012 33 10 1426 1433

10.1016/j.patrec.2012.03.001

2-s2.0-84860387759

Chang

H.-W.

Chiu

Y.-H.

Kao

H.-Y.

Yang

C.-H.

W.-H.

Comparison of classification algorithms with wrapper-based feature selection for predicting osteoporosis outcome based on genetic factors in a Taiwanese women population

International Journal of Endocrinology 2013 2013 8

850735

10.1155/2013/850735

2-s2.0-84873361818

Imani

M. B.

Keyvanpour

M. R.

Azmi

A novel embedded feature selection method: a comparative study in the application of text categorization

Applied Artificial Intelligence 2013 27 5 408 427

10.1080/08839514.2013.774211

2-s2.0-84878720298

Zdunek

Cichocki

Nonnegative matrix factorization with constrained second-order optimization

Signal Processing 2007 87 8 1904 1916

10.1016/j.sigpro.2007.01.024

ZBL1186.94391

2-s2.0-34247173538

Chang

Wang

Ashby

Zhou

Zhang

Huang

eMBI: boosting gene expression-based clustering for cancer subtypes

Cancer Informatics 2014 13 supplement 2 105 112

10.4137/cin.s13777

Zheng

C.-H.

Huang

D.-S.

Zhang

Kong

X.-Z.

Tumor clustering using nonnegative matrix factorization with gene selection

IEEE Transactions on Information Technology in Biomedicine 2009 13 4 599 607

10.1109/titb.2009.2018115

2-s2.0-67749108622

Zheng

C.-H.

T.-Y.

Zhang

Shiu

C.-K.

Wang

H.-Q.

Tumor classification based on non-negative matrix factorization using gene expression data

IEEE Transactions on Nanobioscience 2011 10 2 86 93

10.1109/TNB.2011.2144998

2-s2.0-80051776578

Cichocki

Lee

Kim

Y.-D.

Choi

Non-negative matrix factorization with α-divergence

Pattern Recognition Letters 2008 29 9 1433 1440

10.1016/j.patrec.2008.02.016

2-s2.0-43249131130

Zdunek

Cichocki

Nonnegative matrix factorization with quadratic programming

Neurocomputing 2008 71 10–12 2309 2320

10.1016/j.neucom.2007.01.013

2-s2.0-44649157722

Lee

D. D.

Seung

H. S.

Algorithms for non-negative matrix factorization

Proceedings of the Advances in Neural Information Processing Systems (NIPS '01)

2001

Theys

Lantéri

Richard

SGM to solve NMF—application to hyperspectral data

New Concepts in Imaging: Optical and Statistical Models 2013 59 357 379 EAS Publications Series

10.1051/eas/1359016

Casalino

del Buono

Mencar

Subtractive clustering for seeding non-negative matrix factorizations

Information Sciences 2014 257 369 387

10.1016/j.ins.2013.05.038

MR3131801

2-s2.0-84888641788

Vignolo

L. D.

Milone

D. H.

Scharcanski

Feature selection for face recognition based on multi-objective evolutionary wrappers

Expert Systems with Applications 2013 40 13 5077 5084

10.1016/j.eswa.2013.03.032

2-s2.0-84878287376

Anand

Pugalenthi

Fogel

G. B.

Suganthan

P. N.

An approach for classification of highly imbalanced data using weighting and undersampling

Amino Acids 2010 39 5 1385 1391

10.1007/s00726-010-0595-2

2-s2.0-78449268828

Bria

Karssemeijer

Tortorella

Learning from unbalanced data: a cascade-based approach for detecting clustered microcalcifications

Medical Image Analysis 2014 18 2 241 252

10.1016/j.media.2013.10.014

2-s2.0-84888787427

Cao

Zhao

D. Z.

Zaiane

Hybrid probabilistic sampling with random subspace for imbalanced data learning

Intelligent Data Analysis 2014 18 6 1089 1108

10.3233/ida-140686

2-s2.0-84911098802

Shubair

Ramadass

Altyeb

A. A.

KENFIS: kNN-based evolving neuro-fuzzy inference system for computer worms detection

Journal of Intelligent and Fuzzy Systems 2014 26 4 1893 1908

10.3233/ifs-130868

2-s2.0-84897723050

Wang

H.-Q.

Sun

F.-C.

Cai

Y.-N.

Ding

L.-G.

Chen

An unbiased LSSVM model for classification and regression

Soft Computing 2010 14 2 171 180

10.1007/s00500-009-0435-z

ZBL1191.68604

2-s2.0-70349275231

Mustaffa

Yusof

LSSVM parameters tuning with enhanced artificial bee colony

International Arab Journal of Information Technology 2014 11 3 236 242

2-s2.0-84900003454

Ngom

The non-negative matrix factorization toolbox for biological data mining

Source Code for Biology and Medicine 2013 8 1, article 10

10.1186/1751-0473-8-10

2-s2.0-84876097941

Brunet

J.-P.

Tamayo

Golub

T. R.

Mesirov

J. P.

Metagenes and molecular pattern discovery using matrix factorization

Proceedings of the National Academy of Sciences of the United States of America 2004 101 12 4164 4169

10.1073/pnas.0308531101

2-s2.0-1642529511

Frigyesi

Höglund

Non-negative matrix factorization for the analysis of complex gene expression data: identification of clinically relevant tumor subtypes

Cancer Informatics 2008 6 275 292

2-s2.0-49649102048

L. F.

Wang

Jiao

L. C.

Multiple parameter selection for LS-SVM using smooth leave-one-out error

Advances in Neural Networks—ISNN 2005 2005 3496

Berlin, Germany

Springer

851 856 Lecture Notes in Computer Science

10.1007/11427391_136

Lai

S.-W.

Chen

H.-J.

Lin

C.-L.

Liao

K.-F.

No correlation between Alzheimer's disease and risk of hepatocellular carcinoma in older people: an observation in Taiwan

Geriatrics & Gerontology International 2014 14 1 231 232

10.1111/ggi.12141

2-s2.0-84892165822

S.-M.

Lee

Y.-J.

Y.-W.

Liu

C.-J.

Chen

T.-J.

Fuh

J.-L.

Wang

S.-J.

Does Alzheimer's disease protect against cancers? A nationwide population-based study

Neuroepidemiology 2012 40 1 42 49

10.1159/000341411

2-s2.0-84867458469

Peng

S.-Y.

Feng

X.-D.

Liu

Y.-B.

Qian

H.-R.

J.-T.

Wang

J.-W.

Fang

H.-Q.

Cao

L.-P.

Shen

H.-W.

J.-J.

Cai

X.-J.

Y.-P.

Surgical treatment of hepatocellular carcinoma originating from caudate lobe

Zhonghua Wai Ke Za Zhi 2005 43 1 49 52

2-s2.0-25144518775

Peng

S. Y.

J. T.

Liu

Y. B.

Cai

X. J.

Mou

Y. P.

Feng

X. D.

Wang

J. W.

Qian

H. R.

Hong

D. F.

Wang

X. B.

Fang

H. Q.

Cao

L. P.

Chen

Peng

C. H.

Liu

F. B.

Xue

J. F.

Surgical treatment of hepatocellular carcinoma originating from caudate lobe—a report of 39 cases

Journal of Gastrointestinal Surgery 2006 10 3 371 378

10.1016/j.gassur.2005.09.026

2-s2.0-33644536758

Lin

M.-H.

P.-Y.

Tsai

S.-T.

Lin

C.-L.

Chen

T.-W.

Hwang

S.-J.

Hospice palliative care for patients with hepatocellular carcinoma in Taiwan

Palliative Medicine 2004 18 2 93 99

10.1191/0269216304pm851oa

2-s2.0-1642277253

Fujiyama

Shibata

Maeda

Tanaka

Noumaru

Sato

Tomita

Phase I clinical study of a novel lipophilic platinum complex (SM-11355) in patients with hepatocellular carcinoma refractory to cisplatin/lipiodol

British Journal of Cancer 2003 89 9 1614 1619

10.1038/sj.bjc.6601318

2-s2.0-0344009740

Zhao

Liu

Cao

Ren

Zhang

Ren

A randomized phase II study of autologous cytokine-induced killer cells in treatment of hepatocelluar carcinoma

Journal of Clinical Immunology 2014 34 2 194 203

10.1007/s10875-013-9976-0

2-s2.0-84898920952

Ciombor

K. K.

Feng

Benson

A. B.

III Su

Horton

Short

S. P.

Kauh

J. S. W.

Staley

Mulcahy

Powell

Amiri

K. I.

Richmond

Berlin

Phase II trial of bortezomib plus doxorubicin in hepatocellular carcinoma (E6202): a trial of the Eastern Cooperative Oncology Group

Investigational New Drugs 2014 32 5 1017 1027

10.1007/s10637-014-0111-8

2-s2.0-84901578278

Henderson

Feun

Van Veldhuizen

Gold

Zheng

Ryan

Blaszkowsky

L. S.

Chen

Costa

Rosenzweig

Nierodzik

Hochster

Muggia

Abbadessa

Lewis

Zhu

A. X.

Phase II study of darinaparsin in patients with advanced hepatocellular carcinoma

Investigational New Drugs 2010 28 5 670 676

10.1007/s10637-009-9286-9

2-s2.0-77956060553

Lin

J.-J.

Jin

C.-N.

Zheng

M.-L.

Ouyang

X.-N.

Zeng

J.-X.

Dai

X.-H.

Clinical study on treatment of primary hepatocellular carcinoma by Shenqi mixture combined with microwave coagulation

Chinese Journal of Integrative Medicine 2005 11 2 104 110

10.1007/BF02836465

2-s2.0-22144465154

Doffoël

Bonnetain

Bouché

Vetter

Abergel

Fratté

Grangé

J. D.

Stremsdoerfer

Blanchi

Bronowicki

J. P.

Caroli-Bosc

F. X.

Causse

Masskouri

Rougier

Bedenne

Multicentre randomised phase III trial comparing Tamoxifen alone or with Transarterial Lipiodol Chemoembolisation for unresectable hepatocellular carcinoma in cirrhotic patients (Federation Francophone de Cancerologie Digestive 9402)

European Journal of Cancer 2008 44 4 528 538

10.1016/j.ejca.2008.01.004

2-s2.0-40249116266