1. Introduction

CMMM

Computational and Mathematical Methods in Medicine

1748-6718 1748-670X

Hindawi Publishing Corporation

750151

10.1155/2012/750151

750151

Research Article

Machine Learning Approach to Extract Diagnostic and Prognostic Thresholds: Application in Prognosis of Cardiovascular Mortality

Mena

Luis J.

¹ Orozco

Eber E.

¹ Felix

Vanessa G.

¹ Ostos

Rodolfo

¹ Melgarejo

Jesus

² Maestre

Gladys E.

^{2, 3} Barreto

Guilherme de Alencar

Department of Computer Engineering

Polytechnic University of Sinaloa

82199 Mazatlan, SIN

Mexico

upsin.edu.mx

Institute for Biological Research and Cardiovascular Institute

Faculty of Medicine

University of Zulia

Maracaibo 4002

Venezuela

luz.edu.ve

Departments of Psychiatry and Neurology

and the Gertrude H. Sergievsky Center

Columbia University

New York

NY 10032

USA

columbia.edu

2012

9 8 2012

2012 30 03 2012 25 06 2012 03 07 2012

2012

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Machine learning has become a powerful tool for analysing medical domains, assessing the importance of clinical parameters, and extracting medical knowledge for outcomes research. In this paper, we present a machine learning method for extracting diagnostic and prognostic thresholds, based on a symbolic classification algorithm called REMED. We evaluated the performance of our method by determining new prognostic thresholds for well-known and potential cardiovascular risk factors that are used to support medical decisions in the prognosis of fatal cardiovascular diseases. Our approach predicted 36% of cardiovascular deaths with 80% specificity and 75% general accuracy. The new method provides an innovative approach that might be useful to support decisions about medical diagnoses and prognoses.

1. Introduction

Machine learning (ML) disciplines provide computational methods and learning mechanisms that can help generate new knowledge from large databases. Applications of ML are useful for constructing approaches to solving problems of classification, prediction, recognition patterns, and knowledge extraction, where the data take the form of a set of examples, and the output takes the form of prediction of new examples [1, 2]. In this sense, ML can provide techniques and tools that help solve diagnostic and prognostic problems in medical domains, where the input is a dataset with characteristics of the subjects, and the output is a diagnosis or prognosis of a specific disease [3]. Although diagnosis and prognosis are relatively straightforward ML problems, clinical decision-making using ML applications is not yet widely used by the medical community [4], because such a complex task requires not only accuracy, but also the confidence of physician specialists about the functional use of ML approaches in the medical field.

To successfully implement an ML application in problems related to clinical decisions, it is necessary to consider some specific requirements [4, 5]. For example, the prediction of disease progression is generally associated with the evolution of certain risk factors; in the case of some chronic diseases (e.g., cancer, cardiovascular diseases, and diabetes), the risk factors include nonchangeable characteristics, such as age or gender. The use of such nonchangeable qualities to predict the onset of a disease might not be as useful for avoiding evolution of the disease, because currently there is no medical treatment for modifying these biological characteristics. Thus, ML applications usually focus on changeable qualities, which make the prognostic task more difficult and complex.

Another important aspect to consider is the need to obtain interpretable approximations, in order to provide medical staff with useful information about the given problem. This is typically achieved using symbolic learning methods (e.g., decision trees and rules systems), which allow decisions to be explained in an easily comprehensible manner. However, the use of a symbolic learning algorithm to obtain a more comprehensible model frequently sacrifices accuracy in the prediction.

Another problem that often hinders high overall performance in the analysis of medical datasets is that generally these exhibit an unbalanced class distribution [6], which include a majority or negative class of healthy people (normal data) and a minority or positive class of sick people (the important class) with higher cost of erroneous classification. The latter usually has a higher rate of misclassification, because the performance of standard ML algorithms tends to be overwhelmed by the majority class, ignoring the minority class examples and obtaining results with acceptable accuracy and specificity (healthy subjects diagnosed correctly), but low sensitivity (sick subjects diagnosed correctly).

In addition to developing ML approaches that result in good overall performance and provide medical staff with interpretable prognostic information, providing the ability to support decisions and to reduce the number of medical tests for a reliable prognosis are also desirable. A measure of reliability of the diagnosis or prognosis is also important, because this would give medical staff sufficient confidence to put the new approach into practice. On the other hand, it is also desirable to have an approach that can provide reliable predictions based on a small amount of information about the patient, because collection of that information is often expensive, possibly subject to privacy issues, time consuming, and possibly harmful to the patient [4].

The present study focused on the implementation of a ML method to support medical decisions in the prognosis of fatal cardiovascular diseases, which are ranked among the top ten in the global disease burden [7]. The goal was to solve previously identified problems, through interdisciplinary work that included the collection and preprocessing of data from an ambulatory blood pressure (ABP) monitoring study [8], the implementation of a current ML algorithm with specific application to medical diagnosis and prognosis [9], and the identification of new prognostic thresholds for risk factors of cardiovascular mortality.

2. Methods 2.1. Ambulatory Blood Pressure Monitoring

Currently available ABP monitors are fully automatic and portable devices (Figure 1) that can record BP for 24 hours or longer, while patients go about their normal daily activities [10]. This BP measurement technique provides a better estimate of risk in an individual patient than the traditional method, because it removes variability among individual observers, avoids the “white coat” effect (the transient but variable elevation of BP in a medical environment) [11] and the “masked hypertension” (normotensive by clinic measurement and hypertensive by ambulatory measurement) [12] and includes the inherent variability of BP [13]. Detailed descriptions of the ABP measurement methods are provided in previous reports of the Maracaibo Aging Study (MAS) [8, 14, 15].

Figure 1

Ambulatory blood pressure monitoring procedure.

2.2. Subjects

The MAS is an ongoing population-based, longitudinal study that includes 2500 subjects older than 55 years, residing in the Santa Lucia County, Maracaibo, Venezuela. All participants underwent extensive clinical and laboratory examinations and randomly selected individuals also underwent ABP monitoring. Informed consent was obtained from the subjects who agreed to participate, and from a close family member when doubts existed about the competence of the subject. The ethical review board of the Institute of Cardiovascular Diseases of the University of Zulia approved the protocol.

2.3. Cardiovascular Risk Factors

The leading global risk factor for mortality is high BP, which is responsible for 13% of deaths globally. Eight changeable risk factors (alcohol use, tobacco use, high BP, high body mass index, high cholesterol, high blood glucose, low fruit and vegetable intake, and physical inactivity) account for 61% of cardiovascular deaths. Combined, these same risk factors account for over three quarters of ischaemic heart disease, the leading cause of death worldwide [16].

However, investigators continue to look for new and emerging risk factors for cardiovascular disease. Recent ABP monitoring studies using a novel variability index [14] reported significant relationships between high BP variability (BPV) and cardiovascular outcomes [17–19]. BPV is a multifaceted phenomenon, influenced by the interaction between external emotional stimuli, such as stress and anxiety, and internal cardiovascular mechanisms that can vary from heartbeat to heartbeat. However, the complexity of BPV makes analysis difficult, and its independent contribution as a predictor of cardiovascular outcomes is not yet clear [20]. The present study aimed to identify new prognostic thresholds of risk factors for cardiovascular mortality, including high BP (the most significant cardiovascular predictor) and abnormal BPV (a potential independent predictor).

To estimate 24-hour BP level, we computed the weighed mean of valid BP readings (WBP) using the time interval between successive valid measurements as weighting factors [18]. In the case of BPV over 24 hours, we calculated the Average Real Variability (ARV) index [14] using (1): (1)ARV=1∑wk∑k=1n-1wk×|BPk-BPk-1|, where n is the number of valid BP readings, k ranges from 1 to n−1, and wk is the time interval between BPk and BPk-1.

2.4. Machine Learning Approach

We implemented an interdisciplinary ML method that encompassed all stages of knowledge extraction from databases (data preprocessing, attribute selection, data mining, and knowledge extraction), to examine the application of ML to support clinical decisions (Figure 2).

Figure 2

Machine learning method proposed.

To improve the accuracy of predictions for affected subjects (positive class), we used the Rule Extraction for MEdical Diagnosis (REMED) algorithm [9], a symbolic one-class classification approach that implements internal bias strategies during the learning process [21]. REMED employs three main procedures in the knowledge extraction process: (1) selection of attributes, (2) selection of initial partitions, and (3) construction of classification rules.

First, REMED attempts to select the best combination of relevant attributes, using a simple logistic regression model. This is a standard method of analysis in medical research that uses the odds ratio metric [22] to determine if there is a significant association (P<0.01) between a considered attribute and the positive class. REMED then begins to build initial partitions (exclusionary and exhaustive conditions) to maximize sensitivity and maintain acceptable accuracy without significantly decreasing specificity. Finally, REMED uses the respective partitions for each selected attribute to construct a system of rules that includes m conditions (one for each selected attribute) in the following way:

If Condition 1 <relation> p1

and Condition 2 <relation> p2

and Condition j <relation> pj and ⋯⋯⋯

and Condition m <relation> pm

then class = 1

Else class = 0,

where <relation> is either ≥ or ≤ depending on whether j is positively or negatively associated with the positive class through pj (partition for attribute j).

To avoid overfitting during the training and testing phase, REMED implements the k-fold cross validation technique, which is based on randomly shuffling sample vectors among training and testing spaces [23]. REMED also maintains the approximate imbalance of the original dataset through the k iterations.

2.5. Data Preprocessing and Attributes Selection

Based on current medical guidelines [24], we only included participants that had ABP recordings of good technical quality. Therefore, subjects with <40 BP readings during the 24-hour ABP period were excluded. Systolic BP readings values >260 mmHg or <70 mmHg, and diastolic BP readings >150 mmHg or <40 mmHg were considered outliers or erroneous values and discarded. The treatment of missing values was addressed with predictive techniques, specifically multiple linear regression analysis considering medical criterions.

Only continuous and changeable attributes were considered in the knowledge extraction process. Continuous attributes have a higher degree of uncertainty than discrete attributes, because discrete attributes are usually binary in the clinical environment (e.g., smoker versus nonsmoker), and their associations with specific diseases are almost always well known. We also excluded age, which is a nonchangeable attribute. The attributes considered in the initial ML analysis were body mass index (BMI), serum cholesterol level, 24-hour heart rate, and systolic and diastolic 24-hour WBP and ARV.

3. Results 3.1. Dataset

The minable dataset was composed of 551 observations with 7 attributes, with only 43 missing values (1.1%) in the serum cholesterol attribute. The missing data were estimated from the regression slope on sex and age, according to the criteria of physician specialists. The sample included 374 women (67.8%) and 170 patients (30.9%) undergoing treatment with antihypertensive drugs (Table 1). The average number of BP readings was 65.1 (5th to 95th percentile = 51.5−77.5), indicating good quality ABP recordings. Mean age was 67.1±8 years. At enrolment, 61 participants (11.1%) had a history of cardiovascular disease; 100 (18.1%) had a history of diabetes mellitus, of whom 59 (59%) were undergoing diabetes treatment; 86 (15.6%) were current smokers; 174 (31.6%) reported intake of alcohol. The average total cholesterol level was 5.5±1.3 mmol L⁻¹, and BMI averaged 27.1±5.6 kg m⁻². Mean 24-hour systolic WBP was 133.8±16.6 mmHg, and diastolic WBP was 76.1±10 mmHg. Average heart rate was 73.7±9.8 bpm.

Table 1

Baseline characteristics.

	Frequency in percent or median
Demographic variables
Men, % (n)	32.1 (177)
Age, years	67.1 ± 8
Race, % (n)
Mixed	73.1 (404)
Caucasian	22.2 (122)
African-Venezuelan	4 (22)
Natives	0.5 (3)
Use of antihypertensive drugs, % (n)	30.9 (170)
Use of anti-diabetic drugs, % (n)	11.1 (61)
History of cardiovascular disease, % (n)	11.5 (63)
Diagnosis of diabetes mellitus, % (n)	18.1 (100)
Lifestyle, physical and lipid factors
Smoking current status, % (n)	15.6 (86)
Drinking current status, % (n)	31.6 (174)
Body max index, kg/m²	27.1 ± 5.6
Total serum cholesterol, mmol/L	5.5 ± 1.3
24-hour ambulatory measurements
Systolic blood pressure, mm Hg	133.8 ± 16.6
Diastolic blood pressure, mm Hg	76.1 ± 10
Heart rate, bpm	73.7 ± 9.8

The median follow-up period was 7.1±3.7 years (5th to 95th percentile = 1.7−12.3 years). Only the participants that died from cardiovascular diseases (n=61) were classified as positive examples. Cardiovascular mortality included 10 strokes and 51 cardiac deaths for a high event rate of 15.5 per 1000 person-years. The imbalance ratio between the positive (affected) and negative (unaffected) class was approximately of 1 : 9.

3.2. Machine Learning Process 3.2.1. Selection of Attributes

Using the simple logistic regression model, REMED found only two attributes significantly associated with the positive class: systolic WBP (P=0.008) and ARV (P=0.0001). However, other well-known cardiovascular risk factors, such as serum cholesterol level, BMI, and diastolic WBP [16, 25], were considered in further analyses.

3.2.2. Rule System

To provide medical staff with more information and comprehensible models, we used REMED to build several simple rule systems, which included individual and combined predictions of the more significant attributes (systolic WBP and ARV), as well as the combined predictions with the additional risk factors.

3.3. Performance

The confusion matrix from the predictions of the system rule, combining only high systolic ARV and WBP and using 10-fold cross-validation, indicated that REMED performed at 0.36 sensitivity, correctly diagnosing more than 35% of the cardiovascular deaths (Table 2). REMED focuses on improving sensitivity over specificity, because in the case of medical diagnosis/prognosis, the cost of misclassification of false negatives (FN, i.e., sick subjects diagnosed incorrectly) is higher than that of false positives (FP, healthy subjects diagnosed incorrectly), because more specific medical tests could discover the FP error, but an FN could cause a life-threatening condition and possibly lead to death [26]. Additionally, to compare the performance of our approach in terms of reliable prediction, we selected from the WEKA framework [2] the ML approach that better performed with our dataset: the Naïve Bayes classifier, which is one of the most effective and efficient classification algorithms and has been successfully applied to many medical problems [27, 28]. The performance of all classifiers is showed in Table 3.

Table 2

Confusion matrix of REMED predictions.

		Predictive class
		Positive	Negative
Actual class	Positive	22	39
Actual class	Negative	98	392

Table 3

Performance of classifiers.

Classifiers	Sensitivity	Specificity	Accuracy
I f systolic ARV ≥ 9.6 then 1 Else 0	55.7%	60.4%	59.9%
If systolic WBP ≥ 134.6 then 1 Else 0	52.5%	58.8%	58.08%
If systolic ARV ≥ 9.6 and systolic WBP ≥ 137then 1 Else 0	36.1%	80.0%	75.1%
If systolic ARV ≥ 9.6 and systolic WBP ≥ 138.6 and cholesterol ≥ 5.5 then 1 Else 0	8.2%	93.3%	83.8%
If systolic ARV ≥ 10.4 and systolic WBP ≥ 139.8 and BMI ≥ 27.3 then 1 Else 0	9.8%	93.3%	84.0%
If systolic ARV ≥ 9.6 and systolic WBP ≥ 137 and diastolic WBP ≥ 78.4then 1 Else 0	22.9%	87.5%	80.4%
Naïve Bayes	11.48%	95.92%	86.57%

4. Discussion

Use of the REMED algorithm selecting only the more significant attributes provided some of the desired features for solving medical diagnosis/prognosis problems: (1) good overall performance for imbalanced datasets, with 36.1% of sensitivity, 80% specificity, and 75.1% general accuracy; (2) comprehensible prognostic information, based on a rule system with a high degree of abstraction (only one rule to predict positive class examples, independent of the number of instances and initial attributes); (3) the ability to provide the medical staff with sufficient confidence to use the rule system in practice, because it was based on attributes with high confidence levels (>99%), estimated with a standard method of medical analysis; (4) the ability to reduce the number of medical tests necessary to obtain a reliable diagnosis/prognosis, because a simple logistic regression model was used to select attributes strongly associated with the specific disease.

The ML approach generated a new prognostic threshold for cardiovascular mortality: systolic WBP ≥ 137 mmHg, which is lower than the currently proposed by hypertension guidelines (≥140 mmHg) and in agreement with recent ABP studies [29, 30], but with the advantage that our analysis was fully automated and had a smaller sample. Moreover, our ML approach generated a new prognostic threshold for abnormal systolic ARV (≥9.6 mmHg). Together, these new thresholds could provide improved predictions of cardiovascular mortality.

Both systolic WBP and ARV were independent predictors of cardiovascular mortality, performed >50% of sensitivity, but sacrificed significantly in specificity and general accuracy (≤60%). The addition of other well-known cardiovascular risk factors decreased considerably the accuracy in the prediction of affected subjects (<23%). Therefore, the use of logistic regression for the selection of significant attributes (>99%) could be an effective strategy in this stage of ML analysis in medical datasets.

Undoubtedly, one of the most important goals of the application of ML in the medical field is to generate new knowledge, providing the medical community with tools to develop novel points of view about any given problem. In our case, for example, although previous medical studies determined possible ranges of a low and high BPV measured whit ARV through statistical methods (median and quartiles analysis) [17, 18], our work is pioneer proposing a prognostic threshold for abnormal systolic ARV (≥9.6 mmHg). This threshold has a good performance as an independent a composed predictor of fatal cardiovascular events. The use of this threshold should facilitate new fields of investigation regarding BPV and its prognostic relevance.

We do not claim that our ML analysis using REMED is the ultimate solution for medical diagnosis/prognosis problems from unbalanced datasets, because it is necessary to implement modifications that improve REMED’s predictive capacity in terms of sensitivity (≥50%) without significantly deteriorating its specificity. However, we obtained better results than the Naïve Bayes classifier (11.48%), which is considered as a benchmark algorithm that in any medical domain has to be tried before any other advanced method [27]. Therefore, we believe that our approach could improve performance in these medical tasks, and increase the confidence of the medical community in the use of ML approaches to support clinical decisions.

Acknowledgments

The authors are grateful to the referees for their detailed review on the paper and thoughtful comments. This paper was supported by the Secretaria de Educación Pública, México DF, México (PROMEP/103-5/11/4145). The Maracaibo Aging Study was funded by the Venezuelan Grant FONACIT G-97000726, FundaConCiencia, and by Award no. R01AG036469 from the National Institute on Aging.

Alpaydin

Introduction to Machine Learning 2010 2nd

Cambridge, Mass, USA

The MIT Press

Witten

I. H.

Frank

Hall

M. A.

Data Mining: Practical Machine Learning Tools and Techniques 2011 3rd

Burlington, Mass, USA

Morgan Kaufmann

Karpagavalli

Jamuna

K. S.

Vijaya

M. S.

Machine learning approach for preoperative anaesthetic risk prediction

International Journal of Recent Trends in Engineering 2009 1 2 19 22

Kononenko

Machine learning for medical diagnosis: history, state of the art and perspective

Artificial Intelligence in Medicine 2001 23 1 89 109

2-s2.0-0034922742

10.1016/S0933-3657(01)00077-X

Bosnić

Kononenko

Estimation of individual prediction reliability using the local sensitivity analysis

Applied Intelligence 2008 29 3 187 203

2-s2.0-54249164497

10.1007/s10489-007-0084-9

Chawla

N. V.

Japkowicz

Kolcz

Editorial: special issue on learning from imbalanced data sets

ACM SIGKDD Explorations 2004 6 1 1 6

Lopez

A. D.

Mathers

C. D.

Ezzati

Jamison

D. T.

Murray

C. J.

Global and regional burden of disease and risk factors, 2001: systematic analysis of population health data

The Lancet 2006 367 9524 1747 1757

2-s2.0-33646799069

10.1016/S0140-6736(06)68770-9

Maestre

G. E.

Pino-Ramírez

Molero

A. E.

Silva

E. R.

Zambrano

Falque

Gamero

M. P.

Sulbarán

T. A.

The maracaibo aging study: population and methodological issues

Neuroepidemiology 2002 21 4 194 201

2-s2.0-0036281836

10.1159/000059524

Mena

Gonzalez

J. A.

Symbolic one-class learning from imbalanced datasets: application in medical diagnosis

International Journal on Artificial Intelligence Tools 2009 18 2 273 309

2-s2.0-65849151044

10.1142/S0218213009000135

Pickering

T. G.

Shimbo

Haas

Ambulatory blood-pressure monitoring

New England Journal of Medicine 2006 354 22 2316 2374

2-s2.0-33744501815

10.1056/NEJMra060433

Pickering

T. G.

James

G. D.

Boddie

Harshfield

G. A.

Blank

Laragh

J. H.

How common is white coat hypertension?

Journal of the American Medical Association 1988 259 2 225 228

2-s2.0-0023839194

Frattola

Parati

Cuspidi

Albini

Mancia

Prognostic value of 24-hour blood pressure variability

Journal of Hypertension 1993 11 10 1133 1137

2-s2.0-0027441842

10.1097/00004872-199310000-00019

Pickering

T. G.

Davidson

Gerin

Schwartz

J. E.

Masked hypertension

Hypertension 2002 40 6 795 796

2-s2.0-0036898048

10.1161/01.HYP.0000038733.08436.98

Mena

Pintos

Queipo

N. V.

Aizpúrua

J. A.

Maestre

Sulbarán

A reliable index for the prognostic significance of blood pressure variability

Journal of Hypertension 2005 23 3 505 511

2-s2.0-18844396779

Mena

Melgarejo

J. D.

Chavez

Pineda

Calmon

Silva

E. R.

Maestre

G. E.

Relevance of blood pressure variability among the elderly: findings from the Maracaibo aging study

Journal of Hypertension 2011 29

e312

World Health organization

Global Health Risks-Mortality and Burden of Disease Attributable to Selected Major Risk 2009

Geneva, Switzerland

World Health Organization

Pierdomenico

S. D.

Di Nicola

Esposito

A. L.

Di Mascio

Ballone

Lapenna

Cuccurullo

Prognostic value of different indices of blood pressure variability in hypertensive patients

American Journal of Hypertension 2009 22 8 842 847

2-s2.0-68149124683

10.1038/ajh.2009.103

Hansen

T. W.

Thijs

Boggia

Kikuya

Björklund-Bodegård

Richart

Ohkubo

Jeppesen

Torp-Pedersen

Dolan

Kuznetsova

Stolarz-Skrzypek

Tikhonoff

Malyutina

Casiglia

Nikitin

Lind

Sandoya

Kawecka-Jaszcz

Imai

Wang

Ibsen

O'Brien

Staessen

J. A.

Prognostic value of reading-to-reading blood pressure variability over 24 hours in 8938 subjects from 11 populations

Hypertension 2010 55 4 1049 1057

2-s2.0-77950463099

10.1161/HYPERTENSIONAHA.109.140798

Veerabhadrappa

Diaz

K. M.

Feairheller

D. L.

Sturgeon

K. M.

Williamson

Crabbe

D. L.

Kashem

Ahrensfield

Brown

M. D.

Enhanced blood pressure variability in a high cardiovascular risk group of African Americans: FIT4Life Study

Journal of the American Society of Hypertension 2010 4 4 187 195

2-s2.0-77957719578

10.1016/j.jash.2010.04.005

Hansen

T. W.

Staessen

J. A.

Blood pressure variability remains an elusive predictor of cardiovascular outcome

American Journal of Hypertension 2009 22 1 3 4

2-s2.0-57749202544

10.1038/ajh.2008.322

Kotsiantis

Kanellopoulos

Pintelas

Handling imbalanced datasets: a review

GESTS International Transactions on Computer Science and Engineering 2006 30 1 25 36

Zheng

Srihari

Feature selection for text categorization on imbalanced data

ACM SIGKDD Explorations 2004 6 80 89

10.1145/1007730.1007741

Kohavi

A study of cross-validation and bootstrap for accuracy estimation and model selection

Proceedings of the 14th International Conference on Artificial Intelligence

1995

1137 1143

Mancia

De Backer

Dominiczak

Cifkova

Fagard

Germano

Grassi

Heagerty

A. M.

Kjeldsen

S. E.

Laurent

Narkiewicz

Ruilope

Rynkiewicz

Schmieder

R. E.

Boudier

H. A. J. S.

Zanchetti

2007 guidelines for the management of arterial hypertension: the task force for the management of arterial hypertension of the European Society of Hypertension (ESH) and of the European Society of Cardiology (ESC)

Journal of Hypertension 2007 25 6 1105 1187

2-s2.0-34250350040

10.1097/HJH.0b013e3281fc975a

Weiss

G. M.

Mining with rarity a unifying frame-work

ACM SIGKDD Explorations 2004 1 7 19

Cui

Blumenthal

R. S.

Flaws

J. A.

Whiteman

M. K.

Langenberg

Bachorik

P. S.

Bush

T. L.

Non-high-density lipoprotein cholesterol level as a predictor of cardiovascular disease mortality

Archives of Internal Medicine 2001 161 11 1413 1419

2-s2.0-0034912018

Al-Aidaroos

K. M.

Bakar

A. A.

Othman

Medical data classification with Naive Bayes approach

Information Technology Journal 2012 11 9 1166 1174

10.3923/itj.2012.1166.1174

Kukar

Grošelj

Reliable diagnostics for coronary artery disease

Proceedings of the 15th IEEE Symposium on Computer-Based Medical Systems

June 2002

7 12

2-s2.0-0036069257

Kikuya

Hansen

T. W.

Thijs

Björklund-Bodegård

Kuznetsova

Ohkubo

Richart

Torp-Pedersen

Lind

Ibsen

Imai

Staessen

J. A.

Diagnostic thresholds for ambulatory blood pressure monitoring based on 10-year cardiovascular risk

Circulation 2007 115 16 2145 2152

2-s2.0-34247547816

10.1161/CIRCULATIONAHA.106.662254

Hansen

T. W.

Kikuya

Thijs

Boggia

Björklund-Bodegârd

Torp-Pedersen

Jeppesen

Ibsen

Staessen

J. A.

Diagnostic thresholds for ambulatory blood pressure moving lower: a review based on a meta-analysis-clinical implications

Journal of Clinical Hypertension 2008 10 5 377 381

2-s2.0-50349085694

10.1111/j.1751-7176.2008.07681.x