Data Mining based Methodologies for Cardiac Risk Patterns Identification

V. G. Almeida, J. Borba, T. Pereira, H. C. Pereira, J. Cardoso, C. Correia

2013

Abstract

Cardiovascular diseases (CVDs) are the leading cause of death in the world. The pulse wave analysis provides a new insight in the analysis of these pathologies, while data mining techniques can contribute for an efficient diagnostic method. Amongst the various available techniques, artificial neural networks (ANNs) are well established in biomedical applications and have numerous successful classification applications. Also, clustering procedures have proven to be very useful in assessing different risk groups in terms of cardiovascular function in healthy populations. In this paper, a robust data mining approach was performed for cardiac risk patterns identification. Eight classifiers were tested: C4.5, Random Forest, RIPPER, Naïve Bayes, Bayesian Network, Multy-layer perceptron (MLP) (1 and 2-hidden layers) and radial basis function (RBF). As for clustering procedures, k-means clustering (using Euclidean distance) and expectation-maximization (EM) were the chosen algorithms. Two datasets were used as case studies to perform classification and clustering analysis. The accuracy values are good with intervals between 88.05% and 97.15%. The clustering techniques were essential in the analysis of a dataset where little information was available, allowing the identification of different clusters that represent different risk group in terms cardiovascular function. The three cluster analysis has allowed the characterization of distinctive features for each of the clusters. Reflected wave time (T_RP) and systolic wave time (T_SP) were the selected features for clusters visualization. Data mining methodologies have proven their usefulness in screening studies due to its descriptive and predictive power.

Download


Paper Citation


in Harvard Style

G. Almeida V., Borba J., Pereira T., C. Pereira H., Cardoso J. and Correia C. (2013). Data Mining based Methodologies for Cardiac Risk Patterns Identification . In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013) ISBN 978-989-8565-35-8, pages 127-133. DOI: 10.5220/0004222701270133

in Bibtex Style

@conference{bioinformatics13,
author={V. G. Almeida and J. Borba and T. Pereira and H. C. Pereira and J. Cardoso and C. Correia},
title={Data Mining based Methodologies for Cardiac Risk Patterns Identification},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013)},
year={2013},
pages={127-133},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004222701270133},
isbn={978-989-8565-35-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013)
TI - Data Mining based Methodologies for Cardiac Risk Patterns Identification
SN - 978-989-8565-35-8
AU - G. Almeida V.
AU - Borba J.
AU - Pereira T.
AU - C. Pereira H.
AU - Cardoso J.
AU - Correia C.
PY - 2013
SP - 127
EP - 133
DO - 10.5220/0004222701270133