Comparative Study of Query Performance in a Remote Health Framework using Cassandra and Hadoop

Himadri Sekhar Ray, Kausik Naguri, Poly Sil Sen, Nandini Mukherjee

2016

Abstract

With the recent advancements in distributed processing, sensor networks, cloud computing and similar technologies, big data has gained importance and a number of big data applications can now be envisaged which could not be conceptualised earlier. However, gradually as technologists focus on storing, processing and management of big data, a number of big data solutions have come up. The objective of this paper is to study two such solutions, namely Hadoop and Cassandra, in order to find their suitability for healthcare applications. The paper considers a data model for a remote health framework and demonstrates mappings of the data model using Hadoop and Cassandra. The data model follows popular national and international standards for Electronic Health Records. It is shown in the paper that in order to obtain an efficient mapping of a given data model onto a big data solution, like Cassandra, sample queries must be considered. In this paper, health data is stored in Hadoop using xml files considering the same set of queries. Next, the performances of these queries in Hadoop are observed and later, performances of executing these queries on the same experimental setup using Hadoop and Cassandra are compared. YCSB guidelines are followed to design the experiments. The study provides an insight for the applicability of big data solutions in healthcare domain.

Download


Paper Citation


in Harvard Style

Ray H., Naguri K., Sil Sen P. and Mukherjee N. (2016). Comparative Study of Query Performance in a Remote Health Framework using Cassandra and Hadoop . In Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTHINF, (BIOSTEC 2016) ISBN 978-989-758-170-0, pages 330-337. DOI: 10.5220/0005706803300337

in Bibtex Style

@conference{healthinf16,
author={Himadri Sekhar Ray and Kausik Naguri and Poly Sil Sen and Nandini Mukherjee},
title={Comparative Study of Query Performance in a Remote Health Framework using Cassandra and Hadoop},
booktitle={Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTHINF, (BIOSTEC 2016)},
year={2016},
pages={330-337},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005706803300337},
isbn={978-989-758-170-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTHINF, (BIOSTEC 2016)
TI - Comparative Study of Query Performance in a Remote Health Framework using Cassandra and Hadoop
SN - 978-989-758-170-0
AU - Ray H.
AU - Naguri K.
AU - Sil Sen P.
AU - Mukherjee N.
PY - 2016
SP - 330
EP - 337
DO - 10.5220/0005706803300337