SUSTAINABILITY OF HADOOP CLUSTERS

Luis Bautista, Alain April

2011

Abstract

Hadoop is a set of utilities and frameworks for the development and storage of distributed applications in cloud computing, the core component of which is the Hadoop Distributed File System (HDFS). NameNode is a key element of its architecture, and also its “single point of failure”. To address this issue, we propose a replication mechanism that will protect the NameNode data in case of failure. The proposed solution involves two distinct components: the creation of a BackupNode cluster that will use a leader election function to replace the NameNode, and a mechanism to replicate and synchronize the file system namespace that is used as a recovery point.

Download


Paper Citation


in Harvard Style

Bautista L. and April A. (2011). SUSTAINABILITY OF HADOOP CLUSTERS . In Proceedings of the 1st International Conference on Cloud Computing and Services Science - Volume 1: CLOSER, ISBN 978-989-8425-52-2, pages 587-590. DOI: 10.5220/0003332705870590

in Bibtex Style

@conference{closer11,
author={Luis Bautista and Alain April},
title={SUSTAINABILITY OF HADOOP CLUSTERS},
booktitle={Proceedings of the 1st International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,},
year={2011},
pages={587-590},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003332705870590},
isbn={978-989-8425-52-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 1st International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,
TI - SUSTAINABILITY OF HADOOP CLUSTERS
SN - 978-989-8425-52-2
AU - Bautista L.
AU - April A.
PY - 2011
SP - 587
EP - 590
DO - 10.5220/0003332705870590