Clustering As a result of the rapid development in cloud computing, it & fundamental to investigate the performance of extraordinary Hadoop MapReduce purposes and to realize the performance bottleneck in a cloud cluster that contributes to higher or diminish performance. It is usually primary to research the underlying hardware in cloud cluster servers to permit the optimization of program and hardware to achieve the highest performance feasible. Hadoop is founded on MapReduce, which is among the most popular programming items for huge knowledge analysis in a parallel computing environment. In this paper, we reward a particular efficiency analysis, characterization, and evaluation of Hadoop MapReduce Word Count utility. The main aim of this paper is to give implements of Hadoop map-reduce programming by giving a hands-on experience in developing Hadoop based Word-Count and Apriori application. Word count problem using Hadoop Map Reduce framework. The Apriori Algorithm has been used for finding frequent item set using Map Reduce framework.
References
• Samneet Singh and Yan Liu,“A Cloud Service Architecture for Analyzing Big Monitoring Data”,ISSNll1007-0214ll05/10llpp55-70 Volume 21, Number 1, February 2016
• JOSEPH A. ISSA, “Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount”, Received November 19, 2015, accepted December 12, 2015, date of publication December 18, 2015, date of current version December 29, 2015.
• Yaxiong Zhao, Jie Wu, and Cong Liu, “Dache: A Data Aware Caching for Big-Data Applications Using the MapReduce Framework”,ISSNll10070214ll05/10llpp39-50 Volume 19, Number 1, February 2014
• Zhuoyao Zhang Ludmila Cherkasova, “Benchmarking Approach for Designing a MapReduce Performance Model”, ICPE’13, April 21-24, 2013
• Nikzad Babaii Rizvandi, Albert Y. Zomaya , Ali Javadzadeh Boloori, Javid Taheri1, “On Modeling Dependency between MapReduce Configuration Parameters and Total Execution Time”, 2012
• Nikzad Babaii Rizvandi, Javid Taheri1, Reza Moraveji, Albert Y. Zomaya, “On Modelling and Prediction of Total CPU Usage for Applications in MapReduce Enviornments”, 2011.
• Baratloo, M. Karaul, Z. Kedem, and P.Wyckoff, ``Charlotte: Meta computing on theWeb,'' in Proc. 9th Int. Conf. Parallel Distrib. Comput. Syst., 1996, pp. 1_13.
• J. Bent, D. Thain, A. C. Arpaci-Dusseau, R. H. Arpaci-Dusseau, and M. Livny, ``Explicit control in the batch-aware distributed _le system,'' in Proc. 1st USENIX Symp. Netw. Syst. Design Implement. (NSDI), Mar. 2004, pp. 365_378.
• Fox, S. D. Gribble, Y. Chawathe, E. A. Brewer, and P. Gauthier,``Cluster-based scalable network services,'' in Proc. 16th ACMSymp. Oper. Syst. Principles, Saint-Malo, France, 1997, pp. 78_91.
• S. Ghemawat, H. Gobioff, and S.-T. Leung, ``The Google _le system,'' in Proc. 19th Symp. Oper. Syst. Principles, New York, NY, USA, 2003, pp. 29_43.