Analysis of information management and scheduling technology in hadoop

TitleAnalysis of information management and scheduling technology in hadoop
Publication TypeJournal Article
Year of Publication2014
AuthorsWeihua, M, Hong, Z, Qianmu, L, Bin, X
JournalJournal of Digital Information Management
Volume12
Issue2
Pagination133 - 138
Date Published2014
KeywordsCloud computing, Hadoop, Map Reduce, Task Scheduling
Abstract

Development of big data computing has brought many changes to society and social life is constantly digitized. 'How to handle vast amounts of data' has become a more and more fashionable topic. Hadoop is a distributed computing software framework, which includes HDFS and MapReduce distributed computing method, make distributed processing huge amounts of data possible. Then job scheduler determines the efficiency of Hadoop clusters and user experience. Under the premise of familiar with the mechanism of Hadoop's running tasks, make a full analysis of the existing Hadoop task scheduling algorithm, such as FIFO-Scheduler, Capacity-Scheduler, FairShare-Scheduler and LATE-Scheduler, found that the existing scheduling algorithms do not perceive the performance of the computing node, so that it cannot assign different tasks depending on the machine performance in heterogeneous Hadoop cluster.

URLhttp://www.scopus.com/inward/record.url?eid=2-s2.0-84903118321&partnerID=40&md5=9c6201daf048bd5fc8d52c577eeb6126

Collaborative Partner

Institute of Electronic and Information Technology (IEIT)

Collaborative Partner

Collaborative Partner