Title | Spanning tree method for minimum communication costs in grouped virtual mapreduce cluster |
Publication Type | Journal Article |
Year of Publication | 2013 |
Authors | Yang, Y, Long, X, Shi, B |
Journal | Journal of Digital Information Management |
Volume | 11 |
Issue | 3 |
Pagination | 213 - 219 |
Date Published | 2013 |
Keywords | Cloud computing, Mapreduce, Spanning tree, Virtual machine |
Abstract | Today, MapReduce and virtual cluster are sharp swords for this big data and cloud computing era. To combine these two emerging technologies, it brings feasible-scalability, easy-management, fast-deployment and high-efficiency with the system. As every sword has two sides, the I/O bottleneck of virtualization technologies may seriously impacts on the performance of MapReduce cluster which deals with I/O-intensive applications. In this paper, we analyze the combination advantages and disadvantages of virtualization technology of MapReduce cluster. We also analyze the communication model for both of them and build a communication costs model. Then, we propose a novel algorithm of minimum-weight spanning tree to construct a lower communication costs virtual MapReduce cluster. With the help of constructing minimum-weight spanning tree, we find out a method to select local-master and group the cluster. Theoretical simulation and experiment results demonstrate that our method can greatly reduce communication costs. The performance improvement is up to ~40.4% respectively. |
URL | http://www.scopus.com/inward/record.url?eid=2-s2.0-84880774380&partnerID=40&md5=a8db5f920e0bbee748a4977aecbefdf2 |