I. Introduction
One of the recent developments in high performance and cloud computing has been the emergence of the map-reduce model, originated by Google [8]. Besides the use of mapreduce for real commercial applications, map-reduce has sparked a large volume of research related to APIs for data-intensive computing, their implementations, and application/algorithm studies. Multiple projects have studied improving the API or implementations [13], [24], [30], [32], [37], [22], [29], [25], [23]. At the same time, several studies have evaluated the suitability of the map-reduce model for a variety of applications and on different computing environments [10], [27], [36], [11], [9], [7], [20].