Loading [MathJax]/extensions/MathMenu.js
A fast estimation of shortest path distance for power-law network predominant cloud service | IEEE Conference Publication | IEEE Xplore

A fast estimation of shortest path distance for power-law network predominant cloud service


Abstract:

The estimation of the shortest path between two vertices in a network graphs is an important issue for many applications in the real world, which may be road networks, so...Show More

Abstract:

The estimation of the shortest path between two vertices in a network graphs is an important issue for many applications in the real world, which may be road networks, social collaboration networks, biological networks and so forth. The short response time, the less space cost and the high accuracy are three critical evaluation metrics in the approximate calculation of the shortest path. To achieve a quick and accurate calculation of an approximate shortest path distance, the typical method that takes a non-trivial approach calculates the distance between every pair of vertices in advance and records the calculation results with an n×n matrix, where n is the number of vertexes in the network graph. Unfortunately, it hard for this method function because it is difficult to sustain the huge amount of storage space and time consuming preprocessing for the big size of the network graph in practice. Unlike many efforts that have been made to minimize the costs of time and space in enhancing the accuracy of the shortest path calculation, this paper proposes an estimation scheme of the shortest path calculation for power-law graphs, due to the fact that many networks in the real applications are power-law networks. The theoretical analysis indicates that the schema is successfully optimized so that the query time can be an approximation constant and the calculation result of the approximate shortest path distance is almost 2 to 3 times as long as the actual shortest path distance between a pair of vertices. A simulation experiment also validates the analysis and demonstrates the feasibility of the proposed scheme.
Date of Conference: 03-06 December 2012
Date Added to IEEE Xplore: 04 February 2013
ISBN Information:
Conference Location: Taipei, Taiwan
References is not available for this document.

I. Introduction

In most cases, cloud computing has the advantage over handling data intensive applications as large size graph processing. With storage becoming cheaper and the need to store and retrieve large amounts of graph data, developing systems to handle trillions of graph data requiring tera/peta bytes of disk space is no longer a distant prospect. As yet, many efforts have been done in this aspect. Lichtenwalter presented the design and implementation of a master-worker framework for easily computing distributed graph issues [1]. The framework automatically divides and distributes the workload and manages completion using an arbitrary number of heterogeneous computational resources. Jiang described a system, Extended MATE or Ex-MATE, which supports this alternate API with reduction objects of arbitrary sizes [2]. He evaluated this system using three graph mining applications and compare its performance to that of PEGASUS, a graph mining system implemented based on the original map-reduce API and its Hadoop implementation. Yang proposed a unified distributed method in solving some critical graph mining problems on top of a cluster system with the help of MapReduce [3]. These problems include graph transformation, sub-graph partition, maximal clique enumeration, connected component finding and community detection. Liu et. al. used Hadoop, an open source implementation of MapReduce, to conduct a series of analyses on large-scale social networks including several distributions, clustering coefficient and diameter [4].

Select All
1.
R. Lichtenwalter, "DisNet: A Framework for Distributed Graph Computation," 2011 International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 263-270, 2011.
2.
Wei Jiang, "Ex-MATE: Data Intensive Computing with Large Reduction Objects and Its Application to Graph Mining," 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), pp.475-484, 2011.
3.
Shengqi Yang, "Efficient Dense Structure Mining Using MapReduce," IEEE International Conference on Data Mining Workshops ( ICDMW 09), pp. 332-337, 2009.
4.
Guojun Liu, Ming Zhang and Fei Yan, "International Conference on Computational Aspects of Social Networks (CASoN)," pp. 487-490, 2010.
5.
W.K. Chan, Lijun Mei and Zhenyu Zhang, "Modeling and Testing of Cloud Applications," IEEE Asia-Pacific Services Computing Conference (APSCC 2009), pp. 111-118, 2009.
6.
Luh Yen, Marco Saerens, Amin Mantrach and Masashi Shimbo, "A Family of Dissimilarity Measures between Nodes Generalizing both the Shortest-path and the Commute-time Distances," Proceedings of 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD08), pp. 785-793, 2008.
7.
Jiefeng Cheng and Jeffrey Xu Yu, "On-line Exact Shortest Distance Query Processing," Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology (EDBT09), pp. 481-491, 2009.
8.
Michalis Potamias, Francesco Bonchi, Carlos Castillo and Aristides Gionis, "Fast Shortest Path Distance Estimation in Large Networks," In Proceeding of the 18th ACM Conference on Information and Knowledge Management (CIKM), pp. 867-876, 2009.
9.
Atish Das Sarma, Sreenivas Gollapudi, Marc Najork and Rina Panigrahy, "A Sketch-Based Distance Oracle for Web-Scale Graphs," Proceedings of International Conference on Web Search and Data Mining (WSDM), pp. 401-410, 2010.
10.
Sandeep Sen, "Approximating Shortest Paths in Graphs," WALCOM: Algorithms and Computation, LNCS, vol. 5431, pp. 32-43, 2009.
11.
Christian Sommer, "Approximate Shortest Path and Distance Queries in Networks," PhD thesis, The University of Tokyo, 2010.
12.
Mikkel Thorup and Uri Zwick, "Approximate Distance Oracles," Journal of ACM, vol. 52, no. 1, pp. 1-24, 2005.
13.
Rachit Agarwal, P. Brighten Godfrey, and Sariel Har-Peled, "Approximate Distance Queries and Compact Routing in Sparse Graphs," Proceedings of 30th the IEEE Computer and Communications Societies (INFOCOM2011), pp.1754-1762, 2011.
14.
Christian Sommer, Elad Verbin and Wei Yu, "Distance Oracles for Sparse Graphs," 50th Annual IEEE Symposium on Foundations of Computer Science (FOCS), pp. 703-712, 2009.
15.
Michalis Faloutsos, Petros Faloutsos and Christos Faloutsos, "On Power-law Relationships of the Internet Topology," Computer Communications Review, vol. 29, pp.251-262, 1999.
16.
Reka Albert, Hawoong Jeong and Albert-Laszlo Barabasi, "Dimeter of the World Wide Web," Nature, vol. 401, pp.130-131, 1999.
17.
Dmitri Krioukov, Kevin Fall and Xiaowei Yang, "Compact Routing on Internet-Like Graphs," Proceedings of 30th the IEEE Computer and Communications Societies (INFOCOM2011), pp.209-219, 2004.
18.
Wei Chen, Christian Sommer, Shang-Hua Teng and Yajun Wang, "A Compact Routing Scheme and Approximate Distance Oracle for Power-Law Graphs," Technical Report MSR-TR-2009-84, Microsoft Research, July 2009.
19.
Arthur Brady and Lenore Cowen, "Compact Routing On Power Law Graphs with Additive Stretch," Proceedings of 8th Workshop on Algorithm Engineering and Experiments (ALENEX 06), pp. 119-128, 2006.
20.
William Aiello, Fan Chung and Linyuan Lu, "A Random Graph Model for Massive Graphs," Proceedings of 32th ACM symposium on Theory of computing (STOC 00), pp. 171-180, 2000.
21.
Fan Chung, Fan Chung, Fan Chung, Linyuan Lu and Linyuan Lu, "The Average Distances in Random Graphs with Given Expected Degrees," Proceedings of the National Academy of Science of the United States of America (PNAS), vol. 99, pp.15879-15882, 2002.
22.
Linyuan Lincoln Lu, "Probabilistic Methods in Massive Graphs and Internet Computing," Ph D thesis, University of California San Diego, 2002.
23.
M. Fredman, J. Komlos, and E. Szemeredi, "Storing a sparse table with O(1) worst case access time," Journal of the ACM, 31(3):538-544, 1984.
Contact IEEE to Subscribe

References

References is not available for this document.