Abstract
Recently much attention has been paid to semantic overlay networks for information retrieval in large scale peer-to-peer networks, and much research work on semantic overlay protocols and searching algorithms has been done and the results indicate that semantic overlay is efficient for content searching in peer-to-peer networks. However, very limited work has been done to analyze and evaluate the characteristics of semantic overlay networks. In this paper we identify a natural property of semantic overlay networks, the community structure. We propose a mathematical model to evaluate the property of community structure of semantic P2P overlay networks. A heuristic algorithm is designed to optimize the community structure. Using the evaluation model we compare the SemreX semantic overlay with the Gnutella network. Results demonstrate that a SemreX overlay network has the distinctive community structure feature, while a Gnutella-like network does not. We also simulate a simple flooding protocol in both overlays to show that the overlay with community structure is more efficient for content searching.
Similar content being viewed by others
References
Shen H T, Shu Y, Yu B. Efficient semantic-based content search in P2P network. IEEE Trans Knowl Data Eng, 2004, 16: 813–826
Stoica I, Morris R, Karger D, et al. Chord: a scalable peer-to-peer lookup service for internet application. In: Proceedings of ACM SIGCOMM’01, San Diego, California, USA, 2001
Ratnasamy S, Francis P, Handley M, et al. A scalable content-addressable network. In: Proceedings of ACM SIGCOMM’ 01, San Diego, California, USA, 2001
Li J, Loo B T, Hellerstein J M, et al. On the feasibility of peer-to-peer web indexing and search. In: Proceedings of IPTPS, Berkeley, CA, USA, 2003
Reynolds P, Vahdat A. Efficient peer-to-peer keyword searching. In: Proceedings of Middleware, Rio de Janeiro, Brazil, 2003
Gnawali O D. A keyword-set search system for peer-to-peer networks. Master’s thesis, Massachusetts Institute of Technology, 2002
Bender M, Michel S, Triantafillou P, et al. P2P content search: give the web back to the people. In: Proceedings of the 5th International Workshop on Peer-to-Peer Systems (IPTPS’06), Santa Barbara, CA, USA, 2006
Gkantsidis C, Mihail M, Saberi A. Random walks in peer-to-peer networks. In: Proceedings of IEEE INFOCOM’04, Hong Kong, China, 2004
Sripanidkulchai K, Maggs B, Zhang H. Efficient content location using interest-based locality in peer-to-peer systems. In: Proceedings of IEEE INFOCOM, San Francisco, California, USA, 2003
Nejdl W, Wolf B, Qu C, et al. Edutella: a peer-to-peer networking infrastructure based on rdf. In: Proceedings of the 11th World Wide Web Conference (WWW’02), Hawaii, USA, 2002
Nejdl W, Wolpers M, Siberski W, et al. Super-peer-based routing and clustering strategies for rdf-based peer-to-peer networks. In: Proceedings of the 12th World Wide Web Conference (WWW’03), Budapest, Hungary, 2003
Haase P, Broekstra J, Ehrig M, et al. Bibster: a semantic-based bibliographic peer-to-peer system. In: Proceedings of the 2004 International Semantic Web Conference (ISWC’04), Hiroshima, Japan, 2004
Li M, Lee W C, Sivasubramaniam A, et al. SSW: A small-world-based overlay for peer-to-peer search. IEEE Trans Paral Distr Syst, 2008, 19: 735–749
Newman M. Modularity and community structure in networks. Proc Nat Acad Sci (PNAS), 2006, 103: 8577–8582
Flake G W, Lawrence S R, Giles C L, et al. Self-organization and identification of web communities. IEEE Comput, 2002, 35: 66–71
Newman M E J. Coauthorship networks and patterns of scientific collaboration. Proc Nat Acad Sci, 2004, 101: 5200–5205
Jin H, Chen H, Ning X M. SemreX: a semantic peer-to-peer system for literature documents retrieval. In: Proceedings of the 1st Asian Semantic Web Conference (ASWC’06), Beijing, China, 2006
Jin H, Chen H. SemreX: efficient search in semantic overlay for literature retrieval. Future Gener Comput Syst, 2008, 24: 475–488
Ning X M, Jin H, Chen H. Efficient search for peer-to-peer information retrieval using semantic small world. In: Proceedings of the 15th International World Wide Web Conference (WWW’06), Edinburgh Scotland, 2006
ACM Topics. http://www.acm.org/class/1998
Yu Y, Jin H. Building a semantic P2P scientific references sharing system with jxta. In: Proceedings of the Eighth Asia Pacific Web Conference (APWEB’06), Harbin, China, 2006
Yuhua L, Bandar Z A, McLean D. An approach for measuring semantic similarity between words using multiple information sources. IEEE Trans Knowl Data Eng, 2003, 15: 871–882
Iamnitchi A, Ripeanu M, Foster I. Small-world file-sharing communities. In: Proceedings of IEEE INFOCOM’04, Hong Kong, 2004
Clauset A, Newman M E J, Moore C. Finding community structure in very large networks. Phys Rev E, 2004, 70: 0066111
Radicch F, Castellano C, Cecconi F, et al. Defining and identifying communities in networks. Proc Nat Acad Sci (PNAS), 2004, 101: 2658–2663
Zhou D, Manavoglu E, Li J, et al. Probabilistic models for discovering e-communities. In: Proceedings of the 15th international World Wide Web confernece (WWW’06), Edinburgh Scotland, 2006
Barrat A, Barthelemy M, Pastor-Satorras R, et al. The architecture of complex weighted networks. Proc Nat Acad Sci, 2004, 101: 3747–3752
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chen, H., Jin, H. Finding and evaluating the community structure in semantic peer-to-peer overlay networks. Sci. China Inf. Sci. 54, 1340–1351 (2011). https://doi.org/10.1007/s11432-011-4296-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11432-011-4296-6