Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Reducing network traffic in unstructured P2P systems using Top-k queries

  • Published:
Distributed and Parallel Databases Aims and scope Submit manuscript

Abstract

A major problem of unstructured P2P systems is their heavy network traffic. This is caused mainly by high numbers of query answers, many of which are irrelevant for users. One solution to this problem is to use Top-k queries whereby the user can specify a limited number (k) of the most relevant answers. In this paper, we present FD, a (Fully Distributed) framework for executing Top-k queries in unstructured P2P systems, with the objective of reducing network traffic. FD consists of a family of algorithms that are simple but effective. FD is completely distributed, does not depend on the existence of certain peers, and addresses the volatility of peers during query execution. We validated FD through implementation over a 64-node cluster and simulation using the BRITE topology generator and SimJava. Our performance evaluation shows that FD can achieve major performance gains in terms of communication and response time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. K. Aberer and J. Wu, “Framework for decentralized ranking in web information retrieval,” in Proc. of the 5th Asia Pacific Web Conference (APWeb), 2003.

  2. K. Aberer et al., “P-Grid: A self-organizing structured P2P system,” SIGMOD Record, vol. 32, no. 3, 2003.

  3. R. Akbarinia, V. Martins, E. Pacitti and P. Valduriez, “Design and implementation of Atlas P2P architecture,” Global Data Management, R. Baldoni, G. Cortese and F. Davide (Eds.), IOS Press, 2006.

  4. R. Akbarinia and V. Martins, “Data management in the APPA P2P system,” Int. Workshop on High-Performance Data Management in Grid Environments (HPDGrid), 2006.

  5. BRITE, http://www.cs.bu.edu/brite/.

  6. M.J. Carey and D. Kossmann, On saying “Enough Already!,” SIGMOD Conf., 1997.

  7. S. Chaudhuri and L. Gravano, “Evaluating Top-k selection queries,” VLDB Conf., 1999.

  8. I. Clarke et al., “Protecting free expression online with freenet,” IEEE Internet Computing, vol. 6, no. 1, 2002.

  9. F.M. Cuenca-Acuna, C. Peery, R.P. Martin, and T.D. Nguyen, “PlanetP: Using gossiping to build content addressable peer-to-peer information sharing communities,” IEEE Int. Symp. on High Performance Distributed Computing (HPDC), 2003.

  10. R. Fagin, J. Lotem, and M. Naor, “Optimal aggregation algorithms for middleware,” J. Comput. Syst. Sci., vol. 66, no. 4, 2003.

  11. Gnutella. http://www.gnutelliums.com/.

  12. F. Howell and R. McNab, “SimJava: A discrete event simulation package for Java with applications in computer systems modeling,” Int. Conf. on Web-based Modelling and Simulation, Society for Computer Simulation, San Diego, CA, 1998.

  13. Kazaa. http://www.kazaa.com/.

  14. S. Michel, P. Triantafillou, and G. Weikum, “KLEE: A framework for distributed top-k query algorithms,” VLDB Conf., 2005.

  15. B. Ooi, Y. Shu, and K.-L. Tan, “Relational data sharing in peer-based data management systems,” SIGMOD Record, vol. 32, no. 3, 2003.

  16. S. Ratnasamy, P. Francis, M. Handley, R.M. Karp, and S. Shenker, “A scalable content-addressable network,” in Proc. of SIGCOMM, 2001.

  17. M. Ripeanu and I. Foster, “Mapping the gnutella network: Macroscopic properties of large-scale peer-to-peer systems,” IPTPS, 2002.

  18. M. Ripeanu, A. Iamnitchi, and I. Foster, “Mapping the Gnutella network,” IEEE Internet Computing, vol. 6, no. 1, 2002.

  19. S. Saroiu, P. Gummadi, and S. Gribble, “A measurement study of peer-to-peer file sharing systems,” in Proc. of Multimedia Computing and Networking (MMCN), 2002.

  20. I. Stoica, R. Morris, D.R. Karger, M.F. Kaashoek, and H. Balakrishnan, “Chord: A scalable peer-to-peer lookup service for internet applications,” in Proc. of SIGCOMM, 2001.

  21. I. Tatarinov et al., “The Piazza peer data management project,” SIGMOD Record, vol. 32, no. 3, 2003.

  22. U. Thaden, W. Siberski, W.T. Balke, and W. Nejdl, “Top-k query evaluation for schema-based peer-to-peer networks,” Int. Semantic Web Conf. (ISWC), 2004.

  23. M. Theobald, G. Weikum, and R. Schenkel, “Top-k query evaluation with probabilistic guarantees,” VLDB Conf., 2004.

  24. B. Yang and H. Garcia-Molina, “Improving search in peer-to-peer networks,” in Proc. Int. Conf. on Distributed Computing Systems (ICDCS), 2002.

  25. C. Yu, G. Philip, and W. Meng, “Distributed Top-n query processing with possibly uncooperative local systems,” VLDB Conf., 2003.

  26. C. Yu, P. Sharma, W. Meng, and Y. Qin, “Databases selection for processing k nearest neighbors queries in distributed environments,” ACM/IEEE-CS joint Conf. on DL, 2001.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Reza Akbarinia.

Additional information

Recommended by: Sunil Prabhakar

Work partially funded by the ARA Massive Data of the Agence Nationale de la Recherche.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Akbarinia, R., Pacitti, E. & Valduriez, P. Reducing network traffic in unstructured P2P systems using Top-k queries. Distrib Parallel Databases 19, 67–86 (2006). https://doi.org/10.1007/s10619-006-8313-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10619-006-8313-5

Keywords