Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1183579.1183588acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

ALVIS peers: a scalable full-text peer-to-peer retrieval engine

Published: 11 November 2006 Publication History

Abstract

We present Alvis peers, a full-text P2P retrieval engine designed to offer retrieval performance comparable to centralized solutions while scaling to a very large number of peers. It is the result of our research efforts within the project Alvis1 European FP 6 STREP project ALVIS, http://www.alvis.info/ that aims at building a truly-distributed semantic search engine. To cope with problem of unscalable bandwidth consumption in the P2P network, the engine implements a novel retrieval model that indexes highly-discriminative keys (HDKs)---terms and term sets appearing in a limited number of collection documents. Our prototype is a fully-functional retrieval engine built over a structured P2P network. It includes a component for HDK based indexing and retrieval, and a distributed content-based ranking module. Such an integrated system represents a substantial contribution to the design and development of realistic P2P retrieval systems.

References

[1]
K. Aberer. P-Grid: A self-organizing access structure for P2P information systems. Sixth International Conference on Cooperative Information Systems, 2001.]]
[2]
K. Aberer, F. Klemm, M. Rajman, and J. Wu. An Architecture for Peer-to-Peer Information Retrieval. 2004.]]
[3]
W.-T. Balke, W. Nejdl, W. Siberski, and U. Thaden. DL Meets P2P - Distributed Document Retrieval Based on Classification and Content. In 9th European Conference on Research and Advanced Technology for Digital Libraries, (ECDL), pages 379--390, 2005.]]
[4]
M. Bender, S. Michel, P. Triantafillou, G. Weikum, and C. Zimmer. Improving collection selection with overlap awareness in P2P search engines. In SIGIR '05, pages 67--74, New York, NY, USA, 2005. ACM Press.]]
[5]
W. Buntine, K. Aberer, I. Podnar, and M. Rajman. Opportunities from open source search. In Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence, pages 2--8, 2005.]]
[6]
F. M. Cuenca-Acuna, C. Peery, R. P. Martin, and T. D. Nguyen. PlanetP: Using Gossiping to Build Content Addressable Peer-to-Peer Information Sharing Communities. In 12th IEEE International Symposium on High Performance Distributed Computing (HPDC-12). IEEE Press, 2003.]]
[7]
O. D. Gnawali. A keyword set search system for peer-to-peer networks, 2002. Master's thesis, Massachusetts Institute of Technology.]]
[8]
J. Kleinberg. The Small-World Phenomenon: An Algorithmic Perspective. In Proceedings of the 32nd ACM Symposium on Theory of Computing, 2000.]]
[9]
F. Klemm and K. Aberer. Aggregation of a Term Vocabulary for Peer-to-Peer Information Retrieval: a DHT Stress Test. In Third International Workshop on Databases, Information Systems and Peer-to-Peer Computing (DBISP2P 2005), 2005.]]
[10]
F. Klemm, J.-Y. Le Boudec, and K. Aberer. Congestion control for distributed hash tables. In The 5th IEEE International Symposium on Network Computing and Applications (IEEE NCA06), 2006.]]
[11]
J. Li, B. Loo, J. Hellerstein, F. Kaashoek, D. Karger, and R. Morris. The Feasibility of Peer-to-Peer Web Indexing and Search, 2003.]]
[12]
J. Lu and J. Callan. Content-based retrieval in hybrid peer-to-peer networks. In Proceedings of the twelfth international conference on Information and knowledge management, 2003.]]
[13]
J. Lu and J. Callan. Federated search of text-based digital libraries in hierarchical peer-to-peer networks. In Advances in Information Retrieval, 27th European Conference on IR Research (ECIR), pages 52--66, 2005.]]
[14]
N. Ntarmos, P. Triantafillou, and G. Weikum. Counting at large: Efficient cardinality estimation in internet-scale data networks. In Proceedings of the 22nd International Conference on Data Engineering (ICDE 2006), 2006.]]
[15]
I. Podnar, T. Luu, M. Rajman, F. Klemm, and K. Aberer. A Peer-to-Peer Architecture for Information Retrieval Across Digital Library Collections. In To appear in European conference on research and advanced technology for digital libraries (ECDL 2006), September 2006.]]
[16]
I. Podnar, M. Rajman, T. Luu, F. Klemm, and K. Aberer. Beyond term indexing: A P2P framework for web information retrieval. To appear in Informatica, Special Issue on Specialised Web Search, 2006.]]
[17]
I. Podnar, M. Rajman, T. Luu, F. Klemm, and K. Aberer. Scalable peer-to-peer web retrieval with highly discriminative keys. Technical Report LSIR-REPORT-2006-009, 2006.]]
[18]
P. Reynolds and A. Vahdat. Efficient peer-to-peer keyword searching, 2003.]]
[19]
S. E. Robertson, S. Walker, M. Hancock-Beaulieu, A. Gull, and M. Lau. Okapi at TREC. In Text REtrieval Conference, pages 21--30, 1992.]]
[20]
A. Rowstron and P. Druschel. Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems. In IFIP/ACM International Conference on Distributed Systems Platforms (Middleware), 2001.]]
[21]
I. Stoica, R. Morris, D. Karger, M. F. Kaashoek, and H. Balakrishnan. Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications. In Proceedings of ACM SIGCOMM, 2001.]]
[22]
T. Suel, C. Mathur, J.-W. Wu, J. Zhang, A. Delis, M. Kharrazi, X. Long, and K. Shanmugasundaram. ODISSEA: A Peer-to-Peer Architecture for Scalable Web Search and Information Retrieval. WebDB'03, 2003.]]
[23]
J. Zhang and T. Suel. Efficient query evaluation on large textual collections in a peer-to-peer environment. In P2P '05: Proceedings of the Fifth IEEE International Conference on Peer-to-Peer Computing (P2P'05), pages 225--233, Washington, DC, USA, 2005. IEEE Computer Society.]]

Cited By

View all
  • (2017)Distributed Search Efficiency and Robustness in Service oriented Multi-agent NetworksProceedings of the 2017 International Conference on Management Engineering, Software Engineering and Service Sciences10.1145/3034950.3034975(9-18)Online publication date: 14-Jan-2017
  • (2016)Peer-to-Peer Full-Text Keyword Search of the WebNetworked Systems10.1007/978-3-319-26850-7_18(263-277)Online publication date: 23-Mar-2016
  • (2015)A feasible MapReduce peer-to-peer framework for distributed computing applicationsVietnam Journal of Computer Science10.1007/s40595-014-0031-82:1(57-66)Online publication date: 1-Feb-2015
  • Show More Cited By

Index Terms

  1. ALVIS peers: a scalable full-text peer-to-peer retrieval engine

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      P2PIR '06: Proceedings of the international workshop on Information retrieval in peer-to-peer networks
      November 2006
      66 pages
      ISBN:1595935274
      DOI:10.1145/1183579
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 11 November 2006

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. architecture
      2. distributed information retrieval
      3. peer-to-peer information systems
      4. scalability

      Qualifiers

      • Article

      Conference

      CIKM06
      Sponsor:
      CIKM06: Conference on Information and Knowledge Management
      November 11, 2006
      Virginia, Arlington, USA

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 15 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2017)Distributed Search Efficiency and Robustness in Service oriented Multi-agent NetworksProceedings of the 2017 International Conference on Management Engineering, Software Engineering and Service Sciences10.1145/3034950.3034975(9-18)Online publication date: 14-Jan-2017
      • (2016)Peer-to-Peer Full-Text Keyword Search of the WebNetworked Systems10.1007/978-3-319-26850-7_18(263-277)Online publication date: 23-Mar-2016
      • (2015)A feasible MapReduce peer-to-peer framework for distributed computing applicationsVietnam Journal of Computer Science10.1007/s40595-014-0031-82:1(57-66)Online publication date: 1-Feb-2015
      • (2013)Studying the clustering paradox and scalability of search in highly distributed environmentsACM Transactions on Information Systems10.1145/2457465.245746831:2(1-36)Online publication date: 17-May-2013
      • (2012)Decentralized Search and the Clustering Paradox in Large Scale Information NetworksNext Generation Search Engines10.4018/978-1-4666-0330-1.ch002(29-46)Online publication date: 2012
      • (2012)Peer-to-Peer Information RetrievalACM Transactions on Information Systems10.1145/2180868.218087130:2(1-34)Online publication date: 1-May-2012
      • (2011)CoFeed: privacy‐preserving Web search recommendation based on collaborative aggregation of interest feedbackSoftware: Practice and Experience10.1002/spe.112743:10(1165-1184)Online publication date: 6-Oct-2011
      • (2010)Scalability of findabilityProceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval10.1145/1835449.1835465(74-81)Online publication date: 19-Jul-2010
      • (2010)HAPSJournal of Computer Science and Technology10.1007/s11390-010-9339-825:3(482-498)Online publication date: 1-May-2010
      • (2010)Collaborative ranking and profilingProceedings of the 10th IFIP WG 6.1 international conference on Distributed Applications and Interoperable Systems10.1007/978-3-642-13645-0_17(226-242)Online publication date: 7-Jun-2010
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media