Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
Skip header Section
The Probabilistic Relevance FrameworkDecember 2009
Publisher:
  • Now Publishers Inc.
  • P.O. Box 1024
  • Hanover
  • MA
  • United States
ISBN:978-1-60198-308-4
Published:17 December 2009
Pages:
70
Skip Bibliometrics Section
Reflects downloads up to 12 Nov 2024Bibliometrics
Skip Abstract Section
Abstract

The Probabilistic Relevance Framework (PRF) is a formal framework for document retrieval, grounded in work done in the 1970-80s, which led to the development of one of the most successful text-retrieval algorithms, BM25. In recent years, research in the PRF has yielded new retrieval models capable of taking into account structure and link-graph information. Again, this has led to one of the most successful web-search and corporate-search algorithms, BM25F. The Probabilistic Relevance Framework: BM25 and Beyond presents the PRF from a conceptual point of view, describing the probabilistic modelling assumptions behind the framework and the different ranking algorithms that result from its application: the binary independence model, relevance feedback models, BM25, BM25F. Besides presenting a full derivation of the PRF ranking algorithms, it provides many insights about document retrieval in general, and points to many open challenges in this area. It also discusses the relation between the PRF and other statistical models for IR, and covers some related topics, such as the use of non-textual features, and parameter optimization for models with free parameters. The Probabilistic Relevance Framework: BM25 and Beyond is self-contained and accessible to anyone with basic knowledge of probability and inference.

Cited By

  1. ACM
    Zhao W, Liu J, Ren R and Wen J (2023). Dense Text Retrieval Based on Pretrained Language Models: A Survey, ACM Transactions on Information Systems, 42:4, (1-60), Online publication date: 31-Jul-2024.
  2. ACM
    Dao H, Deng Y, Le D and Liao L Broadening the View: Demonstration-augmented Prompt Learning for Conversational Recommendation Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, (785-795)
  3. ACM
    Yang G, Zhou Y, Yang W, Yue T, Chen X and Chen T (2024). How Important Are Good Method Names in Neural Code Generation? A Model Robustness Perspective, ACM Transactions on Software Engineering and Methodology, 33:3, (1-35), Online publication date: 31-Mar-2024.
  4. Wang Y, Hu M, Huang Z, Li D, Luo W, Yang D and Lu X A canonicalization-enhanced known fact-aware framework for open knowledge graph link prediction Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, (2332-2342)
  5. ACM
    Li M, Popa D, Chagnon J, Cinar Y and Gaussier E (2023). The Power of Selecting Key Blocks with Local Pre-ranking for Long Document Information Retrieval, ACM Transactions on Information Systems, 41:3, (1-35), Online publication date: 31-Jul-2023.
  6. Sun Y, Song J, Song X and Hou J (2023). Research on question retrieval method for community question answering, Multimedia Tools and Applications, 82:16, (24309-24325), Online publication date: 1-Jul-2023.
  7. Kuang M, Chen Z, Wang W, Kang L, Yan Q, Tang M and Hao P Multi-task Learning Based Keywords Weighted Siamese Model for Semantic Retrieval Advances in Knowledge Discovery and Data Mining, (86-98)
  8. Ezzini S, Abualhaija S, Arora C and Sabetzadeh M AI-Based Question Answering Assistance for Analyzing Natural-Language Requirements Proceedings of the 45th International Conference on Software Engineering, (1277-1289)
  9. ACM
    Yao J, Liu Z, Yang J, Dou Z, Xie X and Wen J (2022). CDSM: Cascaded Deep Semantic Matching on Textual Graphs Leveraging Ad-hoc Neighbor Selection, ACM Transactions on Intelligent Systems and Technology, 14:2, (1-24), Online publication date: 30-Apr-2023.
  10. ACM
    Zou L, Lu W, Liu Y, Cai H, Chu X, Ma D, Shi D, Sun Y, Cheng Z, Gu S, Wang S and Yin D (2022). Pre-trained Language Model-based Retrieval and Ranking for Web Search, ACM Transactions on the Web, 17:1, (1-36), Online publication date: 28-Feb-2023.
  11. Wang Y, Hou Y, Wang H, Miao Z, Wu S, Sun H, Chen Q, Xia Y, Chi C, Zhao G, Liu Z, Xie X, Sun H, Deng W, Zhang Q and Yang M A neural corpus indexer for document retrieval Proceedings of the 36th International Conference on Neural Information Processing Systems, (25600-25614)
  12. Li Z, Guo R and Kumar S Decoupled context processing for context augmented language modeling Proceedings of the 36th International Conference on Neural Information Processing Systems, (21698-21710)
  13. Zou L, Mao H, Chu X, Tang J, Wang S, Ye W and Yin D A large scale search dataset for unbiased learning to rank Proceedings of the 36th International Conference on Neural Information Processing Systems, (1127-1139)
  14. López J and Cuadrado J (2022). An efficient and scalable search engine for models, Software and Systems Modeling (SoSyM), 21:5, (1715-1737), Online publication date: 1-Oct-2022.
  15. Bouzayane S and Aberkane A (2022). Visual Chatbot for Knowledge Transfer, International Journal of Knowledge-Based Organizations, 12:2, (1-13), Online publication date: 1-Apr-2022.
  16. ACM
    Zhao X, Li Z, Wu S, Zhan Y and Zhang C Deep Text Matching in Medical Question Answering System Proceedings of the 2021 ACM International Conference on Intelligent Computing and its Emerging Applications, (134-138)
  17. Lindgren E, Reddi S, Guo R and Kumar S Efficient training of retrieval models using negative cache Proceedings of the 35th International Conference on Neural Information Processing Systems, (4134-4146)
  18. ACM
    Xu J, Lei Z, Wang H, Niu Z, Wu H, Che W, Huang J and Liu T (2021). Coherent Dialog Generation with Query Graph, ACM Transactions on Asian and Low-Resource Language Information Processing, 20:6, (1-23), Online publication date: 30-Nov-2021.
  19. Li J, Li Y, Li G, Hu X, Xia X and Jin Z EditSum Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, (155-166)
  20. Suri S, Ilyas I, Ré C and Rekatsinas T (2022). Ember, Proceedings of the VLDB Endowment, 15:3, (699-712), Online publication date: 1-Nov-2021.
  21. ACM
    Hashemi H, Zamani H and Croft W Learning Multiple Intent Representations for Search Queries Proceedings of the 30th ACM International Conference on Information & Knowledge Management, (669-679)
  22. ACM
    Zhang H, Santos A and Freire J DSDD Proceedings of the 30th ACM International Conference on Information & Knowledge Management, (2527-2536)
  23. ACM
    Zhang X and Yang Q DML: Dynamic Multi-Granularity Learning for BERT-Based Document Reranking Proceedings of the 30th ACM International Conference on Information & Knowledge Management, (3642-3646)
  24. ACM
    Li J, Yang M and Li C CLC-RS Proceedings of the 30th ACM International Conference on Information & Knowledge Management, (4734-4738)
  25. Heo J, Lee S, Min S, Park Y, Jung S, Ham T and Lee J BOSS Proceedings of the 48th Annual International Symposium on Computer Architecture, (279-291)
  26. Westermann H, Savelka J and Benyekhlef K Paragraph Similarity Scoring and Fine-Tuned BERT for Legal Information Retrieval and Entailment New Frontiers in Artificial Intelligence, (269-285)
  27. ACM
    Gomes T and Ladeira M A new conceptual framework for enhancing legal information retrieval at the Brazilian Superior Court of Justice Proceedings of the 12th International Conference on Management of Digital EcoSystems, (26-29)
  28. Cohan A and Goharian N (2018). Scientific document summarization via citation contextualization and scientific discourse, International Journal on Digital Libraries, 19:2-3, (287-303), Online publication date: 1-Sep-2018.
  29. Alharbi A, Li Y and Xu Y An Extended Random-Sets Model for Fusion-Based Text Feature Selection Advances in Knowledge Discovery and Data Mining, (126-138)
  30. Imhof M and Braschler M (2018). A study of untrained models for multimodal information retrieval, Information Retrieval, 21:1, (81-106), Online publication date: 1-Feb-2018.
  31. ACM
    Tang Y, Huang W, Liu Q, Tung A, Wang X, Yang J and Zhang B QALink Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, (1359-1368)
  32. ACM
    Alharbi A, Li Y and Xu Y Topical term weighting based on extended random sets for relevance feature selection Proceedings of the International Conference on Web Intelligence, (654-661)
  33. Mills C and Haiduc S The impact of retrieval direction on IR-based traceability link recovery Proceedings of the 39th International Conference on Software Engineering: New Ideas and Emerging Results Track, (51-54)
  34. Diaz-Mosquera J, Sanabria P, Neyem A, Parra D and Navon J Enriching capstone project-based learning experiences using a crowdsourcing recommender engine Proceedings of the 4th International Workshop on CrowdSourcing in Software Engineering, (25-29)
  35. Zou B, Lampos V, Liang S, Ren Z, Yilmaz E and Cox I A Concept Language Model for Ad-hoc Retrieval Proceedings of the 26th International Conference on World Wide Web Companion, (885-886)
  36. Mitra B, Diaz F and Craswell N Learning to Match using Local and Distributed Representations of Text for Web Search Proceedings of the 26th International Conference on World Wide Web, (1291-1299)
  37. Wang Y, Wu C and Tsai R (2016). Cross-language article linking with different knowledge bases using bilingual topic model and translation features, Knowledge-Based Systems, 111:C, (228-236), Online publication date: 1-Nov-2016.
  38. Khennak I and Drias H (2016). A Firefly Algorithm-based Approach for Pseudo-Relevance Feedback, Journal of Medical Systems, 40:11, (1-15), Online publication date: 1-Nov-2016.
  39. ACM
    Tutek M, Glavas G, Šnajder J, Milić-Frayling N and Dalbelo Basic B Detecting and Ranking Conceptual Links between Texts Using a Knowledge Base Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, (2077-2080)
  40. ACM
    Rekabsaz N, Lupu M, Hanbury A and Zuccon G Generalizing Translation Models in the Probabilistic Relevance Framework Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, (711-720)
  41. Vaidyanathan R, Das S and Srivastava N (2016). Query Expansion based on Central Tendency and PRF for Monolingual Retrieval, International Journal of Information Retrieval Research, 6:4, (30-50), Online publication date: 1-Oct-2016.
  42. ACM
    Xu B, Ye D, Xing Z, Xia X, Chen G and Li S Predicting semantically linkable knowledge in developer online forums via convolutional neural network Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, (51-62)
  43. ACM
    Omari A, Carmel D, Rokhlenko O and Szpektor I Novelty based Ranking of Human Answers for Community Questions Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, (215-224)
  44. ACM
    Rekabsaz N Enhancing Information Retrieval with Adapted Word Embedding Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, (1169-1169)
  45. Nalisnick E, Mitra B, Craswell N and Caruana R Improving Document Ranking with Dual Word Embeddings Proceedings of the 25th International Conference Companion on World Wide Web, (83-84)
  46. Cummins R A Study of Retrieval Models for Long Documents and Queries in Information Retrieval Proceedings of the 25th International Conference on World Wide Web, (795-805)
  47. Hsieh C, Yang L, Wei H, Naaman M and Estrin D Immersive Recommendation Proceedings of the 25th International Conference on World Wide Web, (51-62)
  48. ACM
    Pal D, Mitra M and Bhattacharya S Improving Pseudo Relevance Feedback in the Divergence from Randomness Model Proceedings of the 2015 International Conference on The Theory of Information Retrieval, (325-328)
  49. ACM
    Rodriguez Perez J and Jose J On Microblog Dimensionality and Informativeness Proceedings of the 2015 International Conference on The Theory of Information Retrieval, (211-220)
  50. ACM
    Mirhosseini S, Zuccon G, Koopman B, Nguyen A and Lawley M Medical Free-Text to Concept Mapping as an Information Retrieval Problem Proceedings of the 19th Australasian Document Computing Symposium, (93-96)
  51. ACM
    Albakour M, Macdonald C and Ounis I On sparsity and drift for effective real-time filtering in microblogs Proceedings of the 22nd ACM international conference on Information & Knowledge Management, (419-428)
Contributors
  • University College London
  • Yahoo Research Barcelona

Recommendations