Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Diversifying Citation Recommendations

Published: 15 December 2014 Publication History

Abstract

Literature search is one of the most important steps of academic research. With more than 100,000 papers published each year just in computer science, performing a complete literature search becomes a Herculean task. Some of the existing approaches and tools for literature search cannot compete with the characteristics of today’s literature, and they suffer from ambiguity and homonymy. Techniques based on citation information are more robust to the mentioned issues. Thus, we recently built a Web service called the advisor, which provides personalized recommendations to researchers based on their papers of interest. Since most recommendation methods may return redundant results, diversifying the results of the search process is necessary to increase the amount of information that one can reach via an automated search. This article targets the problem of result diversification in citation-based bibliographic search, assuming that the citation graph itself is the only information available and no categories or intents are known. The contribution of this work is threefold. We survey various random walk--based diversification methods and enhance them with the direction awareness property to allow users to reach either old, foundational (possibly well-cited and well-known) research papers or recent (most likely less-known) ones. Next, we propose a set of novel algorithms based on vertex selection and query refinement. A set of experiments with various evaluation criteria shows that the proposed γ-RLM algorithm performs better than the existing approaches and is suitable for real-time bibliographic search in practice.

References

[1]
Rakesh Agrawal, Sreenivas Gollapudi, Alan Halverson, and Samuel Ieong. 2009. Diversifying search results. In Proceedings of the 2nd ACM International Conference on Web Search and Data Mining. 5--14.
[2]
Sergey Brin and Lawrence Page. 1998. The anatomy of a large-scale hypertextual Web search engine. In Proceedings of the 7th International Conference on World Wide Web. 107--117.
[3]
Jaime Carbonell and Jade Goldstein. 1998. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 335--336.
[4]
Ben Carterette. 2009. An analysis of NP-completeness in novelty and diversity ranking. In Proceedings of the 2nd International Conference on Theory of Information Retrieval: Advances in Information Retrieval Theory. 200--211.
[5]
Charles L. A. Clarke, Maheedhar Kolla, Gordon V. Cormack, Olga Vechtomova, Azin Ashkan, Stefan Büttcher, and Ian MacKinnon. 2008. Novelty and diversity in information retrieval evaluation. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 659--666.
[6]
Marina Drosou and Evaggelia Pitoura. 2010. Search result diversification. ACM SIGMOD Record 39, 1, 41--47.
[7]
Sreenivas Gollapudi and Aneesh Sharma. 2009. An axiomatic approach for result diversification. In Proceedings of the 18th International Conference on World Wide Web. 381--390.
[8]
Marco Gori and Augusto Pucci. 2006. Research paper recommender systems: A random-walk based approach. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence. 778--781
[9]
Taher H. Haveliwala. 2002. Topic-sensitive PageRank. In Proceedings of the 11th International Conference on World Wide Web. 517--526.
[10]
Maxwell M. Kessler. 1963. Bibliographic coupling between scientific papers. American Documentation 14, 10--25.
[11]
Onur Küçüktunç, Kamer Kaya, Erik Saule, and Ümit V. Çatalyürek. 2012a. Fast recommendation on bibliographic networks. In Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. 480--487.
[12]
Onur Küçüktunç, Erik Saule, Kamer Kaya, and Ümit V. Çatalyürek. 2012b. Direction awareness in citation recommendation. In Proceedings of the 6th International Workshop on Ranking in Databases (DBRank’12) in conjunction with VLDB’12.
[13]
Onur Küçüktunç, Erik Saule, Kamer Kaya, and Ümit V. Çatalyürek. 2013. Diversified recommendation on graphs: Pitfalls, measures, and algorithms. In Proceedings of the 22nd International Conference on World Wide Web. 715--726.
[14]
Ni Lao and William Cohen. 2010. Relational retrieval using a combination of path-constrained random walks. Machine Learning 81, 1, 53--67.
[15]
Steve Lawrence, C. Lee Giles, and Kurt Bollacker. 1999. Digital libraries and autonomous citation indexing. Computer 32, 6, 67--71.
[16]
Jiang Li and Peter Willett. 2009. ArticleRank: A PageRank-based alternative to numbers of citations for analyzing citation networks. ASLIB Proceedings 61, 6, 605--618.
[17]
Rong-Hua Li and Jeffrey X. Yu. 2011. Scalable diversified ranking on large graphs. In Proceedings of the 11th IEEE International Conference on Data Mining. 1152--1157.
[18]
Yicong Liang, Qing Li, and Tieyun Qian. 2011. Finding relevant papers based on citation relations. In Proceedings of the 12th International Conference on Web-Age Information Management. 403--414.
[19]
David Liben-Nowell and Jon M. Kleinberg. 2007. The link-prediction problem for social networks. Journal of the American Society for Information Science and Technology 58, 7, 1019--1031.
[20]
Bin Liu and Hosagrahar V. Jagadish. 2009. Using trees to depict a forest. Proceedings of the VLDB Endowment 2, 1, 133--144.
[21]
Nan Ma, Jiancheng Guan, and Yi Zhao. 2008. Bringing PageRank to the citation analysis. Information Processing and Management 44, 2, 800--810.
[22]
Qiaozhu Mei, Jian Guo, and Dragomir Radev. 2010. DivRank: The interplay of prestige and diversity in information networks. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1009--1018.
[23]
H. P. F. Peters, Robert R. Braam, and Anthony F. J. van Raan. 1995. Cognitive resemblance and citation relations in chemical engineering publications. Journal of the American Society for Information Science 46, 1, 9--21.
[24]
Gerard Salton. 1963. Associative document retrieval techniques using bibliographic information. Journal of the ACM 10, 4, 440--457.
[25]
Henry Small. 1973. Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the Association for Information Science 24, 4, 265--269.
[26]
Trevor Strohman, W. Bruce Croft, and David Jensen. 2007. Recommending citations for academic papers. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 705--706.
[27]
Hanghang Tong, Christos Faloutsos, and Jia-Yu Pan. 2006. Fast random walk with restart and its applications. In Proceedings of the 6th International Conference on Data Mining. 613--622.
[28]
Hanghang Tong, Jingrui He, Zhen Wen, Ravi Konuru, and Ching-Yung Lin. 2011. Diversified ranking on large graphs: An optimization viewpoint. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1028--1036.
[29]
Erik Vee, Utkarsh Srivastava, Jayavel Shanmugasundaram, Prashant Bhat, and Sihem Amer-Yahia. 2008. Efficient computation of diverse query results. In Proceedings of the 24th IEEE International Conference on Data Engineering. 228--236.
[30]
Michael J. Welch, Junghoo Cho, and Christopher Olston. 2011. Search result diversity for informational queries. In Proceedings of the 20th International Conference on World Wide Web. 237--246.
[31]
Cheng Xiang Zhai, William W. Cohen, and John Lafferty. 2003. Beyond independent relevance: Methods and evaluation metrics for subtopic retrieval. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 10--17.
[32]
Xiaojin Zhu, Andrew B. Goldberg, Jurgen Van Gael, and David Andrzejewski. 2007. Improving diversity in ranking using absorbing random walks. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics. 97--104.
[33]
Xiaofei Zhu, Jiafeng Guo, Xueqi Cheng, Pan Du, and Hua-Wei Shen. 2011. A unified framework for recommending diverse and relevant queries. In Proceedings of the 20th International Conference on World Wide Web. 37--46.
[34]
Cai-Nicolas Ziegler, Sean M. McNee, Joseph A. Konstan, and Georg Lausen. 2005. Improving recommendation lists through topic diversification. In Proceedings of the 14th International Conference on World Wide Web. 22--32.
[35]
Guido Zuccon, Leif Azzopardi, Dell Zhang, and Jun Wang. 2012. Top-k retrieval using facility location analysis. In Proceedings of the 34th European Conference on Advances in Information Retrieval. 305--316.

Cited By

View all
  • (2024)Personalized global citation recommendation with diversification awarenessScientometrics10.1007/s11192-024-05057-5129:7(3625-3657)Online publication date: 1-Jul-2024
  • (2023)Exploiting Contextual Word Embedding for Identification of Important Citations: Incorporating Section-Wise Citation Counts and Metadata FeaturesIEEE Access10.1109/ACCESS.2023.332003811(114044-114060)Online publication date: 2023
  • (2023)Personalized literature recommendation based on heterogeneous entity academic networkJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2023.10164935:8(101649)Online publication date: Sep-2023
  • Show More Cited By

Index Terms

  1. Diversifying Citation Recommendations

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Intelligent Systems and Technology
    ACM Transactions on Intelligent Systems and Technology  Volume 5, Issue 4
    Special Sections on Diversity and Discovery in Recommender Systems, Online Advertising and Regular Papers
    January 2015
    390 pages
    ISSN:2157-6904
    EISSN:2157-6912
    DOI:10.1145/2699158
    • Editor:
    • Huan Liu
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 December 2014
    Accepted: 01 November 2013
    Revised: 01 July 2013
    Received: 01 September 2012
    Published in TIST Volume 5, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Bibliographic search
    2. direction awareness
    3. diversity

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 01 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Personalized global citation recommendation with diversification awarenessScientometrics10.1007/s11192-024-05057-5129:7(3625-3657)Online publication date: 1-Jul-2024
    • (2023)Exploiting Contextual Word Embedding for Identification of Important Citations: Incorporating Section-Wise Citation Counts and Metadata FeaturesIEEE Access10.1109/ACCESS.2023.332003811(114044-114060)Online publication date: 2023
    • (2023)Personalized literature recommendation based on heterogeneous entity academic networkJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2023.10164935:8(101649)Online publication date: Sep-2023
    • (2023)An anatomization of research paper recommender systemEngineering Applications of Artificial Intelligence10.1016/j.engappai.2022.105641118:COnline publication date: 1-Feb-2023
    • (2022)Incremental Refinement of Relevance Rankings: Introducing a New Method Supported with Pennant RetrievalTurk Kutuphaneciligi - Turkish Librarianship10.24146/tk.1062751Online publication date: 10-Apr-2022
    • (2021)A New Citation Recommendation Strategy Based on Term Functions in Related Studies SectionJournal of Data and Information Science10.2478/jdis-2021-00226:3(75-98)Online publication date: 9-May-2021
    • (2020)A review of citation recommendation: from textual content to enriched contextScientometrics10.1007/s11192-019-03336-0Online publication date: 3-Jan-2020
    • (2019)Geographic Diversification of Recommended POIs in Frequently Visited AreasACM Transactions on Information Systems10.1145/336250538:1(1-39)Online publication date: 17-Oct-2019
    • (2019)SwICS: Section-Wise In-Text Citation ScoreIEEE Access10.1109/ACCESS.2019.29423227(137090-137102)Online publication date: 2019
    • (2018)Recommending Scientific PapersProceedings of the 1st International Conference on Digital Tools & Uses Congress10.1145/3240117.3240123(1-4)Online publication date: 3-Oct-2018
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media