Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3209978.3210032acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Cross-language Citation Recommendation via Hierarchical Representation Learning on Heterogeneous Graph

Published: 27 June 2018 Publication History
  • Get Citation Alerts
  • Abstract

    While the volume of scholarly publications has increased at a frenetic pace, accessing and consuming the useful candidate papers, in very large digital libraries, is becoming an essential and challenging task for scholars. Unfortunately, because of language barrier, some scientists (especially the junior ones or graduate students who do not master other languages) cannot efficiently locate the publications hosted in a foreign language repository. In this study, we propose a novel solution, cross-language citation recommendation via Hierarchical Representation Learning on Heterogeneous Graph (HRLHG), to address this new problem. HRLHG can learn a representation function by mapping the publications, from multilingual repositories, to a low-dimensional joint embedding space from various kinds of vertexes and relations on a heterogeneous graph. By leveraging both global (task specific) plus local (task independent) information as well as a novel supervised hierarchical random walk algorithm, the proposed method can optimize the publication representations by maximizing the likelihood of locating the important cross-language neighborhoods on the graph. Experiment results show that the proposed method can not only outperform state-of-the-art baseline models, but also improve the interpretability of the representation model for cross-language citation recommendation task.

    References

    [1]
    Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio . 2015. Neural machine translation by jointly learning to align and translate Proceedings of the International Conference on Learning Representations (ICLR).
    [2]
    Yoshua Bengio, Aaron Courville, and Pascal Vincent . 2013. Representation learning: A review and new perspectives. IEEE transactions on pattern analysis and machine intelligence Vol. 35, 8 (2013), 1798--1828.
    [3]
    Shaosheng Cao, Wei Lu, and Qiongkai Xu . 2015. Grarep: Learning graph representations with global structural information Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, 891--900.
    [4]
    Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li . 2007. Learning to rank: from pairwise approach to listwise approach Proceedings of the 24th international conference on Machine learning. ACM, 129--136.
    [5]
    José Augusto de Azevedo, Joaquim Jo ao ER Silvestre Madeira, Ernesto Q Vieira Martins, and Filipe Manuel A Pires . 1990. A shortest paths ranking algorithm. In Proceedings of the Annual Conference of Associazione Italiana di Ricerca Operativa: Models and Methods for Decision Support (AIRO'90). 1--8.
    [6]
    Yuxiao Dong, Nitesh V Chawla, and Ananthram Swami . 2017. metapath2vec: Scalable representation learning for heterogeneous networks Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 135--144.
    [7]
    Tao-yang Fu, Wang-Chien Lee, and Zhen Lei . 2017. HIN2Vec: Explore Meta-paths in Heterogeneous Information Networks for Representation Learning. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. ACM, 1797--1806.
    [8]
    Giorgio Gallo and Stefano Pallottino . 1986. Shortest path methods: A unifying approach. Netflow at Pisa (1986), 38--64.
    [9]
    Aditya Grover and Jure Leskovec . 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 855--864.
    [10]
    Jiafeng Guo, Yixing Fan, Qingyao Ai, and W Bruce Croft . 2016. A deep relevance matching model for ad-hoc retrieval Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 55--64.
    [11]
    Qi He, Jian Pei, Daniel Kifer, Prasenjit Mitra, and Lee Giles . 2010. Context-aware citation recommendation. In Proceedings of the 19th international conference on World wide web. ACM, 421--430.
    [12]
    Zhuoren Jiang, Xiaozhong Liu, and Liangcai Gao . 2015. Chronological Citation Recommendation with Information-Need Shifting Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, 1291--1300.
    [13]
    Thomas N Kipf and Max Welling . 2016. Semi-Supervised Classification with Graph Convolutional Networks. arXiv preprint arXiv:1609.02907 (2016).
    [14]
    Ni Lao and William W Cohen . 2010. Relational retrieval using a combination of path-constrained random walks. Machine learning Vol. 81, 1 (2010), 53--67.
    [15]
    Xiaozhong Liu, Yingying Yu, Chun Guo, and Yizhou Sun . 2014. Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. ACM, 121--130.
    [16]
    Sean M McNee, Istvan Albert, Dan Cosley, Prateep Gopalkrishnan, Shyong K Lam, Al Mamunur Rashid, Joseph A Konstan, and John Riedl . 2002. On the recommending of citations for research papers Proceedings of the 2002 ACM conference on Computer supported cooperative work. ACM, 116--125.
    [17]
    Donald Metzler and W Bruce Croft . 2007. Linear feature-based models for information retrieval. Information Retrieval Vol. 10, 3 (2007), 257--274.
    [18]
    Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean . 2013 a. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).
    [19]
    Tomas Mikolov, Quoc V Le, and Ilya Sutskever . 2013 b. Exploiting similarities among languages for machine translation. arXiv preprint arXiv:1309.4168 (2013).
    [20]
    Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean . 2013 c. Distributed representations of words and phrases and their compositionality Advances in neural information processing systems. 3111--3119.
    [21]
    Bryan Perozzi, Rami Al-Rfou, and Steven Skiena . 2014. Deepwalk: Online learning of social representations Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 701--710.
    [22]
    Xiang Ren, Jialu Liu, Xiao Yu, Urvashi Khandelwal, Quanquan Gu, Lidan Wang, and Jiawei Han . 2014. Cluscite: Effective citation recommendation by information network-based clustering. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 821--830.
    [23]
    Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl . 2001. Item-based collaborative filtering recommendation algorithms Proceedings of the 10th international conference on World Wide Web. ACM, 285--295.
    [24]
    Xiaolin Shi, Jure Leskovec, and Daniel A McFarland . 2010. Citing for high impact. In Proceedings of the 10th annual joint conference on Digital libraries. ACM, 49--58.
    [25]
    Xiaoyuan Su and Taghi M Khoshgoftaar . 2009. A survey of collaborative filtering techniques. Advances in artificial intelligence Vol. 2009 (2009), 4.
    [26]
    Yizhou Sun, Jiawei Han, Xifeng Yan, Philip S Yu, and Tianyi Wu . 2011. Pathsim: Meta path-based top-k similarity search in heterogeneous information networks. Proceedings of the VLDB Endowment Vol. 4, 11 (2011), 992--1003.
    [27]
    Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei . 2015. Line: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 1067--1077.
    [28]
    Jie Tang and Jing Zhang . 2009. A discriminative approach to topic-based citation recommendation. Advances in Knowledge Discovery and Data Mining (2009), 572--579.
    [29]
    Xuewei Tang, Xiaojun Wan, and Xun Zhang . 2014. Cross-language context-aware citation recommendation in scientific articles. In Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval. ACM, 817--826.
    [30]
    Chengxiang Zhai and John Lafferty . 2001. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 334--342.

    Cited By

    View all
    • (2024)A Graph-Based Keyword Extraction Method for Academic Literature Knowledge Graph ConstructionMathematics10.3390/math1209134912:9(1349)Online publication date: 29-Apr-2024
    • (2024)Professional Network Matters: Connections Empower Person-Job FitProceedings of the 17th ACM International Conference on Web Search and Data Mining10.1145/3616855.3635852(96-105)Online publication date: 4-Mar-2024
    • (2024)A Comprehensive Survey on Deep Graph Representation LearningNeural Networks10.1016/j.neunet.2024.106207173(106207)Online publication date: May-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval
    June 2018
    1509 pages
    ISBN:9781450356572
    DOI:10.1145/3209978
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 June 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. citation recommendation
    2. cross-language
    3. heterogeneous graph representation learning

    Qualifiers

    • Research-article

    Funding Sources

    • Health & Medical Collaborative Innovation Project of Guangzhou City, China
    • Opening Project of State Key Laboratory of Digital Publishing Technology
    • National Natural Science Foundation of China
    • Guangdong Province Frontier and Key Technology Innovative Grant

    Conference

    SIGIR '18
    Sponsor:

    Acceptance Rates

    SIGIR '18 Paper Acceptance Rate 86 of 409 submissions, 21%;
    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)31
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A Graph-Based Keyword Extraction Method for Academic Literature Knowledge Graph ConstructionMathematics10.3390/math1209134912:9(1349)Online publication date: 29-Apr-2024
    • (2024)Professional Network Matters: Connections Empower Person-Job FitProceedings of the 17th ACM International Conference on Web Search and Data Mining10.1145/3616855.3635852(96-105)Online publication date: 4-Mar-2024
    • (2024)A Comprehensive Survey on Deep Graph Representation LearningNeural Networks10.1016/j.neunet.2024.106207173(106207)Online publication date: May-2024
    • (2024)Towards human-like perception: Learning structural causal model in heterogeneous graphInformation Processing & Management10.1016/j.ipm.2023.10360061:2(103600)Online publication date: Mar-2024
    • (2024)Graph Representation Learning for Recommendation Systems: A Short ReviewAdvances in Information Systems, Artificial Intelligence and Knowledge Management10.1007/978-3-031-51664-1_3(33-48)Online publication date: 20-Jan-2024
    • (2024)Citation Recommendation Employing Proximity-Based Heterogeneous Network EmbeddingsIntelligent Systems and Applications10.1007/978-3-031-47721-8_32(477-495)Online publication date: 10-Jan-2024
    • (2023)A Systematic Review of Citation Recommendation Over the Past Two DecadesInternational Journal on Semantic Web & Information Systems10.4018/IJSWIS.32407119:1(1-22)Online publication date: 1-Jun-2023
    • (2023)Top-N Knowledge Concept Recommendations in MOOCs Using a Neural Co-Attention ModelIEEE Access10.1109/ACCESS.2023.327860911(51214-51228)Online publication date: 2023
    • (2023)A new method for recommendation based on embedding spectral clustering in heterogeneous networks (RESCHet)Expert Systems with Applications10.1016/j.eswa.2023.120699231(120699)Online publication date: Nov-2023
    • (2023)Heterogeneous deep graph convolutional network with citation relational BERT for COVID-19 inline citation recommendationExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.118841213:PAOnline publication date: 1-Mar-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media