Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1273496.1273526acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

Unsupervised prediction of citation influences

Published: 20 June 2007 Publication History

Abstract

Publication repositories contain an abundance of information about the evolution of scientific research areas. We address the problem of creating a visualization of a research area that describes the flow of topics between papers, quantifies the impact that papers have on each other, and helps to identify key contributions. To this end, we devise a probabilistic topic model that explains the generation of documents; the model incorporates the aspects of topical innovation and topical inheritance via citations. We evaluate the model's ability to predict the strength of influence of citations against manually rated citations.

References

[1]
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993--1022.
[2]
Cohn, D., & Chang, H. (2000). Learning to probabilistically identify authoritative documents. ICML '00: Proceedings of the Seventeenth International Conference on Machine Learning (pp. 167--174).
[3]
Cohn, D., & Hofmann, T. (2000). The missing link - a probabilistic model of document content and hypertext connectivity. NIPS '00: Advances in Neural Information Processing Systems.
[4]
Doucet, A., de Freitas, N., Murphy, K. P., & Russell, S. J. (2000). Rao-blackwellised particle filtering for dynamic bayesian networks. UAI '00: Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence (pp. 176--183).
[5]
Flake, G. W., Tsioutsiouliklis, K., & Zhukov, L. (2004). Methods for mining web communities: Bibliometric, spectral, and flow. Web Dynamics - Adapting to Change in Content, Size, Topology and Use, 45--68.
[6]
Garfield, E. (2004). Historiographic mapping of knowledge domains literature. Journal of Information Science, 30, 119--145.
[7]
Gilks, W. R., Richardson, S., & Spiegelhalter, D. J. (1996). Markov chain monte carlo in practice. London, UK: Chapman & Hall.
[8]
Hofmann, T. (2001). Unsupervised learning by probabilistic latent semantic analysis. Machine Learning, 42, 177--196.
[9]
Kleinberg, J. M. (1999). Authoritative sources in a hyperlinked environment. Journal of the ACM, 46, 604--632.
[10]
Mann, G., Mimno, D., & Mccallum, A. (2006). Bibliometric impact measures leveraging topic analysis. JCDL '06: Proceedings of the Joint Conference on Digital Libraries.
[11]
Newman, M. E. J. (2003). The structure and function of complex networks. SIAM Review, 45, 167--256.
[12]
Nowicki, K., & Snijders, T. A. B. (2001). Estimation and prediction for stochastic blockstructures. Journal of the American Statistical Association, 96, 1077--1087.
[13]
Rosen-Zvi, M., Griffiths, T., Steyvers, M., & Smyth, P. (2004). The author-topic model for authors and documents. Proceedings of the 20th conference on Uncertainty in artificial intelligence (pp. 487--494).
[14]
Sandor, A., Kaplan, A., & Rondeau, G. (2006). Discourse and citation analysis with concept-matching. International Symposium: Discourse and Document.
[15]
Spiegel-Roesing, I. (1977). Science studies: Bibliometric and content analysis. Social Studies of Science, 7, 97--113.
[16]
Trigg, R. (1983). A network-based approach to text handling for the online scientific community (Technical Report).
[17]
Xu, Z., Tresp, V., Yu, K., Yu, S., & Kriegel, H.-P. (2005). Dirichlet enhanced relational learning. ICML '05: Proceedings of the 22nd international conference on Machine learning (pp. 1004--1011). New York, NY, USA: ACM Press.

Cited By

View all
  • (2024)Flexible Distribution Approaches to Enhance Regression and Deep Topic Modelling TechniquesExpert Systems10.1111/exsy.13789Online publication date: 25-Nov-2024
  • (2024) Investigating Safety Awareness in Assembly Operations via Mixed Reality Technology IISE Transactions on Occupational Ergonomics and Human Factors10.1080/24725838.2024.2431112(1-17)Online publication date: 4-Dec-2024
  • (2023)Topic Selection Using Conceptual Distance: How to Select Topics that are Interesting but Unfamiliar to UsersIEEJ Journal of Industry Applications10.1541/ieejjia.2200678412:4(588-595)Online publication date: 1-Jul-2023
  • Show More Cited By
  1. Unsupervised prediction of citation influences

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICML '07: Proceedings of the 24th international conference on Machine learning
    June 2007
    1233 pages
    ISBN:9781595937933
    DOI:10.1145/1273496
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • Machine Learning Journal

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 June 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Article

    Conference

    ICML '07 & ILP '07
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 140 of 548 submissions, 26%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)81
    • Downloads (Last 6 weeks)17
    Reflects downloads up to 08 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Flexible Distribution Approaches to Enhance Regression and Deep Topic Modelling TechniquesExpert Systems10.1111/exsy.13789Online publication date: 25-Nov-2024
    • (2024) Investigating Safety Awareness in Assembly Operations via Mixed Reality Technology IISE Transactions on Occupational Ergonomics and Human Factors10.1080/24725838.2024.2431112(1-17)Online publication date: 4-Dec-2024
    • (2023)Topic Selection Using Conceptual Distance: How to Select Topics that are Interesting but Unfamiliar to UsersIEEJ Journal of Industry Applications10.1541/ieejjia.2200678412:4(588-595)Online publication date: 1-Jul-2023
    • (2023)Does the Market of Citations Reward Reproducible Work?Proceedings of the 2023 ACM Conference on Reproducibility and Replicability10.1145/3589806.3600041(89-96)Online publication date: 27-Jun-2023
    • (2023)Advancing Multinomial Regression and Topic Modeling with Beta-Liouville Distributions2023 International Conference on Machine Learning and Applications (ICMLA)10.1109/ICMLA58977.2023.00292(1928-1935)Online publication date: 15-Dec-2023
    • (2023)Revisiting Citation Prediction with Cluster-Aware Text-Enhanced Heterogeneous Graph Neural Networks2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00058(682-695)Online publication date: Apr-2023
    • (2023)Generalized Dirichlet-Multinomial Regression: Leveraging Arbitrary Features for Topic Modelling2023 IEEE International Conference on High Performance Computing & Communications, Data Science & Systems, Smart City & Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys)10.1109/HPCC-DSS-SmartCity-DependSys60770.2023.00128(884-891)Online publication date: 17-Dec-2023
    • (2023)New Data Representation and Simulate Over Social Media Using Bayes Classifier of Machine LearningEnergy Systems, Drives and Automations10.1007/978-981-99-3691-5_45(515-526)Online publication date: 21-Aug-2023
    • (2023)Extracting the evolutionary backbone of scientific domains: The semantic main path network analysis approach based on citation context analysisJournal of the Association for Information Science and Technology10.1002/asi.2474874:5(546-569)Online publication date: 21-Mar-2023
    • (2022)Analyzing the generalizability of the network-based topic emergence identification methodSemantic Web10.3233/SW-21295113:3(423-439)Online publication date: 1-Jan-2022
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media