Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/1610075.1610091dlproceedingsArticle/Chapter ViewAbstractPublication PagesemnlpConference Proceedingsconference-collections
research-article
Free access

Automatic classification of citation function

Published: 22 July 2006 Publication History

Abstract

Citation function is defined as the author's reason for citing a given paper (e.g. acknowledgement of the use of the cited method). The automatic recognition of the rhetorical function of citations in scientific text has many applications, from improvement of impact factor calculations to text summarisation and more informative citation indexers. We show that our annotation scheme for citation function is reliable, and present a supervised machine learning framework to automatically classify citation function, using both shallow and linguistically-inspired features. We find, amongst other things, a strong relationship between citation function and sentiment classification.

References

[1]
Rashid M. Abdalla and Simone Teufel. 2006. A bootstrapping approach to unsupervised detection of cue phrase variants. In Proc. of ACL/COLING-06.
[2]
Susan Bonzi. 1982. Characteristics of a literature as predictors of relatedness between cited and citing works. JASIS, 33(4):208--216.
[3]
Christine L. Borgman, editor. 1990. Scholarly Communication and Bibliometrics. Sage Publications, CA.
[4]
Jean Carletta. 1996. Assessing agreement on classification tasks: The kappa statistic. Computational Linguistics, 22(2):249--254.
[5]
Daryl E. Chubin and S. D. Moitra. 1975. Content analysis of references: Adjunct or alternative to citation counting? Social Studies of Science, 5(4):423--441.
[6]
Eugene Garfield. 1979. Citation Indexing: Its Theory and Application in Science, Technology and Humanities. J. Wiley, New York, NY.
[7]
C. Lee Giles, Kurt D. Bollacker, and Steve Lawrence. 1998. Citeseer: An automatic citation indexing system. In Proc. of the Third ACM Conference on Digital Libraries, pages 89--98.
[8]
T. L. Hodges. 1972. Citation Indexing: Its Potential for Bibliographical Control. Ph.D. thesis, University of California at Berkeley.
[9]
David D. Lewis. 1991. Evaluating text categorisation. In Speech and Natural Language: Proceedings of the ARPA Workshop of Human Language Technology.
[10]
Terttu Luukkonen. 1992. Is scientists' publishing behaviour reward-seeking? Scientometrics, 24:297--319.
[11]
Michael H. MacRoberts and Barbara R. MacRoberts. 1984. The negational reference: Or the art of dissembling. Social Studies of Science, 14:91--94.
[12]
Michael J. Moravcsik and Poovanalingan Murugesan. 1975. Some results on the function and quality of citations. Social Studies of Science, 5:88--91.
[13]
Greg Myers. 1992. In this paper we report…---speech acts and scientific facts. Journal of Pragmatics, 17(4).
[14]
John O'Connor. 1982. Citing statements: Computer recognition and use to improve retrieval. Information Processing and Management, 18(3):125--131.
[15]
Chris D. Paice. 1981. The automatic generation of literary abstracts: an approach based on the identification of self-indicating phrases. In R. Oddy, S. Robertson, C. van Rijsbergen, and P. W. Williams, editors, Information Retrieval Research. Butterworth, London, UK.
[16]
Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proc. of EMNLP-02.
[17]
Anna Ritchie, Simone Teufel, and Stephen Robertson. 2006a. Creating a test collection for citation-based IR experiments. In Proc. of HLT/NAACL 2006, New York, US.
[18]
Anna Ritchie, Simone Teufel, and Stephen Robertson. 2006b. How to find better index terms through citations. In Proc. of ACL/COLING workshop "Can Computational Linguistics improve IR".
[19]
Simon Buckingham Shum. 1998. Evolving the web for scientific knowledge: First steps towards an "HCI knowledge web". Interfaces, British HCI Group Magazine, 39.
[20]
Henry Small. 1982. Citation context analysis. In P. Dervin and M. J. Voigt, editors, Progress in Communication Sciences 3, pages 287--310. Ablex, Norwood, N.J.
[21]
Ina Spiegel-Rüsing. 1977. Bibliometric and content analysis. Social Studies of Science, 7:97--113.
[22]
John Swales. 1986. Citation analysis and discourse analysis. Applied Linguistics, 7(1):39--56.
[23]
John Swales, 1990. Genre Analysis: English in Academic and Research Settings. Chapter 7: Research articles in English, pages 110--176. Cambridge University Press, Cambridge, UK.
[24]
Simone Teufel and Marc Moens. 2002. Summarising scientific articles --- experiments with relevance and rhetorical status. Computational Linguistics, 28(4):409--446.
[25]
Simone Teufel, Advaith Siddharthan, and Dan Tidhar. 2006. An annotation scheme for citation function. In Proc. of SIGDial-06.
[26]
Simone Teufel. 1999. Argumentative Zoning: Information Extraction from Scientific Text. Ph.D. thesis, School of Cognitive Science, University of Edinburgh, UK.
[27]
Peter D. Turney. 2002. Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In Proc. of ACL-02.
[28]
Melvin Weinstock. 1971. Citation indexes. In Encyclopedia of Library and Information Science, volume 5. Dekker, New York, NY.
[29]
Howard D. White. 2004. Citation analysis and discourse analysis revisited. Applied Linguistics, 25(1):89--116.
[30]
Ian H. Witten and Eibe Frank. 2005. Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann, San Francisco.
[31]
Yiming Yang and Xin Liu. 1999. A re-examination of text categorization methods. In Proc. of SIGIR-99.
[32]
John M. Ziman. 1968. Public Knowledge: An Essay Concerning the Social Dimensions of Science. Cambridge University Press, Cambridge, UK.

Cited By

View all
  • (2024)The Impact of CHIIR Publications: A Study of Eight Years of CHIIRProceedings of the 2024 Conference on Human Information Interaction and Retrieval10.1145/3627508.3638338(34-44)Online publication date: 10-Mar-2024
  • (2024)Mining Semantic Relations in Data References to Understand the Roles of Research Data in Academic LiteratureProceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries10.1109/JCDL57899.2023.00039(215-227)Online publication date: 26-Jun-2024
  • (2024)CitePrompt: Using Prompts to Identify Citation Intent in Scientific PapersProceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries10.1109/JCDL57899.2023.00017(51-55)Online publication date: 26-Jun-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
EMNLP '06: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
July 2006
648 pages
ISBN:1932432736

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 22 July 2006

Qualifiers

  • Research-article

Acceptance Rates

EMNLP '06 Paper Acceptance Rate 73 of 234 submissions, 31%;
Overall Acceptance Rate 73 of 234 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)129
  • Downloads (Last 6 weeks)16
Reflects downloads up to 24 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)The Impact of CHIIR Publications: A Study of Eight Years of CHIIRProceedings of the 2024 Conference on Human Information Interaction and Retrieval10.1145/3627508.3638338(34-44)Online publication date: 10-Mar-2024
  • (2024)Mining Semantic Relations in Data References to Understand the Roles of Research Data in Academic LiteratureProceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries10.1109/JCDL57899.2023.00039(215-227)Online publication date: 26-Jun-2024
  • (2024)CitePrompt: Using Prompts to Identify Citation Intent in Scientific PapersProceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries10.1109/JCDL57899.2023.00017(51-55)Online publication date: 26-Jun-2024
  • (2023)Prompting Strategies for Citation ClassificationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615018(1127-1137)Online publication date: 21-Oct-2023
  • (2023)Impact-Oriented Contextual Scholar Profiling using Self-Citation GraphsProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599845(4572-4583)Online publication date: 6-Aug-2023
  • (2023)Neural machine translation for in‐text citation classificationJournal of the Association for Information Science and Technology10.1002/asi.2481774:10(1229-1240)Online publication date: 7-Sep-2023
  • (2021)Genealogical Tree Construction of Research PaperProceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)10.1145/3430984.3431056(435-435)Online publication date: 2-Jan-2021
  • (2020)Argument MiningComputational Linguistics10.1162/coli_a_0036445:4(765-818)Online publication date: 1-Jan-2020
  • (2020)An Authoritative Approach to Citation ClassificationProceedings of the ACM/IEEE Joint Conference on Digital Libraries in 202010.1145/3383583.3398617(337-340)Online publication date: 1-Aug-2020
  • (2020)Identifying Referential Intention with Heterogeneous ContextsProceedings of The Web Conference 202010.1145/3366423.3380175(962-972)Online publication date: 20-Apr-2020
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media