Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2396761.2398627acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
poster

Infobox suggestion for Wikipedia entities

Published: 29 October 2012 Publication History

Abstract

Given the sheer amount of work and expertise required in authoring Wikipedia articles, automatic tools that help Wikipedia contributors in generating and improving content are valuable. This paper presents our initial step towards building a full-fledged author assistant, particularly for suggesting infobox templates for articles. We build SVM classifiers to suggest infobox template types, among a large number of possible types, to Wikipedia articles without infoboxes. Different from prior works on Wikipedia article classification which deal with only a few label classes for named entity recognition, the much larger 337-class setup in our study is geared towards realistic deployment of infobox suggestion tool. We also emphasize testing on articles without infoboxes, due to that labeled and unlabeled data exhibit different distributions of features, which departs from the typical assumption that they are drawn from the same underlying population.

References

[1]
K. Bollacker, C. Evans, P. Paritosh, T. Sturge, and J. Taylor. Freebase: a collaboratively created graph database for structuring human knowledge. In SIGMOD, 2008.
[2]
W. Dakka and S. Cucerzan. Augmenting Wikipedia with named entity tags. In IJCNLP, 2008.
[3]
M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. The weka data mining software: an update. SIGKDD Explor. Newsl., 11(1), Nov. 2009.
[4]
R. Kaptein and J. Kamps. Using links to classify Wikipedia pages. Advances in Focused Retrieval, 2009.
[5]
Nadeau, David, Sekine, and Satoshi. A survey of named entity recognition and classification. Linguisticae Investigationes, 30(1):3--26, January 2007.
[6]
J. Nothman, J. R. Curran, and T. Murphy. Transforming Wikipedia into named entity training data. In Proceedings of the Australian Language Technology Workshop, 2008.
[7]
A. E. Richman, P. Schone, and F. G. G. Meade. Mining Wiki resources for multilingual named entity recognition. In ACL, 2008.
[8]
I. Saleh, K. Darwish, and A. Fahmy. Classifying wikipedia articles into ne's using svm's with threshold adjustment. In Proceedings of the 2010 Named Entities Workshop.
[9]
M. Tkatchenko, A. Ulanov, and A. Simanovsky. Classifying wikipedia entities into fine-grained classes. In International Conference on Data Engineering Workshops, 2011.
[10]
A. Toral and R. Munoz. A proposal to automatically build and maintain gazetteers for named entity recognition by using Wikipedia. In Proceedings of the EACL Workshop on New Text, 2006.
[11]
Y. Watanabe, M. Asahara, and Y. Matsumoto. A graph-based approach to named entity categorization in Wikipedia using conditional random fields. In EMNLP-CoNLL, 2007.
[12]
F. Wu and D. S. Weld. Autonomously semantifying wikipedia. In CIKM, 2007.
[13]
F. Wu and D. S. Weld. Automatically refining the wikipedia infobox ontology. In WWW, 2008.

Cited By

View all
  • (2019)Con2KG-A Large-scale Domain-Specific Knowledge GraphProceedings of the 30th ACM Conference on Hypertext and Social Media10.1145/3342220.3344931(287-288)Online publication date: 12-Sep-2019
  • (2017)"Tell me more" using Ladders in WikipediaProceedings of the 20th International Workshop on the Web and Databases10.1145/3068839.3068847(1-6)Online publication date: 14-May-2017
  • (2015)BibliographyMultimedia Ontology10.1201/b18639-16(237-259)Online publication date: 17-Jun-2015
  • Show More Cited By

Index Terms

  1. Infobox suggestion for Wikipedia entities

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management
    October 2012
    2840 pages
    ISBN:9781450311564
    DOI:10.1145/2396761
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 October 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. text classification
    2. wikipedia

    Qualifiers

    • Poster

    Conference

    CIKM'12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 07 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2019)Con2KG-A Large-scale Domain-Specific Knowledge GraphProceedings of the 30th ACM Conference on Hypertext and Social Media10.1145/3342220.3344931(287-288)Online publication date: 12-Sep-2019
    • (2017)"Tell me more" using Ladders in WikipediaProceedings of the 20th International Workshop on the Web and Databases10.1145/3068839.3068847(1-6)Online publication date: 14-May-2017
    • (2015)BibliographyMultimedia Ontology10.1201/b18639-16(237-259)Online publication date: 17-Jun-2015
    • (2015)An Unsupervised Approach for Identifying the Infobox Template of Wikipedia ArticleProceedings of the 2015 IEEE 18th International Conference on Computational Science and Engineering (CSE)10.1109/CSE.2015.47(334-338)Online publication date: 21-Oct-2015
    • (2014)WiiClusterProceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management10.1145/2661829.2661840(2033-2035)Online publication date: 3-Nov-2014
    • (2013)Automatic Mapping of Wikipedia Templates for Fast Deployment of Localised DBpedia DatasetsProceedings of the 13th International Conference on Knowledge Management and Knowledge Technologies10.1145/2494188.2494196(1-8)Online publication date: 4-Sep-2013
    • (2013)Towards an Automatic Creation of Localized Versions of DBpediaProceedings of the 12th International Semantic Web Conference - Part I10.1007/978-3-642-41335-3_31(494-509)Online publication date: 21-Oct-2013

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media