Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3397271.3401315acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

A Case Study of Multi-class Classification with Diversified Precision Recall Requirements for Query Disambiguation

Published: 25 July 2020 Publication History
  • Get Citation Alerts
  • Abstract

    We introduce a new metric for measuring the performance of multi-class classifiers. This metric is a generalization of the f1 score that is defined on binary classifiers, and offers significant improvement over other generalizations such as micro- and macro-averaging. In particular, one can select coefficients that weight the per-class precision and recall, as well as the overall class importance, with a robust mathematical interpretation. When certain parameters are selected our metric yields macro-averaged statistic as a special case. We demonstrate the efficacy of this metric on an application in genealogical search.

    References

    [1]
    Martin Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mané, Rajat Monga, Sherry Moore, Derek Murray, Chris Olah, Mike Schuster, Jonathon Shlens, Benoit Steiner, Ilya Sutskever, Kunal Talwar, Paul Tucker, Vincent Vanhoucke, Vijay Vasudevan, Fernanda Viégas, Oriol Vinyals, Pete Warden, Martin Wattenberg, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. http://tensorflow.org/ Software available from tensorflow.org.
    [2]
    Vincent Van Asch. 2013. Macro-and micro-averaged evaluation measures [ [ BASIC DRAFT ] ].
    [3]
    Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. (2018). https://arxiv.org/abs/1810.04805
    [4]
    Peng Jiang, Yingrui Yang, Gann Bierner, Fengjie Alex Li, Ruhan Wang, and Azadeh Moghtaderi. 2019. Family History Discovery through Search at Ancestry. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'19). Association for Computing Machinery, New York, NY, USA, 1389--1390. https://doi.org/10.1145/3331184.3331430
    [5]
    Juri Opitz and Sebastian Burst. 2019. Macro F1 and Macro F1. arXiv preprint arXiv:1911.03347 (2019).
    [6]
    Matthew E. Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, and Luke Zettlemoyer. 2018. Deep contextualized word representations. (2018). https://arxiv.org/abs/1802.05365
    [7]
    Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilya Sutskever. 2018. Language Models are Unsupervised Multitask Learners. (2018). https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf
    [8]
    Tommaso Teofili. 2018. Deep Learning for Search. Manning Early Access Program.
    [9]
    C.J. Van Rijsbergen. 1979. Information Retrieval. Butterworths. 78040725 https://books.google.com/books?id=t-pTAAAAMAAJ

    Cited By

    View all
    • (2024)A comparative study of the neural network models for the stock market data classification—A multicriteria optimization approachExpert Systems with Applications10.1016/j.eswa.2023.122287238(122287)Online publication date: Mar-2024
    • (2023)DTLR-CS: Deep tensor low rank channel cross fusion neural network for reproductive cell segmentationPLOS ONE10.1371/journal.pone.029472718:11(e0294727)Online publication date: 30-Nov-2023
    • (2022)Comparative Analysis Between Macro and Micro-Accuracy in Imbalance Dataset for Movie Review ClassificationProceedings of Seventh International Congress on Information and Communication Technology10.1007/978-981-19-2394-4_8(83-93)Online publication date: 12-Jul-2022

    Index Terms

    1. A Case Study of Multi-class Classification with Diversified Precision Recall Requirements for Query Disambiguation

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
        July 2020
        2548 pages
        ISBN:9781450380164
        DOI:10.1145/3397271
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 25 July 2020

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. multi-class classification
        2. query disambiguation
        3. query intent

        Qualifiers

        • Short-paper

        Conference

        SIGIR '20
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 792 of 3,983 submissions, 20%

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)47
        • Downloads (Last 6 weeks)12

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)A comparative study of the neural network models for the stock market data classification—A multicriteria optimization approachExpert Systems with Applications10.1016/j.eswa.2023.122287238(122287)Online publication date: Mar-2024
        • (2023)DTLR-CS: Deep tensor low rank channel cross fusion neural network for reproductive cell segmentationPLOS ONE10.1371/journal.pone.029472718:11(e0294727)Online publication date: 30-Nov-2023
        • (2022)Comparative Analysis Between Macro and Micro-Accuracy in Imbalance Dataset for Movie Review ClassificationProceedings of Seventh International Congress on Information and Communication Technology10.1007/978-981-19-2394-4_8(83-93)Online publication date: 12-Jul-2022

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media