Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3487553.3524261acmconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
poster

Effectiveness of Data Augmentation to Identify Relevant Reviews for Product Question Answering

Published: 16 August 2022 Publication History
  • Get Citation Alerts
  • Abstract

    With the rapid growth of e-commerce and an increasing number of questions posted on the Question Answer (QA) platforms of e-commerce websites, there is a need for providing automated answers to questions. In this paper, we use transformer-based review ranking models which provide a ranked list of reviews as a potential answer to a new question. Since no explicit training data is available, we exploit the product reviews along with available QA pairs to learn a relevance function between a question and a review sentence. Further, we present a data augmentation technique by fine-tuning the T5 model to generate new questions from customer reviews by considering the summary of the review as an answer and the review as the document. We conduct experiments on a real-world dataset from three categories in Amazon.com. To assess the performance of the models, we use the annotated question review dataset from RIKER [13]. Experimental results show that Deberta-RR model with the augmentation technique outperforms the current state-of-the-art model by 5.84%, 4.38%, 3.96%, and 2.96% on average in nDCG@1, nDCG@3, nDCG@5, and nDCG@10, respectively.

    References

    [1]
    Akari Asai and Hannaneh Hajishirzi. 2020. Logic-Guided Data Augmentation and Regularization for Consistent Question Answering. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.
    [2]
    Johannes Bjerva, Nikita Bhutani, Behzad Golshan, Wang-Chiew Tan, and Isabelle Augenstein. 2020. SubjQA: A Dataset for Subjectivity and Review Comprehension. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. 5480–5494.
    [3]
    Long Chen, Ziyu Guan, Wei Zhao, Wanqing Zhao, Xiaopeng Wang, Zhou Zhao, and Huan Sun. 2019. Answer Identification from Product Reviews for User Questions by Multi-Task Attentive Networks. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 45–52.
    [4]
    Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 4171–4186.
    [5]
    Pengcheng He, Xiaodong Liu, Jianfeng Gao, and Weizhu Chen. 2020. Deberta: Decoding-enhanced bert with disentangled attention. ArXiv abs/2006.03654(2020).
    [6]
    Julian McAuley and Alex Yang. 2016. Addressing Complex and Subjective Product-Related Queries with Customer Reviews. In Proceedings of the 25th International Conference on World Wide Web. 625–635.
    [7]
    Jianmo Ni, Jiacheng Li, and Julian McAuley. 2019. Justifying Recommendations using Distantly-Labeled Reviews and Fine-Grained Aspects. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 188–197.
    [8]
    Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. 2020. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. Journal of Machine Learning Research 21, 140 (2020), 1–67.
    [9]
    Stephen Robertson and Hugo Zaragoza. 2009. The Probabilistic Relevance Framework: BM25 and Beyond. Found. Trends Inf. Retr. 3, 4 (April 2009), 333–389.
    [10]
    Adams Wei Yu, David Dohan, Quoc Le, Thang Luong, Rui Zhao, and Kai Chen. 2018. Fast and Accurate Reading Comprehension by Combining Self-Attention and Convolution. In International Conference on Learning Representations, Vol. 2.
    [11]
    Qian Yu, Wai Lam, and Zihao Wang. 2018. Responding E-commerce Product Questions via Exploiting QA Collections and Reviews. In Proceedings of the 27th International Conference on Computational Linguistics. 2192–2203.
    [12]
    Shiwei Zhang, Jey Han Lau, Xiuzhen Zhang, Jeffrey Chan, and Cécile Paris. 2019. Discovering Relevant Reviews for Answering Product-Related Queries. In 2019 IEEE International Conference on Data Mining (ICDM). 1468–1473.
    [13]
    Jie Zhao, Ziyu Guan, and Huan Sun. 2019. Riker: Mining Rich Keyword Representations for Interpretable Product Question Answering. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1389–1398.

    Cited By

    View all
    • (2023)A Comprehensive Review of Arabic Question Answering DatasetsNeural Information Processing10.1007/978-981-99-8126-7_22(278-289)Online publication date: 13-Nov-2023

    Index Terms

    1. Effectiveness of Data Augmentation to Identify Relevant Reviews for Product Question Answering

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          WWW '22: Companion Proceedings of the Web Conference 2022
          April 2022
          1338 pages
          ISBN:9781450391306
          DOI:10.1145/3487553
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 16 August 2022

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Data Augmentation
          2. Product Question Answering
          3. Review Ranking

          Qualifiers

          • Poster
          • Research
          • Refereed limited

          Conference

          WWW '22
          Sponsor:
          WWW '22: The ACM Web Conference 2022
          April 25 - 29, 2022
          Virtual Event, Lyon, France

          Acceptance Rates

          Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)54
          • Downloads (Last 6 weeks)6
          Reflects downloads up to 09 Aug 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2023)A Comprehensive Review of Arabic Question Answering DatasetsNeural Information Processing10.1007/978-981-99-8126-7_22(278-289)Online publication date: 13-Nov-2023

          View Options

          Get Access

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media