Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Effective Selection of a Compact and High-Quality Review Set with Information Preservation

Published: 10 December 2019 Publication History
  • Get Citation Alerts
  • Abstract

    Consumers increasingly make informed buying decisions based on reading online reviews for products and services. Due to the large volume of available online reviews, consumers hardly have the time and patience to read them all. This article aims to select a compact set of high-quality reviews that can cover a specific set of product features and related consumer sentiments. Selecting such a subset of reviews can significantly save the time spent on reading reviews while preserving the information needed. A unique review selection problem is defined and modeled as a bi-objective combinatorial optimization problem, which is then transformed into a minimum-cost set cover problem that is NP-complete. Several approximation algorithms are then designed, which can sustain performance guarantees in polynomial time. Our effective selection algorithms can also be upgraded to handle dynamic situations. Comprehensive experiments conducted on twelve real datasets demonstrate that the proposed algorithms significantly outperform benchmark methods by generating a more compact review set with much lower computational cost. The number of reviews selected is much smaller compared to the quantity of all available reviews, and the selection efficiency is deeply increased by accelerating strategies, making it very practical to adopt the methods in real-world online applications.

    Supplementary Material

    a15-chen-app.pdf (chen.zip)
    Supplemental movie, appendix, image and software files for, Effective Selectionof a Compact and High-Quality Review Set with Information Preservation

    References

    [1]
    Charu C. Aggarwal and ChengXiang Zhai. 2012. A survey of text clustering algorithms. In Mining Text Data. Springer, 77--128.
    [2]
    Stephan Borzsony, Donald Kossmann, and Konrad Stocker. 2001. The skyline operator. In Proc.eedings of the 17th IInternational Conference on Data Engineering. IEEE, Los Alamitos, CA, 421--430.
    [3]
    Roque Enrique López Condori and Thiago Alexandre Salgueiro Pardo. 2017. Opinion summarization methods: Comparing and extending extractive and abstractive approaches. Expert Systems with Applications 78 (2017), 124--134.
    [4]
    Dingzhu Du, Ker I. Ko, and Xiaodong Hu. 2011. Design and Analysis of Approximation Algorithms. Vol. 62. Springer Science & Business Media.
    [5]
    Matthias Ehrgott. 2006. Multicriteria Optimization. Springer Science & Business Media.
    [6]
    Matthias Ehrgott and Xavier Gandibleux. 2000. A survey and annotated bibliography of multiobjective combinatorial optimization. OR-Spektrum 22, 4 (2000), 425--460.
    [7]
    eMarketer. 2016. Consumers like reading online reviews, not writing them. https://www.emarketer.com/Article/Consumers-Like-Reading-Online-Reviews-Not-Writing-Them/1014242.
    [8]
    Mahak Gambhir and Vishal Gupta. 2017. Recent automatic text summarization techniques: A survey. Artificial Intelligence Review 47, 1 (2017), 1--66.
    [9]
    Anindya Ghose and Panagiotis G. Ipeirotis. 2011. Estimating the helpfulness and economic impact of product reviews: Mining text and reviewer characteristics. IEEE Transactions on Knowledge and Data Engineering 23, 10 (2011), 1498--1512.
    [10]
    Xunhua Guo, Qiang Wei, Guoqing Chen, Jin Zhang, and Dandan Qiao. 2017. Extracting representative information on intra-organizational blogging platforms. MIS Quarterly 41, 4 (2017), 1105--1127.
    [11]
    Hong Hong, Di Xu, G. Alan Wang, and Weiguo Fan. 2017. Understanding the determinants of online review helpfulness: A meta-analytic investigation. Decision Support Systems 102 (2017), 1--11.
    [12]
    David S. Johnson. 1973. Approximation algorithms for combinatorial problems. In Proceedings of the 5th ACM Symposium on Theory of Computing. ACM, New York, NY, 38--49.
    [13]
    Richard M. Karp. 1972. Reducibility among combinatorial problems. In Complexity of Computer Computations. Springer, 85--103.
    [14]
    Shirlee-Ann Knight and Janice Burn. 2005. Developing a framework for assessing information quality on the World Wide Web. Informing Science 8 (2005), 1--14.
    [15]
    Theodoros Lappas, Mark Crovella, and Evimaria Terzi. 2012. Selecting a characteristic set of reviews. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 832--840.
    [16]
    Jongwuk Lee and Seungwon Hwang. 2010. QSkycube: Efficient Skycube computation using point-based space partitioning. Proceedings of the VLDB Endowment 4, 3 (2010), 185--196.
    [17]
    Yang Liu, Xiangji Huang, Aijun An, and Xiaohui Yu. 2008. Modeling and predicting the helpfulness of online reviews. In Proceedings of the 8th International Conference on Data Mining (ICDM’08). IEEE, Los Alamitos, CA, 443--452.
    [18]
    Steven Loria, P. Keen, M. Honnibal, R. Yankovsky, D. Karesh, E. Dempsey, W. Childs, et al. 2018. TextBlob: Simplified text processing. Retrieved November 13, 2019 from https://textblob.readthedocs.io/en/dev/.
    [19]
    Baojun Ma, Qiang Wei, Guoqing Chen, Jin Zhang, and Xunhua Guo. 2017. Content and structure coverage: Extracting a diverse information subset. INFORMS J.ournal on Computing 29, 4 (2017), 660--675.
    [20]
    Thanh-Son Nguyen, Hady W. Lauw, and Panayiotis Tsaparas. 2015. Review selection using micro-reviews. IEEE Transactions on Knowledge and Data Engineering 27, 4 (2015), 1098--1111.
    [21]
    Thanh-Son Nguyen, Hady W. Lauw, and Panayiotis Tsaparas. 2017. Micro-review synthesis for multi-entity summarization. Data Mining and Knowledge Discovery 31, 5 (2017), 1189--1217.
    [22]
    Dimitris Papadias, Yufei Tao, Greg Fu, and Bernhard Seeger. 2005. Progressive skyline computation in database systems. ACM Transactions on Database Systems 30, 1 (2005), 41--82.
    [23]
    Debanjan Paul, Sudeshna Sarkar, Muthusamy Chelliah, Chetan Kalyan, and Prajit Prashant Sinai Nadkarni. 2017. Recommendation of high quality representative reviews in e-commerce. In Proceedings of the10th ACM Conference on Recommender Systems. ACM, New York, NY, 311--315.
    [24]
    Kim Schouten and Flavius Frasincar. 2015. Survey on aspect-level sentiment analysis. IEEE Transactions on Knowledge and Data Engineering 28, 3 (2015), 813--830.
    [25]
    Kian Lee Tan, Pin Kwang Eng, and Beng Chin Ooi. 2001. Efficient progressive skyline computation. In Proceedings of the 27th International Conference on Very Large Data Bases (VLDB’01), Vol. 1. 301--310.
    [26]
    Panayiotis Tsaparas, Alexandros Ntoulas, and Evimaria Terzi. 2011. Selecting a comprehensive set of reviews. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 168--176.
    [27]
    Vijay V. Vazirani. 2013. Approximation Algorithms. Springer Science 8 Business Media.
    [28]
    Nana Xu, Hongyan Liu, Jiawei Chen, Jun He, and Xiaoyong Du. 2014. Selecting a representative set of diverse quality reviews automatically. In Proceedings of the SIAM International Conference on Data Mining (SDM’14). 488--496.
    [29]
    Wenzhe Yu, Rong Zhang, Xiaofeng He, and Chaofeng Sha. 2013. Selecting a diversified set of reviews. In Proceedings of the Asia-Pacific Web Conference. 721--733.
    [30]
    Zunqiang Zhang, Guoqing Chen, Jin Zhang, Xunhua Guo, and Qiang Wei. 2016. Providing consistent opinions from online reviews: A heuristic stepwise optimization approach. INFORMS Journal on Computing 28, 2 (2016), 236--250.

    Cited By

    View all
    • (2023)A Review Selection Method Based on Consumer Decision Phases in E-commerceACM Transactions on Information Systems10.1145/358726542:1(1-27)Online publication date: 30-Mar-2023
    • (2022)Survey on Visual Analysis of Event Sequence DataIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2021.310041328:12(5091-5112)Online publication date: 1-Dec-2022
    • (2022)A deep recommendation model of cross-grained sentiments of user reviews and ratingsInformation Processing and Management: an International Journal10.1016/j.ipm.2021.10284259:2Online publication date: 1-Mar-2022
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Management Information Systems
    ACM Transactions on Management Information Systems  Volume 10, Issue 4
    December 2019
    98 pages
    ISSN:2158-656X
    EISSN:2158-6578
    DOI:10.1145/3374918
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 December 2019
    Accepted: 01 October 2019
    Revised: 01 August 2019
    Received: 01 December 2018
    Published in TMIS Volume 10, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Review selection
    2. approximation algorithms
    3. dynamic updating
    4. information preservation

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • The MOE Project of Key Research Institute of Humanities and Social Sciences at Universities
    • National Natural Science Foundation of China

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)14
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 27 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)A Review Selection Method Based on Consumer Decision Phases in E-commerceACM Transactions on Information Systems10.1145/358726542:1(1-27)Online publication date: 30-Mar-2023
    • (2022)Survey on Visual Analysis of Event Sequence DataIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2021.310041328:12(5091-5112)Online publication date: 1-Dec-2022
    • (2022)A deep recommendation model of cross-grained sentiments of user reviews and ratingsInformation Processing and Management: an International Journal10.1016/j.ipm.2021.10284259:2Online publication date: 1-Mar-2022
    • (2022)An orthogonal-space-learning-based method for selecting semantically helpful reviewsElectronic Commerce Research and Applications10.1016/j.elerap.2022.10115453(101154)Online publication date: May-2022
    • (2021)POI Recommendation Method Using Deep Learning in Location-Based Social NetworksWireless Communications & Mobile Computing10.1155/2021/91208642021Online publication date: 1-Jan-2021

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media