Abstract
Before making high-consideration purchase decisions, shoppers generally need to identify and evaluate products’ key differentiating features or attributes. Many customers, however, lack the knowledge required to do so for all product domains. In this work, we investigate and analyze alternatives for identifying important product attributes, which customers can then use to compare candidate products. We propose an unsupervised attribute-ranking approach ReBARC, that combines both objective data from structured product catalogs, and subjective information from unstructured customer reviews, to suggest to the shopper the most important attributes to consider. Our detailed analysis of product attribute importance across various domains on a shopping website shows that ReBARC significantly outperforms prior efforts judged by both automated and human evaluation metrics. We also analyze the correlation and overlap between key product attributes detected by ReBARC, and those visible to customers during online product search.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bing, L., Wong, T., Lam, W.: Unsupervised extraction of popular product attributes from e-commerce web sites by considering customer reviews. ACM TOIT (2016)
Buhrmester, M., Kwang, T., Gosling, S.: Amazon’s mechanical Turk: a new source of inexpensive, yet high-quality data? (2016)
Cai, F., de Rijke, M.: A survey of query auto completion in information retrieval. Found. Trends Inf. Retrieval (2016)
Campos, R., Mangaravite, V., Pasquali, A., Jorge, A., Nunes, C., Jatowt, A.: Yake! keyword extraction from single documents using multiple local features. Inf. Sci. (2020)
Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: ACM SIGIR (1998)
Carmel, D., Lewin-Eytan, L., Maarek, Y.: Product question answering using customer generated content-research challenges. In: ACM SIGIR (2018)
Chen, G., Tian, Y., Song, Y.: Joint aspect extraction and sentiment analysis with directional graph convolutional networks. In: COLING (2020)
Chen, S., Li, C., Ji, F., Zhou, W., Chen, H.: Driven answer generation for product-related questions in e-commerce. In: ACM WSDM (2019)
Cohen, J.: A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 20(1), 37–46 (1960)
Da’u, A., Salim, N.: Aspect extraction on user textual reviews using multi-channel convolutional neural network. PeerJ Comput. Sci. 5 (2019)
Ghani, R., Probst, K., Liu, Y., Krema, M., Fano, A.: Text mining for product attribute extraction. ACM SIGKDD Explor. Newsl. (2006)
Giannakopoulos, A., Musat, C., Hossmann, A., Baeriswyl, M.: Unsupervised aspect term extraction with B-LSTM & CRF using automatically labelled datasets. arXiv preprint arXiv:1709.05094 (2017)
He, R., Lee, W.S., Ng, H.T., Dahlmeier, D.: An unsupervised neural attention model for aspect extraction. In: ACL (2017)
Hirschmeier, S., Egger, M.: Social product search-enhancing product search with mined (sparse) product features (2018)
Huynh, V.P., Papotti, P.: A benchmark for fact checking algorithms built on knowledge bases. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 689–698 (2019)
Järvelin, K., Kekäläinen, J.: Cumulated gain-based evaluation of IR techniques. TOIS (2002)
Kozareva, Z., Li, Q., Zhai, K., Guo, W.: Recognizing salient entities in shopping queries. In: ACL (2016)
Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data (2001)
Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the web. In: WWW (2005)
Liu, Y., et al.: Roberta: a robustly optimized Bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019)
Luo, L., et al.: Unsupervised neural aspect extraction with sememes. In: IJCAI (2019)
More, A.: Attribute extraction from product titles in ecommerce. arXiv preprint arXiv:1608.04670 (2016)
Ni, J., Li, J., McAuley, J.: Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In: EMNLP-IJCNLP (2019)
Petrovski, P., Bizer, C.: Extracting attribute-value pairs from product specifications on the web. In: International Conference on Web Intelligence (2017)
Pontiki, M., et al.: SemEval-2016 task 5: aspect based sentiment analysis. In: Proceedings of the 10th International Workshop on Semantic Evaluation (2016)
Pontiki, M., Galanis, D., Pavlopoulos, J., Papageorgiou, H., Androutsopoulos, I., Manandhar, S.: SemEval-2014 task 4: aspect based sentiment analysis. In: Proceedings of the 8th International Workshop on Semantic Evaluation (2014)
Popescu, A., Etzioni, O.: Extracting product features and opinions from reviews. In: Kao, A., Poteet, S.R. (eds.) Natural Language Processing and Text Mining. Springer, London (2007). https://doi.org/10.1007/978-1-84628-754-1_2
Probst, K., Ghani, R., Krema, M., Fano, A., Liu, Y.: Semi-supervised learning of attribute-value pairs from product descriptions. In: IJCAI (2007)
Putthividhya, D., Hu, J.: Bootstrapped named entity recognition for product attribute extraction. In: EMNLP (2011)
Reimers, N., Gurevych, I.: Sentence-Bert: sentence embeddings using Siamese Bert-networks. In: EMNLP-IJCNLP (2019)
Retail, T.: They say they want a revolution - price water house (2016). https://www.pwc.es/es/publicaciones/retail-y-consumo/assets/total-retail-2016.pdf
Socher, R., et al.: Recursive deep models for semantic compositionality over a sentiment treebank. In: EMNLP (2013)
Thorne, J., Vlachos, A.: Evidence-based factual error correction. arXiv preprint arXiv:2106.01072 (2021)
Tulkens, S., van Cranenburgh, A.: Embarrassingly simple unsupervised aspect extraction. ArXiv abs/2004.13580 (2020)
Vedula, N., Parthasarathy, S.: Face-keg: fact checking explained using knowledge graphs. In: Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pp. 526–534 (2021)
Wu, B., Cheng, X., Wang, Y., Guo, Y., Song, L.: Simultaneous product attribute name and value extraction from web pages. In: ACM WI-IAT (2009)
Xu, H., Liu, B., Shu, L., Yu, P.S.: Double embeddings and CNN-based sequence labeling for aspect extraction. ArXiv abs/1805.04601 (2018)
Yang, Y., Chen, W., Li, Z., He, Z., Zhang, M.: Distantly supervised NER with partial annotation learning and reinforcement learning. In: International Conference on Computational Linguistics (2018)
Zheng, G., Mukherjee, S., Dong, X., Li, F.: OpenTag: open attribute value extraction from product profiles. In: ACM SIGKDD (2018)
Zhu, M.: Recall, precision and average precision. Department of Statistics and Actuarial Science, University of Waterloo, Waterloo (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Vedula, N., Collins, M., Agichtein, E., Rokhlenko, O. (2022). What Matters for Shoppers: Investigating Key Attributes for Online Product Comparison. In: Hagen, M., et al. Advances in Information Retrieval. ECIR 2022. Lecture Notes in Computer Science, vol 13186. Springer, Cham. https://doi.org/10.1007/978-3-030-99739-7_27
Download citation
DOI: https://doi.org/10.1007/978-3-030-99739-7_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-99738-0
Online ISBN: 978-3-030-99739-7
eBook Packages: Computer ScienceComputer Science (R0)