Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Analyzing, Detecting, and Exploiting Sentiment in Web Queries

Published: 01 December 2013 Publication History

Abstract

The Web contains an increasing amount of biased and opinionated documents on politics, products, and polarizing events. In this article, we present an indepth analysis of Web search queries for controversial topics, focusing on query sentiment. To this end, we conduct extensive user assessments and discriminative term analyses, as well as a sentiment analysis using the SentiWordNet thesaurus, a lexical resource containing sentiment annotations. Furthermore, in order to detect the sentiment expressed in queries, we build different classifiers based on query texts, query result titles, and snippets. We demonstrate the virtue of query sentiment detection in two different use cases. First, we define a query recommendation scenario that employs sentiment detection of results to recommend additional queries for polarized queries issued by search engine users. The second application scenario is controversial topic discovery, where query sentiment classifiers are employed to discover previously unknown topics that trigger both highly positive and negative opinions among the users of a search engine. For both use cases, the results of our evaluations on real-world data are promising and show the viability and potential of query sentiment analysis in practical scenarios.

References

[1]
Ahmad, K. 2011. Affective Computing and Sentiment Analysis: Emotion, Metaphor and Terminology (Text, Speech and Language Technology) 1st Ed. Springer.
[2]
Aktolga, E. and Allan, J. 2011. Reranking search results for sparse queries. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management. 173--182.
[3]
Allan, J. 2002. Topic Detection and Tracking: Event-Based Information Organization. Kluwer Academic Publishers.
[4]
Anagnostopoulos, A., Becchetti, L., Castillo, C., and Gionis, A. 2010. An optimization framework for query recommendation. In Proceedings of the 3rd ACM International Conference on Web Search and Data Mining. 161--170.
[5]
Awadallah, R., Ramanath, M., and Weikum, G. 2012. Harmony and dissonance: Organizing the people’s voices on political controversies. In Proceedings of the 5th ACM International Conference on Web Search and Data Mining. 523--532.
[6]
Baeza-Yates, R., Hurtado, C., and Mendoza, M. 2004. Query recommendation using query logs in search engines. In Proceedings of the International Conference on Current Trends in Database Technology. Lecture Notes in Computer Science, vol. 3268. Springer-Verlag, Berlin, Heidelberg, 588--596.
[7]
Bar-Yossef, Z. and Gurevich, M. 2008. Mining search engine query logs via suggestion sampling. Proc. VLDB Endow. 1, 1, 54--65.
[8]
Bar-Yossef, Z. and Kraus, N. 2011. Context-sensitive query auto-completion. In Proceedings of the 20th International Conference on World Wide Web. 107--116.
[9]
Bermingham, A. and Smeaton, A. F. 2010. Classifying sentiment in microblogs: Is brevity an advantage? In Proceedings of the 19th ACM International Conference on Information and Knowledge Management. 1833--1836.
[10]
Broccolo, D., Marcon, L., Nardini, F. M., Perego, R., and Silvestri, F. 2012. Generating suggestions for queries in the long tail with an inverted index. Inf. Process. Manage. 48, 2, 326--339.
[11]
Broder, A. 2002. A taxonomy of Web search. SIGIR Forum 36, 2, 3--10.
[12]
Broder, A. Z., Fontoura, M., Gabrilovich, E., Joshi, A., Josifovski, V., and Zhang, T. 2007. Robust classification of rare queries using Web knowledge. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 231--238.
[13]
Cao, H., Hu, D. H., Shen, D., Jiang, D., Sun, J.-T., Chen, E., and Yang, Q. 2009. Context-aware query classification. In Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 3--10.
[14]
Chang, C.-C. and Lin, C.-J. 2011. Libsvm: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 3, 27:1--27:27.
[15]
Chelaru, S., Altingovde, I. S., and Siersdorfer, S. 2012. Analyzing the polarity of opinionated queries. In Proceedings of the 34th European Conference on IR Research. Lecture Notes in Computer Science, vol. 7224. Springer-Verlag, Berlin, Heidelberg, 463--467.
[16]
Demartini, G. and Siersdorfer, S. 2010. Dear search engine: What’s your opinion about...?: Sentiment analysis for semantic enrichment of Web search results. In Proceedings of the 3rd International Semantic Search Workshop. 4:1--4:7.
[17]
Denecke, K. 2009. Are sentiwordnet scores suited for multi-domain sentiment classification? In Proceedings of the 4th IEEE International Conference on Digital Information Management. 33--38.
[18]
Esuli, A. and Sebastiani, F. 2006. Sentiwordnet: A publicly available lexical resource for opinion mining. In Proceedings of the 5th Conference on Language Resources and Evaluation. 417--422.
[19]
Fellbaum, C., Ed. 1998. WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA.
[20]
Fonseca, B. M., Golgher, P. B., de Moura, E. S., and Ziviani, N. 2003. Using association rules to discover search engines related queries. In Proceedings of the 1st Conference on Latin American Web Congress. IEEE Computer Society, 66--71.
[21]
Goorha, S. and Ungar, L. 2010. Discovery of significant emerging trends. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 57--64.
[22]
Gwet, K. 2010. Handbook of Inter-Rater Reliability 2nd Ed. Advanced Analytics, LLC.
[23]
Gyllstrom, K. and Moens, M.-F. 2011. Clash of the typings: Finding controversies and children’s topics within queries. In Proceedings of the 33rd European Conference on IR Research. Lecture Notes in Computer Science, vol. 6611. Springer-Verlag, Berlin, Heidelberg, 80--91.
[24]
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., and Witten, I. H. 2009. The weka data mining software: An update. SIGKDD Explor. 11, 1, 10--18.
[25]
Hatzivassiloglou, V. and McKeown, K. 1995. A quantitative evaluation of linguistic tests for the automatic prediction of semantic markedness. In Proceedings of the 33rd Annual Meeting of the ACL. 197--204.
[26]
Hatzivassiloglou, V. and McKeown, K. 1997. Predicting the semantic orientation of adjectives. In Proceedings of the 35th Annual Meeting of the ACL. 174--181.
[27]
Jain, A. and Mishne, G. 2010. Organizing query completions for Web search. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management. 1169--1178.
[28]
Kang, I.-H. and Kim, G. 2003. Query type classification for Web document retrieval. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 64--71.
[29]
Kittur, A., Suh, B., Pendleton, B. A., and Chi, E. H. 2007. He says, she says: Conflict and coordination in Wikipedia. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 453--462.
[30]
Kucuktunc, O., Cambazoglu, B. B., Weber, I., and Ferhatosmanoglu, H. 2012. A large-scale sentiment analysis for Yahoo! answers. In Proceedings of the 5th ACM International Conference on Web Search and Data Mining. 633--642.
[31]
Li, X., Wang, Y.-Y., and Acero, A. 2008. Learning query intent from regularized click graphs. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 339--346.
[32]
Manning, C. D., Raghavan, P., and Schütze, H. 2008. Introduction to Information Retrieval. Cambridge University Press.
[33]
O’Connor, B., Balasubramanyan, R., Routledge, B. R., and Smith, N. A. 2010. From tweets to polls: Linking text sentiment to public opinion time series. In Proceedings of the 4th International Conference on Weblogs and Social Media.
[34]
Orimaye, S. O., Alhashmi, S. M., and Siew, E.-G. 2011. Frequency of sentential contexts vs. frequency of query terms in opinion retrieval. In Proceedings of the 7th International Conference on Web Information Systems and Technologies, J. Cordeiro and J. Filipe Eds., SciTePress, 607--610.
[35]
Pak, A. and Paroubek, P. 2010. Twitter as a corpus for sentiment analysis and opinion mining. In Proceedings of the 7th Conference on International Language Resources and Evaluation.
[36]
Pan, S. J., Ni, X., Sun, J.-T., Yang, Q., and Chen, Z. 2010. Cross-domain sentiment classification via spectral feature alignment. In Proceedings of the 19th International Conference on World Wide Web. 751--760.
[37]
Pang, B. and Lee, L. 2008. Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2, 1--2.
[38]
Pang, B., Lee, L., and Vaithyanathan, S. 2002. Thumbs up?: Sentiment classification using machine learning techniques. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing - Volume 10. Association for Computational Linguistics, 79--86.
[39]
Pass, G., Chowdhury, A., and Torgeson, C. 2006. A picture of search. In Proceedings of the 1st International Conference on Scalable Information Systems.
[40]
Pera, M. S., Qumsiyeh, R., and Ng, Y.-K. 2011. A query-based multi-document sentiment summarizer. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management. 1071--1076.
[41]
Preis, T., Moat, H. S., and Stanley, H. E. 2013. Quantifying trading behavior in financial markets using Google trends. Sci. Rep. 3.
[42]
Radlinski, F., Szummer, M., and Craswell, N. 2010. Inferring query intent from reformulations and clicks. In Proceedings of the 19th International Conference on World Wide Web. 1171--1172.
[43]
Ripberger, J. T. 2011. Capturing curiosity: Using internet search trends to measure public attentiveness. Policy Stud. J. 39, 2, 239--259.
[44]
Shen, D., Li, Y., Li, X., and Zhou, D. 2009. Product query classification. In Proceedings of the 18th ACM Conference on Information and Knowledge Management. 741--750.
[45]
Shokouhi, M. and Radinsky, K. 2012. Time-sensitive query auto-completion. In Proceedings of the 35th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 601--610.
[46]
Siersdorfer, S., Chelaru, S., Nejdl, W., and San Pedro, J. 2010. How useful are your comments?: Analyzing and predicting Youtube comments and comment ratings. In Proceedings of the 19th International Conference on World Wide Web. 891--900.
[47]
Song, Y., Zhou, D., and He, L.-w. 2011. Post-ranking query suggestion by diversifying search results. In Proceedings of the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 815--824.
[48]
Szpektor, I., Gionis, A., and Maarek, Y. 2011. Improving recommendation for long-tail queries via templates. In Proceedings of the 20th International Conference on World Wide Web. 47--56.
[49]
Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., and Kappas, A. 2010. Sentiment in short strength detection informal text. J. Am. Soc. Inf. Sci. Technol. 61, 12, 2544--2558.
[50]
Thomas, M., Pang, B., and Lee, L. 2006. Get out the vote: Determining support or opposition from congressional floor-debate transcripts. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 327--335.
[51]
Turney, P. D. 2002. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. 417--424.
[52]
Turney, P. D. and Littman, M. L. 2002. Unsupervised learning of semantic orientation from a hundred-billion-word corpus. Tech. rep. egb-1094. National Research Council Canada.
[53]
Vuong, B.-Q., Lim, E.-P., Sun, A., Le, M.-T., Lauw, H. W., and Chang, K. 2008. On ranking controversies in Wikipedia: Models and evaluation. In Proceedings of the International Conference on Web Search and Data Mining. 171--182.
[54]
Vural, A. G., Cambazoglu, B. B., and Senkul, P. 2012. Sentiment-focused Web crawling. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management. 2020--2024.
[55]
Weber, I., Garimella, V. R. K., and Borra, E. 2012. Mining Web query logs to analyze political issues. In Proceedings of the 3rd Annual ACM Web Science Conference. 330--334.
[56]
Wilkinson, E. 2012. Climate change: Environmental issues vs leadership. http://www.wateo.org/2012/01/02/climate-change-environmental-issues-vs-leadership-by-elisa-wilkinson/.
[57]
Zaragoza, H., Cambazoglu, B. B., and Baeza-Yates, R. A. 2010. Web search solved?: All result rankings the same? In Proceedings of the 19th ACM International Conference on Information and Knowledge Management. 529--538.

Cited By

View all
  • (2023)Semantic Web technologies and bias in artificial intelligence: A systematic literature reviewSemantic Web10.3233/SW-22304114:4(745-770)Online publication date: 24-Apr-2023
  • (2023)Into the Unknown: Exploration of Search Engines’ Responses to Users with Depression and AnxietyACM Transactions on the Web10.1145/358028317:4(1-29)Online publication date: 11-Jul-2023
  • (2023)Web Page Evaluation and Opinion Formation on Controversial Search TopicsLeveraging Generative Intelligence in Digital Libraries: Towards Human-Machine Collaboration10.1007/978-981-99-8085-7_17(188-203)Online publication date: 4-Dec-2023
  • Show More Cited By

Index Terms

  1. Analyzing, Detecting, and Exploiting Sentiment in Web Queries

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on the Web
    ACM Transactions on the Web  Volume 8, Issue 1
    December 2013
    204 pages
    ISSN:1559-1131
    EISSN:1559-114X
    DOI:10.1145/2560539
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 December 2013
    Accepted: 01 September 2013
    Revised: 01 September 2013
    Received: 01 September 2012
    Published in TWEB Volume 8, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Opinionated queries
    2. Web search
    3. sentiment analysis

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)16
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 17 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Semantic Web technologies and bias in artificial intelligence: A systematic literature reviewSemantic Web10.3233/SW-22304114:4(745-770)Online publication date: 24-Apr-2023
    • (2023)Into the Unknown: Exploration of Search Engines’ Responses to Users with Depression and AnxietyACM Transactions on the Web10.1145/358028317:4(1-29)Online publication date: 11-Jul-2023
    • (2023)Web Page Evaluation and Opinion Formation on Controversial Search TopicsLeveraging Generative Intelligence in Digital Libraries: Towards Human-Machine Collaboration10.1007/978-981-99-8085-7_17(188-203)Online publication date: 4-Dec-2023
    • (2022)Identifying Argumentative Questions in Web Search LogsProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531864(2393-2399)Online publication date: 6-Jul-2022
    • (2022)BMPInformation Sciences: an International Journal10.1016/j.ins.2022.04.039603:C(262-288)Online publication date: 1-Jul-2022
    • (2022)Opinion mining in online social media: a surveySocial Network Analysis and Mining10.1007/s13278-021-00855-812:1Online publication date: 11-Jan-2022
    • (2021)Sentiment analysis-based method for matching creative agri-product scheme demanders and suppliers: A case study from ChinaComputers and Electronics in Agriculture10.1016/j.compag.2021.106196186(106196)Online publication date: Jul-2021
    • (2020)Hindi EmotionNetACM Transactions on Asian and Low-Resource Language Information Processing10.1145/338333019:4(1-35)Online publication date: 7-Jun-2020
    • (2020)Measurement of Hotel Service Quality Based on Online Comment Sentiment Analysis2020 Eighth International Conference on Advanced Cloud and Big Data (CBD)10.1109/CBD51900.2020.00025(89-95)Online publication date: Dec-2020
    • (2017)Study on smart care service for the aged based on context awareness2017 International Conference on Progress in Informatics and Computing (PIC)10.1109/PIC.2017.8359559(289-293)Online publication date: Dec-2017
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media