Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1645953.1646047acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Product query classification

Published: 02 November 2009 Publication History

Abstract

Web query classification is an effective way to understand Web user intents, which can further improve Web search and online advertising relevance. However, Web queries are usually very short which cannot fully reflect their meanings. What is more, it is quite hard to obtain enough training data for training accurate classifiers. Therefore, previous work on query classification has focused on two issues. One is how to represent Web queries through query expansion. The other is how to increase the amount of training data. In this paper, we took product query classification as an example, which is to classify Web queries into a predefined product taxonomy, and systematically studied the impact of query expansion and the size of training data. We proposed two methods of enriching Web queries and three approaches of collecting training data. Thereafter, we conducted a series of experiments to compare the classification performance of using different combinations of training data and query representations over a real data set. The data set consists of hundreds of thousands queries collected from a popular commercial search engine. From the experiments, we found some interesting observations, which were not discussed before. Finally, we proposed an effective and efficient product query classification method based on our observations.

References

[1]
S. M. Beitzel, E. C. Jensen, D. D. Lewis, A. Chowdhury, and O. Frieder. Automatic classification of web queries using very large unlabeled query logs. TOIS, 25(2):9, 2007.
[2]
A. Broder. A taxonomy of web search. SIGIR Forum, 36(2):3--10, 2002.
[3]
A. Broder, M. Fontoura, E. Gabrilovich, A. Joshi, V. Josifovski, and T. Zhang. Robust classification of rare queries using web knowledge. In SIGIR '07, 2007.
[4]
S.-L. Chuang, H.-T. Pu, and L.-F. Chien. Automatic subject categorization of query terms for filtering sensitive queries in web multimedia search. In PCM '01, pages 825--830, 2001.
[5]
J. M. Crego, A. de Gispert, and J. B. Marino. The TALP ngram-based SMT system for IWSLT'05. In Proc. International Workshop Spoken Language Translation, Pittsburgh, 2005.
[6]
H. K. Dai, L. Zhao, Z. Nie, J.-R. Wen, L. Wang, and Y. Li. Detecting online commercial intention (oci). In WWW '06, pages 829--837, 2006.
[7]
L. Gravano, V. Hatzivassiloglou, and R. Lichtenstein. Categorizing web queries according to geographical locality. In CIKM '03, pages 325--333, 2003.
[8]
B. J. Jansen. The effect of query complexity on web searching results. Information Research, 6(1), 2000.
[9]
I.-H. Kang and G. Kim. Query type classification for web document retrieval. In SIGIR '03, pages 64--71, 2003.
[10]
U. Lee, Z. Liu, and J. Cho. Automatic identification of user goals in web search. In WWW '05, pages 391--400, New York, NY, USA, 2005. ACM Press.
[11]
X. Li, Y.-C. Ju, G. Zweig, and A. Acero. Language modeling for voice search: a machine translation approach. In Proc. International Conf. on Audio, Speech and Signal Processing, 2008.
[12]
X. Li, Y.-Y. Wang, and A. Acero. Learning query intent from regularized click graphs. In SIGIR '08, 2008.
[13]
R. Manmatha, A. Feng, and J. Allan. A critical examination of tdt's cost function. In SIGIR '02, pages 403--404, New York, NY, USA, 2002. ACM Press.
[14]
C. Manning and H. Schütze. Foundations of Statistical Natural Language Processing. MIT Press., Cambridge, Ma, 1999.
[15]
C. D. Manning, P. Raghavan, and H. Schütze. Introduction To Information Retrieval. Cambridge University Press, 2008.
[16]
B. Martins, M. J. Silva, S. Freitas, and A. P. Afonso. Handling locations in search engine queries. In GIR '06, 2006.
[17]
Q. Mei, D. Zhou, and K. Church. Query suggestion using hitting time. In CIKM '08, October 2008.
[18]
K. Nigam, J. Lafferty, and A. K. Mccallum. Using maximum entropy for text classification. pages 61--67, 1999.
[19]
H.-T. Pu, S.-L. Chuang, and C. Yang. Subject categorization of query terms for exploring web users' search interests. JASIST, 53(8):617--630, 2002.
[20]
C. V. Rijsbergen. Information Retrieval. Butterworths, London, second edition edition, 1979.
[21]
D. E. Rose and D. Levinson. Understanding user goals in web search. In WWW '04, pages 13--19, New York, NY, USA, 2004. ACM.
[22]
F. Sha and F. Pereira. Shallow parsing with conditional random fields. In NAACL '03, pages 134--141, Morristown, NJ, USA, 2003. Association for Computational Linguistics.
[23]
D. Shen, R. Pan, J.-T. Sun, J. J. Pan, K. Wu, J. Yin, and Q. Yang. Query enrichment for web-query classification. TOIS, 24(3):320--352, 2006.
[24]
A. Spink, D. Wolfram, M. B. J. Jansen, and T. Saracevic. Searching the web: the public and their queries. JASIST, 52(3):226--234, 2001.
[25]
L. Wang, C. Wang, X. Xie, J. Forman, Y. Lu, W.-Y. Ma, and Y. Li. Detecting dominant locations from search queries. In SIGIR '05, pages 424--431, 2005.

Cited By

View all
  • (2024)Towards Better Seach Query Classification with Distribution-Diverse Multi-Expert Knowledge Distillation in JD Ads SearchProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3680049(4786-4794)Online publication date: 21-Oct-2024
  • (2024)Hierarchical Query Classification in E-commerce SearchCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648332(338-345)Online publication date: 13-May-2024
  • (2023)HCL4QC: Incorporating Hierarchical Category Structures Into Contrastive Learning for E-commerce Query ClassificationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614907(3647-3656)Online publication date: 21-Oct-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management
November 2009
2162 pages
ISBN:9781605585123
DOI:10.1145/1645953
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 November 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. product query classification
  2. query enrichment

Qualifiers

  • Research-article

Conference

CIKM '09
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)15
  • Downloads (Last 6 weeks)1
Reflects downloads up to 27 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Towards Better Seach Query Classification with Distribution-Diverse Multi-Expert Knowledge Distillation in JD Ads SearchProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3680049(4786-4794)Online publication date: 21-Oct-2024
  • (2024)Hierarchical Query Classification in E-commerce SearchCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648332(338-345)Online publication date: 13-May-2024
  • (2023)HCL4QC: Incorporating Hierarchical Category Structures Into Contrastive Learning for E-commerce Query ClassificationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614907(3647-3656)Online publication date: 21-Oct-2023
  • (2021)Modeling Across-Context Attention For Long-Tail Query Classification in E-commerceProceedings of the 14th ACM International Conference on Web Search and Data Mining10.1145/3437963.3441822(58-66)Online publication date: 8-Mar-2021
  • (2020)Query Classification with Multi-objective Backoff OptimizationProceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3397271.3401320(1925-1928)Online publication date: 25-Jul-2020
  • (2020)Query ClassificationQuery Understanding for Search Engines10.1007/978-3-030-58334-7_2(15-41)Online publication date: 2-Dec-2020
  • (2013)Analyzing, Detecting, and Exploiting Sentiment in Web QueriesACM Transactions on the Web10.1145/25355258:1(1-28)Online publication date: 1-Dec-2013
  • (2013)Cost-sensitive learning for large-scale hierarchical classificationProceedings of the 22nd ACM international conference on Information & Knowledge Management10.1145/2505515.2505582(1351-1360)Online publication date: 27-Oct-2013
  • (2013)Impact of query intent and search context on clickthrough behavior in sponsored searchKnowledge and Information Systems10.1007/s10115-012-0485-x34:2(425-452)Online publication date: 1-Feb-2013
  • (2012)Confidence-aware graph regularization with heterogeneous pairwise featuresProceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval10.1145/2348283.2348410(951-960)Online publication date: 12-Aug-2012
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media