Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2872427.2883047acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
research-article
Public Access

Behavior Driven Topic Transition for Search Task Identification

Published: 11 April 2016 Publication History
  • Get Citation Alerts
  • Abstract

    Search tasks in users' query sequences are dynamic and interconnected. The formulation of search tasks can be influenced by multiple latent factors such as user characteristics, product features and search interactions, which makes search task identification a challenging problem. In this paper, we propose an unsupervised approach to identify search tasks via topic membership along with topic transition probabilities, thus it becomes possible to interpret how user's search intent emerges and evolves over time. Moreover, a novel hidden semi-Markov model is introduced to model topic transitions by considering not only the semantic information of queries but also the latent search factors originated from user search behaviors. A variational inference algorithm is developed to identify remarkable search behavior patterns, typical topic transition tracks, and the topic membership of each query from query logs. The learned topic transition tracks and the inferred topic memberships enable us to identify both small search tasks, where a user searches the same topic, and big search tasks, where a user searches a series of related topics. We extensively evaluate the proposed approach and compare with several state-of-the-art search task identification methods on both synthetic and real-world query log data, and experimental results illustrate the effectiveness of our proposed model.

    References

    [1]
    E. Agichtein, R. W. White, S. T. Dumais, and P. N. Bennet. Search, interrupted: understanding and predicting search task continuation. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval, pages 315--324. ACM, 2012.
    [2]
    L. M. Aiello, D. Donato, U. Ozertem, and F. Menczer. Behavior-driven clustering of queries into topics. In Proceedings of the 20th ACM international conference on Information and knowledge management, CIKM '11, pages 1373--1382, New York, NY, USA, 2011. ACM.
    [3]
    AOL. http://gregsadetsky.com/aol-data/.
    [4]
    A. H. Awadallah, R. W. White, P. Pantel, S. T. Dumais, and Y.-M. Wang. Supporting complex search tasks. In Proceedings of the ACM International Conference on Information and Knowledge Management, ACM, New York, NY, pages 1--10, 2014.
    [5]
    R. Baeza-yates, C. Hurtado, and M. Mendoza. Query recommendation using query logs in search engines. In International Workshop on Clustering Information over the Web, Creete, pages 588--596. Springer, 2004.
    [6]
    V. Barbu and N. Limnios. Semi-Markov Chains and Hidden Semi-Markov Models Toward Applications: Their Use in Reliability and DNA Analysis. Springer Publishing Company, Incorporated, 1 edition, 2008.
    [7]
    D. Blei and M. Jordan. Variational inference for dirichlet process mixtures. In Bayesian Analysis, volume 1, pages 121--144, 2005.
    [8]
    A. Broder. A taxonomy of web search. In ACM Sigir forum, volume 36, pages 3--10. ACM, 2002.
    [9]
    H. Cao, D. Jiang, J. Pei, Q. He, Z. Liao, E. Chen, and H. Li. Context-aware query suggestion by mining click-through and session data. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 875--883. ACM, 2008.
    [10]
    A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), pages 1--38, 1977.
    [11]
    N. Ford, D. Miller, and N. Moss. The role of individual differences in internet searching: An empirical study. Journal of the American Society for Information Science and technology, 52(12):1049--1066, 2001.
    [12]
    N. Ford, D. Miller, and N. Moss. Web search strategies and human individual differences: Cognitive and demographic factors, internet attitudes, and approaches. Journal of the american society for information science and technology, 56(7):741--756, 2005.
    [13]
    D. He and A. Göker. Detecting session boundaries from web user logs. In Proceedings of the BCS-IRSG 22nd annual colloquium on information retrieval research, pages 57--66, 2000.
    [14]
    D. He, A. Göker, and D. J. Harper. Combining evidence for automatic web session identification. Information Processing & Management, 38(5):727--742, 2002.
    [15]
    W. Hua, Y. Song, H. Wang, and X. Zhou. Identifying users' topical tasks in web search. In Proceedings of the sixth ACM international conference on Web search and data mining, pages 93--102. ACM, 2013.
    [16]
    L. Jie, S. Lamkhede, R. Sapra, E. Hsu, H. Song, and Y. Chang. A unified search federation system based on online user feedback. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1195--1203. ACM, 2013.
    [17]
    F. Johansson, T. F\"ardig, V. Jethava, and S. Marinov. Intent-aware temporal query modeling for keyword suggestion. In Proceedings of the 5th Ph.D. Workshop on Information and Knowledge, PIKM '12, pages 83--86, New York, NY, USA, 2012. ACM.
    [18]
    R. Jones and K. L. Klinkner. Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs. In Proceedings of the 17th ACM conference on Information and knowledge management, pages 699--708. ACM, 2008.
    [19]
    A. Kotov, P. N. Bennett, R. W. White, S. T. Dumais, and J. Teevan. Modeling and analysis of cross-session search tasks. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, pages 5--14. ACM, 2011.
    [20]
    L. Li, H. Deng, A. Dong, Y. Chang, and H. Zha. Identifying and labeling search tasks via query-based hawkes processes. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 731--740. ACM, 2014.
    [21]
    L. Li, H. Deng, A. Dong, Y. Chang, H. Zha, and R. Baeza-Yates. Analyzing user's sequential behavior in query auto-completion via markov processes. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '15, pages 123--132, New York, NY, USA, 2015. ACM.
    [22]
    Z. Liao, Y. Song, L.-w. He, and Y. Huang. Evaluating the effectiveness of search task trails. In Proceedings of the 21st international conference on World Wide Web, pages 489--498. ACM, 2012.
    [23]
    C. Lucchese, S. Orlando, R. Perego, F. Silvestri, and G. Tolomei. Identifying task-based sessions in search engine query logs. In Proceedings of the fourth ACM international conference on Web search and data mining, pages 277--286. ACM, 2011.
    [24]
    S. Ozmutlua and C. G. Cosara. Defining topic boundaries in search engine transaction logs using genetic algorithms. In Applied Artificial Intelligence, volume 23, pages 910--931, 2009.
    [25]
    X. Shen, S. Dumais, and E. Horvitz. Analysis of topic dynamics in web search. In Special Interest Tracks and Posters of the 14th International Conference on World Wide Web, WWW '05, pages 1102--1103, New York, NY, USA, 2005. ACM.
    [26]
    A. Spink, S. Koshman, M. Park, C. Field, and B. J. Jansen. Multitasking web search on vivisimo. com. In Information Technology: Coding and Computing, 2005. ITCC 2005. International Conference on, volume 2, pages 486--490. IEEE, 2005.
    [27]
    A. Spink, M. Park, B. J. Jansen, and J. Pedersen. Multitasking during web search sessions. Information Processing & Management, 42(1):264--275, 2006.
    [28]
    H. Wang, Y. Song, M.-W. Chang, X. He, R. W. White, and W. Chu. Learning to extract cross-session search tasks. In Proceedings of the 22nd international conference on World Wide Web, pages 1353--1364. International World Wide Web Conferences Steering Committee, 2013.
    [29]
    R. W. White, P. N. Bennett, and S. T. Dumais. Predicting short-term interests using activity-based search context. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management, CIKM '10, pages 1009--1018, New York, NY, USA, 2010. ACM.
    [30]
    R. W. White, W. Chu, A. Hassan, X. He, Y. Song, and H. Wang. Enhancing personalized search by mining and modeling task behavior. In Proceedings of the 22nd international conference on World Wide Web, pages 1411--1420. International World Wide Web Conferences Steering Committee, 2013.
    [31]
    R. W. White, S. T. Dumais, and J. Teevan. Characterizing the influence of domain expertise on web search behavior. In Proceedings of the Second ACM International Conference on Web Search and Data Mining, pages 132--141. ACM, 2009.
    [32]
    B. M. Wildemuth. The effects of domain knowledge on search tactic formulation. Journal of the american society for information science and technology, 55(3):246--258, 2004.
    [33]
    S.-Z. Yu. Hidden semi-markov models. Artificial Intelligence, 174(2):215--243, 2009.
    [34]
    X. Zhang and P. Mitra. Learning topical transition probabilities in click through data with regression models. In Procceedings of the 13th International Workshop on the Web and Databases, WebDB '10, pages 11:1--11:6, New York, NY, USA, 2010. ACM.
    [35]
    Z. Zhang and O. Nasraoui. Mining search engine query logs for query recommendation. In Proceedings of the 15th international conference on World Wide Web, WWW '06, pages 1039--1040, New York, NY, USA, 2006. ACM.

    Cited By

    View all
    • (2023)Recommending tasks based on search queries and missionsNatural Language Engineering10.1017/S1351324923000219(1-25)Online publication date: 17-May-2023
    • (2022)Application of Transaction Minimization Artificial Intelligent Algorithm for Weblog Analysis2022 International Conference on Smart and Sustainable Technologies in Energy and Power Sectors (SSTEPS)10.1109/SSTEPS57475.2022.00052(175-177)Online publication date: Nov-2022
    • (2021)Extracting the Implicit Search States from Explicit Behavioral Signals in Complex Search TasksProceedings of the Association for Information Science and Technology10.1002/pra2.58758:1(854-856)Online publication date: 13-Oct-2021
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '16: Proceedings of the 25th International Conference on World Wide Web
    April 2016
    1482 pages
    ISBN:9781450341431

    Sponsors

    • IW3C2: International World Wide Web Conference Committee

    In-Cooperation

    Publisher

    International World Wide Web Conferences Steering Committee

    Republic and Canton of Geneva, Switzerland

    Publication History

    Published: 11 April 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Markov model
    2. search behavior
    3. search task identification

    Qualifiers

    • Research-article

    Funding Sources

    • NSF

    Conference

    WWW '16
    Sponsor:
    • IW3C2
    WWW '16: 25th International World Wide Web Conference
    April 11 - 15, 2016
    Québec, Montréal, Canada

    Acceptance Rates

    WWW '16 Paper Acceptance Rate 115 of 727 submissions, 16%;
    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)31
    • Downloads (Last 6 weeks)3
    Reflects downloads up to

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Recommending tasks based on search queries and missionsNatural Language Engineering10.1017/S1351324923000219(1-25)Online publication date: 17-May-2023
    • (2022)Application of Transaction Minimization Artificial Intelligent Algorithm for Weblog Analysis2022 International Conference on Smart and Sustainable Technologies in Energy and Power Sectors (SSTEPS)10.1109/SSTEPS57475.2022.00052(175-177)Online publication date: Nov-2022
    • (2021)Extracting the Implicit Search States from Explicit Behavioral Signals in Complex Search TasksProceedings of the Association for Information Science and Technology10.1002/pra2.58758:1(854-856)Online publication date: 13-Oct-2021
    • (2020)Guided exploration of user groupsProceedings of the VLDB Endowment10.14778/3397230.339724213:9(1469-1482)Online publication date: 26-Jun-2020
    • (2018)Search action sequence modeling with long short-term memory for search task success evaluationProceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence10.5555/3504035.3504303(2200-2207)Online publication date: 2-Feb-2018
    • (2018)CA-LSTMThe 41st International ACM SIGIR Conference on Research & Development in Information Retrieval10.1145/3209978.3210087(1101-1104)Online publication date: 27-Jun-2018
    • (2018)Publication Popularity Modeling via Adversarial Learning of Profile-Specific Dynamic ProcessIEEE Access10.1109/ACCESS.2018.28096876(19984-19992)Online publication date: 2018
    • (2017)A probabilistic approach to event log completenessExpert Systems with Applications: An International Journal10.1016/j.eswa.2017.03.03980:C(263-272)Online publication date: 1-Sep-2017
    • (2017)Do Topic Shift and Query Reformulation Patterns Correlate in Academic Search?Advances in Information Retrieval10.1007/978-3-319-56608-5_12(146-159)Online publication date: 8-Apr-2017
    • (2016)Framing Mobile Information NeedsProceedings of the 25th ACM International on Conference on Information and Knowledge Management10.1145/2983323.2983654(2131-2136)Online publication date: 24-Oct-2016
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media