Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Generating exact- and ranked partially-matched answers to questions in advertisements

Published: 01 November 2011 Publication History

Abstract

Taking advantage of the Web, many advertisements (ads for short) websites, which aspire to increase client's transactions and thus profits, offer searching tools which allow users to (i) post keyword queries to capture their information needs or (ii) invoke form-based interfaces to create queries by selecting search options, such as a price range, filled-in entries, check boxes, or drop-down menus. These search mechanisms, however, are inadequate, since they cannot be used to specify a natural-language query with rich syntactic and semantic content, which can only be handled by a question answering (QA) system. Furthermore, existing ads websites are incapable of evaluating arbitrary Boolean queries or retrieving partially-matched answers that might be of interest to the user whenever a user's search yields only a few or no results at all. In solving these problems, we present a QA system for ads, called CQAds, which (i) allows users to post a natural-language question Q for retrieving relevant ads, if they exist, (ii) identifies ads as answers that partially-match the requested information expressed in Q, if insufficient or no answers to Q can be retrieved, which are ordered using a similarity-ranking approach, and (iii) analyzes incomplete or ambiguous questions to perform the "best guess" in retrieving answers that "best match" the selection criteria specified in Q. CQAds is also equipped with a Boolean model to evaluate Boolean operators that are either explicitly or implicitly specified in Q, i.e., with or without Boolean operators specified by the users, respectively. CQAds is easy to use, scalable to all ads domains, and more powerful than search tools provided by existing ads websites, since its query-processing strategy retrieves relevant ads of higher quality and quantity. We have verified the accuracy of CQAds in retrieving ads on eight ads domains and compared its ranking strategy with other well-known ranking approaches.

References

[1]
B. Allison. An improved hierarchical bayesian model of language for document classification. In COLING, pages 25--32, 2008.
[2]
M. Bilotti, P. Oglivie, J. Callan, and E. Nyberg. Structured retrieval for question answering. In ACM SIGIR, pages 351--358, 2007.
[3]
R. Burke, K. Hammnod, V. Kulyykin, S. Lytinen, N. Tomuro, and S. Schoenberg. Question answering from frequently-asked question files: Experiences with the faq finder system. AI Magazine, 18(2), 1997.
[4]
H. Chung, Y. Song, K. Han, D. Yoon, J. Lee, and H. Rim. A practical qa system in restricted domains. In ACL Workshop on QA in Restricted Domains, pages 39--45, 2004.
[5]
W. Croft, D. Metzler, and T. Strohman. Search Engines: Information Retrieval in Practice. Addison Wesley, 2010.
[6]
G. Das, V. Hristidis, N. Kapoor, and S. Sudarshan. Ordering the attributes of query results. In ACM SIGMOD, pages 395--406, 2006.
[7]
P. Dubois. MySQL, 3 rd Ed. Pearson Education, 2005.
[8]
O. Ferrandez, R. Izquierdo, S. Ferrandez, and J. Vicedo. Addressing ontology-based question answering with collections of user queries. IPM, 45(2): 175--188, 2009.
[9]
iProspect. iprospect.com/premiumPDFs/WhitePaper_2006_SearchEngineUserBehavior.pdf, 2006.
[10]
W. Kießling. Foundations of preferences in database systems. In VLDB, pages 311--322, 2002.
[11]
J. Koberstein and Y.-K. Ng. Using word clusters to detect similar web documents. In KSEM, pages 215--228, 2006.
[12]
C. Manning, P. Raghavan, and H. Schutze. Introduction to Information Retrieval. Cambridge University Press, 2008.
[13]
X. Meng, Z. Ma, and L. Yan. Answering approximate queries over autonomous web databases. In WWW, pages 1021--1030, 2009.
[14]
D. Moldovan, M. Paşca, S. Harabagiu, and M. Surdeanu. Performance issues and error analysis in an open-domain question answering system. TOIS, 21(2):133--154, 2003.
[15]
U. Nambiar and S. Kambhampati. Answering imprecise queries over autonomous web databases. In ICDE, pages 45--54, 2006.
[16]
M. Pera, W. Lund, and Y.-K. Ng. A sophisticated library search strategy using folksonomies and similarity matches. JASIST, 60(7):1392--1406, 2009.
[17]
M. Pera, R. Qumsiyeh, and Y.-K. Ng. Web-based closed-domain data extraction on online advertisements. Submitted to Journal of Information Systems, July 2011.
[18]
L. Pizzato and D. Molla. Indexing on semantic roles for question answering. In COLING, pages 74--81, 2008.
[19]
N. Ross and D. Wolfram. End-user searching on the internet: An analysis of term pair topics submitted to the excite search engine. JASIS, 51(10):949--958, 2002.
[20]
R. Terol, P. Martinez-Barco, and M. Palomar. A knowledge based method for the medical question answering problem. Computers in Biology and Medicine, 10:1511--1521, 2007.
[21]
M. Vargas-Vera and M. Lytras. Aqua: A closed-domain question answering system. ISM, 27(3):217--225, 2010.
[22]
D. Wang. A domain-specific question answering system based on ontology and question templates. In ACIS SNPD, pages 151--156, 2010.
[23]
Z. Wang and X. Luo. A semantic pattern for restricted domain chinese question answering. In IMCLC, pages 1333--1338, 2009.

Cited By

View all
  • (2013)A group recommender for movies based on content similarity and popularityInformation Processing and Management: an International Journal10.1016/j.ipm.2012.07.00749:3(673-687)Online publication date: 1-May-2013
  1. Generating exact- and ranked partially-matched answers to questions in advertisements

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 5, Issue 3
    November 2011
    117 pages

    Publisher

    VLDB Endowment

    Publication History

    Published: 01 November 2011
    Published in PVLDB Volume 5, Issue 3

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 09 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2013)A group recommender for movies based on content similarity and popularityInformation Processing and Management: an International Journal10.1016/j.ipm.2012.07.00749:3(673-687)Online publication date: 1-May-2013

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media