Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1242572.1242580acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

Learning information intent via observation

Published: 08 May 2007 Publication History

Abstract

Users in an organization frequently request help by sending request messages to assistants that express information intent: an intention to update data in an information system. Human assistants spend a significant amount of time and effort processing these requests. For example, human resource assistants process requests to update personnel records, and executive assistants process requests to schedule conference rooms or to make travel reservations. To process the intent of a request, assistants read the request and then locate, complete, and submit a form that corresponds to the expressed intent. Automatically or semi-automatically processing the intent expressed in a request on behalf of an assistant would ease the mundane and repetitive nature of this kind of work.For a well-understood domain, a straightforward application of natural language processing techniques can be used to build an intelligent form interface to semi-automatically process information intent request messages. However, high performance parsers are based on machine learning algorithms that require a large corpus of examples that have been labeled by an expert. The generation of a labeled corpus of requests is a major barrier to the construction of a parser. In this paper, we investigate the construction of a natural language processing system and an intelligent form system that observes an assistant processing requests. The intelligent form system then generates a labeled training corpus by interpreting the observations. This paper reports on the measurement of the performance of the machine learning algorithms based on real data. The combination of observations, machine learning and interaction design produces an effective intelligent form interface based on natural language processing.

References

[1]
William W. Cohen, Einat Minkov, Anthony Tomasic, Learning to Understand Web Site Update Requests, in Proceedings of IJCAI, 2005, pp 1028--1033.
[2]
Anthony Tomasic, John Zimmerman, Isaac Simmons, Linking Messages and Form Requests, in Intelligent User Interfaces (IUI), 2006, pp 78--85.
[3]
Ray Mooney, Learning semantic parsers: An important but under-studied problem, in Working notes of the AAAI spring symposium on language learning, 2004.
[4]
Anthony Tomasic, William Cohen, Susan Fussell, John Zimmerman, Marina Kobayashi, Einat Minkov, Nathan Halstead, Ravi Mosur, and Jason Hum, Learning to Navigate Web Forms, in Workshop on Information Integration on the Web (IIWEB), 2004.
[5]
Andrea Lockerd, Huy Pham, Taly Sharon, and Ted Selker, Mr.Web: An Automated Interactive Webmaster, in Proceedings of the Conference on Computer/Human Interaction (CHI), 2003.
[6]
William W. Cohen, Minorthird: Methods for Identifying Names and Ontological Relations in Text using Heuristics for Inducing Regularities from Data, http://minorthird.sourceforge.net.
[7]
John Zimmerman, Anthony Tomasic, Isaac Simmons, Ian Hargraves, Ken Mohnkern, Jason Cornwell, Robert Martin McGuire, VIO: a mixed-initiative approach to learning and automating procedural update tasks, in Proceedings of the Conference on Computer/Human Interaction (CHI), 2007.
[8]
Eric Horvitz. Principles of Mixed-Initiative User Interfaces. Proceedings of Conference on Human Factors in Computing Systems (CHI), Pittsburgh, PA, May 1999.
[9]
Victoria Bellotti, Nicolas Ducheneaut, Mark Howard, Ian Smith, Taking Email to Task: The Design and Evaluation of a Task Management Centered E-Mail Tool, Proceedings of Conference on Computer/Human Interaction (CHI), 2003, 345--352.
[10]
Oren Etzioni, Alon Halevy, Henry Levy, and Luke McDowell, Semantic Email: Adding Lightweight Data Manipulation Capabilities to the Email Habitat. International Workshop on the Web and Databases (WebDB), June 12-13, 2003, San Diego, California.
[11]
Alon Halevy, Oren Etzioni, AnHai Doan, Zachary Ives, Jayant Madhavan, Luke McDowell, Igor Tatarinov, Crossing the Structure Chasm in Conference on Innovated Data Systems Research (CIDR), 2003.
[12]
Bozena Thompson, Frederick Thompson, Introducing ASK, A Simple Knowledgeable System, Proceedings of the First Conference on Applied Natural Language Processing, Santa Monica, California, 1983.
[13]
Frank Meng, A Natural Language Interface for Information Retrieval from Forms on the World Wide Web, Proceedings of the 20th International Conference on Information Systems, Charlotte, North Carolina, 1999.
[14]
Philip Cohen, et. al., Synergistic Use of Direct Manipulation and Natural Language, in Proceedings of the Conference on Computer/Human Interaction (CHI), 1989.
[15]
William W. Cohen, Pradeep Ravikumar, Stephen Fienberg, A Comparison of String Distance Metrics for Name-Matching Tasks in Workshop on Information Integration on the Web (IIWEB), 2003.
[16]
Terry Winograd, Understanding Natural Language, Ph.D. thesis, Academic Press, 1972.
[17]
Rohit J. Kate, Yuk Wah Wong, Raymond J. Mooney, Learning to Transform Natural to Formal Languages. Proceedings of AAAI, 2005.
[18]
Einat Minkov, Richard C. Wang, Anthony Tomasic, William W. Cohen, NER Systems that Suit User's Preferences: Adjusting the Recall-Precision Trade-off for Entity Extraction in HLT/NAACL, 2006.
[19]
SecondString. http://secondstring.sourceforge.net/
[20]
Lafferty, J., McCallum, A., Pereira, F., Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data, Proceedings of 18th International Conf. on Machine Learning (ICML), 2001.
[21]
Robert E. Schapire, Yoram Singer, Improved boosting algorithms using confidence-rated predictions in Proceedings of the Eleventh Annual Conference on Computational Learning Theory, pages 80--91, 1998.
[22]
W. Cohen, V. Carvalho, and T. Mitchell, Learning to classify email into "speech acts", in Conference Empirical Methods in Natural Language Processing, 2004.
[23]
Mark Dredze, Tessa Lau, Nicholas Kushmerick, Automatically Classifying Emails into Activities, Intelligent User Interfaces (IUI) 2006.
[24]
Jianqiang Shen, Lida Li, Thomas G. Dietterich, Jonathan L. Herlocker, A hybrid learning system for recognizing user tasks from desktop activities and email messages, Intelligent User Interfaces (IUI), 2006.
[25]
Ivan Felligi, Alan Sunter, "A theory for record linkage", Journal of the American Statistical Society, 64:1183--1210, 1969.

Cited By

View all
  • (2012)User Models for Adaptive Information Retrieval on the WebInternational Journal of Adaptive, Resilient and Autonomic Systems10.4018/jaras.20120701013:3(1-19)Online publication date: 1-Jul-2012
  • (2011)A transfer approach to detecting disease reporting events in blog social mediaProceedings of the 22nd ACM conference on Hypertext and hypermedia10.1145/1995966.1996001(271-280)Online publication date: 6-Jun-2011
  • (2011)Self-supervised detection of disease reporting events in outbreak reports2011 IEEE International Conference on Information Reuse & Integration10.1109/IRI.2011.6009584(416-421)Online publication date: Aug-2011
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '07: Proceedings of the 16th international conference on World Wide Web
May 2007
1382 pages
ISBN:9781595936547
DOI:10.1145/1242572
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 May 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. domestication
  2. information intent
  3. weak labeling

Qualifiers

  • Article

Conference

WWW'07
Sponsor:
WWW'07: 16th International World Wide Web Conference
May 8 - 12, 2007
Alberta, Banff, Canada

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 10 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2012)User Models for Adaptive Information Retrieval on the WebInternational Journal of Adaptive, Resilient and Autonomic Systems10.4018/jaras.20120701013:3(1-19)Online publication date: 1-Jul-2012
  • (2011)A transfer approach to detecting disease reporting events in blog social mediaProceedings of the 22nd ACM conference on Hypertext and hypermedia10.1145/1995966.1996001(271-280)Online publication date: 6-Jun-2011
  • (2011)Self-supervised detection of disease reporting events in outbreak reports2011 IEEE International Conference on Information Reuse & Integration10.1109/IRI.2011.6009584(416-421)Online publication date: Aug-2011
  • (2008)RADARProceedings of the 23rd national conference on Artificial intelligence - Volume 310.5555/1620270.1620274(1287-1293)Online publication date: 13-Jul-2008

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media