Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1185448.1185517acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesacm-seConference Proceedingsconference-collections
Article

VTQuest: a voice-based multimodal web-based software system for maps and directions

Published: 10 March 2006 Publication History

Abstract

Finding our way out at a large university campus is a problem. We developed VTQuest, http://sunfish.cs.vt.edu/VTQuestV, as a web-based software system to solve this problem for the campus of Virginia Tech (http://www.vt.edu/). VTQuest enables (a) multimodal interaction with voice, mouse, and keyboard, (b) browsing the campus map, (c) locating a building by name, abbreviation, category, or within a distance on the campus map, (d) locating a room on the floor plan of a building, and (e) obtaining walking directions from one building to another. VTQuest provides these capabilities for 103 buildings with floor plans for most of the buildings. VTQuest is engineered based on Java 2 Platform, Enterprise Edition (J2EE) using Scalable Vector Graphics (SVG) and Speech Application Language Tags (SALT). SVG enables zooming into the maps without losing image quality. The voice interface offers a variety of features including an extensive grammar and out-of-turn interaction.

References

[1]
Barker, J., Cooke, M., and Ellis, D. Decoding speech in the presence of other sound sources. In Proceedings of the 6th International Conference on Spoken Language Processing, Beijing, China, October 16-20, 2000.
[2]
Chino, T., Kazuhiro, F., and Suzuki, K. Gaze To Talk: a nonverbal interface with meta-communication facility. In Proceedings of the Symposium on Eye Tracking Research and Applications, (Palm Beach Gardens, FL, November 6-8, 2000). ACM Press, New York, NY, 2000, 111.
[3]
Cohen, P. R., and Oviatt, S. L. The role of voice input for human-machine communication. In Proceedings of the National Academy of Sciences 92, 22, 9921--9927, 1995.
[4]
Coin, E. Speech is NOT dialog. Speech Technology Magazine 7, 3 (May/June 2002) http://www.speechtechmag.com/issues/7_3/cover/744-1.html
[5]
Green, P., Barker, J., Cooke, M., and Josifovski, L. Handling missing and unreliable information in speech recognition. In Proceedings of the International Conference on Artificial Intelligence and Statistics, Key West, Florida, 2001
[6]
Hauptmann, A. G. Speech recognition in the informedia digital video library: uses and limitations. Retrieved 2005. http://zero.inf.cs.cmu.edu/alex/ictai95.pdf
[7]
Hocek, A. VoiceXML and Next-Generation Voice Services. In Proceedings of the XML Conference and Exposition, Baltimore, MD, 2002.
[8]
Microsoft. How to use speech recognition in Windows XP. Retrieved 2005. http://support.microsoft.com/default.aspx?scid=kb;en-us;306901&sd=tech#4
[9]
Microsoft. SALT programmer's reference. Retrieved 2005. http://msdn.microsoft.com/library/default.asp?url=/library/en-us/sasdk_salt/html/ST_Programmers_Reference.asp
[10]
Moraes, I. Architectural tools for enabling speech applications. XML Journal, SYS-CON Publications, Montvale, NJ, 2004.
[11]
Potter, S., and Larson, J. A. VoiceXML and SALT: How are they different, and why. Speech Technology Magazine 7, 3 (May/June 2002) http://www.speechtechmag.com/issues/7_3/cover/742-1.html
[12]
SaltForum. Speech Application Language Tags (SALT) forum. Retrieved 2005. http://www.saltforum.org/
[13]
Stifelman, L. J., Arons, B., Schmandt, C., and Hulteen, E. A. VoiceNotes: a speech interface for a hand-held voice notetaker. In Proceedings of the ACM CHI 93 Human Factors in Computing Systems Conference (Amsterdam, The Netherlands, April 24-29, 1993). ACM Press, New York, NY, 1993, 179--186.
[14]
Sun Microsystems. Java 2 Platform, Enterprise Edition (J2EE). Retrieved 2005. http://java.sun.com/j2ee/
[15]
Wilson, L. X+V is a markup language, not a Roman math expression. 19 August 2003. http://www-128.ibm.com/developerworks/library/wi-xvlanguage/
[16]
World Wide Web Consortium. Scalable Vector Graphics (SVG) 1.1 Specification. Retrieved 2005. http://www.w3.org/TR/SVG/

Cited By

View all
  • (2015)Investigating accessibility on web-based mapsACM SIGAPP Applied Computing Review10.1145/2815169.281517115:2(17-26)Online publication date: 14-Aug-2015
  • (2015)Evaluation of web accessibility on the maps domainProceedings of the 30th Annual ACM Symposium on Applied Computing10.1145/2695664.2695771(157-162)Online publication date: 13-Apr-2015
  • (2008)Interaction techniques for the analysis of complex data on high-resolution displaysProceedings of the 10th international conference on Multimodal interfaces10.1145/1452392.1452399(21-28)Online publication date: 20-Oct-2008
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ACMSE '06: Proceedings of the 44th annual ACM Southeast Conference
March 2006
823 pages
ISBN:1595933158
DOI:10.1145/1185448
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 March 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. J2EE
  2. client-server software
  3. multimodal user interface
  4. scalable vector graphics
  5. voice user interface
  6. web-based software

Qualifiers

  • Article

Conference

ACM SE06
ACM SE06: ACM Southeast Regional Conference
March 10 - 12, 2006
Florida, Melbourne

Acceptance Rates

Overall Acceptance Rate 81 of 137 submissions, 59%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2015)Investigating accessibility on web-based mapsACM SIGAPP Applied Computing Review10.1145/2815169.281517115:2(17-26)Online publication date: 14-Aug-2015
  • (2015)Evaluation of web accessibility on the maps domainProceedings of the 30th Annual ACM Symposium on Applied Computing10.1145/2695664.2695771(157-162)Online publication date: 13-Apr-2015
  • (2008)Interaction techniques for the analysis of complex data on high-resolution displaysProceedings of the 10th international conference on Multimodal interfaces10.1145/1452392.1452399(21-28)Online publication date: 20-Oct-2008
  • (2007)Enabling rapid development of multimodal data entry applicationsProceedings of the 2007 OTM confederated international conference on On the move to meaningful internet systems - Volume Part I10.5555/1780909.1780980(377-386)Online publication date: 25-Nov-2007

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media