Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1878803.1878833acmconferencesArticle/Chapter ViewAbstractPublication PagesassetsConference Proceedingsconference-collections
research-article

Are synthesized video descriptions acceptable?

Published: 25 October 2010 Publication History

Abstract

We conducted a series of experiments to assess the feasibility of synthesized narrations to describe online videos. To reduce the cultural bias, we included adult blind or low-vision participants from Japan and the U.S. in the main study. Our research also includes a follow-up study we conducted in Japan to assess the effectiveness of synthesized video descriptions in realistic situations. The results showed that synthesized video descriptions were generally accepted in both countries. We also found that appropriate technology support allowed a novice describer to make effective video descriptions. Based on these results, we discuss the implications for developing a technology platform for describing online videos.

References

[1]
Chapdelaine, C. and Gagnon, L. Accessible Videodescription On-Demand. In Proceedings of ASSETS '09, ACM, 2009, pp. 221--222.
[2]
Ely, R., Wall Emerson, R., Maggiore, T., Rothberg, M., O'Connell, T., and Hudson, L. Increased Content Knowledge of Students with Visual Impairments as a Result of Extended Descriptions. Journal of Special Education Technology, 21(3), 2006, pp. 31--43.
[3]
Gould, B., Ferrell, K. A., and O'Connell, T. Accessible Science: How to Describe STEM Images. AER Journal: Research and Practice in Visual Impairment & Blindness, 2(1), 2009, pp. 52--54.
[4]
Kobayashi, M., Fukuda, K., Takagi, H., and Asakawa, C. Providing Synthesized Audio Description for Online Videos. In Proceedings of ASSETS '09, ACM, 2009, pp. 249--250.
[5]
Miyashita, H., Sato, D., Takagi, H., and Asakawa, C. aiBrowser for Multimedia: Introducing Multimedia Content Accessibility for Visually Impaired Users. In Proceedings of ASSETS '07, ACM, 2007, pp. 91--98.
[6]
Miyashita, H., Sato, D., Takagi, H., and Asakawa, C. Making Multimedia Content Accessible for Screen Reader Users, In Proceedings of W4A '07, ACM, 2007, pp. 126--127.
[7]
Pitrelli, J. F., Eide, E. M., Bakis, R., Fernandez, R., Hamza, W., and Picheny, M. A. The IBM Expressive Text-to-Speech Synthesis System for American English. IEEE Trans. on Audio, Speech and Language Processing, 14(4), 2006, pp. 1099--1108.
[8]
Viswanathan, M. and Viswanathan, M. Measuring Speech Quality for Text-to-Speech Systems: Development and Assessment of a Modified Mean Opinion Score (MOS) Scale, Computer Speech & Language, 19(1), 2005, pp. 55--83.
[9]
Demos of HTML5 Video and Audio Tag Accessibility, http://www.annodex.net/~silvia/itext/
[10]
Guidelines for Older Persons and Persons with Disabilities - Information and Communications Equipment, Software and Services - Part 3: Web Content (JIS X 8341-3), Japanese Standards Association.
[11]
Section 508 of the Rehabilitation Act, http://www.section508.gov/
[12]
Web Contents Accessibility Guidelines (WCAG) 2.0, http://www.w3.org/TR/WCAG20/

Cited By

View all
  • (2024)Knocking on doors: The use of blogging sites by visually impaired people in the USA preliminary studyConvergence: The International Journal of Research into New Media Technologies10.1177/13548565241261963Online publication date: 14-Jun-2024
  • (2024)SPICA: Interactive Video Content Exploration through Augmented Audio Descriptions for Blind or Low-Vision ViewersProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642632(1-18)Online publication date: 11-May-2024
  • (2024)Enhancing movie experience by speech rate design of audio descriptionUniversal Access in the Information Society10.1007/s10209-024-01178-zOnline publication date: 4-Dec-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ASSETS '10: Proceedings of the 12th international ACM SIGACCESS conference on Computers and accessibility
October 2010
346 pages
ISBN:9781605588810
DOI:10.1145/1878803
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 October 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. audio description
  2. online videos
  3. speech synthesis
  4. text-to-speech (tts)
  5. video description
  6. web accessibility

Qualifiers

  • Research-article

Conference

ASSETS '10
Sponsor:

Acceptance Rates

Overall Acceptance Rate 436 of 1,556 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)26
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Knocking on doors: The use of blogging sites by visually impaired people in the USA preliminary studyConvergence: The International Journal of Research into New Media Technologies10.1177/13548565241261963Online publication date: 14-Jun-2024
  • (2024)SPICA: Interactive Video Content Exploration through Augmented Audio Descriptions for Blind or Low-Vision ViewersProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642632(1-18)Online publication date: 11-May-2024
  • (2024)Enhancing movie experience by speech rate design of audio descriptionUniversal Access in the Information Society10.1007/s10209-024-01178-zOnline publication date: 4-Dec-2024
  • (2023)Machine Generation of Audio Description for Blind and Visually Impaired PeopleACM Transactions on Accessible Computing10.1145/359095516:2(1-28)Online publication date: 24-Jun-2023
  • (2023)Accessibility Research in Digital Audiovisual Media: What Has Been Achieved and What Should Be Done Next?Proceedings of the 2023 ACM International Conference on Interactive Media Experiences10.1145/3573381.3596159(94-114)Online publication date: 12-Jun-2023
  • (2023)Supporting Novices Author Audio Descriptions via Automatic FeedbackProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581023(1-18)Online publication date: 19-Apr-2023
  • (2022)CrossA11y: Identifying Video Accessibility Issues via Cross-modal GroundingProceedings of the 35th Annual ACM Symposium on User Interface Software and Technology10.1145/3526113.3545703(1-14)Online publication date: 29-Oct-2022
  • (2022)OmniScribe: Authoring Immersive Audio Descriptions for 360° VideosProceedings of the 35th Annual ACM Symposium on User Interface Software and Technology10.1145/3526113.3545613(1-14)Online publication date: 29-Oct-2022
  • (2022)Accessibility-Related Publication Distribution in HCI Based on a Meta-AnalysisExtended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491101.3519701(1-28)Online publication date: 27-Apr-2022
  • (2022)Cost-effective and Collaborative Methods to Author Video’s Scene Description for Blind People.Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491101.3503814(1-5)Online publication date: 27-Apr-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media