research-article

Are synthesized video descriptions acceptable?

Authors:

Chieko AsakawaAuthors Info & Claims

ASSETS '10: Proceedings of the 12th international ACM SIGACCESS conference on Computers and accessibility

Pages 163 - 170

https://doi.org/10.1145/1878803.1878833

Published: 25 October 2010 Publication History

Get Access

Abstract

We conducted a series of experiments to assess the feasibility of synthesized narrations to describe online videos. To reduce the cultural bias, we included adult blind or low-vision participants from Japan and the U.S. in the main study. Our research also includes a follow-up study we conducted in Japan to assess the effectiveness of synthesized video descriptions in realistic situations. The results showed that synthesized video descriptions were generally accepted in both countries. We also found that appropriate technology support allowed a novice describer to make effective video descriptions. Based on these results, we discuss the implications for developing a technology platform for describing online videos.

References

[1]

Chapdelaine, C. and Gagnon, L. Accessible Videodescription On-Demand. In Proceedings of ASSETS '09, ACM, 2009, pp. 221--222.

Digital Library

Google Scholar

[2]

Ely, R., Wall Emerson, R., Maggiore, T., Rothberg, M., O'Connell, T., and Hudson, L. Increased Content Knowledge of Students with Visual Impairments as a Result of Extended Descriptions. Journal of Special Education Technology, 21(3), 2006, pp. 31--43.

Crossref

Google Scholar

[3]

Gould, B., Ferrell, K. A., and O'Connell, T. Accessible Science: How to Describe STEM Images. AER Journal: Research and Practice in Visual Impairment & Blindness, 2(1), 2009, pp. 52--54.

Google Scholar

[4]

Kobayashi, M., Fukuda, K., Takagi, H., and Asakawa, C. Providing Synthesized Audio Description for Online Videos. In Proceedings of ASSETS '09, ACM, 2009, pp. 249--250.

Digital Library

Google Scholar

[5]

Miyashita, H., Sato, D., Takagi, H., and Asakawa, C. aiBrowser for Multimedia: Introducing Multimedia Content Accessibility for Visually Impaired Users. In Proceedings of ASSETS '07, ACM, 2007, pp. 91--98.

Digital Library

Google Scholar

[6]

Miyashita, H., Sato, D., Takagi, H., and Asakawa, C. Making Multimedia Content Accessible for Screen Reader Users, In Proceedings of W4A '07, ACM, 2007, pp. 126--127.

Digital Library

Google Scholar

[7]

Pitrelli, J. F., Eide, E. M., Bakis, R., Fernandez, R., Hamza, W., and Picheny, M. A. The IBM Expressive Text-to-Speech Synthesis System for American English. IEEE Trans. on Audio, Speech and Language Processing, 14(4), 2006, pp. 1099--1108.

Digital Library

Google Scholar

[8]

Viswanathan, M. and Viswanathan, M. Measuring Speech Quality for Text-to-Speech Systems: Development and Assessment of a Modified Mean Opinion Score (MOS) Scale, Computer Speech & Language, 19(1), 2005, pp. 55--83.

Crossref

Google Scholar

[9]

Demos of HTML5 Video and Audio Tag Accessibility, http://www.annodex.net/~silvia/itext/

Google Scholar

[10]

Guidelines for Older Persons and Persons with Disabilities - Information and Communications Equipment, Software and Services - Part 3: Web Content (JIS X 8341-3), Japanese Standards Association.

Google Scholar

[11]

Section 508 of the Rehabilitation Act, http://www.section508.gov/

Google Scholar

[12]

Web Contents Accessibility Guidelines (WCAG) 2.0, http://www.w3.org/TR/WCAG20/

Google Scholar

Cited By

View all

Emara I(2024)Knocking on doors: The use of blogging sites by visually impaired people in the USA preliminary studyConvergence: The International Journal of Research into New Media Technologies10.1177/13548565241261963Online publication date: 14-Jun-2024
https://doi.org/10.1177/13548565241261963
Ning ZWimer BJiang KChen KBan JTian YZhao YLi T(2024)SPICA: Interactive Video Content Exploration through Augmented Audio Descriptions for Blind or Low-Vision ViewersProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642632(1-18)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642632
Nakajima SOkochi NMitobe K(2024)Enhancing movie experience by speech rate design of audio descriptionUniversal Access in the Information Society10.1007/s10209-024-01178-zOnline publication date: 4-Dec-2024
https://doi.org/10.1007/s10209-024-01178-z
Show More Cited By

Index Terms

Are synthesized video descriptions acceptable?
1. Social and professional topics
  1. Professional topics
    1. Computing profession
      1. Assistive technologies
  2. User characteristics
    1. People with disabilities

Recommendations

Toward Automatic Audio Description Generation for Accessible Videos
CHI '21: Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems

Video accessibility is essential for people with visual impairments. Audio descriptions describe what is happening on-screen, e.g., physical actions, facial expressions, and scene changes. Generating high-quality audio descriptions requires a lot of ...
Human-in-the-Loop Machine Learning to Increase Video Accessibility for Visually Impaired and Blind Users
DIS '20: Proceedings of the 2020 ACM Designing Interactive Systems Conference

Video accessibility is crucial for blind and visually impaired individuals for education, employment, and entertainment purposes. However, professional video descriptions are costly and time-consuming. Volunteer-created video descriptions could be a ...
Providing synthesized audio description for online videos
Assets '09: Proceedings of the 11th international ACM SIGACCESS conference on Computers and accessibility

We describe an initial attempt to develop a common platform for adding an audio description (AD) to an online video so that blind and visually impaired people can enjoy such material. A speech synthesis technology allows content providers to offer the ...

Comments

Information & Contributors

Information

Published In

ASSETS '10: Proceedings of the 12th international ACM SIGACCESS conference on Computers and accessibility

October 2010

346 pages

ISBN:9781605588810

DOI:10.1145/1878803

General Chair:
Armando Barreto
Florida International University, USA
,
Program Chair:
Vicki L. Hanson
University of Dundee, UK

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 October 2010

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

ASSETS '10

Sponsor:

SIGACCESS

ASSETS '10: The 12th International ACM SIGACCESS Conference on Computers and Accessibility

October 25 - 27, 2010

Florida, Orlando, USA

Acceptance Rates

Overall Acceptance Rate 436 of 1,556 submissions, 28%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

27
Total Citations
View Citations
347
Total Downloads

Downloads (Last 12 months)26
Downloads (Last 6 weeks)0

Reflects downloads up to 26 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Emara I(2024)Knocking on doors: The use of blogging sites by visually impaired people in the USA preliminary studyConvergence: The International Journal of Research into New Media Technologies10.1177/13548565241261963Online publication date: 14-Jun-2024
https://doi.org/10.1177/13548565241261963
Ning ZWimer BJiang KChen KBan JTian YZhao YLi T(2024)SPICA: Interactive Video Content Exploration through Augmented Audio Descriptions for Blind or Low-Vision ViewersProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642632(1-18)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642632
Nakajima SOkochi NMitobe K(2024)Enhancing movie experience by speech rate design of audio descriptionUniversal Access in the Information Society10.1007/s10209-024-01178-zOnline publication date: 4-Dec-2024
https://doi.org/10.1007/s10209-024-01178-z
Campos VGonçalves LRibeiro WAraújo TDo Rego TFigueiredo PVieira SCosta TMoraes CCruz AAraújo FSouza Filho G(2023)Machine Generation of Audio Description for Blind and Visually Impaired PeopleACM Transactions on Accessible Computing10.1145/359095516:2(1-28)Online publication date: 24-Jun-2023
https://dl.acm.org/doi/10.1145/3590955
Nevsky ANeate TSimperl EVatavu R(2023)Accessibility Research in Digital Audiovisual Media: What Has Been Achieved and What Should Be Done Next?Proceedings of the 2023 ACM International Conference on Interactive Media Experiences10.1145/3573381.3596159(94-114)Online publication date: 12-Jun-2023
https://dl.acm.org/doi/10.1145/3573381.3596159
Natalie RTseng JKacorri HHara K(2023)Supporting Novices Author Audio Descriptions via Automatic FeedbackProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581023(1-18)Online publication date: 19-Apr-2023
https://dl.acm.org/doi/10.1145/3544548.3581023
Liu XWang RLi DChen XPavel A(2022)CrossA11y: Identifying Video Accessibility Issues via Cross-modal GroundingProceedings of the 35th Annual ACM Symposium on User Interface Software and Technology10.1145/3526113.3545703(1-14)Online publication date: 29-Oct-2022
https://dl.acm.org/doi/10.1145/3526113.3545703
Chang RTing CHung CLee WChen LChao YChen BGuo A(2022)OmniScribe: Authoring Immersive Audio Descriptions for 360° VideosProceedings of the 35th Annual ACM Symposium on User Interface Software and Technology10.1145/3526113.3545613(1-14)Online publication date: 29-Oct-2022
https://dl.acm.org/doi/10.1145/3526113.3545613
Colley MKränzle TRukzio E(2022)Accessibility-Related Publication Distribution in HCI Based on a Meta-AnalysisExtended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491101.3519701(1-28)Online publication date: 27-Apr-2022
https://dl.acm.org/doi/10.1145/3491101.3519701
Natalie R(2022)Cost-effective and Collaborative Methods to Author Video’s Scene Description for Blind People.Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491101.3503814(1-5)Online publication date: 27-Apr-2022
https://dl.acm.org/doi/10.1145/3491101.3503814
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Toward Automatic Audio Description Generation for Accessible Videos

Human-in-the-Loop Machine Learning to Increase Video Accessibility for Visually Impaired and Blind Users

Providing synthesized audio description for online videos

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations