Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3586183.3606830acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
research-article

Front Row: Automatically Generating Immersive Audio Representations of Tennis Broadcasts for Blind Viewers

Published: 29 October 2023 Publication History

Abstract

Blind and low-vision (BLV) people face challenges watching sports due to the lack of accessibility of sports broadcasts. Currently, BLV people rely on descriptions from TV commentators, radio announcers, or their friends to understand the game. These descriptions, however, do not allow BLV viewers to visualize the action by themselves. We present Front Row, a system that automatically generates an immersive audio representation of sports broadcasts, specifically tennis, allowing BLV viewers to more directly perceive what is happening in the game. Front Row first recognizes gameplay from the video feed using computer vision, then renders players’ positions and shots via spatialized (3D) audio cues. User evaluations with 12 BLV participants show that Front Row gives BLV viewers a more accurate understanding of the game compared to TV and radio, enabling viewers to form their own opinions on players’ moods and strategies. We discuss future implications of Front Row and illustrate several applications, including a Front Row plug-in for video streaming platforms to enable BLV people to visualize the action in sports videos across the Web.

References

[1]
Action Audio. 2021. Making Sports Broadcasts Accessible to People Living With Blindness or Low Vision. https://action-audio.com/
[2]
American Council of the Blind. 2022. The Audio Description Project. https://adp.acb.org/guidelines.html
[3]
Katrin Angerbauer, Nils Rodrigues, Rene Cutura, Seyda Öney, Nelusa Pathmanathan, Cristina Morariu, Daniel Weiskopf, and Michael Sedlmair. 2022. Accessibility for Color Vision Deficiencies: Challenges and Findings of a Large Scale Study on Paper Figures. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–23. https://doi.org/10.1145/3491102.3502133
[4]
Saki Asakawa, João Guerreiro, Daisuke Sato, Hironobu Takagi, Dragan Ahmetovic, Desi Gonzalez, Kris M. Kitani, and Chieko Asakawa. 2019. An Independent and Interactive Museum Experience for Blind People. In Proceedings of the 16th International Web for All Conference. ACM, San Francisco CA USA, 1–9. https://doi.org/10.1145/3315002.3317557
[5]
Saki Asakawa and Amy Hurst. 2021. “What just happened?”: Understanding Non-visual Watching Sports Experiences. In The 23rd International ACM SIGACCESS Conference on Computers and Accessibility. ACM, Virtual Event USA, 1–3. https://doi.org/10.1145/3441852.3476525
[6]
Cynthia L. Bennett, Cole Gleason, Morgan Klaus Scheuerman, Jeffrey P. Bigham, Anhong Guo, and Alexandra To. 2021. “It’s Complicated”: Negotiating Accessibility and (Mis)Representation in Image Descriptions of Race, Gender, and Disability. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, Yokohama Japan, 1–19. https://doi.org/10.1145/3411764.3445498
[7]
Virginia Braun and Victoria Clarke. 2006. Using thematic analysis in psychology. Qualitative Research in Psychology 3, 2 (Jan. 2006), 77–101. https://doi.org/10.1191/1478088706qp063oa
[8]
Matthew Butler, Leona M Holloway, Samuel Reinders, Cagatay Goncu, and Kim Marriott. 2021. Technology Developments in Touch-Based Accessible Graphics: A Systematic Review of Research 2010-2020. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems(CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 278. https://doi.org/10.1145/3411764.3445207
[9]
Ruei-Che Chang, Chao-Hsien Ting, Chia-Sheng Hung, Wan-Chen Lee, Liang-Jin Chen, Yu-Tzu Chao, Bing-Yu Chen, and Guo Anhong. 2022. OmniScribe: Authoring Immersive Audio Descriptions for 360 ° Videos. (2022), 14.
[10]
Zhutian Chen, Shuainan Ye, Xiangtong Chu, Haijun Xia, Hui Zhang, Huamin Qu, and Yingcai Wu. 2022. Augmenting Sports Videos with VisCommentator. IEEE Transactions on Visualization and Computer Graphics 28, 1 (Jan. 2022), 824–834. https://doi.org/10.1109/TVCG.2021.3114806
[11]
Morgan Cottril. February 12, 2020. The Importance of Sports in Culture. https://fghsnews.com/2603/diversity/the-importance-of-sports-in-culture/
[12]
N. Dalal and B. Triggs. 2005. Histograms of Oriented Gradients for Human Detection. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), Vol. 1. IEEE, San Diego, CA, USA, 886–893. https://doi.org/10.1109/CVPR.2005.177
[13]
Expedio Design. 2019. Footbraile. https://www.expediodesign.com/portfolio-footbraille
[14]
Olutayo Falase, Alexa F. Siu, and Sean Follmer. 2019. Tactile Code Skimmer: A Tool to Help Blind Programmers Feel the Structure of Code. In The 21st International ACM SIGACCESS Conference on Computers and Accessibility. ACM, Pittsburgh PA USA, 536–538. https://doi.org/10.1145/3308561.3354616
[15]
John C. Flanagan. 1954. The critical incident technique. Psychological Bulletin 51, 4 (1954), 327–358.
[16]
John Garhammer and Harvey Newton. 2013. Applied Video Analysis for Coaches: Weightlifting Examples. International Journal of Sports Science & Coaching 8, 3 (Sept. 2013), 581–594. https://doi.org/10.1260/1747-9541.8.3.581 Publisher: SAGE Publications.
[17]
Anurag Ghosh and C. V. Jawahar. 2018. SmartTennisTV: Automatic indexing of tennis videos. arXiv:1801.01430 [cs] (Jan. 2018). http://arxiv.org/abs/1801.01430 arXiv:1801.01430.
[18]
Anurag Ghosh, Suriya Singh, and C. V. Jawahar. 2017. Towards Structured Analysis of Broadcast Badminton Videos. arXiv:1712.08714 [cs] (Dec. 2017). http://arxiv.org/abs/1712.08714 arXiv:1712.08714.
[19]
Cole Gleason, Amy Pavel, Himalini Gururaj, Kris Kitani, and Jeffrey Bigham. 2020. Making GIFs Accessible. In The 22nd International ACM SIGACCESS Conference on Computers and Accessibility(ASSETS ’20). Association for Computing Machinery, New York, NY, USA, 1–10. https://doi.org/10.1145/3373625.3417027
[20]
Cole Gleason, Amy Pavel, Emma McCamey, Christina Low, Patrick Carrington, Kris M. Kitani, and Jeffrey P. Bigham. 2020. Twitter A11y: A Browser Extension to Make Twitter Images Accessible. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. ACM, Honolulu HI USA, 1–12. https://doi.org/10.1145/3313831.3376728
[21]
Cagatay Goncu and Daniel J. Finnegan. 2021. ‘Did You See That!?’ Enhancing the Experience of Sports Media Broadcast for Blind People. In Human-Computer Interaction – INTERACT 2021. Vol. 12932. Springer International Publishing, Cham, 396–417. https://doi.org/10.1007/978-3-030-85623-6_24
[22]
Cagatay Goncu, Anuradha Madugalla, Simone Marinai, and Kim Marriott. 2015. Accessible On-Line Floor Plans. In Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, Florence Italy, 388–398. https://doi.org/10.1145/2736277.2741660
[23]
Cagatay Goncu and Kim Marriott. 2011. GraVVITAS: Generic Multi-touch Presentation of Accessible Graphics. In Human-Computer Interaction – INTERACT 2011, David Hutchison, Takeo Kanade, Josef Kittler, Jon M. Kleinberg, Friedemann Mattern, John C. Mitchell, Moni Naor, Oscar Nierstrasz, C. Pandu Rangan, Bernhard Steffen, Madhu Sudan, Demetri Terzopoulos, Doug Tygar, Moshe Y. Vardi, Gerhard Weikum, Pedro Campos, Nicholas Graham, Joaquim Jorge, Nuno Nunes, Philippe Palanque, and Marco Winckler (Eds.). Vol. 6946. Springer Berlin Heidelberg, Berlin, Heidelberg, 30–48. https://doi.org/10.1007/978-3-642-23774-4_5
[24]
Leo A. Goodman. 1961. Snowball Sampling. The Annals of Mathematical Statistics 32, 1 (1961), 148–170. https://www.jstor.org/stable/2237615
[25]
Alex Graves. 2014. Generating Sequences With Recurrent Neural Networks. http://arxiv.org/abs/1308.0850 arXiv:1308.0850 [cs].
[26]
Giles Hamilton-Fletcher, Marianna Obrist, Phil Watten, Michele Mengucci, and Jamie Ward. 2016. "I Always Wanted to See the Night Sky": Blind User Preferences for Sensory Substitution Devices. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, San Jose California USA, 2162–2174. https://doi.org/10.1145/2858036.2858241
[27]
Sandra G. Hart and Lowell E. Staveland. 1988. Development of NASA-TLX (Task Load Index): Results of Empirical and Theoretical Research. In Advances in Psychology, Peter A. Hancock and Najmedin Meshkati (Eds.). Human Mental Workload, Vol. 52. North-Holland, 139–183. https://doi.org/10.1016/S0166-4115(08)62386-9
[28]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735–1780. Publisher: MIT press.
[29]
Leona M Holloway, Cagatay Goncu, Alon Ilsar, Matthew Butler, and Kim Marriott. 2022. Infosonics: Accessible Infographics for People who are Blind using Sonification and Voice. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–13. https://doi.org/10.1145/3491102.3517465
[30]
Yu-Chuan Huang, I-No Liao, Ching-Hsuan Chen, Tsì-Uí İk, and Wen-Chih Peng. 2019. TrackNet: A Deep Learning Network for Tracking High-speed and Tiny Objects in Sports Applications*. In 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS). 1–8. https://doi.org/10.1109/AVSS.2019.8909871 ISSN: 2643-6213.
[31]
Kenneth A. Hunt, Terry Bristol, and R. Edward Bashaw. 1999. A conceptual approach to classifying sports fans. The Journal of Services Marketing 13, 6 (1999), 439–452. https://doi.org/10.1108/08876049910298720
[32]
[32] Hawk-Eye Innovations. 2001. https://www.hawkeyeinnovations.com/
[33]
IrisVision. Retrieved July 15, 2022. IrisVision. https://irisvision.com/product/
[34]
Hiroo Iwata, Hiroaki Yano, Fumitaka Nakaizumi, and Ryo Kawamura. 2001. Project FEELEX: Adding Haptic Surface to Graphics. In Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques - SIGGRAPH ’01. ACM Press, Not Known, 469–476. https://doi.org/10.1145/383259.383314
[35]
Gaurav Jain, Basel Hindi, Connor Courtien, Conrad Wyrick, Xin Yi Therese Xu, Michael C Malcolm, and Brian A. Smith. 2023. Towards Accessible Sports Broadcasts for Blind and Low-Vision Viewers. In Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems. ACM, Hamburg Germany, 1–7. https://doi.org/10.1145/3544549.3585610
[36]
Grant Jarvie, James Thornton, and Hector Mackie. 2017. Sport, Culture and Society: An Introduction (3 ed.). Routledge, Third edition. | Abingdon, Oxon ; New York, NY : Routledge is an imprint of the Taylor & Francis Group, an Informa Business, [2017].
[37]
Glenn Jocher et al.April 2021. YOLOv5. https://ultralytics.com/yolov5
[38]
Shaun K. Kane, Meredith Ringel Morris, and Jacob O. Wobbrock. 2013. Touchplates: Low-Cost Tactile Overlays for Visually Impaired Touch Screen Users. In Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility. ACM, Bellevue Washington, 1–8. https://doi.org/10.1145/2513383.2513442
[39]
Jaewook Lee, Jaylin Herskovitz, Yi-Hao Peng, and Anhong Guo. 2022. ImageExplorer: Multi-Layered Touch Exploration to Encourage Skepticism Towards Imperfect AI-Generated Image Captions. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–15. https://doi.org/10.1145/3491102.3501966
[40]
Franklin Mingzhe Li, Lotus Zhang, Maryam Bandukda, Abigale Stangl, Kristen Shinohara, Leah Findlater, and Patrick Carrington. 2023. Understanding Visual Arts Experiences of Blind People. https://doi.org/10.1145/3544548.3580941 arXiv:2301.12687 [cs].
[41]
Thomas Lin. 2012. Hitting the Court, With an Ear on the Ball. The New York Times (June 2012). https://www.nytimes.com/2012/06/05/science/a-game-of-tennis-tests-notions-of-blindness.html
[42]
Xingyu Liu, Patrick Carrington, Xiang ’Anthony’ Chen, and Amy Pavel. 2021. What Makes Videos Accessible to Blind and Visually Impaired People?. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, Yokohama Japan, 1–14. https://doi.org/10.1145/3411764.3445233
[43]
Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully Convolutional Networks for Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015), 3431–3440. https://openaccess.thecvf.com/content_cvpr_2015/html/Long_Fully_Convolutional_Networks_2015_CVPR_paper.html
[44]
Kelly Mack, Danielle Bragg, Meredith Ringel Morris, Maarten W. Bos, Isabelle Albi, and Andrés Monroy-Hernández. 2020. Social App Accessibility for Deaf Signers. Proceedings of the ACM on Human-Computer Interaction 4, CSCW2 (Oct. 2020), 1–31.
[45]
Kelly Mack, Edward Cutrell, Bongshin Lee, and Meredith Ringel Morris. 2021. Designing Tools for High-Quality Alt Text Authoring. In The 23rd International ACM SIGACCESS Conference on Computers and Accessibility(ASSETS ’21). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3441852.3471207
[46]
Anuradha Madugalla, Kim Marriott, Simone Marinai, Samuele Capobianco, and Cagatay Goncu. 2020. Creating Accessible Online Floor Plans for Visually Impaired Readers. ACM Transactions on Accessible Computing 13, 4 (Oct. 2020), 1–37. https://doi.org/10.1145/3410446
[47]
J. Matas, C. Galambos, and J. Kittler. 1998. Progressive Probabilistic Hough Transform. In Procedings of the British Machine Vision Conference 1998. British Machine Vision Association, Southampton, 26.1–26.10. https://doi.org/10.5244/C.12.26
[48]
Tom McEwan and Ben Weerts. 2007. ALT Text and Basic Accessibility. https://doi.org/10.14236/ewic/HCI2007.64
[49]
Meredith Ringel Morris, Jazette Johnson, Cynthia L. Bennett, and Edward Cutrell. 2018. Rich Representations of Visual Content for Screen Reader Users. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. ACM, Montreal QC Canada, 1–11. https://doi.org/10.1145/3173574.3173633
[50]
Vishnu Nair, Jay L Karp, Samuel Silverman, Mohar Kalra, Hollis Lehv, Faizan Jamil, and Brian A. Smith. 2021. NavStick: Making Video Games Blind-Accessible via the Ability to Look Around. In The 34th Annual ACM Symposium on User Interface Software and Technology. ACM, Virtual Event USA, 538–551. https://doi.org/10.1145/3472749.3474768
[51]
Yuri Nishikawa, Hitoshi Sato, and Jun Ozawa. 2018. Multiple sports player tracking system based on graph optimization using low-cost cameras. In 2018 IEEE International Conference on Consumer Electronics (ICCE). 1–4. https://doi.org/10.1109/ICCE.2018.8326126 ISSN: 2158-4001.
[52]
NVivo. 1997. NVivo. https://www.qsrinternational.com/nvivo-qualitative-data-analysis-software/home
[53]
Hiroyuki Ohshima, Makoto Kobayashi, and Shigenobu Shimada. 2021. Development of Blind Football Play-by-play System for Visually Impaired Spectators: Tangible Sports. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, Yokohama Japan, 1–6. https://doi.org/10.1145/3411763.3451737
[54]
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. 2011. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12 (2011), 2825–2830.
[55]
Peter Meijer. Retrieved August 2022. The vOICe. https://www.seeingwithsound.com/
[56]
Bridget Pettitt, Katharine Sharpe, and Steven Cooper. 1996. AUDETEL: Enhancing television for visually impaired people. British Journal of Visual Impairment 14, 2 (May 1996), 48–52. https://doi.org/10.1177/026461969601400202 Publisher: SAGE Publications Ltd.
[57]
Venkatesh Potluri, Tadashi E Grindeland, Jon E. Froehlich, and Jennifer Mankoff. 2021. Examining Visual Semantic Understanding in Blind and Low-Vision Technology Users. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, Yokohama Japan, 1–14. https://doi.org/10.1145/3411764.3445040
[58]
Denise Prescher, Jens Bornschein, Wiebke Kohlmann, and Gerhard Weber. 2018. Touching Graphical Applications: Bimanual Tactile Interaction on the HyperBraille Pin-Matrix Display. Universal Access in the Information Society 17, 2 (June 2018), 391–409. https://doi.org/10.1007/s10209-017-0538-8
[59]
Arthur A Raney and Jennings Bryant. 2006. Handbook of Sports and Media. Chapter 19: Why we watch and enjoy mediated sports.
[60]
Kyle Rector, Keith Salmon, Dan Thornton, Neel Joshi, and Meredith Ringel Morris. 2017. Eyes-Free Art: Exploring Proxemic Audio Interfaces For Blind and Low Vision Art Engagement. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 1, 3 (Sept. 2017), 1–21. https://doi.org/10.1145/3130958
[61]
Andreas Reichinger, Stefan Maierhofer, and Werner Purgathofer. 2011. High-Quality Tactile Paintings. Journal on Computing and Cultural Heritage 4, 2 (Nov. 2011), 1–13. https://doi.org/10.1145/2037820.2037822
[62]
Santander. 2019. Fieeld. https://www.santander.com/en/press-room/press-releases/santander-presents-fieeld-a-deviceenabling- blind-people-to-watch-football-using-their-fingertips
[63]
Ather Sharif, Olivia H. Wang, Alida T. Muongchan, Katharina Reinecke, and Jacob O. Wobbrock. 2022. VoxLens: Making Online Data Visualizations Accessible with an Interactive JavaScript Plug-In. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–19. https://doi.org/10.1145/3491102.3517431
[64]
Roy Shilkrot, Jochen Huber, Connie Liu, Pattie Maes, and Suranga Chandima Nanayakkara. 2014. FingerReader: a wearable device to support text reading on the go. In CHI ’14 Extended Abstracts on Human Factors in Computing Systems. ACM, Toronto Ontario Canada, 2359–2364. https://doi.org/10.1145/2559206.2581220
[65]
Roy Shilkrot, Jochen Huber, Wong Meng Ee, Pattie Maes, and Suranga Chandima Nanayakkara. 2015. FingerReader: A Wearable Device to Explore Printed Text on the Go. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. ACM, Seoul Republic of Korea, 2363–2372. https://doi.org/10.1145/2702123.2702421
[66]
Jaeeun Shin, Jundong Cho, and Sangwon Lee. 2020. Please Touch Color: Tactile-Color Texture Design for The Visually Impaired. In Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems. ACM, Honolulu HI USA, 1–7. https://doi.org/10.1145/3334480.3383003
[67]
Alexa Siu, Gene S-H Kim, Sile O’Modhrain, and Sean Follmer. 2022. Supporting Accessible Data Visualization Through Audio Data Narratives. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–19. https://doi.org/10.1145/3491102.3517678
[68]
Alexa F. Siu, Son Kim, Joshua A. Miele, and Sean Follmer. 2019. shapeCAD: An Accessible 3D Modelling Workflow for the Blind and Visually-Impaired Via 2.5D Shape Displays. In The 21st International ACM SIGACCESS Conference on Computers and Accessibility. ACM, Pittsburgh PA USA, 342–354. https://doi.org/10.1145/3308561.3353782
[69]
Brian A. Smith and Shree K. Nayar. 2018. The RAD: Making Racing Games Equivalently Accessible to People Who Are Blind. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems(CHI ’18). Association for Computing Machinery, New York, NY, USA, 1–12. https://doi.org/10.1145/3173574.3174090
[70]
Joel Snyder. 2005. Audio description: The visual made verbal. International Congress Series 1282 (Sept. 2005), 935–939. https://doi.org/10.1016/j.ics.2005.05.215
[71]
Nancy Staggers and David Kobus. 2000. Comparing Response Time, Errors, and Satisfaction Between Text-based and Graphical User Interfaces During Nursing Order Tasks. Journal of the American Medical Informatics Association : JAMIA 7, 2 (2000), 164–176. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC61470/
[72]
Abigale Stangl, Meredith Ringel Morris, and Danna Gurari. 2020. "Person, Shoes, Tree. Is the Person Naked?" What People with Vision Impairments Want in Image Descriptions. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems(CHI ’20). Association for Computing Machinery, New York, NY, USA, 1–13. https://doi.org/10.1145/3313831.3376404
[73]
Lee Stearns, Victor DeSouza, Jessica Yin, Leah Findlater, and Jon E. Froehlich. 2017. Augmented Reality Magnification for Low Vision Users with the Microsoft Hololens and a Finger-Worn Camera. In Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility. ACM, Baltimore Maryland USA, 361–362. https://doi.org/10.1145/3132525.3134812
[74]
Takamasa Tsunoda, Yasuhiro Komori, Masakazu Matsugu, and Tatsuya Harada. 2017. Football Action Recognition Using Hierarchical LSTM. In 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE, Honolulu, HI, USA, 155–163. https://doi.org/10.1109/CVPRW.2017.25
[75]
Valve Corporation. 2018. Steam Audio. https://valvesoftware.github.io/steam-audio/
[76]
Roman Voeikov, Nikolay Falaleev, and Ruslan Baikulov. 2020. TTNet: Real-time temporal and spatial video analysis of table tennis. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE, Seattle, WA, USA, 3866–3874. https://doi.org/10.1109/CVPRW50498.2020.00450
[77]
Yujia Wang, Wei Liang, Haikun Huang, Yongqi Zhang, Dingzeyu Li, and Lap-Fai Yu. 2021. Toward Automatic Audio Description Generation for Accessible Videos. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, Yokohama Japan, 1–12. https://doi.org/10.1145/3411764.3445347
[78]
Yanan Wang, Ruobin Wang, Crescentia Jung, and Yea-Seul Kim. 2022. What makes web data tables accessible? Insights and a tool for rendering accessible tables for people with visual impairments. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–20. https://doi.org/10.1145/3491102.3517469
[79]
World Wide Web Consortium (W3C). 2022. Making Audio and Video Media Accessible. https://www.w3.org/WAI/media/av/
[80]
World Wide Web Consortium (W3C). 2022. W3C Image Concepts. https: //www.w3.org/WAI/tutorials/images/
[81]
Bosun Xie. 2013. Head-Related Transfer Function and Virtual Auditory Display: Second Edition. J. Ross Publishing. Google-Books-ID: fvDLCgAAQBAJ.
[82]
Mingrui Ray Zhang, Mingyuan Zhong, and Jacob O. Wobbrock. 2022. Ga11y: An Automated GIF Annotation System for Visually Impaired Users. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–16. https://doi.org/10.1145/3491102.3502092

Cited By

View all
  • (2024)MIMOSA: Human-AI Co-Creation of Computational Spatial Audio Effects on VideosProceedings of the 16th Conference on Creativity & Cognition10.1145/3635636.3656189(156-169)Online publication date: 23-Jun-2024
  • (2024)SPICA: Interactive Video Content Exploration through Augmented Audio Descriptions for Blind or Low-Vision ViewersProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642632(1-18)Online publication date: 11-May-2024
  • (2024)“It’s Kind of Context Dependent”: Understanding Blind and Low Vision People’s Video Accessibility Preferences Across Viewing ScenariosProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642238(1-20)Online publication date: 11-May-2024

Index Terms

  1. Front Row: Automatically Generating Immersive Audio Representations of Tennis Broadcasts for Blind Viewers

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology
      October 2023
      1825 pages
      ISBN:9798400701320
      DOI:10.1145/3586183
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 29 October 2023

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Visual impairments
      2. accessibility
      3. computer vision
      4. sports

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Funding Sources

      Conference

      UIST '23

      Acceptance Rates

      Overall Acceptance Rate 842 of 3,967 submissions, 21%

      Upcoming Conference

      UIST '24

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)286
      • Downloads (Last 6 weeks)24
      Reflects downloads up to 21 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)MIMOSA: Human-AI Co-Creation of Computational Spatial Audio Effects on VideosProceedings of the 16th Conference on Creativity & Cognition10.1145/3635636.3656189(156-169)Online publication date: 23-Jun-2024
      • (2024)SPICA: Interactive Video Content Exploration through Augmented Audio Descriptions for Blind or Low-Vision ViewersProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642632(1-18)Online publication date: 11-May-2024
      • (2024)“It’s Kind of Context Dependent”: Understanding Blind and Low Vision People’s Video Accessibility Preferences Across Viewing ScenariosProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642238(1-20)Online publication date: 11-May-2024

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media