research-article

Front Row: Automatically Generating Immersive Audio Representations of Tennis Broadcasts for Blind Viewers

Authors:

Connor Courtien,

Xin Yi Therese Xu,

Michael Malcolm,

Brian A. SmithAuthors Info & Claims

UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology

Article No.: 39, Pages 1 - 17

https://doi.org/10.1145/3586183.3606830

Published: 29 October 2023 Publication History

Abstract

Blind and low-vision (BLV) people face challenges watching sports due to the lack of accessibility of sports broadcasts. Currently, BLV people rely on descriptions from TV commentators, radio announcers, or their friends to understand the game. These descriptions, however, do not allow BLV viewers to visualize the action by themselves. We present Front Row, a system that automatically generates an immersive audio representation of sports broadcasts, specifically tennis, allowing BLV viewers to more directly perceive what is happening in the game. Front Row first recognizes gameplay from the video feed using computer vision, then renders players’ positions and shots via spatialized (3D) audio cues. User evaluations with 12 BLV participants show that Front Row gives BLV viewers a more accurate understanding of the game compared to TV and radio, enabling viewers to form their own opinions on players’ moods and strategies. We discuss future implications of Front Row and illustrate several applications, including a Front Row plug-in for video streaming platforms to enable BLV people to visualize the action in sports videos across the Web.

References

[1]

Action Audio. 2021. Making Sports Broadcasts Accessible to People Living With Blindness or Low Vision. https://action-audio.com/

[2]

American Council of the Blind. 2022. The Audio Description Project. https://adp.acb.org/guidelines.html

[3]

Katrin Angerbauer, Nils Rodrigues, Rene Cutura, Seyda Öney, Nelusa Pathmanathan, Cristina Morariu, Daniel Weiskopf, and Michael Sedlmair. 2022. Accessibility for Color Vision Deficiencies: Challenges and Findings of a Large Scale Study on Paper Figures. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–23. https://doi.org/10.1145/3491102.3502133

Digital Library

[4]

Saki Asakawa, João Guerreiro, Daisuke Sato, Hironobu Takagi, Dragan Ahmetovic, Desi Gonzalez, Kris M. Kitani, and Chieko Asakawa. 2019. An Independent and Interactive Museum Experience for Blind People. In Proceedings of the 16th International Web for All Conference. ACM, San Francisco CA USA, 1–9. https://doi.org/10.1145/3315002.3317557

Digital Library

[5]

Saki Asakawa and Amy Hurst. 2021. “What just happened?”: Understanding Non-visual Watching Sports Experiences. In The 23rd International ACM SIGACCESS Conference on Computers and Accessibility. ACM, Virtual Event USA, 1–3. https://doi.org/10.1145/3441852.3476525

Digital Library

[6]

Cynthia L. Bennett, Cole Gleason, Morgan Klaus Scheuerman, Jeffrey P. Bigham, Anhong Guo, and Alexandra To. 2021. “It’s Complicated”: Negotiating Accessibility and (Mis)Representation in Image Descriptions of Race, Gender, and Disability. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, Yokohama Japan, 1–19. https://doi.org/10.1145/3411764.3445498

Digital Library

[7]

Virginia Braun and Victoria Clarke. 2006. Using thematic analysis in psychology. Qualitative Research in Psychology 3, 2 (Jan. 2006), 77–101. https://doi.org/10.1191/1478088706qp063oa

[8]

Matthew Butler, Leona M Holloway, Samuel Reinders, Cagatay Goncu, and Kim Marriott. 2021. Technology Developments in Touch-Based Accessible Graphics: A Systematic Review of Research 2010-2020. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems(CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 278. https://doi.org/10.1145/3411764.3445207

Digital Library

[9]

Ruei-Che Chang, Chao-Hsien Ting, Chia-Sheng Hung, Wan-Chen Lee, Liang-Jin Chen, Yu-Tzu Chao, Bing-Yu Chen, and Guo Anhong. 2022. OmniScribe: Authoring Immersive Audio Descriptions for 360 ° Videos. (2022), 14.

[10]

Zhutian Chen, Shuainan Ye, Xiangtong Chu, Haijun Xia, Hui Zhang, Huamin Qu, and Yingcai Wu. 2022. Augmenting Sports Videos with VisCommentator. IEEE Transactions on Visualization and Computer Graphics 28, 1 (Jan. 2022), 824–834. https://doi.org/10.1109/TVCG.2021.3114806

Digital Library

[11]

Morgan Cottril. February 12, 2020. The Importance of Sports in Culture. https://fghsnews.com/2603/diversity/the-importance-of-sports-in-culture/

[12]

N. Dalal and B. Triggs. 2005. Histograms of Oriented Gradients for Human Detection. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), Vol. 1. IEEE, San Diego, CA, USA, 886–893. https://doi.org/10.1109/CVPR.2005.177

Digital Library

[13]

Expedio Design. 2019. Footbraile. https://www.expediodesign.com/portfolio-footbraille

[14]

Olutayo Falase, Alexa F. Siu, and Sean Follmer. 2019. Tactile Code Skimmer: A Tool to Help Blind Programmers Feel the Structure of Code. In The 21st International ACM SIGACCESS Conference on Computers and Accessibility. ACM, Pittsburgh PA USA, 536–538. https://doi.org/10.1145/3308561.3354616

Digital Library

[15]

John C. Flanagan. 1954. The critical incident technique. Psychological Bulletin 51, 4 (1954), 327–358.

[16]

John Garhammer and Harvey Newton. 2013. Applied Video Analysis for Coaches: Weightlifting Examples. International Journal of Sports Science & Coaching 8, 3 (Sept. 2013), 581–594. https://doi.org/10.1260/1747-9541.8.3.581 Publisher: SAGE Publications.

[17]

Anurag Ghosh and C. V. Jawahar. 2018. SmartTennisTV: Automatic indexing of tennis videos. arXiv:1801.01430 [cs] (Jan. 2018). http://arxiv.org/abs/1801.01430 arXiv:1801.01430.

[18]

Anurag Ghosh, Suriya Singh, and C. V. Jawahar. 2017. Towards Structured Analysis of Broadcast Badminton Videos. arXiv:1712.08714 [cs] (Dec. 2017). http://arxiv.org/abs/1712.08714 arXiv:1712.08714.

[19]

Cole Gleason, Amy Pavel, Himalini Gururaj, Kris Kitani, and Jeffrey Bigham. 2020. Making GIFs Accessible. In The 22nd International ACM SIGACCESS Conference on Computers and Accessibility(ASSETS ’20). Association for Computing Machinery, New York, NY, USA, 1–10. https://doi.org/10.1145/3373625.3417027

Digital Library

[20]

Cole Gleason, Amy Pavel, Emma McCamey, Christina Low, Patrick Carrington, Kris M. Kitani, and Jeffrey P. Bigham. 2020. Twitter A11y: A Browser Extension to Make Twitter Images Accessible. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. ACM, Honolulu HI USA, 1–12. https://doi.org/10.1145/3313831.3376728

Digital Library

[21]

Cagatay Goncu and Daniel J. Finnegan. 2021. ‘Did You See That!?’ Enhancing the Experience of Sports Media Broadcast for Blind People. In Human-Computer Interaction – INTERACT 2021. Vol. 12932. Springer International Publishing, Cham, 396–417. https://doi.org/10.1007/978-3-030-85623-6_24

Digital Library

[22]

Cagatay Goncu, Anuradha Madugalla, Simone Marinai, and Kim Marriott. 2015. Accessible On-Line Floor Plans. In Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, Florence Italy, 388–398. https://doi.org/10.1145/2736277.2741660

Digital Library

[23]

Cagatay Goncu and Kim Marriott. 2011. GraVVITAS: Generic Multi-touch Presentation of Accessible Graphics. In Human-Computer Interaction – INTERACT 2011, David Hutchison, Takeo Kanade, Josef Kittler, Jon M. Kleinberg, Friedemann Mattern, John C. Mitchell, Moni Naor, Oscar Nierstrasz, C. Pandu Rangan, Bernhard Steffen, Madhu Sudan, Demetri Terzopoulos, Doug Tygar, Moshe Y. Vardi, Gerhard Weikum, Pedro Campos, Nicholas Graham, Joaquim Jorge, Nuno Nunes, Philippe Palanque, and Marco Winckler (Eds.). Vol. 6946. Springer Berlin Heidelberg, Berlin, Heidelberg, 30–48. https://doi.org/10.1007/978-3-642-23774-4_5

[24]

Leo A. Goodman. 1961. Snowball Sampling. The Annals of Mathematical Statistics 32, 1 (1961), 148–170. https://www.jstor.org/stable/2237615

[25]

Alex Graves. 2014. Generating Sequences With Recurrent Neural Networks. http://arxiv.org/abs/1308.0850 arXiv:1308.0850 [cs].

[26]

Giles Hamilton-Fletcher, Marianna Obrist, Phil Watten, Michele Mengucci, and Jamie Ward. 2016. "I Always Wanted to See the Night Sky": Blind User Preferences for Sensory Substitution Devices. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, San Jose California USA, 2162–2174. https://doi.org/10.1145/2858036.2858241

Digital Library

[27]

Sandra G. Hart and Lowell E. Staveland. 1988. Development of NASA-TLX (Task Load Index): Results of Empirical and Theoretical Research. In Advances in Psychology, Peter A. Hancock and Najmedin Meshkati (Eds.). Human Mental Workload, Vol. 52. North-Holland, 139–183. https://doi.org/10.1016/S0166-4115(08)62386-9

[28]

Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735–1780. Publisher: MIT press.

Digital Library

[29]

Leona M Holloway, Cagatay Goncu, Alon Ilsar, Matthew Butler, and Kim Marriott. 2022. Infosonics: Accessible Infographics for People who are Blind using Sonification and Voice. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–13. https://doi.org/10.1145/3491102.3517465

Digital Library

[30]

Yu-Chuan Huang, I-No Liao, Ching-Hsuan Chen, Tsì-Uí İk, and Wen-Chih Peng. 2019. TrackNet: A Deep Learning Network for Tracking High-speed and Tiny Objects in Sports Applications*. In 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS). 1–8. https://doi.org/10.1109/AVSS.2019.8909871 ISSN: 2643-6213.

[31]

Kenneth A. Hunt, Terry Bristol, and R. Edward Bashaw. 1999. A conceptual approach to classifying sports fans. The Journal of Services Marketing 13, 6 (1999), 439–452. https://doi.org/10.1108/08876049910298720

[32]

[32] Hawk-Eye Innovations. 2001. https://www.hawkeyeinnovations.com/

[33]

IrisVision. Retrieved July 15, 2022. IrisVision. https://irisvision.com/product/

[34]

Hiroo Iwata, Hiroaki Yano, Fumitaka Nakaizumi, and Ryo Kawamura. 2001. Project FEELEX: Adding Haptic Surface to Graphics. In Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques - SIGGRAPH ’01. ACM Press, Not Known, 469–476. https://doi.org/10.1145/383259.383314

Digital Library

[35]

Gaurav Jain, Basel Hindi, Connor Courtien, Conrad Wyrick, Xin Yi Therese Xu, Michael C Malcolm, and Brian A. Smith. 2023. Towards Accessible Sports Broadcasts for Blind and Low-Vision Viewers. In Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems. ACM, Hamburg Germany, 1–7. https://doi.org/10.1145/3544549.3585610

Digital Library

[36]

Grant Jarvie, James Thornton, and Hector Mackie. 2017. Sport, Culture and Society: An Introduction (3 ed.). Routledge, Third edition. | Abingdon, Oxon ; New York, NY : Routledge is an imprint of the Taylor & Francis Group, an Informa Business, [2017].

[37]

Glenn Jocher et al.April 2021. YOLOv5. https://ultralytics.com/yolov5

[38]

Shaun K. Kane, Meredith Ringel Morris, and Jacob O. Wobbrock. 2013. Touchplates: Low-Cost Tactile Overlays for Visually Impaired Touch Screen Users. In Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility. ACM, Bellevue Washington, 1–8. https://doi.org/10.1145/2513383.2513442

Digital Library

[39]

Jaewook Lee, Jaylin Herskovitz, Yi-Hao Peng, and Anhong Guo. 2022. ImageExplorer: Multi-Layered Touch Exploration to Encourage Skepticism Towards Imperfect AI-Generated Image Captions. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–15. https://doi.org/10.1145/3491102.3501966

Digital Library

[40]

Franklin Mingzhe Li, Lotus Zhang, Maryam Bandukda, Abigale Stangl, Kristen Shinohara, Leah Findlater, and Patrick Carrington. 2023. Understanding Visual Arts Experiences of Blind People. https://doi.org/10.1145/3544548.3580941 arXiv:2301.12687 [cs].

Digital Library

[41]

Thomas Lin. 2012. Hitting the Court, With an Ear on the Ball. The New York Times (June 2012). https://www.nytimes.com/2012/06/05/science/a-game-of-tennis-tests-notions-of-blindness.html

[42]

Xingyu Liu, Patrick Carrington, Xiang ’Anthony’ Chen, and Amy Pavel. 2021. What Makes Videos Accessible to Blind and Visually Impaired People?. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, Yokohama Japan, 1–14. https://doi.org/10.1145/3411764.3445233

Digital Library

[43]

Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully Convolutional Networks for Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015), 3431–3440. https://openaccess.thecvf.com/content_cvpr_2015/html/Long_Fully_Convolutional_Networks_2015_CVPR_paper.html

[44]

Kelly Mack, Danielle Bragg, Meredith Ringel Morris, Maarten W. Bos, Isabelle Albi, and Andrés Monroy-Hernández. 2020. Social App Accessibility for Deaf Signers. Proceedings of the ACM on Human-Computer Interaction 4, CSCW2 (Oct. 2020), 1–31.

Digital Library

[45]

Kelly Mack, Edward Cutrell, Bongshin Lee, and Meredith Ringel Morris. 2021. Designing Tools for High-Quality Alt Text Authoring. In The 23rd International ACM SIGACCESS Conference on Computers and Accessibility(ASSETS ’21). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3441852.3471207

Digital Library

[46]

Anuradha Madugalla, Kim Marriott, Simone Marinai, Samuele Capobianco, and Cagatay Goncu. 2020. Creating Accessible Online Floor Plans for Visually Impaired Readers. ACM Transactions on Accessible Computing 13, 4 (Oct. 2020), 1–37. https://doi.org/10.1145/3410446

Digital Library

[47]

J. Matas, C. Galambos, and J. Kittler. 1998. Progressive Probabilistic Hough Transform. In Procedings of the British Machine Vision Conference 1998. British Machine Vision Association, Southampton, 26.1–26.10. https://doi.org/10.5244/C.12.26

[48]

Tom McEwan and Ben Weerts. 2007. ALT Text and Basic Accessibility. https://doi.org/10.14236/ewic/HCI2007.64

[49]

Meredith Ringel Morris, Jazette Johnson, Cynthia L. Bennett, and Edward Cutrell. 2018. Rich Representations of Visual Content for Screen Reader Users. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. ACM, Montreal QC Canada, 1–11. https://doi.org/10.1145/3173574.3173633

Digital Library

[50]

Vishnu Nair, Jay L Karp, Samuel Silverman, Mohar Kalra, Hollis Lehv, Faizan Jamil, and Brian A. Smith. 2021. NavStick: Making Video Games Blind-Accessible via the Ability to Look Around. In The 34th Annual ACM Symposium on User Interface Software and Technology. ACM, Virtual Event USA, 538–551. https://doi.org/10.1145/3472749.3474768

Digital Library

[51]

Yuri Nishikawa, Hitoshi Sato, and Jun Ozawa. 2018. Multiple sports player tracking system based on graph optimization using low-cost cameras. In 2018 IEEE International Conference on Consumer Electronics (ICCE). 1–4. https://doi.org/10.1109/ICCE.2018.8326126 ISSN: 2158-4001.

[52]

NVivo. 1997. NVivo. https://www.qsrinternational.com/nvivo-qualitative-data-analysis-software/home

[53]

Hiroyuki Ohshima, Makoto Kobayashi, and Shigenobu Shimada. 2021. Development of Blind Football Play-by-play System for Visually Impaired Spectators: Tangible Sports. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, Yokohama Japan, 1–6. https://doi.org/10.1145/3411763.3451737

Digital Library

[54]

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. 2011. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12 (2011), 2825–2830.

Digital Library

[55]

Peter Meijer. Retrieved August 2022. The vOICe. https://www.seeingwithsound.com/

[56]

Bridget Pettitt, Katharine Sharpe, and Steven Cooper. 1996. AUDETEL: Enhancing television for visually impaired people. British Journal of Visual Impairment 14, 2 (May 1996), 48–52. https://doi.org/10.1177/026461969601400202 Publisher: SAGE Publications Ltd.

[57]

Venkatesh Potluri, Tadashi E Grindeland, Jon E. Froehlich, and Jennifer Mankoff. 2021. Examining Visual Semantic Understanding in Blind and Low-Vision Technology Users. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, Yokohama Japan, 1–14. https://doi.org/10.1145/3411764.3445040

Digital Library

[58]

Denise Prescher, Jens Bornschein, Wiebke Kohlmann, and Gerhard Weber. 2018. Touching Graphical Applications: Bimanual Tactile Interaction on the HyperBraille Pin-Matrix Display. Universal Access in the Information Society 17, 2 (June 2018), 391–409. https://doi.org/10.1007/s10209-017-0538-8

Digital Library

[59]

Arthur A Raney and Jennings Bryant. 2006. Handbook of Sports and Media. Chapter 19: Why we watch and enjoy mediated sports.

[60]

Kyle Rector, Keith Salmon, Dan Thornton, Neel Joshi, and Meredith Ringel Morris. 2017. Eyes-Free Art: Exploring Proxemic Audio Interfaces For Blind and Low Vision Art Engagement. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 1, 3 (Sept. 2017), 1–21. https://doi.org/10.1145/3130958

Digital Library

[61]

Andreas Reichinger, Stefan Maierhofer, and Werner Purgathofer. 2011. High-Quality Tactile Paintings. Journal on Computing and Cultural Heritage 4, 2 (Nov. 2011), 1–13. https://doi.org/10.1145/2037820.2037822

Digital Library

[62]

Santander. 2019. Fieeld. https://www.santander.com/en/press-room/press-releases/santander-presents-fieeld-a-deviceenabling- blind-people-to-watch-football-using-their-fingertips

[63]

Ather Sharif, Olivia H. Wang, Alida T. Muongchan, Katharina Reinecke, and Jacob O. Wobbrock. 2022. VoxLens: Making Online Data Visualizations Accessible with an Interactive JavaScript Plug-In. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–19. https://doi.org/10.1145/3491102.3517431

Digital Library

[64]

Roy Shilkrot, Jochen Huber, Connie Liu, Pattie Maes, and Suranga Chandima Nanayakkara. 2014. FingerReader: a wearable device to support text reading on the go. In CHI ’14 Extended Abstracts on Human Factors in Computing Systems. ACM, Toronto Ontario Canada, 2359–2364. https://doi.org/10.1145/2559206.2581220

Digital Library

[65]

Roy Shilkrot, Jochen Huber, Wong Meng Ee, Pattie Maes, and Suranga Chandima Nanayakkara. 2015. FingerReader: A Wearable Device to Explore Printed Text on the Go. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. ACM, Seoul Republic of Korea, 2363–2372. https://doi.org/10.1145/2702123.2702421

Digital Library

[66]

Jaeeun Shin, Jundong Cho, and Sangwon Lee. 2020. Please Touch Color: Tactile-Color Texture Design for The Visually Impaired. In Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems. ACM, Honolulu HI USA, 1–7. https://doi.org/10.1145/3334480.3383003

Digital Library

[67]

Alexa Siu, Gene S-H Kim, Sile O’Modhrain, and Sean Follmer. 2022. Supporting Accessible Data Visualization Through Audio Data Narratives. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–19. https://doi.org/10.1145/3491102.3517678

Digital Library

[68]

Alexa F. Siu, Son Kim, Joshua A. Miele, and Sean Follmer. 2019. shapeCAD: An Accessible 3D Modelling Workflow for the Blind and Visually-Impaired Via 2.5D Shape Displays. In The 21st International ACM SIGACCESS Conference on Computers and Accessibility. ACM, Pittsburgh PA USA, 342–354. https://doi.org/10.1145/3308561.3353782

Digital Library

[69]

Brian A. Smith and Shree K. Nayar. 2018. The RAD: Making Racing Games Equivalently Accessible to People Who Are Blind. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems(CHI ’18). Association for Computing Machinery, New York, NY, USA, 1–12. https://doi.org/10.1145/3173574.3174090

Digital Library

[70]

Joel Snyder. 2005. Audio description: The visual made verbal. International Congress Series 1282 (Sept. 2005), 935–939. https://doi.org/10.1016/j.ics.2005.05.215

[71]

Nancy Staggers and David Kobus. 2000. Comparing Response Time, Errors, and Satisfaction Between Text-based and Graphical User Interfaces During Nursing Order Tasks. Journal of the American Medical Informatics Association : JAMIA 7, 2 (2000), 164–176. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC61470/

[72]

Abigale Stangl, Meredith Ringel Morris, and Danna Gurari. 2020. "Person, Shoes, Tree. Is the Person Naked?" What People with Vision Impairments Want in Image Descriptions. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems(CHI ’20). Association for Computing Machinery, New York, NY, USA, 1–13. https://doi.org/10.1145/3313831.3376404

Digital Library

[73]

Lee Stearns, Victor DeSouza, Jessica Yin, Leah Findlater, and Jon E. Froehlich. 2017. Augmented Reality Magnification for Low Vision Users with the Microsoft Hololens and a Finger-Worn Camera. In Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility. ACM, Baltimore Maryland USA, 361–362. https://doi.org/10.1145/3132525.3134812

Digital Library

[74]

Takamasa Tsunoda, Yasuhiro Komori, Masakazu Matsugu, and Tatsuya Harada. 2017. Football Action Recognition Using Hierarchical LSTM. In 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE, Honolulu, HI, USA, 155–163. https://doi.org/10.1109/CVPRW.2017.25

[75]

Valve Corporation. 2018. Steam Audio. https://valvesoftware.github.io/steam-audio/

[76]

Roman Voeikov, Nikolay Falaleev, and Ruslan Baikulov. 2020. TTNet: Real-time temporal and spatial video analysis of table tennis. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE, Seattle, WA, USA, 3866–3874. https://doi.org/10.1109/CVPRW50498.2020.00450

[77]

Yujia Wang, Wei Liang, Haikun Huang, Yongqi Zhang, Dingzeyu Li, and Lap-Fai Yu. 2021. Toward Automatic Audio Description Generation for Accessible Videos. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, Yokohama Japan, 1–12. https://doi.org/10.1145/3411764.3445347

Digital Library

[78]

Yanan Wang, Ruobin Wang, Crescentia Jung, and Yea-Seul Kim. 2022. What makes web data tables accessible? Insights and a tool for rendering accessible tables for people with visual impairments. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–20. https://doi.org/10.1145/3491102.3517469

Digital Library

[79]

World Wide Web Consortium (W3C). 2022. Making Audio and Video Media Accessible. https://www.w3.org/WAI/media/av/

[80]

World Wide Web Consortium (W3C). 2022. W3C Image Concepts. https: //www.w3.org/WAI/tutorials/images/

[81]

Bosun Xie. 2013. Head-Related Transfer Function and Virtual Auditory Display: Second Edition. J. Ross Publishing. Google-Books-ID: fvDLCgAAQBAJ.

[82]

Mingrui Ray Zhang, Mingyuan Zhong, and Jacob O. Wobbrock. 2022. Ga11y: An Automated GIF Annotation System for Visually Impaired Users. In CHI Conference on Human Factors in Computing Systems. ACM, New Orleans LA USA, 1–16. https://doi.org/10.1145/3491102.3502092

Digital Library

Cited By

Ning ZZhang ZBan JJiang KGan RTian YLi T(2024)MIMOSA: Human-AI Co-Creation of Computational Spatial Audio Effects on VideosProceedings of the 16th Conference on Creativity & Cognition10.1145/3635636.3656189(156-169)Online publication date: 23-Jun-2024
https://dl.acm.org/doi/10.1145/3635636.3656189
Ning ZWimer BJiang KChen KBan JTian YZhao YLi T(2024)SPICA: Interactive Video Content Exploration through Augmented Audio Descriptions for Blind or Low-Vision ViewersProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642632(1-18)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642632
Jiang LJung CPhutane MStangl AAzenkot S(2024)“It’s Kind of Context Dependent”: Understanding Blind and Low Vision People’s Video Accessibility Preferences Across Viewing ScenariosProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642238(1-20)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642238

Index Terms

Front Row: Automatically Generating Immersive Audio Representations of Tennis Broadcasts for Blind Viewers
1. Human-centered computing
  1. Accessibility
    1. Accessibility systems and tools
    2. Accessibility technologies

Recommendations

Towards Accessible Sports Broadcasts for Blind and Low-Vision Viewers
CHI EA '23: Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems

Blind and low-vision (BLV) people watch sports through radio broadcasts that offer a play-by-play description of the game. However, recent trends show a decline in the availability and quality of radio broadcasts due to the rise of video streaming ...
Game changer: accessible audio and tactile guidance for board and card games
W4A '20: Proceedings of the 17th International Web for All Conference

While board games are a popular social activity, their reliance on visual information can create accessibility problems for blind and visually impaired players. Because some players cannot easily read cards or locate pieces, they may be at a ...
“What just happened?”: Understanding Non-visual Watching Sports Experiences
ASSETS '21: Proceedings of the 23rd International ACM SIGACCESS Conference on Computers and Accessibility

Sports enhances cultural and social life by bringing individuals and communities together. While sports have a different meaning and importance depending on the culture and people, there is a long history of people watching sports. However, sports ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology

October 2023

1825 pages

ISBN:9798400701320

DOI:10.1145/3586183

Editors:
Sean Follmer
Stanford University, USA
,
Jeff Han,
Jürgen Steimle
Saarland University, Germany
,
Nathalie Henry Riche
Microsoft Research, USA

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 October 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

National Science Foundation

Conference

UIST '23

Sponsor:

UIST '23: The 36th Annual ACM Symposium on User Interface Software and Technology

October 29 - November 1, 2023

CA, San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 842 of 3,967 submissions, 21%

Upcoming Conference

UIST '24

Sponsor:
sigchi
sigchi

The 37th Annual ACM Symposium on User Interface Software and Technology

October 13 - 16, 2024

Pittsburgh , PA , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
286
Total Downloads

Downloads (Last 12 months)286
Downloads (Last 6 weeks)24

Reflects downloads up to 21 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Ning ZZhang ZBan JJiang KGan RTian YLi T(2024)MIMOSA: Human-AI Co-Creation of Computational Spatial Audio Effects on VideosProceedings of the 16th Conference on Creativity & Cognition10.1145/3635636.3656189(156-169)Online publication date: 23-Jun-2024
https://dl.acm.org/doi/10.1145/3635636.3656189
Ning ZWimer BJiang KChen KBan JTian YZhao YLi T(2024)SPICA: Interactive Video Content Exploration through Augmented Audio Descriptions for Blind or Low-Vision ViewersProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642632(1-18)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642632
Jiang LJung CPhutane MStangl AAzenkot S(2024)“It’s Kind of Context Dependent”: Understanding Blind and Low Vision People’s Video Accessibility Preferences Across Viewing ScenariosProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642238(1-20)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642238

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents