Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleMay 2024
Stress Classification Model Using Speech: An Ambulatory Protocol-Based Database Study
Artificial Intelligence for Neuroscience and Emotional SystemsPages 245–252https://doi.org/10.1007/978-3-031-61140-7_24AbstractChronic stress poses a significant risk to health, potentially leading to long-term diseases such as cancer and diabetes. Analyzing stress through speech presents a promising avenue, as it offers accessibility and scalability using only a ...
- research-articleJune 2024
Singing for the Missing: Bringing the Body Back to AI Voice and Speech Technologies
MOCO '24: Proceedings of the 9th International Conference on Movement and ComputingArticle No.: 2, Pages 1–12https://doi.org/10.1145/3658852.3659065Technological advancements in deep learning for speech and voice have contributed to a recent expansion in applications for voice cloning, synthesis and generation. Invisibilised stakeholders in this expansion are numerous absent bodies, whose voices ...
- research-articleMay 2024Honorable Mention
Seeking Soulmate via Voice: Understanding Promises and Challenges of Online Synchronized Voice-Based Mobile Dating
CHI '24: Proceedings of the 2024 CHI Conference on Human Factors in Computing SystemsArticle No.: 921, Pages 1–14https://doi.org/10.1145/3613904.3642860Online dating has become a popular way for individuals to connect with potential romantic partners. Many dating apps use personal profiles that include a headshot and self-description, allowing users to present themselves and search for compatible ...
- research-articleMay 2024
Uncovering Human Traits in Determining Real and Spoofed Audio: Insights from Blind and Sighted Individuals
CHI '24: Proceedings of the 2024 CHI Conference on Human Factors in Computing SystemsArticle No.: 949, Pages 1–14https://doi.org/10.1145/3613904.3642817This paper explores how blind and sighted individuals perceive real and spoofed audio, highlighting differences and similarities between the groups. Through two studies, we find that both groups focus on specific human traits in audio–such as accents, ...
- research-articleAugust 2024
Jasay: Towards Voice Commands in Projectional Editors
IDE '24: Proceedings of the 1st ACM/IEEE Workshop on Integrated Development EnvironmentsPages 30–34https://doi.org/10.1145/3643796.3648449Permanent disabilities or temporary injuries (e.g., RSI) hinder the activity of writing code. The interaction modality of voice is a viable substitute or complement for typing on a keyboard. This paper describes the design of Jasay, a prototype tool that ...
-
- short-paperMarch 2024Best Student Paper
Development of a Socially Cognizant Robotic Campus Guide
- Benjamin Greenberg,
- Daniel Nakhimovich,
- Richard Magnotti,
- Hriday Purohit,
- Sanskar Shah,
- Aniket Satish Kulkarni,
- Uriel Gonzalez-Bravo,
- Noah R. Carver
HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot InteractionPages 1229–1232https://doi.org/10.1145/3610978.3641263A robotic system to help lost students find their way around a college campus was designed, built, and tested. Socially cognizant design practices, including stakeholder engagement, and interdisciplinary team-building, were practiced. Users can interact ...
- research-articleMay 2024
An implementation of searchable video player
International Journal of Computational Vision and Robotics (IJCVR), Volume 14, Issue 3Pages 325–337https://doi.org/10.1504/ijcvr.2024.138324This paper introduces an Android app, SVPlayer, that searches for scenes in a video. To search for scenes in a video, SVPlayer extracts voice from the video, converts it into text, and searches for words in the text. Voice is converted to text in units ...
- posterDecember 2023
Effects of Presentation Modalities in Virtual Museum Guides on Agent Impressions and Painting Evaluations.
HAI '23: Proceedings of the 11th International Conference on Human-Agent InteractionPages 446–448https://doi.org/10.1145/3623809.3623958As virtual experiences have become more common in recent years, guide services in virtual spaces are expected to become more popular. In this study, we examined the effect of the modalities of agents guiding visitors while viewing paintings in an online ...
- research-articleNovember 2023
Voice-Face Homogeneity Tells Deepfake
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 20, Issue 3Article No.: 76, Pages 1–22https://doi.org/10.1145/3625231Detecting forgery videos is highly desirable due to the abuse of deepfake. Existing detection approaches contribute to exploring the specific artifacts in deepfake videos and fit well on certain data. However, the growing technique on these artifacts ...
- research-articleOctober 2023
Rethinking Voice-Face Correlation: A Geometry View
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 2458–2467https://doi.org/10.1145/3581783.3611779Previous works on voice-face matching and voice-guided face synthesis demonstrate strong correlations between voice and face, but mainly rely on coarse semantic cues such as gender, age, and emotion. In this paper, we aim to investigate the capability of ...
- research-articleOctober 2023
Benefits of Community Voice: A Framework for Understanding Inclusion of Community Voice in HCI4D
Proceedings of the ACM on Human-Computer Interaction (PACMHCI), Volume 7, Issue CSCW2Article No.: 325, Pages 1–26https://doi.org/10.1145/3610174Community voice is widely used in computer-supported cooperative work (CSCW) and human-computer interaction (HCI) work with underserved communities. However, the term is unresolved, denoting disparate activities, methods, and phenomena that are at their ...
- research-articleOctober 2023
AI Consent Futures: A Case Study on Voice Data Collection with Clinicians
Proceedings of the ACM on Human-Computer Interaction (PACMHCI), Volume 7, Issue CSCW2Article No.: 316, Pages 1–30https://doi.org/10.1145/3610107As new forms of data capture emerge to power new AI applications, questions abound about the ethical implications of these data collection practices. In this paper, we present clinicians' perspectives on the prospective benefits and harms of voice data ...
- ArticleJuly 2023
Effects of Visual and Personality Impressions on the Voices Matched to Animated Characters
Human Interface and the Management of InformationPages 431–444https://doi.org/10.1007/978-3-031-35132-7_33AbstractThis paper analyzes the relationship between the visual aspects of characters and the voice properties. Experiments indicate that humans employ different sets of features to evaluate voice impressions when illustrated characters are displayed in ...
- research-articleApril 2023
Corsetto: A Kinesthetic Garment for Designing, Composing for, and Experiencing an Intersubjective Haptic Voice
- Ozgun Kilic Afsar,
- Yoav Luft,
- Kelsey Cotton,
- Ekaterina R. Stepanova,
- Claudia Núñez-Pacheco,
- Rebecca Kleinberger,
- Fehmi Ben Abdesslem,
- Hiroshi Ishii,
- Kristina Höök
CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing SystemsArticle No.: 181, Pages 1–23https://doi.org/10.1145/3544548.3581294We present a novel intercorporeal experience – an intersubjective haptic voice. Through an autobiographical design inquiry, based on singing techniques from the classical opera tradition, we created Corsetto, a kinesthetic garment for transferring ...
- research-articleMarch 2023
Guiding Oral Conversations: How to Nudge Users Towards Asking Questions?
CHIIR '23: Proceedings of the 2023 Conference on Human Information Interaction and RetrievalPages 34–42https://doi.org/10.1145/3576840.3578291How could an envisioned voice-based conversational information system assist the information seeker when the seeker does not know how to continue the conversation? The system could explicitly suggest a question to ask after each of its responses, but ...
- extended-abstractOctober 2022
Read Your Voice: A Playful Interactive Sound Encoder/Decoder
MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 7243–7244https://doi.org/10.1145/3503161.3549974Read Your Voice is a playful interactive multimedia system that allows the user to record a sound, encode it as an image, and then play it back using his smartphone, while controlling the speed and direction of playback.
- research-articleApril 2022
Expressive Auditory Gestures in a Voice-Based Pedagogical Agent
CHI '22: Proceedings of the 2022 CHI Conference on Human Factors in Computing SystemsArticle No.: 163, Pages 1–13https://doi.org/10.1145/3491102.3517599In this paper, we explore how expressive auditory gestures added to the speech of a pedagogical agent influence the human-agent relationship and learning outcomes. In a between-subjects experiment, 41 participants assumed the role of a tutor to teach a ...
- extended-abstractApril 2022
Feminist Voices about Ecological Issues in HCI
- Marie Louise Juul Søndergaard,
- Gopinaath Kannabiran,
- Simran Chopra,
- Nadia Campo Woytuk,
- Dilrukshi Gamage,
- Ebtisam Alabdulqader,
- Heather McKinnon,
- Heike Winschiers-Theophilus,
- Shaowen Bardzell
CHI EA '22: Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing SystemsArticle No.: 90, Pages 1–7https://doi.org/10.1145/3491101.3503717Even though issues such as climate change, pollution, and declining biodiversity impact us all, people with historically disenfranchised and socio-politically marginalized (HDSM) identities often bear the harsher brunt of ecological crises and suffer ...
- research-articleApril 2022
Fostering Engagement of Underserved Communities with Credible Health Information on Social Media
- Agha Ali Raza,
- Mustafa Naseem,
- Namoos Hayat Qasmi,
- Shan Randhawa,
- Fizzah Malik,
- Behzad Taimur,
- Sacha St-Onge Ahmad,
- Sarojini Hirshleifer,
- Arman Rezaee,
- Aditya Vashistha
WWW '22: Proceedings of the ACM Web Conference 2022Pages 3718–3727https://doi.org/10.1145/3485447.3512267The COVID-19 pandemic has necessitated rapid top-down dissemination of reliable and actionable information. This presents unique challenges in engaging low-literate communities that live in poverty and lack access to the Internet. We describe the design ...
- research-articleMarch 2022
Robo-Identity: Exploring Artificial Identity and Emotion via Speech Interactions
- Guy Laban,
- Sebastien Le Maguer,
- Minha Lee,
- Dimosthenis Kontogiorgos,
- Samantha Reig,
- Ilaria Torre,
- Ravi Tejwani,
- Matthew J. Dennis,
- Andre Pereira
HRI '22: Proceedings of the 2022 ACM/IEEE International Conference on Human-Robot InteractionPages 1265–1268Following the success of the first edition of Robo-Identity, the second edition will provide an opportunity to expand the discussion about artificial identity. This year, we are focusing on emotions that are expressed through speech and voice. Synthetic ...