Keyword: voice : Search

Article

Stress Classification Model Using Speech: An Ambulatory Protocol-Based Database Study

Artificial Intelligence for Neuroscience and Emotional SystemsPages 245–252https://doi.org/10.1007/978-3-031-61140-7_24

Abstract

Chronic stress poses a significant risk to health, potentially leading to long-term diseases such as cancer and diabetes. Analyzing stress through speech presents a promising avenue, as it offers accessibility and scalability using only a ...

research-article

Open Access

Singing for the Missing: Bringing the Body Back to AI Voice and Speech Technologies

MOCO '24: Proceedings of the 9th International Conference on Movement and ComputingArticle No.: 2, Pages 1–12https://doi.org/10.1145/3658852.3659065

Technological advancements in deep learning for speech and voice have contributed to a recent expansion in applications for voice cloning, synthesis and generation. Invisibilised stakeholders in this expansion are numerous absent bodies, whose voices ...

research-article

Honorable Mention

Seeking Soulmate via Voice: Understanding Promises and Challenges of Online Synchronized Voice-Based Mobile Dating

CHI '24: Proceedings of the 2024 CHI Conference on Human Factors in Computing SystemsArticle No.: 921, Pages 1–14https://doi.org/10.1145/3613904.3642860

Online dating has become a popular way for individuals to connect with potential romantic partners. Many dating apps use personal profiles that include a headshot and self-description, allowing users to present themselves and search for compatible ...

research-article

Uncovering Human Traits in Determining Real and Spoofed Audio: Insights from Blind and Sighted Individuals

CHI '24: Proceedings of the 2024 CHI Conference on Human Factors in Computing SystemsArticle No.: 949, Pages 1–14https://doi.org/10.1145/3613904.3642817

This paper explores how blind and sighted individuals perceive real and spoofed audio, highlighting differences and similarities between the groups. Through two studies, we find that both groups focus on specific human traits in audio–such as accents, ...

research-article

Open Access

Jasay: Towards Voice Commands in Projectional Editors

IDE '24: Proceedings of the 1st ACM/IEEE Workshop on Integrated Development EnvironmentsPages 30–34https://doi.org/10.1145/3643796.3648449

Permanent disabilities or temporary injuries (e.g., RSI) hinder the activity of writing code. The interaction modality of voice is a viable substitute or complement for typing on a keyboard. This paper describes the design of Jasay, a prototype tool that ...

short-paper

Open Access

Best Student Paper

Development of a Socially Cognizant Robotic Campus Guide

HRI '24: Companion of the 2024 ACM/IEEE International Conference on Human-Robot InteractionPages 1229–1232https://doi.org/10.1145/3610978.3641263

A robotic system to help lost students find their way around a college campus was designed, built, and tested. Socially cognizant design practices, including stakeholder engagement, and interdisciplinary team-building, were practiced. Users can interact ...

research-article

An implementation of searchable video player

International Journal of Computational Vision and Robotics (IJCVR), Volume 14, Issue 3Pages 325–337https://doi.org/10.1504/ijcvr.2024.138324

This paper introduces an Android app, SVPlayer, that searches for scenes in a video. To search for scenes in a video, SVPlayer extracts voice from the video, converts it into text, and searches for words in the text. Voice is converted to text in units ...

poster

Effects of Presentation Modalities in Virtual Museum Guides on Agent Impressions and Painting Evaluations.

Mari Saito

HAI '23: Proceedings of the 11th International Conference on Human-Agent InteractionPages 446–448https://doi.org/10.1145/3623809.3623958

As virtual experiences have become more common in recent years, guide services in virtual spaces are expected to become more popular. In this study, we examined the effect of the modalities of agents guiding visitors while viewing paintings in an online ...

research-article

Voice-Face Homogeneity Tells Deepfake

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 20, Issue 3Article No.: 76, Pages 1–22https://doi.org/10.1145/3625231

Detecting forgery videos is highly desirable due to the abuse of deepfake. Existing detection approaches contribute to exploring the specific artifacts in deepfake videos and fit well on certain data. However, the growing technique on these artifacts ...

research-article

Rethinking Voice-Face Correlation: A Geometry View

MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 2458–2467https://doi.org/10.1145/3581783.3611779

Previous works on voice-face matching and voice-guided face synthesis demonstrate strong correlations between voice and face, but mainly rely on coarse semantic cues such as gender, age, and emotion. In this paper, we aim to investigate the capability of ...

research-article

Open Access

Benefits of Community Voice: A Framework for Understanding Inclusion of Community Voice in HCI4D

Proceedings of the ACM on Human-Computer Interaction (PACMHCI), Volume 7, Issue CSCW2Article No.: 325, Pages 1–26https://doi.org/10.1145/3610174

Community voice is widely used in computer-supported cooperative work (CSCW) and human-computer interaction (HCI) work with underserved communities. However, the term is unresolved, denoting disparate activities, methods, and phenomena that are at their ...

research-article

Open Access

AI Consent Futures: A Case Study on Voice Data Collection with Clinicians

Proceedings of the ACM on Human-Computer Interaction (PACMHCI), Volume 7, Issue CSCW2Article No.: 316, Pages 1–30https://doi.org/10.1145/3610107

As new forms of data capture emerge to power new AI applications, questions abound about the ethical implications of these data collection practices. In this paper, we present clinicians' perspectives on the prospective benefits and harms of voice data ...

Article

Effects of Visual and Personality Impressions on the Voices Matched to Animated Characters

Human Interface and the Management of InformationPages 431–444https://doi.org/10.1007/978-3-031-35132-7_33

Abstract

This paper analyzes the relationship between the visual aspects of characters and the voice properties. Experiments indicate that humans employ different sets of features to evaluate voice impressions when illustrated characters are displayed in ...

research-article

Open Access

Corsetto: A Kinesthetic Garment for Designing, Composing for, and Experiencing an Intersubjective Haptic Voice

CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing SystemsArticle No.: 181, Pages 1–23https://doi.org/10.1145/3544548.3581294

We present a novel intercorporeal experience – an intersubjective haptic voice. Through an autobiographical design inquiry, based on singing techniques from the classical opera tradition, we created Corsetto, a kinesthetic garment for transferring ...

research-article

Open Access

Guiding Oral Conversations: How to Nudge Users Towards Asking Questions?

CHIIR '23: Proceedings of the 2023 Conference on Human Information Interaction and RetrievalPages 34–42https://doi.org/10.1145/3576840.3578291

How could an envisioned voice-based conversational information system assist the information seeker when the seeker does not know how to continue the conversation? The system could explicitly suggest a question to ask after each of its responses, but ...

extended-abstract

Read Your Voice: A Playful Interactive Sound Encoder/Decoder

MM '22: Proceedings of the 30th ACM International Conference on MultimediaPages 7243–7244https://doi.org/10.1145/3503161.3549974

Read Your Voice is a playful interactive multimedia system that allows the user to record a sound, encode it as an image, and then play it back using his smartphone, while controlling the speed and direction of playback.

research-article

Expressive Auditory Gestures in a Voice-Based Pedagogical Agent

CHI '22: Proceedings of the 2022 CHI Conference on Human Factors in Computing SystemsArticle No.: 163, Pages 1–13https://doi.org/10.1145/3491102.3517599

In this paper, we explore how expressive auditory gestures added to the speech of a pedagogical agent influence the human-agent relationship and learning outcomes. In a between-subjects experiment, 41 participants assumed the role of a tutor to teach a ...

extended-abstract

Feminist Voices about Ecological Issues in HCI

CHI EA '22: Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing SystemsArticle No.: 90, Pages 1–7https://doi.org/10.1145/3491101.3503717

Even though issues such as climate change, pollution, and declining biodiversity impact us all, people with historically disenfranchised and socio-politically marginalized (HDSM) identities often bear the harsher brunt of ecological crises and suffer ...

research-article

Fostering Engagement of Underserved Communities with Credible Health Information on Social Media

WWW '22: Proceedings of the ACM Web Conference 2022Pages 3718–3727https://doi.org/10.1145/3485447.3512267

The COVID-19 pandemic has necessitated rapid top-down dissemination of reliable and actionable information. This presents unique challenges in engaging low-literate communities that live in poverty and lack access to the Internet. We describe the design ...

research-article

Robo-Identity: Exploring Artificial Identity and Emotion via Speech Interactions

HRI '22: Proceedings of the 2022 ACM/IEEE International Conference on Human-Robot InteractionPages 1265–1268

Following the success of the first edition of Robo-Identity, the second edition will provide an opportunity to expand the discussion about artificial identity. This year, we are focusing on emotions that are expressed through speech and voice. Synthetic ...

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Paper Award

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences