Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–1 of 1 results for author: Yoo, K M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2402.05706  [pdf, other

    cs.CL cs.SD eess.AS

    Integrating Paralinguistics in Speech-Empowered Large Language Models for Natural Conversation

    Authors: Heeseung Kim, Soonshin Seo, Kyeongseok Jeong, Ohsung Kwon, Soyoon Kim, Jungwhan Kim, Jaehong Lee, Eunwoo Song, Myungwoo Oh, Jung-Woo Ha, Sungroh Yoon, Kang Min Yoo

    Abstract: Recent work shows promising results in expanding the capabilities of large language models (LLM) to directly understand and synthesize speech. However, an LLM-based strategy for modeling spoken dialogs remains elusive, calling for further investigation. This paper introduces an extensive speech-text LLM framework, the Unified Spoken Dialog Model (USDM), designed to generate coherent spoken respons… ▽ More

    Submitted 26 August, 2024; v1 submitted 8 February, 2024; originally announced February 2024.