Search
Search Results
-
A new speaker-diarization technology with denoising spectral-LSTM for online automatic multi-dialogue recording
In AI pandemic applications, the online automatic AI recording apparatus for official councils such as court trials, business conferences and...
-
Spoken dialog summarization system with HAPPINESS/SUFFERING factor recognition
This work presents a spoken dialog summarization system with HAPPINESS/SUFFERING factor recognition. The semantic content is compressed and...
-
Hemodialysis vascular access stenosis detection using auditory spectro-temporal features of phonoangiography
For end-stage renal disease patients undergoing hemodialysis, thrombosis caused by stenosis hinders the long-term use of vascular access. However,...
-
Computer-Assisted Auscultation: Patent Ductus Arteriosus Detection Based on Auditory Time–frequency Analysis
This study presents a computer-assisted auscultation approach for patent ductus arteriosus (PDA) detection. PDA is a frequent congenital heart...
-
Second Heart Sound (S2) Decomposition by Hilbert Vibration Decomposition (HVD) for Affective Signal Modeling and Learning
This article presents a novel signal decomposition method, Hilbert vibration decomposition (HVD), for analyzing one of the major heart sound... -
Multivoxel analysis for functional magnetic resonance imaging (fMRI) based on time-series and contextual information: relationship between maternal love and brain regions as a case study
This study explores the relationship between maternal love and brain regions by using functional magnetic resonance imaging (fMRI). Also, a novel...
-
Assistive Listening System Using a Human-Like Auditory Processing Algorithm
Enhancing the quality of hearing perception in noisy environments plays a significant role to improve life quality of elderly persons and hearing... -
Advances in Web-Based Learning -- ICWL 2013 12th International Conference, Kenting, Taiwan, October 6-9, 2013, Proceedings
This book constitutes the refereed proceedings of the 12th International Conference on Web-Based Learning, ICWL 2013, held in Kenting, Taiwan, in... -
Speech-driven talking face using embedded confusable system for real time mobile multimedia
This paper presents a real-time speech-driven talking face system which provides low computational complexity and smoothly visual sense. A novel...
-
Novel Mutual Information Analysis of Attentive Motion Entropy Algorithm for Sports Video Summarization
This study presents a novel summarization method, which utilizes attentive motion analysis, mutual information, and segmental spectro-temporal... -
Enhanced long-range personal identification based on multimodal information of human features
This work presents an enhanced long-range personal identification scheme using multimodal information of human features. Multimodal information...
-
User-centric incremental learning model of dynamic personal identification for mobile devices
This study presents a user-centric incremental learning model based on the proposed output selection strategy (OSS) and multiview body direction...
-
Blind Signal Separation with Speech Enhancement
A new speech enhancement architecture using convolutive blind signal separation (CBSS) and subspace-based speech enhancement is presented. The... -
A Framework Design for Human-Robot Interaction
Multimodal human-robot interaction integrates various physical communication channels for face-to-face interaction. However, face-to-face interaction... -
Real World Speech Processing
Real World Speech Processingbrings together in one place important contributions and up-to-date research results in this fast-moving area. The... -
SVM-Based Sound Classification Based on MPEG-7 Audio LLDs and Related Enhanced Features
In this paper, we present a support vector machine (SVM) based sound classifier using MPEG-7 audio low-level descriptors and related enhanced... -
Kernel-Based Lip Shape Clustering with Phoneme Recognition for Real-Time Voice Driven Talking Face
This work describes a real-time voice driven method using which a speaker’s lip shape is synchronized with the corresponding speech signal, for a low... -
Dynamic Fixed-Point Arithmetic Design of Embedded SVM-Based Speaker Identification System
This work proposes a dynamic fixed-point arithmetic design for SVM-based speaker identification in embedded environment. The whole speaker... -
Sports Video Summarization Based on Salient Motion Entropy and Information Analysis
In this study, we presented a novel summarization method for generating sports video abstracts, which utilized motion entropy analysis and mutual...