default search action
Masashi Unoki
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j49]Anuwat Chaiwongyen, Suradej Duangpummet, Jessada Karnjana, Waree Kongprawechnon, Masashi Unoki:
Potential of Speech-Pathological Features for Deepfake Speech Detection. IEEE Access 12: 121958-121970 (2024) - [j48]Khalid Zaman, Islam J. A. M. Samiul, Melike Sah, Cem Direkoglu, Shogo Okada, Masashi Unoki:
Hybrid Transformer Architectures With Diverse Audio Features for Deepfake Speech Classification. IEEE Access 12: 149221-149237 (2024) - [j47]Yuning Liu, Di Zhou, Aijun Li, Jianwu Dang, Shogo Okada, Masashi Unoki:
Investigation of Social Factor in Conversational Entrainments. IEEE Access 12: 165507-165524 (2024) - [j46]Nan Li, Longbiao Wang, Meng Ge, Masashi Unoki, Sheng Li, Jianwu Dang:
Robust voice activity detection using an auditory-inspired masked modulation encoder based convolutional attention network. Speech Commun. 157: 103024 (2024) - [i6]Kai Li, Khalid Zaman, Xingfeng Li, Masato Akagi, Masashi Unoki:
Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks. CoRR abs/2409.05319 (2024) - [i5]Kai Li, Masato Akagi, Yongwei Li, Masashi Unoki:
Modeling and Estimation of Vocal Tract and Glottal Source Parameters Using ARMAX-LF Model. CoRR abs/2410.04704 (2024) - 2023
- [j45]Lijun Wang, Suradej Duangpummet, Masashi Unoki:
Blind Estimation of Speech Transmission Index and Room Acoustic Parameters by Using Extended Model of Room Impulse Response Derived From Speech Signals. IEEE Access 11: 49431-49444 (2023) - [j44]Yasuji Ota, Masashi Unoki:
Anomalous Sound Detection for Industrial Machines Using Acoustical Features Related to Timbral Metrics. IEEE Access 11: 70884-70897 (2023) - [j43]Kai Li, Xugang Lu, Masato Akagi, Masashi Unoki:
Contributions of Jitter and Shimmer in the Voice for Fake Audio Detection. IEEE Access 11: 84689-84698 (2023) - [j42]Khalid Zaman, Melike Sah, Cem Direkoglu, Masashi Unoki:
A Survey of Audio Classification Using Deep Learning. IEEE Access 11: 106620-106649 (2023) - [j41]Huy Nguyen, Tuan Vu Ho, Masato Akagi, Masashi Unoki:
Phase-Aware Speech Enhancement With Complex Wiener Filter. IEEE Access 11: 141573-141584 (2023) - [j40]Candy Olivia Mawalim, Shogo Okada, Yukiko I. Nakano, Masashi Unoki:
Personality trait estimation in group discussions using multimodal analysis and speaker embedding. J. Multimodal User Interfaces 17(2): 47-63 (2023) - [j39]Yang Liu, Haoqin Sun, Wenbo Guan, Yuqi Xia, Yongwei Li, Masashi Unoki, Zhen Zhao:
A Discriminative Feature Representation Method Based on Cascaded Attention Network With Adversarial Strategy for Speech Emotion Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1063-1074 (2023) - [j38]Xingfeng Li, Xiaohan Shi, Desheng Hu, Yongwei Li, Qingchen Zhang, Zhengxia Wang, Masashi Unoki, Masato Akagi:
Music Theory-Inspired Acoustic Representation for Speech Emotion Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2534-2547 (2023) - [j37]Weitao Yuan, Shengbei Wang, Jianming Wang, Masashi Unoki, Wenwu Wang:
Unsupervised Deep Unfolded Representation Learning for Singing Voice Separation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3206-3220 (2023) - [c122]Dung Kim Tran, Masato Akagi, Masashi Unoki:
Increasing Speech Intelligibility by Mimicking Professional Announcers' Voices and Its Physical Correlates. APSIPA ASC 2023: 1187-1192 - [c121]Xiajie Zhou, Candy Olivia Mawalim, Benita Angela Titalim, Masashi Unoki:
Incorporating the Digit Triplet Test in A Lightweight Speech Intelligibility Prediction for Hearing Aids. APSIPA ASC 2023: 1593-1600 - [c120]Haowei Cheng, Candy Olivia Mawalim, Kai Li, Lijun Wang, Masashi Unoki:
Analysis of Spectro-Temporal Modulation Representation for Deep-Fake Speech Detection. APSIPA ASC 2023: 1822-1829 - [c119]Anuwat Chaiwongyen, Suradej Duangpummet, Jessada Karnjana, Waree Kongprawechnon, Masashi Unoki:
Deepfake-speech Detection with Pathological Features and Multilayer Perceptron Neural Network. APSIPA ASC 2023: 2182-2188 - [c118]Taiyang Guo, Sixia Li, Shunsuke Kidani, Shogo Okada, Masashi Unoki:
Contribution of modulation spectral features for cross-lingual speech emotion recognition under noisy reverberant conditions. APSIPA ASC 2023: 2221-2227 - [c117]Kai Li, Dung Kim Tran, Xugang Lu, Masato Akagi, Masashi Unoki:
Data-driven Non-uniform Filterbanks Based on F-ratio for Machine Anomalous Sound Detection. EUSIPCO 2023: 201-205 - [c116]Candy Olivia Mawalim, Benita Angela Titalim, Shogo Okada, Masashi Unoki:
Auditory Model Optimization with Wavegram-CNN and Acoustic Parameter Models for Nonintrusive Speech Intelligibility Prediction in Hearing Aids. EUSIPCO 2023: 211-215 - [c115]Weitao Yuan, Yuren Bian, Shengbei Wang, Masashi Unoki, Wenwu Wang:
An Improved Optimal Transport Kernel Embedding Method with Gating Mechanism for Singing Voice Separation and Speaker Identification. ICASSP 2023: 1-5 - [c114]Yasufumi Uezu, Sicheng Wang, Teruki Toya, Masashi Unoki:
Consonant-emphasis Method Incorporating Robust Consonant-section Detection to Improve Intelligibility of Bone-conducted speech. INTERSPEECH 2023: 849-853 - [c113]Quoc-Huy Nguyen, Nguyen Le Minh, Masashi Unoki:
Speaker Verification Using Distance Based on Principal Component Analysis for Household Scenario Adaptation. RIVF 2023: 441-446 - [c112]Phan Thi Ha Duong, Vo Nguyen Quoc Bao, Masashi Unoki:
RIVF'23 Message from Host University. RIVF 2023: xii-xiii - [i4]Takuto Isoyama, Shunsuke Kidani, Masashi Unoki:
Computational models of sound-quality metrics using method for calculating loudness with gammatone/gammachirp auditory filterbank. CoRR abs/2305.13213 (2023) - 2022
- [j36]Candy Olivia Mawalim, Kasorn Galajit, Jessada Karnjana, Shunsuke Kidani, Masashi Unoki:
Speaker anonymization by modifying fundamental frequency and x-vector singular value. Comput. Speech Lang. 73: 101326 (2022) - [j35]Takuto Isoyama, Shunsuke Kidani, Masashi Unoki:
Blind Speech Watermarking Method with Frame Self-Synchronization Based on Spread-Spectrum Using Linear Prediction Residue. Entropy 24(5): 677 (2022) - [j34]Di Zhou, Gaoyan Zhang, Jianwu Dang, Masashi Unoki, Xin Liu:
Detection of Brain Network Communities During Natural Speech Comprehension From Functionally Aligned EEG Sources. Frontiers Comput. Neurosci. 16 (2022) - [c111]Kai Li, Quoc-Huy Nguyen, Yasuji Ota, Masashi Unoki:
Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Using Temporal Modulation Features on Gammatone Auditory Filterbank. DCASE 2022 - [c110]Quoc-Huy Nguyen, Masashi Unoki:
Bone-conducted Speech Enhancement Using Vector-quantized Variational Autoencoder and Gammachirp Filterbank Cepstral Coefficients. EUSIPCO 2022: 21-25 - [c109]Kai Li, Xugang Lu, Masato Akagi, Jianwu Dang, Sheng Li, Masashi Unoki:
Relationship Between Speakers' Physiological Structure and Acoustic Speech Signals: Data-Driven Study Based on Frequency-Wise Attentional Neural Network. EUSIPCO 2022: 379-383 - [c108]Kai Yang, Zhuo Zhang, Gaoyan Zhang, Masashi Unoki, Jianwu Dang, Longbiao Wang:
An Improved Stimulus Reconstruction Method for EEG-Based Short-Time Auditory Attention Detection. ICONIP (5) 2022: 267-277 - [c107]Tuan Vu Ho, Quoc Huy Nguyen, Masato Akagi, Masashi Unoki:
Vector-quantized Variational Autoencoder for Phase-aware Speech Enhancement. INTERSPEECH 2022: 176-180 - [c106]Nan Li, Meng Ge, Longbiao Wang, Masashi Unoki, Sheng Li, Jianwu Dang:
Global Signal-to-noise Ratio Estimation Based on Multi-subband Processing Using Convolutional Neural Network. INTERSPEECH 2022: 361-365 - [c105]Kai Li, Sheng Li, Xugang Lu, Masato Akagi, Meng Liu, Lin Zhang, Chang Zeng, Longbiao Wang, Jianwu Dang, Masashi Unoki:
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection. INTERSPEECH 2022: 664-668 - [c104]Teruki Toya, Wenyu Zhu, Maori Kobayashi, Kenichi Nakamura, Masashi Unoki:
Method for improving the word intelligibility of presented speech using bone-conduction headphones. INTERSPEECH 2022: 759-763 - [c103]Huy Nguyen, Kai Li, Masashi Unoki:
Automatic Mean Opinion Score Estimation with Temporal Modulation Features on Gammatone Filterbank for Speech Assessment. INTERSPEECH 2022: 4526-4530 - [c102]Di Zhou, Masashi Unoki, Gaoyan Zhang, Jianwu Dang:
Reconstruction of speech spectrogram based on non-invasive EEG signal. ISCSLP 2022: 275-279 - [c101]Yuning Liu, Di Zhou, Masashi Unoki, Jianwu Dang, Aijun Li:
Dialogue scenario classification based on social factors. ISCSLP 2022: 379-383 - 2021
- [j33]Dung Kim Tran, Masashi Unoki:
Matching Pursuit and Sparse Coding for Auditory Representation. IEEE Access 9: 167084-167095 (2021) - [j32]Candy Olivia Mawalim, Masashi Unoki:
Speech Watermarking Method Using McAdams Coefficient Based on Random Forest Learning. Entropy 23(10): 1246 (2021) - [j31]Zhichao Peng, Jianwu Dang, Masashi Unoki, Masato Akagi:
Multi-resolution modulation-filtered cochleagram feature for LSTM-based dimensional emotion recognition from speech. Neural Networks 140: 261-273 (2021) - [j30]Weitao Yuan, Bofei Dong, Shengbei Wang, Masashi Unoki, Wenwu Wang:
Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 807-822 (2021) - [c100]Kai Li, Masashi Unoki, Yongwei Li, Jianwu Dang, Masato Akagi:
Study on Simultaneous Estimation of Glottal Source and Vocal Tract Parameters by ARMAX-LF Model for Speech Analysis/Synthesis. APSIPA ASC 2021: 36-43 - [c99]Shengbei Wang, Weitao Yuan, Zhen Zhang, Jianming Wang, Masashi Unoki:
Tampering Detection for Speech Signals Using Synchronization Code and LSF-based Watermarks. APSIPA ASC 2021: 1621-1626 - [c98]Candy Olivia Mawalim, Masashi Unoki:
Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method. APSIPA ASC 2021: 1627-1633 - [c97]Kasorn Galajit, Jessada Karnjana, Pakinee Aimmanee, Masashi Unoki:
Hybridization of speech information hiding and encryption for double-layer security in speech communication. APSIPA ASC 2021: 1634-1639 - [c96]Suradej Duangpummet, Jessada Karnjana, Waree Kongprawechnon, Masashi Unoki:
Blind Estimation of Room Acoustic Parameters and Speech Transmission Index using MTF-based CNNs. EUSIPCO 2021: 181-185 - [c95]Shengbei Wang, Weitao Yuan, Zhen Zhang, Jianming Wang, Masashi Unoki:
Synchronous Multi-Bit Audio Watermarking Based on Phase Shifting. ICASSP 2021: 2700-2704 - [c94]Nan Li, Longbiao Wang, Masashi Unoki, Sheng Li, Rui Wang, Meng Ge, Jianwu Dang:
Robust Voice Activity Detection Using a Masked Auditory Encoder Based Convolutional Neural Network. ICASSP 2021: 6828-6832 - [c93]Weitao Yuan, Shengbei Wang, Xiangrui Li, Masashi Unoki, Wenwu Wang:
Crossfire Conditional Generative Adversarial Networks for Singing Voice Extraction. Interspeech 2021: 3041-3045 - [c92]Taiyang Guo, Jianwu Dang, Gaoyan Zhang, Bin Zhao, Masashi Unoki:
Frequency-specific Brain Network Dynamics during Perceiving Real Words and Pseudowords. ISCSLP 2021: 1-5 - [i3]Suradej Duangpummet, Jessada Karnjana, Waree Kongprawechnon, Masashi Unoki:
Blind Estimation of Room Acoustic Parameters and Speech Transmission Index using MTF-based CNNs. CoRR abs/2103.07904 (2021) - [i2]Candy Olivia Mawalim, Masashi Unoki:
Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method. CoRR abs/2107.07223 (2021) - 2020
- [j29]Zhichao Peng, Xingfeng Li, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi:
Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks With Auditory Front-Ends. IEEE Access 8: 16560-16572 (2020) - [j28]Reiya Namikawa, Masashi Unoki:
Non-Blind Speech Watermarking Method Based on Spread-Spectrum Using Linear Prediction Residue. IEICE Trans. Inf. Syst. 103-D(1): 63-66 (2020) - [j27]Shengbei Wang, Weitao Yuan, Masashi Unoki:
Multi-Subspace Echo Hiding Based on Time-Frequency Similarities of Audio Signals. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2349-2363 (2020) - [c91]Thuan Van Ngo, Tuan Vu Ho, Masashi Unoki, Rieko Kubo, Masato Akagi:
Enhancement of speech intelligibility under noisy reverberant conditions based on modulation spectrum concept. APSIPA 2020: 753-758 - [c90]Candy Olivia Mawalim, Shengbei Wang, Masashi Unoki:
Speech Information Hiding by Modification of LSF Quantization Index in CELP Codec. APSIPA 2020: 1321-1330 - [c89]Suradej Duangpummet, Phrimphissa Kraikhun, Chatrin Phunruangsakao, Jessada Karnjana, Masashi Unoki, Waree Kongprawechnon:
Speech Privacy Protection based on Optimal Controlling Estimated Speech Transmission Index in Noisy Reverberant Environments. EUSIPCO 2020: 76-80 - [c88]Bin Zhao, Jianwu Dang, Gaoyan Zhang, Masashi Unoki:
Cortical Oscillatory Hierarchy for Natural Sentence Processing. INTERSPEECH 2020: 125-129 - [c87]Candy Olivia Mawalim, Kasorn Galajit, Jessada Karnjana, Masashi Unoki:
X-Vector Singular Value Modification and Statistical-Based Decomposition with Ensemble Regression Modeling for Speaker Anonymization System. INTERSPEECH 2020: 1703-1707 - [i1]Weitao Yuan, Bofei Dong, Shengbei Wang, Masashi Unoki, Wenwu Wang:
Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation. CoRR abs/2008.00816 (2020)
2010 – 2019
- 2019
- [j26]Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki:
Speech Watermarking Based on Source-filter Model of Speech Production. J. Inf. Hiding Multim. Signal Process. 10(4): 517-534 (2019) - [j25]Weitao Yuan, Boxin He, Shengbei Wang, Jianming Wang, Masashi Unoki:
Enhanced feature network for monaural singing voice separation. Speech Commun. 106: 1-6 (2019) - [j24]Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki:
Detection of speech tampering using sparse representations and spectral manipulations based information hiding. Speech Commun. 112: 1-14 (2019) - [j23]Weitao Yuan, Shengbei Wang, Xiangrui Li, Masashi Unoki, Wenwu Wang:
A Skip Attention Mechanism for Monaural Singing Voice Separation. IEEE Signal Process. Lett. 26(10): 1481-1485 (2019) - [c86]Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi:
Dimensional Emotion Recognition from Speech Using Modulation Spectral Features and Recurrent Neural Networks. APSIPA 2019: 524-528 - [c85]Suradej Duangpummet, Jessada Karnjana, Waree Kongprawechnon, Masashi Unoki:
A Robust Method for Blindly Estimating Speech Transmission Index using Convolutional Neural Network with Temporal Amplitude Envelope. APSIPA 2019: 1208-1214 - [c84]Candy Olivia Mawalim, Shogo Okada, Yukiko I. Nakano, Masashi Unoki:
Multimodal BigFive Personality Trait Analysis Using Communication Skill Indices and Multiple Discussion Types Dataset. HCI (13) 2019: 370-383 - [c83]Weitao Yuan, Shengbei Wang, Xiangrui Li, Masashi Unoki, Wenwu Wang:
Proximal Deep Recurrent Neural Network for Monaural Singing Voice Separation. ICASSP 2019: 286-290 - [c82]Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki:
Inaudible Speech Watermarking Based on Self-compensated Echo-hiding and Sparse Subspace Clustering. ICASSP 2019: 2632-2636 - [c81]Boxin He, Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki:
Data Augmentation for Monaural Singing Voice Separation Based on Variational Autoencoder-Generative Adversarial Network. ICME 2019: 1354-1359 - [c80]Teruki Toya, Peter Birkholz, Masashi Unoki:
Estimates of Transmission Characteristics Related to Perception of Bone-Conducted Speech Using Real Utterances and Transcutaneous Vibration on Larynx. SPECOM 2019: 491-500 - 2018
- [c79]Shengbei Wang, Weitao Yuan, Jianming Wang, Masashi Unoki:
Speech Watermarking Based on Robust Principal Component Analysis and Formant Manipulations. ICASSP 2018: 2082-2086 - [c78]Nguyen Khanh Bui, Daisuke Morikawa, Masashi Unoki:
Method of Estimating Direction of Arrival of Sound Source for Monaural Hearing Based on Temporal Modulation Perception. ICASSP 2018: 5014-5018 - [c77]Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi:
Auditory-Inspired End-to-End Speech Emotion Recognition Using 3D Convolutional Recurrent Neural Networks Based on Spectral-Temporal Representation. ICME 2018: 1-6 - [c76]Kasorn Galajit, Jessada Karnjana, Pakinee Aimmanee, Masashi Unoki:
Digital Audio Watermarking Method Based on Singular Spectrum Analysis with Automatic Parameter Estimation Using a Convolutional Neural Network. IIH-MSP (2) 2018: 63-73 - [c75]Takuto Isoyama, Masashi Unoki:
Noise Suppression Method Based on Modulation Spectrum Analysis. SPECOM 2018: 234-244 - 2017
- [j22]Masashi Unoki, Akikazu Miyazaki, Shota Morita, Masato Akagi:
Method of Blindly Estimating Speech Transmission Index in Noisy Reverberant Environments. J. Inf. Hiding Multim. Signal Process. 8(6): 1430-1445 (2017) - [j21]Shota Morita, Xugang Lu, Masashi Unoki, Masato Akagi:
Method of Estimating Signal-to-Noise Ratio Based on Optimal Design for Sub-band Voice Activity Detection. J. Inf. Hiding Multim. Signal Process. 8(6): 1446-1459 (2017) - [c74]Jessada Karnjana, Kasorn Galajit, Pakinee Aimmanee, Chai Wutiwiwatchai, Masashi Unoki:
Speech watermarking scheme based on singular-spectrum analysis for tampering detection and identification. APSIPA 2017: 193-202 - [c73]Surasak Boonkla, Masashi Unoki, Chai Wutiwiwatchai, Stanislav S. Makhanov:
F0 estimation using empirical mode decomposition and complex cepstrum analysis in reverberant environments. APSIPA 2017: 980-986 - [c72]Masashi Unoki, Yuta Kashihara, Maori Kobayashi, Masato Akagi:
Study on method for protecting speech privacy by actively controlling speech transmission index in simulated room. APSIPA 2017: 1199-1204 - [c71]Zhichao Peng, Zhi Zhu, Masashi Unoki, Jianwu Dang, Masato Akagi:
Speech emotion recognition using multichannel parallel convolutional recurrent neural networks based on gammatone auditory filterbank. APSIPA 2017: 1750-1755 - [c70]Zhi Zhu, Ryota Miyauchi, Yukiko Araki, Masashi Unoki:
Feasibility of vocal emotion conversion on modulation spectrogram for simulated cochlear implants. EUSIPCO 2017: 1834-1838 - [c69]Dung Kim Tran, Masashi Unoki:
Study on Speech Representation Based on Spikegram for Speech Fingerprints. IIH-MSP (2) 2017: 153-160 - [c68]Kenichiro Miwa, Masashi Unoki:
Robust Method for Estimating F0 of Complex Tone Based on Pitch Perception of Amplitude Modulated Signal. INTERSPEECH 2017: 2311-2315 - 2016
- [j20]Nhut Minh Ngo, Masashi Unoki:
Method of Audio Watermarking Based on Adaptive Phase Modulation. IEICE Trans. Inf. Syst. 99-D(1): 92-101 (2016) - [j19]Yang Liu, Shota Morita, Masashi Unoki:
MTF-Based Kalman Filtering with Linear Prediction for Power Envelope Restoration in Noisy Reverberant Environments. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 99-A(2): 560-569 (2016) - [j18]Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee, Chai Wutiwiwatchai:
Singular-Spectrum Analysis for Digital Audio Watermarking with Automatic Parameterization and Parameter Estimation. IEICE Trans. Inf. Syst. 99-D(8): 2109-2120 (2016) - [j17]Surasak Boonkla, Masashi Unoki, Stanislav S. Makhanov, Chai Wutiwiwatchai:
Speech Analysis Method Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 99-A(10): 1762-1773 (2016) - [j16]Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee, Chai Wutiwiwatchai:
Audio Watermarking Scheme Based on Singular Spectrum Analysis and Psychoacoustic Model with Self-Synchronization. J. Electr. Comput. Eng. 2016: 5067313:1-5067313:15 (2016) - [j15]Yang Liu, Naushin Nower, Shota Morita, Masashi Unoki:
Speech enhancement of instantaneous amplitude and phase for applications in noisy reverberant environments. Speech Commun. 84: 1-14 (2016) - [j14]Shota Morita, Masashi Unoki, Xugang Lu, Masato Akagi:
Robust Voice Activity Detection Based on Concept of Modulation Transfer Function in Noisy Reverberant Environments. J. Signal Process. Syst. 82(2): 163-173 (2016) - [c67]Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee, Chai Wutiwiwatchai:
SSA-based audio-information-hiding scheme with psychoacoustic model. APSIPA 2016: 1-10 - [c66]Jianwu Dang, Shengbei Wang, Masashi Unoki:
Investigations into vowel and consonant structures in articulatory and auditory spaces using Laplacian eigenmaps. ICASSP 2016: 5355-5359 - [c65]Zhi Zhu, Ryota Miyauchi, Yukiko Araki, Masashi Unoki:
Modulation Spectral Features for Predicting Vocal Emotion Recognition by Simulated Cochlear Implants. INTERSPEECH 2016: 262-266 - [c64]Yang Liu, Naushin Nower, Shota Morita, Masashi Unoki:
Robust front-end for speech recognition by human and machine in noisy reverberant environments: The effect of phase information. ISCSLP 2016: 1-5 - [c63]Surasak Boonkla, Masashi Unoki, Stanislav S. Makhanov:
Robust Speech Analysis Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition in Noisy Environments. SPECOM 2016: 580-587 - 2015
- [j13]Shengbei Wang, Masashi Unoki:
Speech Watermarking Method Based on Formant Tuning. IEICE Trans. Inf. Syst. 98-D(1): 29-37 (2015) - [j12]Masashi Unoki, Ryota Miyauchi:
Robust, Blindly-Detectable, and Semi-Reversible Technique of Audio Watermarking Based on Cochlear Delay Characteristics. IEICE Trans. Inf. Syst. 98-D(1): 38-48 (2015) - [j11]Shengbei Wang, Ryota Miyauchi, Masashi Unoki, Nam Soo Kim:
Tampering Detection Scheme for Speech Signals using Formant Enhancement based Watermarking. J. Inf. Hiding Multim. Signal Process. 6(6): 1264-1283 (2015) - [j10]Naushin Nower, Yang Liu, Masashi Unoki:
Restoration scheme of instantaneous amplitude and phase using Kalman filter with efficient linear prediction for speech enhancement. Speech Commun. 70: 13-27 (2015) - [c62]Jessada Karnjana, Pakinee Aimmanee, Masashi Unoki, Chai Wutiwiwatchai:
An audio watermarking scheme based on automatic parameterized singular-spectrum analysis using differential evolution. APSIPA 2015: 543-551 - [c61]Yang Liu, Naushin Nower, Yonghong Yan, Masashi Unoki:
Restoration of instantaneous amplitude and phase of speech signal in noisy reverberant environments. EUSIPCO 2015: 879-883 - [c60]Nhut Minh Ngo, Brian Michael Kurkoski, Masashi Unoki:
Robust and reliable audio watermarking based on dynamic phase coding and error control coding. EUSIPCO 2015: 2276-2280 - [c59]Nhut Minh Ngo, Masashi Unoki:
Robust and reliable audio watermarking based on phase coding. ICASSP 2015: 345-349 - [c58]Erick Christian Garcia Alvarez, Shengbei Wang, Masashi Unoki:
An Automatic Watermarking in CELP Speech Codec Based on Formant Tuning. IIH-MSP 2015: 160-163 - [c57]Daisuke Morikawa, Masaru Ando, Masashi Unoki:
Feasibility of Estimating Direction of Arrival Based on Monaural Modulation Spectrum. IIH-MSP 2015: 384-387 - [c56]Shogo Masaya, Masashi Unoki:
Complex tensor factorization in modulation frequency domain for single-channel speech enhancement. INTERSPEECH 2015: 1765-1769 - 2014
- [j9]Nhut Minh Ngo, Masashi Unoki, Ryota Miyauchi, Yôiti Suzuki:
Data Hiding Scheme for Amplitude Modulation Radio Broadcasting Systems. J. Inf. Hiding Multim. Signal Process. 5(3): 324-341 (2014) - [c55]Shengbei Wang, Masashi Unoki:
Watermarking of speech signals based on formant enhancement. EUSIPCO 2014: 1257-1261 - [c54]Naushin Nower, Yang Liu, Masashi Unoki:
Restoration of instantaneous amplitude and phase using Kalman filter for speech enhancement. ICASSP 2014: 4633-4637 - [c53]Shengbei Wang, Masashi Unoki:
Hybrid Speech Watermarking Based on Formant Enhancement and Cochlear Delay. IIH-MSP 2014: 272-275 - [c52]Shengbei Wang, Masashi Unoki, Nam Soo Kim:
Formant enhancement based speech watermarking for tampering detection. INTERSPEECH 2014: 1366-1370 - [c51]Shota Morita, Masashi Unoki, Xugang Lu, Masato Akagi:
Robust voice activity detection based on concept of modulation transfer function in noisy reverberant environments. ISCSLP 2014: 108-112 - [c50]Surasak Boonkla, Masashi Unoki, Stanislav S. Makhanov, Chai Wutiwiwatchai:
Speech analysis method based on source-filter model using multivariate empirical mode decomposition in log-spectrum domain. ISCSLP 2014: 555-559 - [c49]Shota Morita, Xugang Lu, Masashi Unoki:
Signal to noise ratio estimation based on an optimal design of subband voice activity detection. ISCSLP 2014: 560-564 - [c48]Nhut Minh Ngo, Masashi Unoki:
Watermarking for Digital Audio Based on Adaptive Phase Modulation. IWDW 2014: 105-119 - [c47]Jessada Karnjana, Masashi Unoki, Pakinee Aimmanee, Chai Wutiwiwatchai:
An Audio Watermarking Scheme Based on Singular-Spectrum Analysis. IWDW 2014: 145-159 - [c46]Kazushi Nishimoto, Akari Ikenoue, Masashi Unoki:
iDAF-drum: Supporting Practice of Drumstick Control by Exploiting Insignificantly Delayed Auditory Feedback. KICSS 2014: 483-497 - 2013
- [j8]Xugang Lu, Masashi Unoki, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Controlling Tradeoff Between Approximation Accuracy and Complexity of a Smooth Function in a Reproducing Kernel Hilbert Space for Noise Reduction. IEEE Trans. Signal Process. 61(3): 601-610 (2013) - [c45]Masashi Unoki, Tomohiro Ikeda, Kyohei Sasaki, Ryota Miyauchi, Masato Akagi, Nam Soo Kim:
Blind method of estimating speech transmission index in room acoustics based on concept of modulation transfer function. ChinaSIP 2013: 308-312 - [c44]Shin Jae Kang, Chang Woo Han, Kang Hyun Lee, Nam Soo Kim, Masashi Unoki:
IMM-based feature compensation robust to slowly time-varying noise and reverberation. ChinaSIP 2013: 313-317 - [c43]Masashi Unoki, Kyohei Sasaki, Ryota Miyauchi, Masato Akagi, Nam Soo Kim:
Blind method of estimating speech transmission index from reverberant speech signals. EUSIPCO 2013: 1-5 - [c42]Kiho Cho, Soo Hyun Bae, In Kyu Choi, Nam Soo Kim, Masashi Unoki:
Robust Audio Data Hiding Method Based on Phase of Modulated Complex Lapped Transform. IIH-MSP 2013: 263-266 - [c41]Shengbei Wang, Masashi Unoki:
Watermarking Method for Speech Signals Based on Modifications to LSFs. IIH-MSP 2013: 283-286 - [c40]Kenichiro Miwa, Masashi Unoki:
Study on Method for Estimating F0 of Steady Complex Tone in Noisy Reverberant Environments. IIH-MSP 2013: 456-459 - [c39]Yasuaki Kanai, Shota Morita, Masashi Unoki:
Concurrent processing of voice activity detection and noise reduction using empirical mode decomposition and modulation spectrum analysis. INTERSPEECH 2013: 742-746 - [c38]Yang Liu, Masashi Unoki:
MTF based Kalman filtering with linear prediction for power envelope restoration. ISPACS 2013: 198-203 - 2012
- [c37]Nhut Minh Ngo, Masashi Unoki, Ryota Miyauchi, Yôiti Suzuki:
Data-hiding Scheme for Digital-audio in Amplitude Modulation Domain. IIH-MSP 2012: 114-117 - [c36]Masashi Unoki, Ryota Miyauchi:
Detection of Tampering in Speech Signals with Inaudible Watermarking Technique. IIH-MSP 2012: 118-121 - [c35]Xugang Lu, Masashi Unoki, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Controlling the tradeoff property in a regularization framework for noise reduction. ISCSLP 2012: 201-205 - [c34]Masashi Unoki, Xugang Lu:
Unified denoising and dereverberation method used in restoration of MTF-based power envelope. ISCSLP 2012: 215-219 - [c33]Yasuaki Kanai, Masashi Unoki:
Robust voice activity detection using empirical mode decomposition and modulation spectrum analysis. ISCSLP 2012: 400-404 - [c32]Masashi Unoki, Kazushi Nishimoto:
Improvements to Creativity in Singing Abilities Based on Perspective of Studies on Interaction between Speech Production and Auditory Perception. KICSS 2012: 157-160 - 2011
- [j7]Xugang Lu, Masashi Unoki, Satoshi Nakamura:
Sub-band temporal modulation envelopes and their normalization for automatic speech recognition in reverberant environments. Comput. Speech Lang. 25(3): 571-584 (2011) - [j6]Masashi Unoki, Kuniaki Imabeppu, Daiki Hamada, Atsushi Haniu, Ryota Miyauchi:
Embedding Limitations with Digital-audio Watermarking Method Based on Cochlear Delay Characteristics. J. Inf. Hiding Multim. Signal Process. 2(1): 1-23 (2011) - [j5]Xugang Lu, Shigeki Matsuda, Masashi Unoki, Satoshi Nakamura:
Temporal modulation normalization for robust speech feature extraction and recognition. Multim. Tools Appl. 52(1): 187-199 (2011) - [c31]Masashi Unoki, Ryota Miyauchi:
Reversible Watermarking for Digital Audio Based on Cochlear Delay Characteristics. IIH-MSP 2011: 314-317 - [c30]Masashi Unoki, Xugang Lu, Rico Petrick, Shota Morita, Masato Akagi, Rüdiger Hoffmann:
Voice Activity Detection in MTF-Based Power Envelope Restoration. INTERSPEECH 2011: 2609-2612 - [c29]Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Adaptive Regularization Framework for Robust Voice Activity Detection. INTERSPEECH 2011: 2653-2656 - 2010
- [j4]Xugang Lu, Shigeki Matsuda, Masashi Unoki, Satoshi Nakamura:
Temporal contrast normalization and edge-preserved smoothing of temporal modulation structures of speech for robust speech recognition. Speech Commun. 52(1): 1-11 (2010) - [c28]Masashi Unoki, Toshizo Kosugi, Atsushi Haniu, Ryota Miyauchi:
Design of IIR All-Pass Filter Based on Cochlear Delay to Reduce Embedding Limitations. IIH-MSP 2010: 526-529 - [c27]Rico Petrick, Thomas Fehér, Masashi Unoki, Rüdiger Hoffmann:
Methods for robust speech recognition in reverberant environments: a comparison. INTERSPEECH 2010: 582-585 - [c26]Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Voice activity detection in a reguarized reproducing kernel hilbert space. INTERSPEECH 2010: 3086-3089 - [c25]Xugang Lu, Masashi Unoki, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Speech enhancement as a functional approximation and generalization. ISCSLP 2010: 18-22
2000 – 2009
- 2009
- [j3]Dongwen Ying, Masashi Unoki, Xugang Lu, Jianwu Dang:
Speech Enhancement Based on Noise Eigenspace Projection. IEICE Trans. Inf. Syst. 92-D(5): 1137-1145 (2009) - [c24]Masashi Unoki, Yutaka Yamasaki, Masato Akagi:
MTF-based power envelope restoration in noisy reverberant environments. EUSIPCO 2009: 228-232 - [c23]Xugang Lu, Shigeki Matsuda, Masashi Unoki, Tohru Shimizu, Satoshi Nakamura:
Temporal contrast normalization and edge-preserved smoothing on temporal modulation structure for robust speech recognition. ICASSP 2009: 4573-4576 - [c22]Kuniaki Imabeppu, Daiki Hamada, Masashi Unoki:
Embedding Limitations with Audio-watermarking Method Based on Cochlear-delay Characteristics. IIH-MSP 2009: 82-85 - [c21]Xugang Lu, Masashi Unoki, Satoshi Nakamura:
Subband temporal modulation spectrum normalization for automatic speech recognition in reverberant environments. INTERSPEECH 2009: 2503-2506 - [c20]Xugang Lu, Masashi Unoki, Satoshi Nakamura:
Normalization on the modulation spectrum of the subband temporal envelopes for automatic speech recognition in reverberant environments. IUCS 2009: 247-254 - 2008
- [c19]Masashi Unoki, Sota Hiramatsu:
MTF-based method of blind estimation of reverberation time in room acoustics. EUSIPCO 2008: 1-5 - [c18]Masashi Unoki, Toshihiro Hosorogiya, Yuichi Ishimoto:
Comparative evaluations of robust and accurate F0 estimates in reverberant environments. ICASSP 2008: 4569-4572 - [c17]Masashi Unoki, Daiki Hamada:
Audio Watermarking Method Based on the Cochlear Delay Characteristics. IIH-MSP 2008: 616-619 - [c16]Rico Petrick, Masashi Unoki, Anish Mittal, Carlos Segura, Rüdiger Hoffmann:
A comprehensive study on the effects of room reverberation on fundamental frequency estimation. INTERSPEECH 2008: 131-134 - [c15]Rico Petrick, Xugang Lu, Masashi Unoki, Masato Akagi, Rüdiger Hoffmann:
Robust front end processing for speech recognition in reverberant environments: utilization of speech characteristics. INTERSPEECH 2008: 658-661 - 2007
- [c14]Thang Tat Vu, Germine Seide, Masashi Unoki, Masato Akagi:
Method of LP-based blind restoration for improving intelligibility of bone-conducted speech. INTERSPEECH 2007: 966-969 - [c13]Takeshi Saitou, Masataka Goto, Masashi Unoki, Masato Akagi:
Vocal conversion from speaking voice to singing voice using STRAIGHT. INTERSPEECH 2007: 4005-4006 - 2006
- [c12]Xugang Lu, Masashi Unoki, Masato Akagi:
A robust feature extraction based on the MTF concept for speech recognition in reverberant environment. INTERSPEECH 2006 - 2005
- [j2]Takeshi Saitou, Masashi Unoki, Masato Akagi:
Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis. Speech Commun. 46(3-4): 405-417 (2005) - [c11]Masashi Unoki, Masaaki Kubo, Atsushi Haniu, Masato Akagi:
A model for selective segregation of a target instrument sound from the mixed sound of various instruments. INTERSPEECH 2005: 2097-2100 - 2004
- [c10]Masashi Unoki, Masato Toi, Masato Akagi:
A speech dereverberation method based on the MTF concept using adaptive time-frequency divisions. EUSIPCO 2004: 1689-1692 - [c9]Takeshi Saitou, Naoya Tsuji, Masashi Unoki, Masato Akagi:
Analysis of acoustic features affecting "singing-ness" and its application to singing-voice synthesis from speaking-voice. INTERSPEECH 2004: 1925-1928 - 2003
- [c8]Masashi Unoki, Masashi Furukawa, Keigo Sakata, Masato Akagi:
A method based on the MTF concept for dereverberating the power envelope from the reverberant signal. ICASSP (1) 2003: 888-891 - [c7]Masashi Unoki, Masaaki Kubo, Masato Akagi:
A model for selective segregation of a target instrument sound from the mixed sound of various instruments. ICMC 2003 - [c6]Masashi Unoki, Keigo Sakata, Masato Akagi:
A speech dereverberation method based on the MTF concept. INTERSPEECH 2003: 1417-1420 - 2001
- [c5]Yuichi Ishimoto, Masashi Unoki, Masato Akagi:
A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency. INTERSPEECH 2001: 2439-2442
1990 – 1999
- 1999
- [j1]Masashi Unoki, Masato Akagi:
A method of signal extraction from noisy signal based on auditory scene analysis. Speech Commun. 27(3-4): 261-279 (1999) - [c4]Masashi Unoki, Masato Akagi:
Segregation of vowel in background noise using the model of segregating two acoustic sources based on auditory scene analysis. EUROSPEECH 1999: 2575-2578 - 1998
- [c3]Toshio Irino, Masashi Unoki:
A time-varying, analysis/synthesis auditory filterbank using the gammachirp. ICASSP 1998: 3653-3656 - [c2]Masashi Unoki, Masato Akagi:
Signal extraction from noisy signal based on auditory scene analysis. ICSLP 1998 - 1997
- [c1]Masashi Unoki, Masato Akagi:
A method of signal extraction from noisy signal. EUROSPEECH 1997: 2587-2590
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-02 22:34 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint