Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection
References
Recommendations
Analysis and modeling of F0 contours for cantonese text-to-speech
For the generation of highly natural synthetic speech, the control of prosody is of primary importance. The fundamental frequency (F0) is one of the most important components of speech prosody. This research investigates the variation of F0 in ...
Subband fusion of complex spectrogram for fake speech detection
AbstractThe phase information was shown useful in fake speech detection. However, the most common reason why phase-based features are not widely used is phase wrapping. This makes the original phase hard to model directly. Therefore, it remains a ...
Highlights- A subband fusion of complex spectrogram is proposed for fake speech detection.
- We model different subbands of complex spectrogram respectively and fuse finally.
- Experimental results show that our proposed method is very effective.
Supervised and unsupervised separation of convolutive speech mixtures using f0 and formant frequencies
In this paper we discuss the role of fundamental frequency f0 and formants F1, F2 and F3 of the speech signal in supervised and unsupervised source separation of real recorded convolutive speech mixtures. Initially supervised source separation is ...
Comments
Information & Contributors
Information
Published In
Publisher
Elsevier Science Ltd.
United Kingdom
Publication History
Author Tags
Qualifiers
- Research-article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 0Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0