Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- surveyNovember 2024
Backdoor Attacks against Voice Recognition Systems: A Survey
ACM Computing Surveys (CSUR), Volume 57, Issue 3Article No.: 78, Pages 1–35https://doi.org/10.1145/3701985Voice Recognition Systems (VRSs) employ deep learning for speech recognition and speaker recognition. They have been widely deployed in various real-world applications, from intelligent voice assistance to telephony surveillance and biometric ...
- research-articleMarch 2024
Rethinking multi‐spatial information for transferable adversarial attacks on speaker recognition systems
CAAI Transactions on Intelligence Technology (CIT2), Volume 9, Issue 3Pages 620–631https://doi.org/10.1049/cit2.12295AbstractAdversarial attacks have been posing significant security concerns to intelligent systems, such as speaker recognition systems (SRSs). Most attacks assume the neural networks in the systems are known beforehand, while black‐box attacks are ...
- ArticleNovember 2023
Improving Speaker Recognition by Time-Frequency Domain Feature Enhanced Method
PRICAI 2023: Trends in Artificial IntelligencePages 362–367https://doi.org/10.1007/978-981-99-7022-3_33AbstractMany existing speaker recognition algorithms have the problem that single-domain feature extraction cannot represent the speech characteristics well, and this problem will affect the accuracy of speaker recognition. To solve this problem, we ...
- research-articleNovember 2023
Spoofing Detection for Personal Voice Assistants
SensorsS&P: Proceedings of the First International Workshop on Security and Privacy of Sensing SystemsPages 1–7https://doi.org/10.1145/3628356.3630114Personal Voice Assistants (PVAs) are common acoustic sensing systems that are used as a speech-based controller for critical systems making them vulnerable to speech spoofing attacks. Prior research has focused on the discrimination of genuine and ...
- ArticleJune 2023
Enhancing Transferability of Adversarial Audio in Speaker Recognition Systems
AbstractAlthough deep neural networks have demonstrated state-of-the-art performance in several tasks such as speaker recognition among others, they are highly vulnerable to adversarial attacks. These attacks involve the transformation of the original ...
-
- research-articleJanuary 2023
Push the Limit of Adversarial Example Attack on Speaker Recognition in Physical Domain
SenSys '22: Proceedings of the 20th ACM Conference on Embedded Networked Sensor SystemsPages 710–724https://doi.org/10.1145/3560905.3568518The integration of deep learning on Speaker Recognition (SR) advances its development and wide deployment, but also introduces the emerging threat of adversarial examples. However, only a few existing studies investigate its practical threat in physical ...
- research-articleJune 2022
Bias in Automated Speaker Recognition
FAccT '22: Proceedings of the 2022 ACM Conference on Fairness, Accountability, and TransparencyPages 230–247https://doi.org/10.1145/3531146.3533089Automated speaker recognition uses data processing to identify speakers by their voice. Today, automated speaker recognition is deployed on billions of smart devices and in services such as call centres. Despite their wide-scale deployment and known ...
- research-articleJanuary 2022
Research on x-vector speaker recognition algorithm based on Kaldi
International Journal of Computing Science and Mathematics (IJCSM), Volume 15, Issue 3Pages 199–212https://doi.org/10.1504/ijcsm.2022.124725This paper presents a convolutional neural network with an attention mechanism for analysing the spectrogram in an x-vector based speaker recognition system. First, the convolutional neural network (CNN) is used to extract the features of the ...
- research-articleJanuary 2022
Configuring artificial neural network using optimisation techniques for speaker voice recognition
International Journal of Bioinformatics Research and Applications (IJBRA), Volume 18, Issue 1-2Pages 101–112https://doi.org/10.1504/ijbra.2022.121765Speaker recognition is proposed in this work using artificial neural network (ANN) and optimisation technique which finds wide variety of applications. Mel-frequency cepstral coefficient (MFCC) and linear prediction-filter coefficients (LPC) coefficients ...
- research-articleNovember 2021
Black-box Adversarial Attacks on Commercial Speech Platforms with Minimal Information
CCS '21: Proceedings of the 2021 ACM SIGSAC Conference on Computer and Communications SecurityPages 86–107https://doi.org/10.1145/3460120.3485383Adversarial attacks against commercial black-box speech platforms, including cloud speech APIs and voice control devices, have received little attention until recent years. Constructing such attacks is difficult mainly due to the unique characteristics ...
- research-articleNovember 2020
Deconstructing Human-assisted Video Transcription and Annotation for Legislative Proceedings
Digital Government: Research and Practice (DGOV), Volume 1, Issue 3Article No.: 19, Pages 1–24https://doi.org/10.1145/3395316Legislative proceedings present a rich source of multidimensional information that is crucial to citizens and journalists in a democratic system. At present, no fully automated solution exists that is capable of capturing all the necessary information ...
- research-articleMarch 2020
Practical Adversarial Attacks Against Speaker Recognition Systems
HotMobile '20: Proceedings of the 21st International Workshop on Mobile Computing Systems and ApplicationsPages 9–14https://doi.org/10.1145/3376897.3377856Unlike other biometric-based user identification methods (e.g., fingerprint and iris), speaker recognition systems can identify individuals relying on their unique voice biometrics without requiring users to be physically present. Therefore, speaker ...
- research-articleJanuary 2020
Multi-Task Learning Based End-to-End Speaker Recognition
SPML '19: Proceedings of the 2019 2nd International Conference on Signal Processing and Machine LearningPages 56–61https://doi.org/10.1145/3372806.3372818Recently, there has been an increasing interest in end-to-end speaker recognition that directly take raw speech waveform as input without any hand-crafted features such as FBANK and MFCC. SincNet is a recently developed novel convolutional neural ...
- articleJanuary 2019
Joint PLDA for simultaneous modeling of two factors
Probabilistic linear discriminant analysis (PLDA) is a method used for biometric problems like speaker or face recognition that models the variability of the samples using two latent variables, one that depends on the class of the sample and another one ...
- articleJanuary 2019
Multisource Speech Analysis for Speaker Recognition
Pattern Recognition and Image Analysis (SPPRIA), Volume 29, Issue 1Pages 181–193https://doi.org/10.1134/S1054661818040260On a comprehensive speech database, speaker recognition characteristics are compared under the usage of various voice-source models. Inverse problems to find a source via vowel speech segments are solved on the base of a special speech-production model ...
- research-articleNovember 2018
ProMETheus: An Intelligent Mobile Voice Meeting Minutes System
MobiQuitous '18: Proceedings of the 15th EAI International Conference on Mobile and Ubiquitous Systems: Computing, Networking and ServicesPages 392–401https://doi.org/10.1145/3286978.3286995In this paper, we focus on designing and developing ProMETheus, an intelligent system for meeting minutes generated from audio data. The first task in ProMETheus is to recognize the speakers from noisy audio data. Speaker recognition algorithm is used ...
- research-articleJune 2018
Ultralow power acoustic feature-scoring using gaussian I-V transistors
DAC '18: Proceedings of the 55th Annual Design Automation ConferenceArticle No.: 77, Pages 1–6https://doi.org/10.1145/3195970.3196133This paper discusses energy-efficient acoustic feature-scoring using transistors with Gaussian-shaped Ids-Vgs. Acoustic feature-scoring is a critical step in speech recognition tasks such as speaker recognition. Suited to the transistor, we discuss a ...
- posterMay 2018
Gaining efficiency in human assisted transcription and speech annotation in legislative proceedings
dg.o '18: Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data AgeArticle No.: 117, Pages 1–2https://doi.org/10.1145/3209281.3209410We present a study using the Digital Democracy transcription tool. Human transcribers work to up-level and annotate California state legislative proceedings using the tool. Four phases of UI and functionality improvements are introduced and for each ...
- surveyMay 2018
Deep Learning for Biometrics: A Survey
ACM Computing Surveys (CSUR), Volume 51, Issue 3Article No.: 65, Pages 1–34https://doi.org/10.1145/3190618In the recent past, deep learning methods have demonstrated remarkable success for supervised learning tasks in multiple domains including computer vision, natural language processing, and speech processing. In this article, we investigate the impact of ...
- research-articleApril 2017
Hardware Architectures for Embedded Speaker Recognition Applications: A Survey
ACM Transactions on Embedded Computing Systems (TECS), Volume 16, Issue 3Article No.: 78, Pages 1–28https://doi.org/10.1145/2975161Authentication technologies based on biometrics, such as speaker recognition, are attracting more and more interest thanks to the elevated level of security offered by these technologies. Despite offering many advantages, such as remote use and low ...