article

Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation

Authors:

Javier Tejedor,

Doroteo T. Toledano,

Paula Lopez-Otero,

Laura Docio-Fernandez,

Mikel Peñagarikano,

Luis Javier Rodriguez-Fuentes,

Antonio Moreno-SandovalAuthors Info & Claims

EURASIP Journal on Audio, Speech, and Music Processing, Volume 2019, Issue 1

Article No.: 156, Pages 1 - 29

https://doi.org/10.1186/s13636-019-0156-x

Published: 01 December 2019 Publication History

Abstract

The huge amount of information stored in audio and video repositories makes search on speech (SoS) a priority area nowadays. Within SoS, Query-by-Example Spoken Term Detection (QbE STD) aims to retrieve data from a speech repository given a spoken query. Research on this area is continuously fostered with the organization of QbE STD evaluations. This paper presents a multi-domain internationally open evaluation for QbE STD in Spanish. The evaluation aims at retrieving the speech files that contain the queries, providing their start and end times, and a score that reflects the confidence given to the detection. Three different Spanish speech databases that encompass different domains have been employed in the evaluation: MAVIR database, which comprises a set of talks from workshops; RTVE database, which includes broadcast television (TV) shows; and COREMAH database, which contains 2-people spontaneous speech conversations about different topics. The evaluation has been designed carefully so that several analyses of the main results can be carried out. We present the evaluation itself, the three databases, the evaluation metrics, the systems submitted to the evaluation, the results, and the detailed post-evaluation analyses based on some query properties (within-vocabulary/out-of-vocabulary queries, single-word/multi-word queries, and native/foreign queries). Fusion results of the primary systems submitted to the evaluation are also presented. Three different teams took part in the evaluation, and ten different systems were submitted. The results suggest that the QbE STD task is still in progress, and the performance of these systems is highly sensitive to changes in the data domain. Nevertheless, QbE STD strategies are able to outperform text-based STD in unseen data domains.

References

[1]

K. Ng, V. W. Zue, Subword-based approaches for spoken document retrieval. Speech Commun.32(3), 157---186 (2000).

Abstract

References

Cited By

Index Terms

Recommendations

ALBAYZIN Query-by-example Spoken Term Detection 2016 evaluation

Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations

ALBAYZIN 2018 spoken term detection evaluation: a multi-domain international evaluation in Spanish

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations