default search action
Mireia Díez
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j13]Federico Landini, Mireia Díez, Themos Stafylakis, Lukás Burget:
DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3450-3465 (2024) - [c45]Jiangyu Han, Federico Landini, Johan Rohdin, Mireia Díez, Lukás Burget, Yuhang Cao, Heng Lu, Jan Cernocký:
Diacorrect: Error Correction Back-End for Speaker Diarization. ICASSP 2024: 11181-11185 - [c44]Dominik Klement, Mireia Díez, Federico Landini, Lukás Burget, Anna Silnova, Marc Delcroix, Naohiro Tawara:
Discriminative Training of VBx Diarization. ICASSP 2024: 11871-11875 - [c43]Lin Zhang, Themos Stafylakis, Federico Landini, Mireia Díez, Anna Silnova, Lukás Burget:
Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information? Odyssey 2024: 123-130 - [i13]Lin Zhang, Themos Stafylakis, Federico Landini, Mireia Díez, Anna Silnova, Lukás Burget:
Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information? CoRR abs/2402.19325 (2024) - [i12]Lin Zhang, Xin Wang, Erica Cooper, Mireia Díez, Federico Landini, Nicholas W. D. Evans, Junichi Yamagishi:
Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio. CoRR abs/2406.07816 (2024) - [i11]Jiangyu Han, Federico Landini, Johan Rohdin, Anna Silnova, Mireia Díez, Lukás Burget:
Leveraging Self-Supervised Learning for Speaker Diarization. CoRR abs/2409.09408 (2024) - 2023
- [c42]Federico Landini, Mireia Díez, Alicia Lozano-Diez, Lukás Burget:
Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization. ICASSP 2023: 1-5 - [c41]Marc Delcroix, Naohiro Tawara, Mireia Díez, Federico Landini, Anna Silnova, Atsunori Ogawa, Tomohiro Nakatani, Lukás Burget, Shoko Araki:
Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization. INTERSPEECH 2023: 3477-3481 - [i10]Marc Delcroix, Naohiro Tawara, Mireia Díez, Federico Landini, Anna Silnova, Atsunori Ogawa, Tomohiro Nakatani, Lukás Burget, Shoko Araki:
Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization. CoRR abs/2305.13580 (2023) - [i9]Jiangyu Han, Federico Landini, Johan Rohdin, Mireia Díez, Lukás Burget, Yuhang Cao, Heng Lu, Jan Cernocký:
DiaCorrect: Error Correction Back-end For Speaker Diarization. CoRR abs/2309.08377 (2023) - [i8]Dominik Klement, Mireia Díez, Federico Landini, Lukás Burget, Anna Silnova, Marc Delcroix, Naohiro Tawara:
Discriminative Training of VBx Diarization. CoRR abs/2310.02732 (2023) - [i7]Federico Landini, Mireia Díez, Themos Stafylakis, Lukás Burget:
DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors. CoRR abs/2312.04324 (2023) - 2022
- [j12]Federico Landini, Ján Profant, Mireia Díez, Lukás Burget:
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks. Comput. Speech Lang. 71: 101254 (2022) - [c40]Martin Kocour, Jahnavi Umesh, Martin Karafiát, Jan Svec, Fernando López, Jordi Luque, Karel Benes, Mireia Díez, Igor Szöke, Karel Veselý, Lukás Burget, Jan Cernocký:
BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge. IberSPEECH 2022: 276-280 - [c39]Murali Karthick Baskar, Tim Herzig, Diana Nguyen, Mireia Díez, Tim Polzehl, Lukás Burget, Jan Cernocký:
Speaker adaptation for Wav2vec2 based dysarthric ASR. INTERSPEECH 2022: 3403-3407 - [c38]Federico Landini, Alicia Lozano-Diez, Mireia Díez, Lukás Burget:
From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization. INTERSPEECH 2022: 5095-5099 - [i6]Murali Karthick Baskar, Tim Herzig, Diana Nguyen, Mireia Díez, Tim Polzehl, Lukás Burget, Jan Honza Cernocký:
Speaker adaptation for Wav2vec2 based dysarthric ASR. CoRR abs/2204.00770 (2022) - [i5]Federico Landini, Alicia Lozano-Diez, Mireia Díez, Lukás Burget:
From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization. CoRR abs/2204.00890 (2022) - [i4]Federico Landini, Mireia Díez, Alicia Lozano-Diez, Lukás Burget:
Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization. CoRR abs/2211.06750 (2022) - 2021
- [c37]Federico Landini, Ondrej Glembek, Pavel Matejka, Johan Rohdin, Lukás Burget, Mireia Díez, Anna Silnova:
Analysis of the but Diarization System for Voxconverse Challenge. ICASSP 2021: 5819-5823 - 2020
- [j11]Johan Rohdin, Anna Silnova, Mireia Díez, Oldrich Plchot, Pavel Matejka, Lukás Burget, Ondrej Glembek:
End-to-end DNN based text-independent speaker recognition for long and short utterances. Comput. Speech Lang. 59: 22-35 (2020) - [j10]Pavel Matejka, Oldrich Plchot, Ondrej Glembek, Lukás Burget, Johan Rohdin, Hossein Zeinali, Ladislav Mosner, Anna Silnova, Ondrej Novotný, Mireia Díez, Jan Honza Cernocký:
13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE. Comput. Speech Lang. 63: 101035 (2020) - [j9]Mireia Díez, Lukás Burget, Federico Landini, Jan Cernocký:
Analysis of Speaker Diarization Based on Bayesian HMM With Eigenvoice Priors. IEEE ACM Trans. Audio Speech Lang. Process. 28: 355-368 (2020) - [c36]Mireia Díez, Lukás Burget, Federico Landini, Shuai Wang, Honza Cernocký:
Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge. ICASSP 2020: 6519-6523 - [c35]Federico Landini, Shuai Wang, Mireia Díez, Lukás Burget, Pavel Matejka, Katerina Zmolíková, Ladislav Mosner, Anna Silnova, Oldrich Plchot, Ondrej Novotný, Hossein Zeinali, Johan Rohdin:
But System for the Second Dihard Speech Diarization Challenge. ICASSP 2020: 6529-6533 - [c34]Jahangir Alam, Gilles Boulianne, Lukás Burget, Mohamed Dahmane, Mireia Díez Sánchez, Alicia Lozano-Diez, Ondrej Glembek, Pierre-Luc St-Charles, Marc Lalonde, Pavel Matejka, Petr Mizera, João Monteiro, Ladislav Mosner, Cedric Noiseux, Ondrej Novotný, Oldrich Plchot, Johan Rohdin, Anna Silnova, Josef Slavícek, Themos Stafylakis, Shuai Wang, Hossein Zeinali:
Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. Odyssey 2020: 289-295 - [i3]Federico Landini, Ondrej Glembek, Pavel Matejka, Johan Rohdin, Lukás Burget, Mireia Díez, Anna Silnova:
Analysis of the BUT Diarization System for VoxConverse Challenge. CoRR abs/2010.11718 (2020) - [i2]Federico Landini, Ján Profant, Mireia Díez, Lukás Burget:
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: theory, implementation and analysis on standard tasks. CoRR abs/2012.14952 (2020)
2010 – 2019
- 2019
- [c33]Mireia Díez, Lukás Burget, Shuai Wang, Johan Rohdin, Jan Cernocký:
Bayesian HMM Based x-Vector Clustering for Speaker Diarization. INTERSPEECH 2019: 346-350 - 2018
- [c32]Johan Rohdin, Anna Silnova, Mireia Díez, Oldrich Plchot, Pavel Matejka, Lukás Burget:
End-to-End DNN Based Speaker Recognition Inspired by I-Vector and PLDA. ICASSP 2018: 4874-4878 - [c31]Mireia Díez, Federico Landini, Lukás Burget, Johan Rohdin, Anna Silnova, Katerina Zmolíková, Ondrej Novotný, Karel Veselý, Ondrej Glembek, Oldrich Plchot, Ladislav Mosner, Pavel Matejka:
BUT System for DIHARD Speech Diarization Challenge 2018. INTERSPEECH 2018: 2798-2802 - [c30]Oldrich Plchot, Pavel Matejka, Ondrej Novotný, Sandro Cumani, Alicia Lozano-Diez, Josef Slavícek, Mireia Díez, Frantisek Grézl, Ondrej Glembek, Mounika Kamsali, Anna Silnova, Lukás Burget, Lucas Ondel, Santosh Kesiraju, Johan Rohdin:
Analysis of BUT-PT Submission for NIST LRE 2017. Odyssey 2018: 47-53 - [c29]Mireia Díez, Lukás Burget, Pavel Matejka:
Speaker Diarization based on Bayesian HMM with Eigenvoice Priors. Odyssey 2018: 147-154 - 2017
- [c28]Karel Veselý, Murali Karthick Baskar, Mireia Díez, Karel Benes:
MGB-3 but system: Low-resource ASR on Egyptian YouTube data. ASRU 2017: 368-373 - [c27]Oldrich Plchot, Pavel Matejka, Anna Silnova, Ondrej Novotný, Mireia Díez Sánchez, Johan Rohdin, Ondrej Glembek, Niko Brümmer, Albert Swart, Jesús Jorrín-Prieto, Paola García, Luis Buera, Patrick Kenny, Md. Jahangir Alam, Gautam Bhattacharya:
Analysis and Description of ABC Submission to NIST SRE 2016. INTERSPEECH 2017: 1348-1352 - [c26]Pavel Matejka, Ondrej Novotný, Oldrich Plchot, Lukás Burget, Mireia Díez Sánchez, Jan Cernocký:
Analysis of Score Normalization in Multilingual Speaker Recognition. INTERSPEECH 2017: 1567-1571 - [i1]Johan Rohdin, Anna Silnova, Mireia Díez, Oldrich Plchot, Pavel Matejka, Lukás Burget:
End-to-end DNN Based Speaker Recognition Inspired by i-vector and PLDA. CoRR abs/1710.02369 (2017) - 2016
- [j8]Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel:
KALAKA-3: a database for the assessment of spoken language recognition technology on YouTube audios. Lang. Resour. Evaluation 50(2): 221-243 (2016) - 2014
- [j7]Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel:
On the Complementarity of Phone Posterior Probabilities for Improved Speaker Recognition. IEEE Signal Process. Lett. 21(6): 649-652 (2014) - [j6]Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel:
On the Projection of PLLRs for Unbounded Feature Distributions in Spoken Language Recognition. IEEE Signal Process. Lett. 21(9): 1073-1077 (2014) - [c25]Luis Javier Rodríguez-Fuentes, Amparo Varona, Mikel Peñagarikano, Germán Bordel, Mireia Díez:
High-performance Query-by-Example Spoken Term Detection on the SWS 2013 evaluation. ICASSP 2014: 7819-7823 - [c24]Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel:
Optimizing PLLR Features for Spoken Language Recognition. ICPR 2014: 779-784 - [c23]Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel:
New insight into the use of phone log-likelihood ratios as features for language recognition. INTERSPEECH 2014: 1841-1845 - [c22]Mireia Díez, Mikel Peñagarikano, Germán Bordel, Amparo Varona, Luis Javier Rodríguez-Fuentes:
On the complementarity of short-time fourier analysis windows of different lengths for improved language recognition. INTERSPEECH 2014: 3032-3036 - [c21]Oldrich Plchot, Mireia Díez, Mehdi Soufifar, Lukás Burget:
PLLR features in language recognition system for RATS. INTERSPEECH 2014: 3047-3051 - [c20]Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel:
KALAKA-3: a database for the recognition of spoken European languages on YouTube audios. LREC 2014: 443-449 - [c19]Luis Javier Rodríguez-Fuentes, Amparo Varona, Mikel Peñagarikano, Germán Bordel, Mireia Díez:
GTTS-EHU Systems for QUESST at MediaEval 2014. MediaEval 2014 - 2013
- [j5]Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel:
Language Recognition on Albayzin 2010 LRE using PLLR features. Proces. del Leng. Natural 51: 153-160 (2013) - [c18]Elie Khoury, Bostjan Vesnicer, Javier Franco-Pedroso, Ricardo P. V. Violato, Z. Boulkcnafet, L. M. Mazaira Fernandez, Mireia Díez, J. Kosmala, Houssemeddine Khemiri, T. Cipr, Rahim Saeidi, Manuel Günther, Jerneja Zganec-Gros, Rubén Zazo-Candil, Flávio Olmos Simões, Messaoud Bengherabi, Agustín Álvarez Marquina, Mikel Peñagarikano, Alberto Abad, M. Boulayemen, Petr Schwarz, David A. van Leeuwen, Javier Gonzalez-Dominguez, Mário Uliani Neto, Elhocine Boutellaa, Pedro Gómez Vilda, Amparo Varona, Dijana Petrovska-Delacrétaz, Pavel Matejka, Joaquín González-Rodríguez, Tiago Freitas Pereira, Farid Harizi, Luis Javier Rodríguez-Fuentes, Laurent El Shafey, Marcus A. Angeloni, Germán Bordel, Gérard Chollet, Sébastien Marcel:
The 2013 speaker recognition evaluation in mobile environment. ICB 2013: 1-8 - [c17]Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel:
Dimensionality reduction of phone log-likelihood ratio features for spoken language recognition. INTERSPEECH 2013: 64-68 - [c16]Luis Javier Rodríguez-Fuentes, Niko Brümmer, Mikel Peñagarikano, Amparo Varona, Germán Bordel, Mireia Díez:
The albayzin 2012 language recognition evaluation. INTERSPEECH 2013: 1497-1501 - [c15]Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel:
Using phone log-likelihood ratios as features for speaker recognition. INTERSPEECH 2013: 2504-2508 - [c14]Jesús Antonio Villalba López, Mireia Díez, Amparo Varona, Eduardo Lleida:
Handling recordings acquired simultaneously over multiple channels with PLDA. INTERSPEECH 2013: 2509-2513 - [c13]Luis Javier Rodríguez-Fuentes, Amparo Varona, Mikel Peñagarikano, Germán Bordel, Mireia Díez:
GTTS Systems for the SWS Task at MediaEval 2013. MediaEval 2013 - 2012
- [c12]Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel, Alberto Abad, David Martínez González, Jesús Antonio Villalba López, Alfonso Ortega, Eduardo Lleida:
The BLZ Submission to the NIST 2011 LRE: Data Collection, System Development and Performance. INTERSPEECH 2012: 38-41 - [c11]Mikel Peñagarikano, Amparo Varona, Luis Javier Rodríguez-Fuentes, Mireia Díez, Germán Bordel:
The EHU Systems for the NIST 2011 Language Recognition Evaluation. INTERSPEECH 2012: 2045-2048 - [c10]Mikel Peñagarikano, Amparo Varona, Mireia Díez, Luis Javier Rodríguez-Fuentes, Germán Bordel:
Study of Different Backends in a State-Of-the-Art Language Recognition System. INTERSPEECH 2012: 2049-2052 - [c9]Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel, Mireia Díez:
Using Time-Synchronous Phone Co-occurrences in a SVM-Phonotactic Dialect Recognition System. INTERSPEECH 2012: 2069-2072 - [c8]Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel:
KALAKA-2: a TV Broadcast Speech Database for the Recognition of Iberian Languages in Clean and Noisy Environments. LREC 2012: 99-105 - [c7]Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel, Mireia Díez:
GTTS System for the Spoken Web Search Task at MediaEval 2012. MediaEval 2012 - [c6]Luis Javier Rodríguez-Fuentes, Amparo Varona, Mireia Díez, Mikel Peñagarikano, Germán Bordel:
Evaluation of spoken language recognition technology using broadcast speech: performance and challenges. Odyssey 2012: 194-201 - [c5]Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel:
On the use of phone log-likelihood ratios as features in spoken language recognition. SLT 2012: 274-279 - 2011
- [j4]Amparo Varona, Silvia Nieto, Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Germán Bordel, Mireia Díez:
A Spoken Document Retrieval System for TV Broadcast News in Spanish and Basque. Proces. del Leng. Natural 47: 75-83 (2011) - [j3]Luis Javier Rodríguez-Fuentes, Amparo Varona, Mikel Peñagarikano, Mireia Díez, Germán Bordel:
Spoken language recognition in conversational telephone speech and TV broadcast news (GLOSA). Proces. del Leng. Natural 47: 349-350 (2011) - [c4]Luis Javier Rodríguez, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel, David Martínez González, Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida, Alberto Abad, Oscar Koller, Isabel Trancoso, Paula Lopez-Otero, Laura Docío Fernández, Carmen García-Mateo, Rahim Saeidi, Mehdi Soufifar, Tomi Kinnunen, Torbjørn Svendsen, Pasi Fränti:
Multi-site heterogeneous system fusions for the Albayzin 2010 Language Recognition Evaluation. ASRU 2011: 377-382 - [c3]Mireia Díez, Mikel Peñagarikano, Amparo Varona, Luis Javier Rodríguez-Fuentes, Germán Bordel:
On the Use of Dot Scoring for Speaker Diarization. IbPRIA 2011: 612-619 - [c2]Luis Javier Rodríguez, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel:
The Albayzin 2010 Language Recognition Evaluation. INTERSPEECH 2011: 1529-1532 - 2010
- [j2]Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Mireia Díez, Germán Bordel:
Verification of the four Spanish official languages on TV show recordings. Proces. del Leng. Natural 45: 95-103 (2010) - [j1]Amparo Varona, Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Silvia Nieto, Mireia Díez, Germán Bordel:
Search and access to information contained in the speech of multimedia resources. Proces. del Leng. Natural 45: 317-318 (2010) - [c1]Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Germán Bordel, Amparo Varona, Mireia Díez:
KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems. LREC 2010
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-22 21:13 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint