Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 229 results for author: Hansen, L

.
  1. arXiv:2407.20936  [pdf, other

    quant-ph

    Non-classical excitation of a solid-state quantum emitter

    Authors: Lena M. Hansen, Francesco Giorgino, Lennart Jehle, Lorenzo Carosini, Juan Camilo López Carreño, Iñigo Arrazola, Philip Walther, Juan C. Loredo

    Abstract: The interaction between a single emitter and a single photon is a fundamental aspect of quantum optics. This interaction allows for the study of various quantum processes, such as emitter-mediated single-photon scattering and effective photon-photon interactions. However, empirical observations of this scenario and its dynamics are rare, and in most cases, only partial approximations to the fully… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 13 pages, 8 figures

  2. arXiv:2407.19677  [pdf, other

    cs.CY cs.CR cs.SD eess.AS

    Navigating the United States Legislative Landscape on Voice Privacy: Existing Laws, Proposed Bills, Protection for Children, and Synthetic Data for AI

    Authors: Satwik Dutta, John H. L. Hansen

    Abstract: Privacy is a hot topic for policymakers across the globe, including the United States. Evolving advances in AI and emerging concerns about the misuse of personal data have pushed policymakers to draft legislation on trustworthy AI and privacy protection for its citizens. This paper presents the state of the privacy legislation at the U.S. Congress and outlines how voice data is considered as part… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 5 pages, 2 figures, accepted at the Interspeech SynData4GenAI 2024 workshop

    ACM Class: I.2; J.1

  3. arXiv:2407.05959  [pdf, other

    eess.SY

    Time Series Dataset for Modeling and Forecasting of $N_2O$ in Wastewater Treatment

    Authors: Laura Debel Hansen, Anju Rani, Mikkel Algren Stokholm-Bjerregaard, Peter Alexander Stentoft, Daniel Ortiz Arroyo, Petar Durdevic

    Abstract: In this paper, we present two years of high-resolution nitrous oxide ($N_2O$) measurements for time series modeling and forecasting in wastewater treatment plants (WWTP). The dataset comprises frequent, real-time measurements from a full-scale WWTP, with a sample interval of 2 minutes, making it ideal for developing models for real-time operation and control. This comprehensive bio-chemical datase… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 10 pages, 4 figures. This publication accompanies the Mendeley dataset available at this URL (version 1): https://data.mendeley.com/datasets/xmbxhscgpr/1

  4. arXiv:2407.04982  [pdf

    cond-mat.mtrl-sci physics.geo-ph

    Microstructural and Micromechanical Evolution of Olivine Aggregates During Transient Creep

    Authors: Harison S. Wiesman, Thomas Breithaupt, David Wallis, Lars N. Hansen

    Abstract: To examine the microstructural evolution that occurs during transient creep, we deformed olivine aggregates to different strains that spanned the initial transient deformation. Two sets of samples with different initial grain sizes of 5 $μ$m and 20 $μ$m were deformed in torsion at T = 1523 K, P = 300 MPa, and a constant shear strain rate of 1.5 $\times$ 10$^{-4}$ s$^{-1}$. Both sets of samples exp… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  5. arXiv:2407.04291  [pdf, other

    eess.AS cs.LG

    We Need Variations in Speech Synthesis: Sub-center Modelling for Speaker Embeddings

    Authors: Ismail Rasim Ulgen, Carlos Busso, John H. L. Hansen, Berrak Sisman

    Abstract: In speech synthesis, modeling of rich emotions and prosodic variations present in human voice are crucial to synthesize natural speech. Although speaker embeddings have been widely used in personalized speech synthesis as conditioning inputs, they are designed to lose variation to optimize speaker recognition accuracy. Thus, they are suboptimal for speech synthesis in terms of modeling the rich va… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Submitted to IEEE Signal Processing Letters

  6. arXiv:2406.16364  [pdf

    q-bio.OT

    The unpaved road towards efficient selective breeding in insects for food and feed

    Authors: Laura Skrubbeltrang Hansen, Stine Frey Laursen, Simon Bahrndorff, Jesper Givskov Sørensen, Goutam Sahana, Torsten Nygaard Kristensen, Hanne Marie Nielsen

    Abstract: Insect production for food and feed presents a promising supplement to ensure food safety and address the adverse impacts of agriculture on climate and environment in the future. However, optimisation is required for insect production to realise its full potential. This can be by targeted improvement of traits of interest through selective breeding, an approach which has so far been underexplored… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  7. arXiv:2406.09981  [pdf, other

    cs.LG cs.AI cs.CV

    Challenges in explaining deep learning models for data with biological variation

    Authors: Lenka Tětková, Erik Schou Dreier, Robin Malm, Lars Kai Hansen

    Abstract: Much machine learning research progress is based on developing models and evaluating them on a benchmark dataset (e.g., ImageNet for images). However, applying such benchmark-successful methods to real-world data often does not work as expected. This is particularly the case for biological data where we expect variability at multiple time and spatial scales. In this work, we are using grain data a… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  8. arXiv:2405.19746  [pdf, other

    cs.CV

    DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation

    Authors: Ron Keuth, Lasse Hansen, Maren Balks, Ronja Jäger, Anne-Nele Schröder, Ludger Tüshaus, Mattias Heinrich

    Abstract: Purpose: Semantic segmentation and landmark detection are fundamental tasks of medical image processing, facilitating further analysis of anatomical objects. Although deep learning-based pixel-wise classification has set a new-state-of-the-art for segmentation, it falls short in landmark detection, a strength of shape-based approaches. Methods: In this work, we propose a dense image-to-shape rep… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  9. arXiv:2405.12017  [pdf, other

    cond-mat.mes-hall physics.optics

    Spectrally resolved free electron-light coupling strength in a transition metal dichalcogenide

    Authors: Niklas Müller, Soufiane el Kabil, Gerrit Vosse, Lina Hansen, Christopher Rathje, Sascha Schäfer

    Abstract: Recent advancements in electron microscopy have introduced innovative techniques enabling the inelastic interaction of fast electrons with tightly confined and intense light fields. These techniques, commonly summarized under the term photon-induced nearfield electron microscopy now offer unprecedented capabilities for a precise mapping of the characteristics of optical near-fields with remarkable… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 19 pages, 3 figures, 40 references

  10. arXiv:2405.05049  [pdf

    cs.CL

    Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources

    Authors: Lasse Hyldig Hansen, Nikolaj Andersen, Jack Gallifant, Liam G. McCoy, James K Stone, Nura Izath, Marcela Aguirre-Jerez, Danielle S Bitterman, Judy Gichoya, Leo Anthony Celi

    Abstract: Background Advancements in Large Language Models (LLMs) hold transformative potential in healthcare, however, recent work has raised concern about the tendency of these models to produce outputs that display racial or gender biases. Although training data is a likely source of such biases, exploration of disease and demographic associations in text data at scale has been limited. Methods We cond… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  11. arXiv:2404.07711  [pdf, other

    cs.CV

    OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities

    Authors: Lasse H. Hansen, Simon B. Jensen, Mark P. Philipsen, Andreas Møgelmose, Lars Bodum, Thomas B. Moeslund

    Abstract: Identifying and classifying underground utilities is an important task for efficient and effective urban planning and infrastructure maintenance. We present OpenTrench3D, a novel and comprehensive 3D Semantic Segmentation point cloud dataset, designed to advance research and development in underground utility surveying and mapping. OpenTrench3D covers a completely novel domain for public 3D point… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  12. Knowledge graphs for empirical concept retrieval

    Authors: Lenka Tětková, Teresa Karen Scheidt, Maria Mandrup Fogh, Ellen Marie Gaunby Jørgensen, Finn Årup Nielsen, Lars Kai Hansen

    Abstract: Concept-based explainable AI is promising as a tool to improve the understanding of complex models at the premises of a given user, viz.\ as a tool for personalized explainability. An important class of concept-based explainability methods is constructed with empirically defined concepts, indirectly defined through a set of positive and negative examples, as in the TCAV approach (Kim et al., 2018)… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Preprint. Accepted to The 2nd World Conference on eXplainable Artificial Intelligence

  13. arXiv:2403.12866  [pdf, other

    quant-ph

    Purifying photon indistinguishability through quantum interference

    Authors: Carlos F. D. Faurby, Lorenzo Carosini, Huan Cao, Patrik I. Sund, Lena M. Hansen, Francesco Giorgino, Andrew B. Villadsen, Stefan N. van den Hoven, Peter Lodahl, Stefano Paesani, Juan C. Loredo, Philip Walther

    Abstract: Indistinguishability between photons is a key requirement for scalable photonic quantum technologies. We experimentally demonstrate that partly distinguishable single photons can be purified to reach near-unity indistinguishability by the process of quantum interference with ancillary photons followed by heralded detection of a subset of them. We report on the indistinguishability of the purified… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 14 pages, 7 figures

  14. arXiv:2403.00293  [pdf, other

    eess.AS cs.LG cs.SD

    Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification

    Authors: Mufan Sang, John H. L. Hansen

    Abstract: With excellent generalization ability, self-supervised speech models have shown impressive performance on various downstream speech tasks in the pre-training and fine-tuning paradigm. However, as the growing size of pre-trained models, fine-tuning becomes practically unfeasible due to heavy computation and storage overhead, as well as the risk of overfitting. Adapters are lightweight modules inser… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: Accepted to ICASSP 2024

  15. arXiv:2401.06091  [pdf, other

    cs.LG stat.ME

    A Closer Look at AUROC and AUPRC under Class Imbalance

    Authors: Matthew B. A. McDermott, Lasse Hyldig Hansen, Haoran Zhang, Giovanni Angelotti, Jack Gallifant

    Abstract: In machine learning (ML), a widespread adage is that the area under the precision-recall curve (AUPRC) is a superior metric for model comparison to the area under the receiver operating characteristic (AUROC) for binary classification tasks with class imbalance. This paper challenges this notion through novel mathematical analysis, illustrating that AUROC and AUPRC can be concisely related in prob… ▽ More

    Submitted 18 April, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  16. arXiv:2311.18364  [pdf, other

    cs.CL cs.LG cs.SI

    Hubness Reduction Improves Sentence-BERT Semantic Spaces

    Authors: Beatrix M. G. Nielsen, Lars Kai Hansen

    Abstract: Semantic representations of text, i.e. representations of natural language which capture meaning by geometry, are essential for areas such as information retrieval and document grouping. High-dimensional trained dense vectors have received much attention in recent years as such representations. We investigate the structure of semantic spaces that arise from embeddings made with Sentence-BERT and f… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted at NLDL 2024

  17. arXiv:2311.08878  [pdf, other

    eess.AS cs.SD

    Multi-objective Non-intrusive Hearing-aid Speech Assessment Model

    Authors: Hsin-Tien Chiang, Szu-Wei Fu, Hsin-Min Wang, Yu Tsao, John H. L. Hansen

    Abstract: Without the need for a clean reference, non-intrusive speech assessment methods have caught great attention for objective evaluations. While deep learning models have been used to develop non-intrusive speech assessment methods with promising results, there is limited research on hearing-impaired subjects. This study proposes a multi-objective non-intrusive hearing-aid speech assessment model, cal… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  18. arXiv:2311.07264  [pdf, other

    cs.CL

    Danish Foundation Models

    Authors: Kenneth Enevoldsen, Lasse Hansen, Dan S. Nielsen, Rasmus A. F. Egebæk, Søren V. Holm, Martin C. Nielsen, Martin Bernstorff, Rasmus Larsen, Peter B. Jørgensen, Malte Højmark-Bertelsen, Peter B. Vahlstrup, Per Møldrup-Dalum, Kristoffer Nielbo

    Abstract: Large language models, sometimes referred to as foundation models, have transformed multiple fields of research. However, smaller languages risk falling behind due to high training costs and small incentives for large companies to train these models. To combat this, the Danish Foundation Models project seeks to provide and maintain open, well-documented, and high-quality foundation models for the… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 4 pages, 2 tables

  19. MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition

    Authors: Jiamin Xie, John H. L. Hansen

    Abstract: In this paper, we present MixRep, a simple and effective data augmentation strategy based on mixup for low-resource ASR. MixRep interpolates the feature dimensions of hidden representations in the neural network that can be applied to both the acoustic feature input and the output of each layer, which generalizes the previous MixSpeech method. Further, we propose to combine the mixup with a regula… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted to Interspeech 2023

  20. arXiv:2310.16981  [pdf, other

    cs.LG

    Reimagining Synthetic Tabular Data Generation through Data-Centric AI: A Comprehensive Benchmark

    Authors: Lasse Hansen, Nabeel Seedat, Mihaela van der Schaar, Andrija Petrovic

    Abstract: Synthetic data serves as an alternative in training machine learning models, particularly when real-world data is limited or inaccessible. However, ensuring that synthetic data mirrors the complex nuances of real-world data is a challenging task. This paper addresses this issue by exploring the potential of integrating data-centric AI techniques which profile the data to guide the synthetic data g… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Presented at NeurIPS 2023 (Datasets & Benchmarks). *Hansen & Seedat contributed equally

  21. arXiv:2310.13200  [pdf, other

    econ.GN cs.LG

    A Deep Learning Analysis of Climate Change, Innovation, and Uncertainty

    Authors: Michael Barnett, William Brock, Lars Peter Hansen, Ruimeng Hu, Joseph Huang

    Abstract: We study the implications of model uncertainty in a climate-economics framework with three types of capital: "dirty" capital that produces carbon emissions when used for production, "clean" capital that generates no emissions but is initially less productive than dirty capital, and knowledge capital that increases with R\&D investment and leads to technological innovation in green sector productiv… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  22. arXiv:2310.11004  [pdf, other

    eess.AS eess.SP

    Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition

    Authors: Shahram Ghorbani, John H. L. Hansen

    Abstract: Accurately classifying accents and assessing accentedness in non-native speakers are both challenging tasks due to the complexity and diversity of accent and dialect variations. In this study, embeddings from advanced pre-trained language identification (LID) and speaker identification (SID) models are leveraged to improve the accuracy of accent classification and non-native accentedness assessmen… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Submitted to The Journal of the Acoustical Society of America

  23. arXiv:2309.09688  [pdf, other

    physics.flu-dyn physics.geo-ph

    Granular dilatancy and non-local fluidity of partially molten rock

    Authors: Richard F. Katz, John F. Rudge, Lars N. Hansen

    Abstract: Partially molten rock is a densely packed, melt-saturated, granular medium, but it has seldom been considered in these terms. In this manuscript, we extend the continuum theory of partially molten rock to incorporate the physics of granular media. Our formulation includes dilatancy in a viscous constitutive law and introduces a non-local fluidity. We analyse the resulting poro-viscous--granular th… ▽ More

    Submitted 27 November, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 31 pages, 9 figures, 4 appendicies

    Journal ref: Journal of Fluid Mechanics, 2023

  24. A photonic source of heralded GHZ states

    Authors: H. Cao, L. M. Hansen, F. Giorgino, L. Carosini, P. Zahalka, F. Zilk, J. C. Loredo, P. Walther

    Abstract: Generating large multiphoton entangled states is of main interest due to enabling universal photonic quantum computing and all-optical quantum repeater nodes. These applications exploit measurement-based quantum computation using cluster states. Remarkably, it was shown that photonic cluster states of arbitrary size can be generated by using feasible heralded linear optics fusion gates that act on… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: 6 pages, 5 figures

    Journal ref: Phys. Rev. Lett. 132, 130604 (2024)

  25. arXiv:2307.12745  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Concept-based explainability for an EEG transformer model

    Authors: Anders Gjølbye Madsen, William Theodor Lehn-Schiøler, Áshildur Jónsdóttir, Bergdís Arnardóttir, Lars Kai Hansen

    Abstract: Deep learning models are complex due to their size, structure, and inherent randomness in training procedures. Additional complexity arises from the selection of datasets and inductive biases. Addressing these challenges for explainability, Kim et al. (2018) introduced Concept Activation Vectors (CAVs), which aim to understand deep models' internal states in terms of human-aligned concepts. These… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: To appear in proceedings of 2023 IEEE International workshop on Machine Learning for Signal Processing

  26. arXiv:2306.16997  [pdf, other

    cs.CV

    Unsupervised 3D registration through optimization-guided cyclical self-training

    Authors: Alexander Bigalke, Lasse Hansen, Tony C. W. Mok, Mattias P. Heinrich

    Abstract: State-of-the-art deep learning-based registration methods employ three different learning strategies: supervised learning, which requires costly manual annotations, unsupervised learning, which heavily relies on hand-crafted similarity metrics designed by domain experts, or learning from synthetic data, which introduces a domain shift. To overcome the limitations of these strategies, we propose a… ▽ More

    Submitted 20 July, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: accepted at MICCAI 2023

  27. arXiv:2306.06524  [pdf, other

    eess.AS cs.CL cs.SD

    What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model

    Authors: Mu Yang, Ram C. M. C. Shekar, Okim Kang, John H. L. Hansen

    Abstract: This study is focused on understanding and quantifying the change in phoneme and prosody information encoded in the Self-Supervised Learning (SSL) model, brought by an accent identification (AID) fine-tuning task. This problem is addressed based on model probing. Specifically, we conduct a systematic layer-wise analysis of the representations of the Transformer layers on a phoneme correlation task… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: Accepted by Interspeech 2023

  28. arXiv:2306.03009  [pdf, other

    stat.ML cs.LG stat.AP

    Using Sequences of Life-events to Predict Human Lives

    Authors: Germans Savcisens, Tina Eliassi-Rad, Lars Kai Hansen, Laust Mortensen, Lau Lilleholt, Anna Rogers, Ingo Zettler, Sune Lehmann

    Abstract: Over the past decade, machine learning has revolutionized computers' ability to analyze text through flexible computational models. Due to their structural similarity to written language, transformer-based architectures have also shown promise as tools to make sense of a range of multi-variate sequences from protein-structures, music, electronic health records to weather-forecasts. We can also rep… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Journal ref: Nature Computational Science 4 (2024) 43-56

  29. arXiv:2306.00561  [pdf, other

    cs.SD cs.AI eess.AS

    Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners

    Authors: Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen, Zheng-Hua Tan

    Abstract: In this work, we propose a Multi-Window Masked Autoencoder (MW-MAE) fitted with a novel Multi-Window Multi-Head Attention (MW-MHA) module that facilitates the modelling of local-global interactions in every decoder transformer block through attention heads of several distinct local and global windows. Empirical results on ten downstream audio tasks show that MW-MAEs consistently outperform standar… ▽ More

    Submitted 1 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  30. arXiv:2305.20017  [pdf, ps, other

    quant-ph cond-mat.mes-hall physics.optics

    Controlling the Photon Number Coherence of Solid-state Quantum Light Sources for Quantum Cryptography

    Authors: Yusuf Karli, Daniel A. Vajner, Florian Kappe, Paul C. A. Hagen, Lena M. Hansen, René Schwarz, Thomas K. Bracht, Christian Schimpf, Saimon F. Covre da Silva, Philip Walther, Armando Rastelli, Vollrath Martin Axt, Juan C. Loredo, Vikas Remesh, Tobias Heindel, Doris E. Reiter, Gregor Weihs

    Abstract: Quantum communication networks rely on quantum cryptographic protocols including quantum key distribution (QKD) using single photons. A critical element regarding the security of QKD protocols is the photon number coherence (PNC), i.e. the phase relation between the zero and one-photon Fock state, which critically depends on the excitation scheme. Thus, to obtain flying qubits with the desired pro… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: 17 pages

    Journal ref: npj Quantum Inf 10, 17 (2024)

  31. arXiv:2305.17154  [pdf, other

    cs.LG cs.AI

    On convex decision regions in deep network representations

    Authors: Lenka Tětková, Thea Brüsch, Teresa Karen Scheidt, Fabian Martin Mager, Rasmus Ørtoft Aagaard, Jonathan Foldager, Tommy Sonne Alstrøm, Lars Kai Hansen

    Abstract: Current work on human-machine alignment aims at understanding machine-learned latent spaces and their correspondence to human representations. G{ä}rdenfors' conceptual spaces is a prominent framework for understanding human representations. Convexity of object regions in conceptual spaces is argued to promote generalizability, few-shot learning, and interpersonal alignment. Based on these insights… ▽ More

    Submitted 6 October, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  32. Programmable multi-photon quantum interference in a single spatial mode

    Authors: Lorenzo Carosini, Virginia Oddi, Francesco Giorgino, Lena M. Hansen, Benoit Seron, Simone Piacentini, Tobias Guggemos, Iris Agresti, Juan Carlos Loredo, Philip Walther

    Abstract: The interference of non-classical states of light enables quantum-enhanced applications reaching from metrology to computation. Most commonly, the polarisation or spatial location of single photons are used as addressable degrees-of-freedom for turning these applications into praxis. However, the scale-up for the processing of a large number of photons of such architectures is very resource demand… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 8 pages, 5 figures

  33. Single-active-element demultiplexed multi-photon source

    Authors: Lena M. Hansen, Lorenzo Carosini, Lennart Jehle, Francesco Giorgino, Romane Houvenaghel, Michal Vyvlecka, Juan C. Loredo, Philip Walther

    Abstract: Temporal-to-spatial demultiplexing routes non-simultaneous events of the same spatial mode to distinct output trajectories. This technique has now been widely adopted because it gives access to higher-number multi-photon states when exploiting solid-state quantum emitters. However, implementations so far have required an always-increasing number of active elements, rapidly facing resource constrai… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 7 pages, 7 figures

    Journal ref: Optica Quantum 1(1), 1-5 (2023)

  34. Robustness of Visual Explanations to Common Data Augmentation

    Authors: Lenka Tětková, Lars Kai Hansen

    Abstract: As the use of deep neural networks continues to grow, understanding their behaviour has become more crucial than ever. Post-hoc explainability methods are a potential solution, but their reliability is being called into question. Our research investigates the response of post-hoc visual explanations to naturally occurring transformations, often referred to as augmentations. We anticipate explanati… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: Accepted to The 2nd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2023

  35. arXiv:2303.17719  [pdf, other

    cs.CV cs.LG

    Why is the winner the best?

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Sharib Ali, Vincent Andrearczyk, Marc Aubreville, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano, Jorge Bernal, Sebastian Bodenstedt, Alessandro Casella, Veronika Cheplygina, Marie Daum, Marleen de Bruijne, Adrien Depeursinge, Reuben Dorent, Jan Egger, David G. Ellis, Sandy Engelhardt, Melanie Ganz , et al. (100 additional authors not shown)

    Abstract: International benchmarking competitions have become fundamental for the comparative performance assessment of image analysis methods. However, little attention has been given to investigating what can be learnt from these competitions. Do they really generate scientific progress? What are common and successful participation strategies? What makes a solution superior to a competing method? To addre… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: accepted to CVPR 2023

  36. arXiv:2302.08639  [pdf, other

    eess.AS cs.LG cs.SD

    Improving Transformer-based Networks With Locality For Automatic Speaker Verification

    Authors: Mufan Sang, Yong Zhao, Gang Liu, John H. L. Hansen, Jian Wu

    Abstract: Recently, Transformer-based architectures have been explored for speaker embedding extraction. Although the Transformer employs the self-attention mechanism to efficiently model the global interaction between token embeddings, it is inadequate for capturing short-range local context, which is essential for the accurate extraction of speaker information. In this study, we enhance the Transformer wi… ▽ More

    Submitted 28 February, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: Accepted to ICASSP 2023

  37. Calibration and data analysis routines for nanoindentation with spherical tips

    Authors: Diana Avadanii, Anna Kareer, Lars Hansen, Angus Wilkinson

    Abstract: Instrumented spherical nanoindentation with a continuous stiffness measurement has gained increased popularity in material science studies in brittle and ductile materials alike. These investigations span hypotheses related to a wide range of microphysics involving grain boundaries, twins, dislocation densities, ion-induced damage and more. These studies rely on the implementation of different met… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

  38. arXiv:2301.06916  [pdf, other

    cs.CL cs.LG cs.SD stat.AP

    Automated speech- and text-based classification of neuropsychiatric conditions in a multidiagnostic setting

    Authors: Lasse Hansen, Roberta Rocca, Arndis Simonsen, Alberto Parola, Vibeke Bliksted, Nicolai Ladegaard, Dan Bang, Kristian Tylén, Ethan Weed, Søren Dinesen Østergaard, Riccardo Fusaroli

    Abstract: Speech patterns have been identified as potential diagnostic markers for neuropsychiatric conditions. However, most studies only compare a single clinical group to healthy controls, whereas clinical practice often requires differentiating between multiple potential diagnoses (multiclass settings). To address this, we assembled a dataset of repeated recordings from 420 participants (67 with major d… ▽ More

    Submitted 31 January, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: 24 pages, 5 figures

  39. arXiv:2301.05983  [pdf, other

    stat.ML cs.LG

    On the role of Model Uncertainties in Bayesian Optimization

    Authors: Jonathan Foldager, Mikkel Jordahn, Lars Kai Hansen, Michael Riis Andersen

    Abstract: Bayesian optimization (BO) is a popular method for black-box optimization, which relies on uncertainty as part of its decision-making process when deciding which experiment to perform next. However, not much work has addressed the effect of uncertainty on the performance of the BO algorithm and to what extent calibrated uncertainties improve the ability to find the global optimum. In this work, we… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: 14 pages, 4 figures, 2 tables

  40. TextDescriptives: A Python package for calculating a large variety of metrics from text

    Authors: Lasse Hansen, Ludvig Renbo Olsen, Kenneth Enevoldsen

    Abstract: TextDescriptives is a Python package for calculating a large variety of metrics from text. It is built on top of spaCy and can be easily integrated into existing workflows. The package has already been used for analysing the linguistic stability of clinical texts, creating features for predicting neuropsychiatric conditions, and analysing linguistic goals of primary school students. This paper des… ▽ More

    Submitted 28 March, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: 3 pages, 0 figures. Submitted to Journal of Open Source Software

    Journal ref: Journal of Open Source Software, 8(84), 5153 (2023)

  41. arXiv:2212.08568  [pdf, other

    cs.CV cs.LG

    Biomedical image analysis competitions: The state of current participation practice

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

    Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  42. arXiv:2211.12632  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation

    Authors: Vinay Kothapally, John H. L. Hansen

    Abstract: Several speech processing systems have demonstrated considerable performance improvements when deep complex neural networks (DCNN) are coupled with self-attention (SA) networks. However, the majority of DCNN-based studies on speech dereverberation that employ self-attention do not explicitly account for the inter-dependencies between real and imaginary features when computing attention. In this st… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: Interspeech 2022: ISCA Best Student Paper Award Finalist

  43. arXiv:2211.12623  [pdf, other

    eess.AS cs.LG cs.SD

    SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking

    Authors: Vinay Kothapally, J. H. L. Hansen

    Abstract: With the advancements in deep learning approaches, the performance of speech enhancing systems in the presence of background noise have shown significant improvements. However, improving the system's robustness against reverberation is still a work in progress, as reverberation tends to cause loss of formant structure due to smearing effects in time and frequency. A wide range of deep learning-bas… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: Published in: IEEE/ACM Transactions on Audio, Speech, and Language Processing ( Volume: 30)

  44. Anatomy-guided domain adaptation for 3D in-bed human pose estimation

    Authors: Alexander Bigalke, Lasse Hansen, Jasper Diesel, Carlotta Hennigs, Philipp Rostalski, Mattias P. Heinrich

    Abstract: 3D human pose estimation is a key component of clinical monitoring systems. The clinical applicability of deep pose estimation models, however, is limited by their poor generalization under domain shifts along with their need for sufficient labeled training data. As a remedy, we present a novel domain adaptation method, adapting a model from a labeled source to a shifted unlabeled target domain. O… ▽ More

    Submitted 4 July, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: accepted at Medical Image Analysis

    Journal ref: Medical Image Analysis 89, 2023, 102887

  45. arXiv:2211.10565  [pdf, other

    eess.AS cs.HC cs.LG cs.SD

    Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting

    Authors: Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen

    Abstract: In the context of keyword spotting (KWS), the replacement of handcrafted speech features by learnable features has not yielded superior KWS performance. In this study, we demonstrate that filterbank learning outperforms handcrafted speech features for KWS whenever the number of filterbank channels is severely decreased. Reducing the number of channels might yield certain KWS performance drop, but… ▽ More

    Submitted 23 February, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

  46. arXiv:2211.09913  [pdf, other

    cs.SD cs.AI eess.AS

    Multi-source Domain Adaptation for Text-independent Forensic Speaker Recognition

    Authors: Zhenyu Wang, John H. L. Hansen

    Abstract: Adapting speaker recognition systems to new environments is a widely-used technique to improve a well-performing model learned from large-scale data towards a task-specific small-scale data scenarios. However, previous studies focus on single domain adaptation, which neglects a more practical scenario where training data are collected from multiple acoustic domains needed in forensic scenarios. Au… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING

  47. Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning

    Authors: Zhenyu Wang, John H. L. Hansen

    Abstract: Automatic speaker verification systems are vulnerable to a variety of access threats, prompting research into the formulation of effective spoofing detection systems to act as a gate to filter out such spoofing attacks. This study introduces a simple attention module to infer 3-dim attention weights for the feature map in a convolutional layer, which then optimizes an energy function to determine… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: Interspeech 2022

  48. arXiv:2211.02051  [pdf, other

    eess.AS cs.SD

    Fearless Steps Challenge Phase-1 Evaluation Plan

    Authors: Aditya Joglekar, John H. L. Hansen

    Abstract: The Fearless Steps Challenge 2019 Phase-1 (FSC-P1) is the inaugural Challenge of the Fearless Steps Initiative hosted by the Center for Robust Speech Systems (CRSS) at the University of Texas at Dallas. The goal of this Challenge is to evaluate the performance of state-of-the-art speech and language systems for large task-oriented teams with naturalistic audio in challenging environments. Research… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: Document Generated in February 2019 for conducting the Fearless Steps Challenge Phase-1 and its associated ISCA Interspeech-2019 Special Session

  49. arXiv:2208.02778  [pdf, other

    eess.AS cs.SD

    Attention and DCT based Global Context Modeling for Text-independent Speaker Recognition

    Authors: Wei Xia, John H. L. Hansen

    Abstract: Learning an effective speaker representation is crucial for achieving reliable performance in speaker verification tasks. Speech signals are high-dimensional, long, and variable-length sequences containing diverse information at each time-frequency (TF) location. The standard convolutional layer that operates on neighboring local regions often fails to capture the complex TF global information. Ou… ▽ More

    Submitted 23 August, 2023; v1 submitted 4 August, 2022; originally announced August 2022.

  50. arXiv:2207.04540  [pdf, other

    eess.AS cs.LG cs.SD

    Multi-Frequency Information Enhanced Channel Attention Module for Speaker Representation Learning

    Authors: Mufan Sang, John H. L. Hansen

    Abstract: Recently, attention mechanisms have been applied successfully in neural network-based speaker verification systems. Incorporating the Squeeze-and-Excitation block into convolutional neural networks has achieved remarkable performance. However, it uses global average pooling (GAP) to simply average the features along time and frequency dimensions, which is incapable of preserving sufficient speaker… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: Accepted to Interspeech 2022