Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–5 of 5 results for author: Chien, H S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.03177  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis

    Authors: Vikramjit Mitra, Vasudha Kowtha, Hsiang-Yun Sherry Chien, Erdrin Azemi, Carlos Avendano

    Abstract: Pre-trained model representations have demonstrated state-of-the-art performance in speech recognition, natural language processing, and other applications. Speech models, such as Bidirectional Encoder Representations from Transformers (BERT) and Hidden units BERT (HuBERT), have enabled generating lexical and acoustic representations to benefit speech recognition applications. We investigated the… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 5 pages, conference

  2. arXiv:2211.02625  [pdf, other

    eess.SP cs.LG

    MAEEG: Masked Auto-encoder for EEG Representation Learning

    Authors: Hsiang-Yun Sherry Chien, Hanlin Goh, Christopher M. Sandino, Joseph Y. Cheng

    Abstract: Decoding information from bio-signals such as EEG, using machine learning has been a challenge due to the small data-sets and difficulty to obtain labels. We propose a reconstruction-based self-supervised learning model, the masked auto-encoder for EEG (MAEEG), for learning EEG representations by learning to reconstruct the masked EEG features using a transformer architecture. We found that MAEEG… ▽ More

    Submitted 27 October, 2022; originally announced November 2022.

    Comments: 10 pages, 5 figures, accepted by Workshop on Learning from Time Series for Health, NeurIPS2022 as poster presentation

  3. arXiv:2207.03334  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation

    Authors: Vikramjit Mitra, Hsiang-Yun Sherry Chien, Vasudha Kowtha, Joseph Yitan Cheng, Erdrin Azemi

    Abstract: Estimating dimensional emotions, such as activation, valence and dominance, from acoustic speech signals has been widely explored over the past few years. While accurate estimation of activation and dominance from speech seem to be possible, the same for valence remains challenging. Previous research has shown that the use of lexical information can improve valence estimation performance. Lexical… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

    Comments: 5 pages, 3 figures, Interspeech 2022

  4. arXiv:2105.05944  [pdf, other

    cs.LG

    Slower is Better: Revisiting the Forgetting Mechanism in LSTM for Slower Information Decay

    Authors: Hsiang-Yun Sherry Chien, Javier S. Turek, Nicole Beckage, Vy A. Vo, Christopher J. Honey, Ted L. Willke

    Abstract: Sequential information contains short- to long-range dependencies; however, learning long-timescale information has been a challenge for recurrent neural networks. Despite improvements in long short-term memory networks (LSTMs), the forgetting mechanism results in the exponential decay of information, limiting their capacity to capture long-timescale information. Here, we propose a power law forge… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: 16 pages, 10 figures

  5. arXiv:2012.06717  [pdf, other

    cs.CL

    Mapping the Timescale Organization of Neural Language Models

    Authors: Hsiang-Yun Sherry Chien, Jinhan Zhang, Christopher. J. Honey

    Abstract: In the human brain, sequences of language input are processed within a distributed and hierarchical architecture, in which higher stages of processing encode contextual information over longer timescales. In contrast, in recurrent neural networks which perform natural language processing, we know little about how the multiple timescales of contextual information are functionally organized. Therefo… ▽ More

    Submitted 17 March, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: 23 pages, 4 main figures, 10 appendix figures; published as a conference paper at ICLR 2021