Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–26 of 26 results for author: Goh, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.15308  [pdf, ps, other

    eess.SP cs.LG

    Label-Efficient Sleep Staging Using Transformers Pre-trained with Position Prediction

    Authors: Sayeri Lala, Hanlin Goh, Christopher Sandino

    Abstract: Sleep staging is a clinically important task for diagnosing various sleep disorders, but remains challenging to deploy at scale because it because it is both labor-intensive and time-consuming. Supervised deep learning-based approaches can automate sleep staging but at the expense of large labeled datasets, which can be unfeasible to procure for various settings, e.g., uncommon sleep disorders. Wh… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: 4 pages, 1 figure. This was work was presented at the IEEE International Conference on AI for Medicine, Health, and Care 2024

  2. arXiv:2404.06087  [pdf, other

    quant-ph cond-mat.dis-nn cond-mat.stat-mech cs.DS

    The Overlap Gap Property limits limit swapping in QAOA

    Authors: Mark Xin Hong Goh

    Abstract: The Quantum Approximate Optimization Algorithm (QAOA) is a quantum algorithm designed for Combinatorial Optimization Problem (COP). We show that if a COP with an underlying Erdös--Rényi hypergraph exhibits the Overlap Gap Property (OGP), then a random regular hypergraph exhibits it as well. Given that Max-$q$-XORSAT on an Erdös--Rényi hypergraph is known to exhibit the OGP, and since the performan… ▽ More

    Submitted 6 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: 22 pages, 2 figures

  3. arXiv:2401.15914  [pdf, other

    cs.CV cs.AI

    Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization

    Authors: Yuhang Zang, Hanlin Goh, Josh Susskind, Chen Huang

    Abstract: Existing vision-language models exhibit strong generalization on a variety of visual domains and tasks. However, such models mainly perform zero-shot recognition in a closed-set manner, and thus struggle to handle open-domain visual concepts by design. There are recent finetuning methods, such as prompt learning, that not only study the discrimination between in-distribution (ID) and out-of-distri… ▽ More

    Submitted 15 April, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: ICLR 2024

  4. arXiv:2312.04000  [pdf, other

    cs.LG cs.CV

    LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures

    Authors: Vimal Thilak, Chen Huang, Omid Saremi, Laurent Dinh, Hanlin Goh, Preetum Nakkiran, Joshua M. Susskind, Etai Littwin

    Abstract: Joint embedding (JE) architectures have emerged as a promising avenue for acquiring transferable data representations. A key obstacle to using JE methods, however, is the inherent challenge of evaluating learned representations without access to a downstream task, and an annotated dataset. Without efficient and reliable evaluation, it is difficult to iterate on architectural and training choices f… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Technical report

  5. arXiv:2310.14545  [pdf

    cs.CL

    Harnessing ChatGPT for thematic analysis: Are we ready?

    Authors: V Vien Lee, Stephanie C. C. van der Lubbe, Lay Hoon Goh, Jose M. Valderas

    Abstract: ChatGPT is an advanced natural language processing tool with growing applications across various disciplines in medical research. Thematic analysis, a qualitative research method to identify and interpret patterns in data, is one application that stands to benefit from this technology. This viewpoint explores the utilization of ChatGPT in three core phases of thematic analysis within a medical con… ▽ More

    Submitted 23 October, 2023; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: 23 pages, 7 figures, 3 tables, 1 textbox

  6. arXiv:2309.05927  [pdf, other

    cs.LG cs.AI eess.SP

    Frequency-Aware Masked Autoencoders for Multimodal Pretraining on Biosignals

    Authors: Ran Liu, Ellen L. Zippi, Hadi Pouransari, Chris Sandino, Jingping Nie, Hanlin Goh, Erdrin Azemi, Ali Moin

    Abstract: Leveraging multimodal information from biosignals is vital for building a comprehensive representation of people's physical and mental states. However, multimodal biosignals often exhibit substantial distributional shifts between pretraining and inference datasets, stemming from changes in task specification or variations in modality compositions. To achieve effective pretraining in the presence o… ▽ More

    Submitted 18 April, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Extended version of ICLR 2024 Learning from Time Series for Health workshop

  7. arXiv:2303.03679  [pdf, other

    cs.LG cs.CV

    MAST: Masked Augmentation Subspace Training for Generalizable Self-Supervised Priors

    Authors: Chen Huang, Hanlin Goh, Jiatao Gu, Josh Susskind

    Abstract: Recent Self-Supervised Learning (SSL) methods are able to learn feature representations that are invariant to different data augmentations, which can then be transferred to downstream tasks of interest. However, different downstream tasks require different invariances for their best performance, so the optimal choice of augmentations for SSL depends on the target task. In this paper, we aim to lea… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: ICLR 2023

  8. arXiv:2301.11856  [pdf, other

    cs.LG stat.ML

    ActiveLab: Active Learning with Re-Labeling by Multiple Annotators

    Authors: Hui Wen Goh, Jonas Mueller

    Abstract: In real-world data labeling applications, annotators often provide imperfect labels. It is thus common to employ multiple annotators to label data with some overlap between their examples. We study active learning in such settings, aiming to train an accurate classifier by collecting a dataset with the fewest total annotations. Here we propose ActiveLab, a practical method to decide what to label… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Journal ref: ICLR 2023 Workshop on Trustworthy ML

  9. arXiv:2211.02625  [pdf, other

    eess.SP cs.LG

    MAEEG: Masked Auto-encoder for EEG Representation Learning

    Authors: Hsiang-Yun Sherry Chien, Hanlin Goh, Christopher M. Sandino, Joseph Y. Cheng

    Abstract: Decoding information from bio-signals such as EEG, using machine learning has been a challenge due to the small data-sets and difficulty to obtain labels. We propose a reconstruction-based self-supervised learning model, the masked auto-encoder for EEG (MAEEG), for learning EEG representations by learning to reconstruct the masked EEG features using a transformer architecture. We found that MAEEG… ▽ More

    Submitted 27 October, 2022; originally announced November 2022.

    Comments: 10 pages, 5 figures, accepted by Workshop on Learning from Time Series for Health, NeurIPS2022 as poster presentation

  10. arXiv:2210.06812  [pdf, other

    cs.LG cs.HC stat.ML

    CROWDLAB: Supervised learning to infer consensus labels and quality scores for data with multiple annotators

    Authors: Hui Wen Goh, Ulyana Tkachenko, Jonas Mueller

    Abstract: Real-world data for classification is often labeled by multiple annotators. For analyzing such data, we introduce CROWDLAB, a straightforward approach to utilize any trained classifier to estimate: (1) A consensus label for each example that aggregates the available annotations; (2) A confidence score for how likely each consensus label is correct; (3) A rating for each annotator quantifying the o… ▽ More

    Submitted 27 January, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Journal ref: NeurIPS 2022 Human in the Loop Learning Workshop

  11. arXiv:2210.02445  [pdf, other

    eess.IV cs.CV cs.LG

    Localizing Anatomical Landmarks in Ocular Images using Zoom-In Attentive Networks

    Authors: Xiaofeng Lei, Shaohua Li, Xinxing Xu, Huazhu Fu, Yong Liu, Yih-Chung Tham, Yangqin Feng, Mingrui Tan, Yanyu Xu, Jocelyn Hui Lin Goh, Rick Siow Mong Goh, Ching-Yu Cheng

    Abstract: Localizing anatomical landmarks are important tasks in medical image analysis. However, the landmarks to be localized often lack prominent visual features. Their locations are elusive and easily confused with the background, and thus precise localization highly depends on the context formed by their surrounding areas. In addition, the required precision is usually higher than segmentation and obje… ▽ More

    Submitted 22 December, 2022; v1 submitted 25 September, 2022; originally announced October 2022.

  12. arXiv:2209.13156  [pdf, other

    cs.CV cs.AI

    Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents

    Authors: Yao-Hung Hubert Tsai, Hanlin Goh, Ali Farhadi, Jian Zhang

    Abstract: The perception system in personalized mobile agents requires developing indoor scene understanding models, which can understand 3D geometries, capture objectiveness, analyze human behaviors, etc. Nonetheless, this direction has not been well-explored in comparison with models for outdoor environments (e.g., the autonomous driving system that includes pedestrian prediction, car detection, traffic s… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Submitted to ICRA2023

  13. arXiv:2207.13751  [pdf, other

    cs.CV cs.GR cs.LG

    GAUDI: A Neural Architect for Immersive 3D Scene Generation

    Authors: Miguel Angel Bautista, Pengsheng Guo, Samira Abnar, Walter Talbott, Alexander Toshev, Zhuoyuan Chen, Laurent Dinh, Shuangfei Zhai, Hanlin Goh, Daniel Ulbricht, Afshin Dehghan, Josh Susskind

    Abstract: We introduce GAUDI, a generative model capable of capturing the distribution of complex and realistic 3D scenes that can be rendered immersively from a moving camera. We tackle this challenging problem with a scalable yet powerful approach, where we first optimize a latent representation that disentangles radiance fields and camera poses. This latent representation is then used to learn a generati… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: Project webpage: https://github.com/apple/ml-gaudi

  14. arXiv:2207.07611  [pdf, other

    cs.LG cs.CV cs.SD eess.AS

    Position Prediction as an Effective Pretraining Strategy

    Authors: Shuangfei Zhai, Navdeep Jaitly, Jason Ramapuram, Dan Busbridge, Tatiana Likhomanenko, Joseph Yitan Cheng, Walter Talbott, Chen Huang, Hanlin Goh, Joshua Susskind

    Abstract: Transformers have gained increasing popularity in a wide range of applications, including Natural Language Processing (NLP), Computer Vision and Speech Recognition, because of their powerful representational capacity. However, harnessing this representational capacity effectively requires a large amount of data, strong regularization, or both, to mitigate overfitting. Recently, the power of the Tr… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: Accepted to ICML 2022

  15. arXiv:2107.13966  [pdf

    cs.CY cs.LG cs.RO

    Artificial Intelligence in Achieving Sustainable Development Goals

    Authors: Hoe-Han Goh

    Abstract: This perspective illustrates some of the AI applications that can accelerate the achievement of SDGs and also highlights some of the considerations that could hinder the efforts towards them. This emphasizes the importance of establishing standard AI guidelines and regulations for the beneficial applications of AI.

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: 10 pages, 1 figure, under evaluation as a Perspective in Science

  16. arXiv:2107.00364  [pdf, other

    cs.LG

    Implicit Acceleration and Feature Learning in Infinitely Wide Neural Networks with Bottlenecks

    Authors: Etai Littwin, Omid Saremi, Shuangfei Zhai, Vimal Thilak, Hanlin Goh, Joshua M. Susskind, Greg Yang

    Abstract: We analyze the learning dynamics of infinitely wide neural networks with a finite sized bottle-neck. Unlike the neural tangent kernel limit, a bottleneck in an otherwise infinite width network al-lows data dependent feature learning in its bottle-neck representation. We empirically show that a single bottleneck in infinite networks dramatically accelerates training when compared to purely in-finit… ▽ More

    Submitted 2 July, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

  17. arXiv:2106.12747  [pdf

    cs.LG

    Automated Agriculture Commodity Price Prediction System with Machine Learning Techniques

    Authors: Zhiyuan Chen, Howe Seng Goh, Kai Ling Sin, Kelly Lim, Nicole Ka Hei Chung, Xin Yu Liew

    Abstract: The intention of this research is to study and design an automated agriculture commodity price prediction system with novel machine learning techniques. Due to the increasing large amounts historical data of agricultural commodity prices and the need of performing accurate prediction of price fluctuations, the solution has largely shifted from statistical methods to machine learning area. However,… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: This paper has been submitted to Advances in Science, Technology and Engineering Systems Journal

  18. arXiv:2105.14103  [pdf, other

    cs.LG cs.CL cs.CV

    An Attention Free Transformer

    Authors: Shuangfei Zhai, Walter Talbott, Nitish Srivastava, Chen Huang, Hanlin Goh, Ruixiang Zhang, Josh Susskind

    Abstract: We introduce Attention Free Transformer (AFT), an efficient variant of Transformers that eliminates the need for dot product self attention. In an AFT layer, the key and value are first combined with a set of learned position biases, the result of which is multiplied with the query in an element-wise fashion. This new operation has a memory complexity linear w.r.t. both the context size and the di… ▽ More

    Submitted 21 September, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

  19. arXiv:2105.08140  [pdf, other

    cs.LG

    Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning

    Authors: Yue Wu, Shuangfei Zhai, Nitish Srivastava, Joshua Susskind, Jian Zhang, Ruslan Salakhutdinov, Hanlin Goh

    Abstract: Offline Reinforcement Learning promises to learn effective policies from previously-collected, static datasets without the need for exploration. However, existing Q-learning and actor-critic based off-policy RL algorithms fail when bootstrapping from out-of-distribution (OOD) actions or states. We hypothesize that a key missing ingredient from the existing methods is a proper treatment of uncertai… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: To appear in ICML 2021

  20. arXiv:2007.04871  [pdf, other

    cs.LG eess.SP stat.ML

    Subject-Aware Contrastive Learning for Biosignals

    Authors: Joseph Y. Cheng, Hanlin Goh, Kaan Dogrusoz, Oncel Tuzel, Erdrin Azemi

    Abstract: Datasets for biosignals, such as electroencephalogram (EEG) and electrocardiogram (ECG), often have noisy labels and have limited number of subjects (<100). To handle these challenges, we propose a self-supervised approach based on contrastive learning to model biosignals with a reduced reliance on labeled data and with fewer subjects. In this regime of limited labels and subjects, intersubject va… ▽ More

    Submitted 30 June, 2020; originally announced July 2020.

  21. arXiv:2002.04764  [pdf, other

    cs.LG stat.ML

    Capsules with Inverted Dot-Product Attention Routing

    Authors: Yao-Hung Hubert Tsai, Nitish Srivastava, Hanlin Goh, Ruslan Salakhutdinov

    Abstract: We introduce a new routing algorithm for capsule networks, in which a child capsule is routed to a parent based only on agreement between the parent's state and the child's vote. The new mechanism 1) designs routing via inverted dot-product attention; 2) imposes Layer Normalization as normalization; and 3) replaces sequential iterative routing with concurrent iterative routing. When compared to pr… ▽ More

    Submitted 26 February, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Comments: ICLR 2020

  22. arXiv:1912.04212  [pdf, other

    stat.ML cs.LG eess.IV

    Solving Bayesian Inverse Problems via Variational Autoencoders

    Authors: Hwan Goh, Sheroze Sheriffdeen, Jonathan Wittmer, Tan Bui-Thanh

    Abstract: In recent years, the field of machine learning has made phenomenal progress in the pursuit of simulating real-world data generation processes. One notable example of such success is the variational autoencoder (VAE). In this work, with a small shift in perspective, we leverage and adapt VAEs for a different purpose: uncertainty quantification in scientific inverse problems. We introduce UQ-VAE: a… ▽ More

    Submitted 28 December, 2021; v1 submitted 5 December, 2019; originally announced December 2019.

    Journal ref: Proceedings of Machine Learning Research 145 (2021) 1-40

  23. arXiv:1912.03310  [pdf, other

    cs.LG cs.CV cs.GR cs.NE stat.ML

    Geometric Capsule Autoencoders for 3D Point Clouds

    Authors: Nitish Srivastava, Hanlin Goh, Ruslan Salakhutdinov

    Abstract: We propose a method to learn object representations from 3D point clouds using bundles of geometrically interpretable hidden units, which we call geometric capsules. Each geometric capsule represents a visual entity, such as an object or a part, and consists of two components: a pose and a feature. The pose encodes where the entity is, while the feature encodes what it is. We use these capsules to… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  24. arXiv:1508.02496  [pdf, other

    cs.CV cs.IR

    A Practical Guide to CNNs and Fisher Vectors for Image Instance Retrieval

    Authors: Vijay Chandrasekhar, Jie Lin, Olivier Morère, Hanlin Goh, Antoine Veillard

    Abstract: With deep learning becoming the dominant approach in computer vision, the use of representations extracted from Convolutional Neural Nets (CNNs) is quickly gaining ground on Fisher Vectors (FVs) as favoured state-of-the-art global image descriptors for image instance retrieval. While the good performance of CNNs for image classification are unambiguously recognised, which of the two has the upper… ▽ More

    Submitted 25 August, 2015; v1 submitted 11 August, 2015; originally announced August 2015.

    Comments: Deep Convolutional Neural Networks for instance retrieval, Fisher Vectors, instance retrieval

  25. arXiv:1501.07738  [pdf, other

    cs.CV

    Co-Regularized Deep Representations for Video Summarization

    Authors: Olivier Morère, Hanlin Goh, Antoine Veillard, Vijay Chandrasekhar, Jie Lin

    Abstract: Compact keyframe-based video summaries are a popular way of generating viewership on video sharing platforms. Yet, creating relevant and compelling summaries for arbitrarily long videos with a small number of keyframes is a challenging task. We propose a comprehensive keyframe-based summarization framework combining deep convolutional neural networks and restricted Boltzmann machines. An original… ▽ More

    Submitted 30 January, 2015; originally announced January 2015.

    Comments: Video summarization, deep convolutional neural networks, co-regularized restricted Boltzmann machines

  26. arXiv:1501.04711  [pdf, other

    cs.CV cs.IR

    DeepHash: Getting Regularization, Depth and Fine-Tuning Right

    Authors: Jie Lin, Olivier Morere, Vijay Chandrasekhar, Antoine Veillard, Hanlin Goh

    Abstract: This work focuses on representing very high-dimensional global image descriptors using very compact 64-1024 bit binary hashes for instance retrieval. We propose DeepHash: a hashing scheme based on deep networks. Key to making DeepHash work at extremely low bitrates are three important considerations -- regularization, depth and fine-tuning -- each requiring solutions specific to the hashing proble… ▽ More

    Submitted 19 January, 2015; originally announced January 2015.