Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–43 of 43 results for author: Chan, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16866  [pdf, other

    cs.CV

    Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models

    Authors: Jierun Chen, Fangyun Wei, Jinjing Zhao, Sizhe Song, Bohuai Wu, Zhuoxuan Peng, S. -H. Gary Chan, Hongyang Zhang

    Abstract: Referring expression comprehension (REC) involves localizing a target instance based on a textual description. Recent advancements in REC have been driven by large multimodal models (LMMs) like CogVLM, which achieved 92.44% accuracy on RefCOCO. However, this study questions whether existing benchmarks such as RefCOCO, RefCOCO+, and RefCOCOg, capture LMMs' comprehensive capabilities. We begin with… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.15613  [pdf, other

    cs.LG cs.GR math.AT

    MOUNTAINEER: Topology-Driven Visual Analytics for Comparing Local Explanations

    Authors: Parikshit Solunke, Vitoria Guardieiro, Joao Rulff, Peter Xenopoulos, Gromit Yeuk-Yin Chan, Brian Barr, Luis Gustavo Nonato, Claudio Silva

    Abstract: With the increasing use of black-box Machine Learning (ML) techniques in critical applications, there is a growing demand for methods that can provide transparency and accountability for model predictions. As a result, a large number of local explainability methods for black-box models have been developed and popularized. However, machine learning explanations are still hard to evaluate and compar… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Author version of article accepted to IEEE Transactions on Visualization and Computer Graphics

  3. arXiv:2405.03218  [pdf, other

    cs.CV

    Elevator, Escalator or Neither? Classifying Pedestrian Conveyor State Using Inertial Navigation System

    Authors: Tianlang He, Zhiqiu Xia, S. -H. Gary Chan

    Abstract: Classifying a pedestrian in one of the three conveyor states of "elevator," "escalator" and "neither" is fundamental to many applications such as indoor localization and people flow analysis. We estimate, for the first time, the pedestrian conveyor state given the inertial navigation system (INS) readings of accelerometer, gyroscope and magnetometer sampled from the phone. Our problem is challengi… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  4. Epigraphics: Message-Driven Infographics Authoring

    Authors: Tongyu Zhou, Jeff Huang, Gromit Yeuk-Yin Chan

    Abstract: The message a designer wants to convey plays a pivotal role in directing the design of an infographic, yet most authoring workflows start with creating the visualizations or graphics first without gauging whether they fit the message. To address this gap, we propose Epigraphics, a web-based authoring system that treats an "epigraph" as the first-class object, and uses it to guide infographic asset… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 18 pages, 9 figures

  5. arXiv:2403.09124  [pdf, other

    cs.CV

    Single Domain Generalization for Crowd Counting

    Authors: Zhuoxuan Peng, S. -H. Gary Chan

    Abstract: Due to its promising results, density map regression has been widely employed for image-based crowd counting. The approach, however, often suffers from severe performance degradation when tested on data from unseen scenarios, the so-called "domain shift" problem. To address the problem, we investigate in this work single domain generalization (SDG) for crowd counting. The existing SDG approaches a… ▽ More

    Submitted 5 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  6. arXiv:2402.12335  [pdf, other

    physics.chem-ph cs.LG

    Image Super-resolution Inspired Electron Density Prediction

    Authors: Chenghan Li, Or Sharir, Shunyue Yuan, Garnet K. Chan

    Abstract: Drawing inspiration from the domain of image super-resolution, we view the electron density as a 3D grayscale image and use a convolutional residual network to transform a crude and trivially generated guess of the molecular density into an accurate ground-state quantum mechanical density. We find that this model outperforms all prior density prediction approaches. Because the input is itself a re… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  7. arXiv:2312.13223  [pdf, other

    cs.CV

    StableKD: Breaking Inter-block Optimization Entanglement for Stable Knowledge Distillation

    Authors: Shiu-hong Kao, Jierun Chen, S. H. Gary Chan

    Abstract: Knowledge distillation (KD) has been recognized as an effective tool to compress and accelerate models. However, current KD approaches generally suffer from an accuracy drop and/or an excruciatingly long distillation process. In this paper, we tackle the issue by first providing a new insight into a phenomenon that we call the Inter-Block Optimization Entanglement (IBOE), which makes the conventio… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  8. arXiv:2312.00540  [pdf, other

    cs.LG cs.AI stat.ML

    Target-agnostic Source-free Domain Adaptation for Regression Tasks

    Authors: Tianlang He, Zhiqiu Xia, Jierun Chen, Haoliang Li, S. -H. Gary Chan

    Abstract: Unsupervised domain adaptation (UDA) seeks to bridge the domain gap between the target and source using unlabeled target data. Source-free UDA removes the requirement for labeled source data at the target to preserve data privacy and storage. However, work on source-free UDA assumes knowledge of domain gap distribution, and hence is limited to either target-aware or classification task. To overcom… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Accepted by ICDE 2024

  9. arXiv:2310.11959  [pdf, other

    cs.LG cs.AI

    A Multi-Scale Decomposition MLP-Mixer for Time Series Analysis

    Authors: Shuhan Zhong, Sizhe Song, Weipeng Zhuo, Guanyao Li, Yang Liu, S. -H. Gary Chan

    Abstract: Time series data, including univariate and multivariate ones, are characterized by unique composition and complex multi-scale temporal variations. They often require special consideration of decomposition and multi-scale modeling to analyze. Existing deep learning methods on this best fit to univariate time series only, and have not sufficiently considered sub-series modeling and decomposition com… ▽ More

    Submitted 24 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted for VLDB 2024

  10. arXiv:2309.17336  [pdf, other

    cs.CV cs.RO

    Robust 3D Object Detection from LiDAR-Radar Point Clouds via Cross-Modal Feature Augmentation

    Authors: Jianning Deng, Gabriel Chan, Hantao Zhong, Chris Xiaoxuan Lu

    Abstract: This paper presents a novel framework for robust 3D object detection from point clouds via cross-modal hallucination. Our proposed approach is agnostic to either hallucination direction between LiDAR and 4D radar. We introduce multiple alignments on both spatial and feature levels to achieve simultaneous backbone refinement and hallucination generation. Specifically, spatial alignment is proposed… ▽ More

    Submitted 12 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted to ICRA 2024. 8 pages, 4 figures. Equal contribution for Gabriel Chan and Hantao Zhong, listed randomly

  11. arXiv:2307.12987  [pdf, other

    cs.MA

    Efficient Behavior-consistent Calibration for Multi-agent Market Simulation

    Authors: Tianlang He, Keyan Lu, Chang Xu, Yang Liu, Weiqing Liu, S. -H. Gary Chan, Jiang Bian

    Abstract: Order-driven market simulation mimics the trader behaviors to generate order streams to support interactive studies of financial strategies. In market simulator, the multi-agent approach is commonly adopted due to its explainability. Existing multi-agent systems employ heuristic search to generate order streams, which is inefficient for large-scale simulation. Furthermore, the search-based behavio… ▽ More

    Submitted 5 June, 2023; originally announced July 2023.

  12. arXiv:2307.12219  [pdf, other

    cs.LG

    Improving Out-of-Distribution Robustness of Classifiers via Generative Interpolation

    Authors: Haoyue Bai, Ceyuan Yang, Yinghao Xu, S. -H. Gary Chan, Bolei Zhou

    Abstract: Deep neural networks achieve superior performance for learning from independent and identically distributed (i.i.d.) data. However, their performance deteriorates significantly when handling out-of-distribution (OoD) data, where the training and test are drawn from different distributions. In this paper, we explore utilizing the generative models as a data augmentation source for improving out-of-… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  13. arXiv:2307.05914  [pdf, other

    cs.NI cs.LG eess.SP

    FIS-ONE: Floor Identification System with One Label for Crowdsourced RF Signals

    Authors: Weipeng Zhuo, Ka Ho Chiu, Jierun Chen, Ziqi Zhao, S. -H. Gary Chan, Sangtae Ha, Chul-Ho Lee

    Abstract: Floor labels of crowdsourced RF signals are crucial for many smart-city applications, such as multi-floor indoor localization, geofencing, and robot surveillance. To build a prediction model to identify the floor number of a new RF signal upon its measurement, conventional approaches using the crowdsourced RF signals assume that at least few labeled signal samples are available on each floor. In t… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE ICDCS 2023

  14. arXiv:2304.09182  [pdf, other

    cs.LG cs.AI

    A Deep Learning Framework for Traffic Data Imputation Considering Spatiotemporal Dependencies

    Authors: Li Jiang, Ting Zhang, Qiruyi Zuo, Chenyu Tian, George P. Chan, Wai Kin, Chan

    Abstract: Spatiotemporal (ST) data collected by sensors can be represented as multi-variate time series, which is a sequence of data points listed in an order of time. Despite the vast amount of useful information, the ST data usually suffer from the issue of missing or incomplete data, which also limits its applications. Imputation is one viable solution and is often used to prepossess the data for further… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: accepted at ICITE 2022

  15. arXiv:2303.11230  [pdf, other

    cs.SI cs.LG stat.ML

    Fitting Low-rank Models on Egocentrically Sampled Partial Networks

    Authors: Angus Chan, Tianxi Li

    Abstract: The statistical modeling of random networks has been widely used to uncover interaction mechanisms in complex systems and to predict unobserved links in real-world networks. In many applications, network connections are collected via egocentric sampling: a subset of nodes is sampled first, after which all links involving this subset are recorded; all other information is missing. Compared with the… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  16. arXiv:2303.03667  [pdf, other

    cs.CV

    Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks

    Authors: Jierun Chen, Shiu-hong Kao, Hao He, Weipeng Zhuo, Song Wen, Chul-Ho Lee, S. -H. Gary Chan

    Abstract: To design fast neural networks, many works have been focusing on reducing the number of floating-point operations (FLOPs). We observe that such reduction in FLOPs, however, does not necessarily lead to a similar level of reduction in latency. This mainly stems from inefficiently low floating-point operations per second (FLOPS). To achieve faster networks, we revisit popular operators and demonstra… ▽ More

    Submitted 21 May, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023

  17. arXiv:2302.12324  [pdf, other

    cs.CL

    Summaries as Captions: Generating Figure Captions for Scientific Documents with Automated Text Summarization

    Authors: Chieh-Yang Huang, Ting-Yao Hsu, Ryan Rossi, Ani Nenkova, Sungchul Kim, Gromit Yeuk-Yin Chan, Eunyee Koh, Clyde Lee Giles, Ting-Hao 'Kenneth' Huang

    Abstract: Good figure captions help paper readers understand complex scientific figures. Unfortunately, even published papers often have poorly written captions. Automatic caption generation could aid paper writers by providing good starting captions that can be refined for better quality. Prior work often treated figure caption generation as a vision-to-language task. In this paper, we show that it can be… ▽ More

    Submitted 11 August, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted by INLG-2023

  18. arXiv:2212.14077  [pdf, other

    cs.LG cs.DM cs.SI

    A Hypergraph Neural Network Framework for Learning Hyperedge-Dependent Node Embeddings

    Authors: Ryan Aponte, Ryan A. Rossi, Shunan Guo, Jane Hoffswell, Nedim Lipka, Chang Xiao, Gromit Chan, Eunyee Koh, Nesreen Ahmed

    Abstract: In this work, we introduce a hypergraph representation learning framework called Hypergraph Neural Networks (HNN) that jointly learns hyperedge embeddings along with a set of hyperedge-dependent embeddings for each node in the hypergraph. HNN derives multiple embeddings per node in the hypergraph where each embedding for a node is dependent on a specific hyperedge of that node. Notably, HNN is acc… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

  19. arXiv:2212.12040  [pdf, ps, other

    cs.SI cs.LG

    Graph Learning with Localized Neighborhood Fairness

    Authors: April Chen, Ryan Rossi, Nedim Lipka, Jane Hoffswell, Gromit Chan, Shunan Guo, Eunyee Koh, Sungchul Kim, Nesreen K. Ahmed

    Abstract: Learning fair graph representations for downstream applications is becoming increasingly important, but existing work has mostly focused on improving fairness at the global level by either modifying the graph structure or objective function without taking into account the local neighborhood of a node. In this work, we formally introduce the notion of neighborhood fairness and develop a computation… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  20. arXiv:2212.11296  [pdf, other

    quant-ph cs.LG cs.NE

    Towards Neural Variational Monte Carlo That Scales Linearly with System Size

    Authors: Or Sharir, Garnet Kin-Lic Chan, Anima Anandkumar

    Abstract: Quantum many-body problems are some of the most challenging problems in science and are central to demystifying some exotic quantum phenomena, e.g., high-temperature superconductors. The combination of neural networks (NN) for representing quantum states, coupled with the Variational Monte Carlo (VMC) algorithm, has been shown to be a promising method for solving such problems. However, the run-ti… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: Appeared on NeurIPS 2022 AI for Science Workshop (a non-archival poster presentation)

  21. arXiv:2210.07895  [pdf, other

    cs.NI

    GRAFICS: Graph Embedding-based Floor Identification Using Crowdsourced RF Signals

    Authors: Weipeng Zhuo, Ziqi Zhao, Ka Ho Chiu, Shiju Li, Sangtae Ha, Chul-Ho Lee, S. -H. Gary Chan

    Abstract: We study the problem of floor identification for radiofrequency (RF) signal samples obtained in a crowdsourced manner, where the signal samples are highly heterogeneous and most samples lack their floor labels. We propose GRAFICS, a graph embedding-based floor identification system. GRAFICS first builds a highly versatile bipartite graph model, having APs on one side and signal samples on the othe… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted by IEEE ICDCS 2022

  22. arXiv:2210.07889  [pdf, other

    cs.NI

    Semi-supervised Learning with Network Embedding on Ambient RF Signals for Geofencing Services

    Authors: Weipeng Zhuo, Ka Ho Chiu, Jierun Chen, Jiajie Tan, Edmund Sumpena, S. -H. Gary Chan, Sangtae Ha, Chul-Ho Lee

    Abstract: In applications such as elderly care, dementia anti-wandering and pandemic control, it is important to ensure that people are within a predefined area for their safety and well-being. We propose GEM, a practical, semi-supervised Geofencing system with network EMbedding, which is based only on ambient radio frequency (RF) signals. GEM models measured RF signal records as a weighted bipartite graph.… ▽ More

    Submitted 8 March, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: A conference version of this paper will appear in IEEE ICDE 2023

  23. arXiv:2205.00819  [pdf, other

    cs.CY cs.AI

    A Novel Approach to Fairness in Automated Decision-Making using Affective Normalization

    Authors: Jesse Hoey, Gabrielle Chan

    Abstract: Any decision, such as one about who to hire, involves two components. First, a rational component, i.e., they have a good education, they speak clearly. Second, an affective component, based on observables such as visual features of race and gender, and possibly biased by stereotypes. Here we propose a method for measuring the affective, socially biased, component, thus enabling its removal. That… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  24. arXiv:2203.10489  [pdf, other

    cs.CV

    TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing

    Authors: Jierun Chen, Tianlang He, Weipeng Zhuo, Li Ma, Sangtae Ha, S. -H. Gary Chan

    Abstract: As convolution has empowered many smart applications, dynamic convolution further equips it with the ability to adapt to diverse inputs. However, the static and dynamic convolutions are either layout-agnostic or computation-heavy, making it inappropriate for layout-specific applications, e.g., face recognition and medical image segmentation. We observe that these applications naturally exhibit the… ▽ More

    Submitted 22 March, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  25. arXiv:2201.03817  [pdf, other

    cs.HC

    Tackling Multipath and Biased Training Data for IMU-Assisted BLE Proximity Detection

    Authors: Tianlang He, Jiajie Tan, Weipeng Zhuo, Maximilian Printz, S. -H. Gary Chan

    Abstract: Proximity detection is to determine whether an IoT receiver is within a certain distance from a signal transmitter. Due to its low cost and high popularity, Bluetooth low energy (BLE) has been used to detect proximity based on the received signal strength indicator (RSSI). To address the fact that RSSI can be markedly influenced by device carriage states, previous works have incorporated RSSI with… ▽ More

    Submitted 11 January, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

  26. arXiv:2201.02155  [pdf, other

    cs.LG

    Topological Representations of Local Explanations

    Authors: Peter Xenopoulos, Gromit Chan, Harish Doraiswamy, Luis Gustavo Nonato, Brian Barr, Claudio Silva

    Abstract: Local explainability methods -- those which seek to generate an explanation for each prediction -- are becoming increasingly prevalent due to the need for practitioners to rationalize their model outputs. However, comparing local explainability methods is difficult since they each generate outputs in various scales and dimensions. Furthermore, due to the stochastic nature of some explainability me… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

  27. arXiv:2201.00008  [pdf, other

    cs.LG cs.AI

    A Lightweight and Accurate Spatial-Temporal Transformer for Traffic Forecasting

    Authors: Guanyao Li, Shuhan Zhong, S. -H. Gary Chan, Ruiyuan Li, Chih-Chieh Hung, Wen-Chih Peng

    Abstract: We study the forecasting problem for traffic with dynamic, possibly periodical, and joint spatial-temporal dependency between regions. Given the aggregated inflow and outflow traffic of regions in a city from time slots 0 to t-1, we predict the traffic at time t at any region. Prior arts in the area often consider the spatial and temporal dependencies in a decoupled manner or are rather computatio… ▽ More

    Submitted 3 May, 2022; v1 submitted 30 December, 2021; originally announced January 2022.

  28. arXiv:2109.02038  [pdf, other

    cs.LG

    NAS-OoD: Neural Architecture Search for Out-of-Distribution Generalization

    Authors: Haoyue Bai, Fengwei Zhou, Lanqing Hong, Nanyang Ye, S. -H. Gary Chan, Zhenguo Li

    Abstract: Recent advances on Out-of-Distribution (OoD) generalization reveal the robustness of deep learning models against distribution shifts. However, existing works focus on OoD algorithms, such as invariant risk minimization, domain generalization, or stable learning, without considering the influence of deep model architectures on OoD generalization, which may lead to sub-optimal performance. Neural A… ▽ More

    Submitted 5 September, 2021; originally announced September 2021.

    Comments: Accepted by ICCV2021

  29. arXiv:2106.05850  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Matrix Completion with Model-free Weighting

    Authors: Jiayi Wang, Raymond K. W. Wong, Xiaojun Mao, Kwun Chuen Gary Chan

    Abstract: In this paper, we propose a novel method for matrix completion under general non-uniform missing structures. By controlling an upper bound of a novel balancing error, we construct weights that can actively adjust for the non-uniformity in the empirical risk without explicitly modeling the observation probabilities, and can be computed efficiently via convex optimization. The recovered matrix based… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021

  30. arXiv:2105.09684  [pdf, other

    cs.CV

    Crowd Counting by Self-supervised Transfer Colorization Learning and Global Prior Classification

    Authors: Haoyue Bai, Song Wen, S. -H. Gary Chan

    Abstract: Labeled crowd scene images are expensive and scarce. To significantly reduce the requirement of the labeled images, we propose ColorCount, a novel CNN-based approach by combining self-supervised transfer colorization learning and global prior classification to leverage the abundantly available unlabeled data. The self-supervised colorization branch learns the semantics and surface texture of the i… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

  31. arXiv:2104.13946  [pdf, other

    cs.CV

    Motion-guided Non-local Spatial-Temporal Network for Video Crowd Counting

    Authors: Haoyue Bai, S. -H. Gary Chan

    Abstract: We study video crowd counting, which is to estimate the number of objects (people in this paper) in all the frames of a video sequence. Previous work on crowd counting is mostly on still images. There has been little work on how to properly extract and take advantage of the spatial-temporal correlation between neighboring frames in both short and long ranges to achieve high estimation accuracy for… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

  32. arXiv:2101.04442  [pdf, other

    cs.CV eess.IV

    Joint Demosaicking and Denoising in the Wild: The Case of Training Under Ground Truth Uncertainty

    Authors: Jierun Chen, Song Wen, S. -H. Gary Chan

    Abstract: Image demosaicking and denoising are the two key fundamental steps in digital camera pipelines, aiming to reconstruct clean color images from noisy luminance readings. In this paper, we propose and study Wild-JDD, a novel learning framework for joint demosaicking and denoising in the wild. In contrast to previous works which generally assume the ground truth of training data is a perfect reflectio… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: Accepted by AAAI2021

  33. arXiv:2012.15685  [pdf, other

    cs.CV

    A Survey on Deep Learning-based Single Image Crowd Counting: Network Design, Loss Function and Supervisory Signal

    Authors: Haoyue Bai, Jiageng Mao, S. -H. Gary Chan

    Abstract: Single image crowd counting is a challenging computer vision problem with wide applications in public safety, city planning, traffic management, etc. With the recent development of deep learning techniques, crowd counting has aroused much attention and achieved great success in recent years. This survey is to provide a comprehensive summary of recent advances on deep learning-based crowd counting… ▽ More

    Submitted 11 July, 2022; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: Neurocomputing minor revision. Project page is at https://github.com/HaoyueBaiZJU/A-Recent-Systematic-Survey-for-Crowd-Counting

  34. arXiv:2012.09382  [pdf, other

    cs.LG

    DecAug: Out-of-Distribution Generalization via Decomposed Feature Representation and Semantic Augmentation

    Authors: Haoyue Bai, Rui Sun, Lanqing Hong, Fengwei Zhou, Nanyang Ye, Han-Jia Ye, S. -H. Gary Chan, Zhenguo Li

    Abstract: While deep learning demonstrates its strong ability to handle independent and identically distributed (IID) data, it often suffers from out-of-distribution (OoD) generalization, where the test data come from another distribution (w.r.t. the training one). Designing a general OoD generalization framework to a wide range of applications is challenging, mainly due to possible correlation shift and di… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: Accepted by AAAI2021

  35. arXiv:2009.05944  [pdf, other

    cs.CR

    vContact: Private WiFi-based Contact Tracing with Virus Lifespan

    Authors: Guanyao Li, Siyan Hu, Shuhan Zhong, Wai Lun Tsui, S. -H. Gary Chan

    Abstract: Covid-19 is primarily spread through contact with the virus which may survive on surfaces with lifespan of more than hours. To curb its spread, it is hence of vital importance to detect and quarantine those who have been in contact with the virus for sustained period of time, the so-called close contacts. In this work, we study, for the first time, automatic contact detection when the virus has a… ▽ More

    Submitted 26 January, 2021; v1 submitted 13 September, 2020; originally announced September 2020.

  36. arXiv:2007.10614  [pdf, other

    cs.HC

    Melody: Generating and Visualizing Machine Learning Model Summary to Understand Data and Classifiers Together

    Authors: Gromit Yeuk-Yin Chan, Enrico Bertini, Luis Gustavo Nonato, Brian Barr, Claudio T. Silva

    Abstract: With the increasing sophistication of machine learning models, there are growing trends of developing model explanation techniques that focus on only one instance (local explanation) to ensure faithfulness to the original model. While these techniques provide accurate model interpretability on various data primitive (e.g., tabular, image, or text), a holistic Explainable Artificial Intelligence (X… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

  37. arXiv:2007.10609  [pdf, other

    cs.HC

    SUBPLEX: Towards a Better Understanding of Black Box Model Explanations at the Subpopulation Level

    Authors: Jun Yuan, Gromit Yeuk-Yin Chan, Brian Barr, Kyle Overton, Kim Rees, Luis Gustavo Nonato, Enrico Bertini, Claudio T. Silva

    Abstract: Understanding the interpretation of machine learning (ML) models has been of paramount importance when making decisions with societal impacts such as transport control, financial activities, and medical diagnosis. While current model interpretation methodologies focus on using locally linear functions to approximate the models or creating self-explanatory models that give explanations to each inpu… ▽ More

    Submitted 5 May, 2024; v1 submitted 21 July, 2020; originally announced July 2020.

  38. arXiv:1909.03839  [pdf, other

    cs.CV

    Crowd Counting on Images with Scale Variation and Isolated Clusters

    Authors: Haoyue Bai, Song Wen, S. -H. Gary Chan

    Abstract: Crowd counting is to estimate the number of objects (e.g., people or vehicles) in an image of unconstrained congested scenes. Designing a general crowd counting algorithm applicable to a wide range of crowd images is challenging, mainly due to the possibly large variation in object scales and the presence of many isolated small clusters. Previous approaches based on convolution operations with mul… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: Accepted at International Conference on Computer Vision (ICCV) 2019 Workshop

  39. arXiv:1907.09146  [pdf, other

    cs.GR cs.HC

    Motion Browser: Visualizing and Understanding Complex Upper Limb Movement Under Obstetrical Brachial Plexus Injuries

    Authors: Gromit Yeuk-Yin Chan, Luis Gustavo Nonato, Alice Chu, Preeti Raghavan, Viswanath Aluru, Claudio T. Silva

    Abstract: The brachial plexus is a complex network of peripheral nerves that enables sensing from and control of the movements of the arms and hand. Nowadays, the coordination between the muscles to generate simple movements is still not well understood, hindering the knowledge of how to best treat patients with this type of peripheral nerve injury. To acquire enough information for medical data analysis, p… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: IEEE Transactions on Visualization and Computer Graphics (VAST 2019, to appear)

  40. arXiv:1903.02082  [pdf, other

    cs.NE cs.LG stat.ML

    DA-LSTM: A Long Short-Term Memory with Depth Adaptive to Non-uniform Information Flow in Sequential Data

    Authors: Yifeng Zhang, Ka-Ho Chow, S. -H. Gary Chan

    Abstract: Much sequential data exhibits highly non-uniform information distribution. This cannot be correctly modeled by traditional Long Short-Term Memory (LSTM). To address that, recent works have extended LSTM by adding more activations between adjacent inputs. However, the approaches often use a fixed depth, which is at the step of the most information content. This one-size-fits-all worst-case approach… ▽ More

    Submitted 18 January, 2019; originally announced March 2019.

  41. arXiv:1811.08069  [pdf, other

    cs.LG stat.ML

    Representation Learning of Pedestrian Trajectories Using Actor-Critic Sequence-to-Sequence Autoencoder

    Authors: Ka-Ho Chow, Anish Hiranandani, Yifeng Zhang, S. -H. Gary Chan

    Abstract: Representation learning of pedestrian trajectories transforms variable-length timestamp-coordinate tuples of a trajectory into a fixed-length vector representation that summarizes spatiotemporal characteristics. It is a crucial technique to connect feature-based data mining with trajectory data. Trajectory representation is a challenging problem, because both environmental constraints (e.g., wall… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

  42. arXiv:1705.06463  [pdf, other

    cs.CL

    Universal Dependencies Parsing for Colloquial Singaporean English

    Authors: Hongmin Wang, Yue Zhang, GuangYong Leonard Chan, Jie Yang, Hai Leong Chieu

    Abstract: Singlish can be interesting to the ACL community both linguistically as a major creole based on English, and computationally for information extraction and sentiment analysis of regional social media. We investigate dependency parsing of Singlish by constructing a dependency treebank under the Universal Dependencies scheme, and then training a neural network model by integrating English syntactic… ▽ More

    Submitted 18 May, 2017; originally announced May 2017.

    Comments: Accepted by ACL 2017

  43. arXiv:1211.4767  [pdf, ps, other

    cs.MM

    Collaborative P2P Streaming of Interactive Live Free Viewpoint Video

    Authors: Dongni Ren, S. -H. Gary Chan, Gene Cheung, Vicky Zhao, Pascal Frossard

    Abstract: We study an interactive live streaming scenario where multiple peers pull streams of the same free viewpoint video that are synchronized in time but not necessarily in view. In free viewpoint video, each user can periodically select a virtual view between two anchor camera views for display. The virtual view is synthesized using texture and depth videos of the anchor views via depth-image-based re… ▽ More

    Submitted 20 November, 2012; originally announced November 2012.