Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 194 results for author: Wong, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15991  [pdf, other

    cs.CY cs.HC

    TikTok Engagement Traces Over Time and Health Risky Behaviors: Combining Data Linkage and Computational Methods

    Authors: Xinyan Zhao, Chau-Wai Wong

    Abstract: Digital technologies and social algorithms are revolutionizing the media landscape, altering how we select and consume health information. Extending the selectivity paradigm with research on social media engagement, the convergence perspective, and algorithmic impact, this study investigates how individuals' liked TikTok videos on various health-risk topics are associated with their vaping and dri… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 12 pages. Under review

  2. arXiv:2406.09630  [pdf, other

    cs.CV cs.LG

    Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition

    Authors: Mehreen Saeed, Adrian Chan, Anupam Mijar, Joseph Moukarzel, Georges Habchi, Carlos Younes, Amin Elias, Chau-Wai Wong, Akram Khater

    Abstract: We present the Manuscripts of Handwritten Arabic~(Muharaf) dataset, which is a machine learning dataset consisting of more than 1,600 historic handwritten page images transcribed by experts in archival Arabic. Each document image is accompanied by spatial polygonal coordinates of its text lines as well as basic page elements. This dataset was compiled to advance the state of the art in handwritten… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2406.08411  [pdf, other

    cs.CL cs.AI cs.HC

    Tailoring Generative AI Chatbots for Multiethnic Communities in Disaster Preparedness Communication: Extending the CASA Paradigm

    Authors: Xinyan Zhao, Yuan Sun, Wenlin Liu, Chau-Wai Wong

    Abstract: This study is among the first to develop different prototypes of generative AI (GenAI) chatbots powered by GPT 4 to communicate hurricane preparedness information to diverse residents. Drawing from the Computers Are Social Actors (CASA) paradigm and the literature on disaster vulnerability and cultural tailoring, this study conducted a between-subjects experiment with 441 Black, Hispanic, and Cauc… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 21 pages

    MSC Class: 68U15

  4. arXiv:2405.20429  [pdf, other

    cs.DB

    Quantum Preference Query

    Authors: Hao Liu, Xiaotian You, Raymond Chi-Wing Wong

    Abstract: Given a large dataset of many tuples, it is hard for users to pick out their preferred tuples. Thus, the preference query problem, which is to find the most preferred tuples from a dataset, is widely discussed in the database area. In this problem, a utility function is given by the user to evaluate to what extent the user prefers a tuple. However, considering a dataset consisting of N tuples, the… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  5. arXiv:2405.20416  [pdf, other

    cs.DB

    First Tree-like Quantum Data Structure: Quantum B+ Tree

    Authors: Hao Liu, Xiaotian You, Raymond Chi-Wing Wong

    Abstract: Quantum computing is a popular topic in computer science, which has recently attracted many studies in various areas such as machine learning and network. However, the topic of quantum data structures seems neglected. There is an open problem in the database area: Can we improve existing data structures by quantum techniques? Consider a dataset of key-record pairs. Given an interval as a query ran… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  6. arXiv:2405.17940  [pdf, other

    cs.RO cs.AI

    World Models for General Surgical Grasping

    Authors: Hongbin Lin, Bin Li, Chun Wai Wong, Juan Rojas, Xiangyu Chu, Kwok Wai Samuel Au

    Abstract: Intelligent vision control systems for surgical robots should adapt to unknown and diverse objects while being robust to system disturbances. Previous methods did not meet these requirements due to mainly relying on pose estimation and feature tracking. We propose a world-model-based deep reinforcement learning framework "Grasp Anything for Surgery" (GAS), that learns a pixel-level visuomotor poli… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Journal ref: Robotics: Science and Systems 2024

  7. arXiv:2405.05241  [pdf, other

    cs.CV cs.LG

    BenthicNet: A global compilation of seafloor images for deep learning applications

    Authors: Scott C. Lowe, Benjamin Misiuk, Isaac Xu, Shakhboz Abdulazizov, Amit R. Baroi, Alex C. Bastos, Merlin Best, Vicki Ferrini, Ariell Friedman, Deborah Hart, Ove Hoegh-Guldberg, Daniel Ierodiaconou, Julia Mackin-McLaughlin, Kathryn Markey, Pedro S. Menandro, Jacquomo Monk, Shreya Nemani, John O'Brien, Elizabeth Oh, Luba Y. Reshitnyk, Katleen Robert, Chris M. Roelfsema, Jessica A. Sameoto, Alexandre C. G. Schimel, Jordan A. Thomson , et al. (4 additional authors not shown)

    Abstract: Advances in underwater imaging enable the collection of extensive seafloor image datasets that are necessary for monitoring important benthic ecosystems. The ability to collect seafloor imagery has outpaced our capacity to analyze it, hindering expedient mobilization of this crucial environmental information. Recent machine learning approaches provide opportunities to increase the efficiency with… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  8. arXiv:2404.13135  [pdf, other

    cs.RO

    Hybrid Continuum-Eversion Robot: Precise Navigation and Decontamination in Nuclear Environments using Vine Robot

    Authors: Mohammed Al-Dubooni, Cuebong Wong, Kaspar Althoefer

    Abstract: Soft growing vine robots show great potential for navigation and decontamination tasks in the nuclear industry. This paper introduces a novel hybrid continuum-eversion robot designed to address certain challenges in relation to navigating and operating within pipe networks and enclosed remote vessels. The hybrid robot combines the flexibility of a soft eversion robot with the precision of a contin… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 7 pages, 8 figures, conference

  9. arXiv:2404.11816  [pdf, other

    cs.LG

    Tailoring Generative Adversarial Networks for Smooth Airfoil Design

    Authors: Joyjit Chattoraj, Jian Cheng Wong, Zhang Zexuan, Manna Dai, Xia Yingzhi, Li Jichao, Xu Xinxing, Ooi Chin Chun, Yang Feng, Dao My Ha, Liu Yong

    Abstract: In the realm of aerospace design, achieving smooth curves is paramount, particularly when crafting objects such as airfoils. Generative Adversarial Network (GAN), a widely employed generative AI technique, has proven instrumental in synthesizing airfoil designs. However, a common limitation of GAN is the inherent lack of smoothness in the generated airfoil surfaces. To address this issue, we prese… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  10. arXiv:2404.10194  [pdf, other

    cs.SE cs.HC

    Impostor Syndrome in Final Year Computer Science Students: An Eye Tracking and Biometrics Study

    Authors: Alyssia Chen, Carol Wong, Katy Tarrit, Anthony Peruma

    Abstract: Imposter syndrome is a psychological phenomenon that affects individuals who doubt their skills and abilities, despite possessing the necessary competencies. This can lead to a lack of confidence and poor performance. While research has explored the impacts of imposter syndrome on students and professionals in various fields, there is limited knowledge on how it affects code comprehension in softw… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted at: 18th International Conference, AC 2024, Held as Part of the 26th HCI International Conference, HCII 2024

  11. arXiv:2404.07135  [pdf, other

    cs.CL cs.AI

    Towards Robustness of Text-to-Visualization Translation against Lexical and Phrasal Variability

    Authors: Jinwei Lu, Yuanfeng Song, Haodi Zhang, Chen Zhang, Raymond Chi-Wing Wong

    Abstract: Text-to-Vis is an emerging task in the natural language processing (NLP) area that aims to automatically generate data visualizations from natural language questions (NLQs). Despite their progress, existing text-to-vis models often heavily rely on lexical matching between words in the questions and tokens in data schemas. This overreliance on lexical matching may lead to a diminished level of mode… ▽ More

    Submitted 11 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  12. arXiv:2404.00032  [pdf, other

    cs.HC cs.CV eess.IV

    Deployment of Deep Learning Model in Real World Clinical Setting: A Case Study in Obstetric Ultrasound

    Authors: Chun Kit Wong, Mary Ngo, Manxi Lin, Zahra Bashir, Amihai Heen, Morten Bo Søndergaard Svendsen, Martin Grønnebæk Tolsgaard, Anders Nymark Christensen, Aasa Feragen

    Abstract: Despite the rapid development of AI models in medical image analysis, their validation in real-world clinical settings remains limited. To address this, we introduce a generic framework designed for deploying image-based AI models in such settings. Using this framework, we deployed a trained model for fetal ultrasound standard plane detection, and evaluated it in real-time sessions with both novic… ▽ More

    Submitted 22 March, 2024; originally announced April 2024.

    Comments: 10 pages

  13. arXiv:2403.08002  [pdf, other

    cs.CL cs.CV

    Towards a clinically accessible radiology foundation model: open-access and lightweight, with automated evaluation

    Authors: Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, Ziyi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu Wei, Tristan Naumann, Muhao Chen, Matthew P. Lungren, Akshay Chaudhari, Serena Yeung-Levy, Curtis P. Langlotz , et al. (2 additional authors not shown)

    Abstract: The scaling laws and extraordinary performance of large foundation models motivate the development and utilization of such models in biomedicine. However, despite early promising results on some biomedical benchmarks, there are still major challenges that need to be addressed before these models can be used in real-world clinics. Frontier general-domain models such as GPT-4V still have significant… ▽ More

    Submitted 26 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  14. arXiv:2403.00192  [pdf, other

    cs.IT

    Block-MDS QC-LDPC Codes for Information Reconciliation in Key Distribution

    Authors: Lev Tauz, Debarnab Mitra, Jayanth Shreekumar, Murat Can Sarihan, Chee Wei Wong, Lara Dolecek

    Abstract: Quantum key distribution (QKD) is a popular protocol that provides information theoretically secure keys to multiple parties. Two important post-processing steps of QKD are 1) the information reconciliation (IR) step, where parties reconcile mismatches in generated keys through classical communication, and 2) the privacy amplification (PA) step, where parties distill their common key into a new se… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 7 pages, 1 figure, submitted to the International Symposium on Information Theory (ISIT) 2024

  15. arXiv:2402.08294  [pdf, other

    cs.CV

    Learning semantic image quality for fetal ultrasound from noisy ranking annotation

    Authors: Manxi Lin, Jakob Ambsdorf, Emilie Pi Fogtmann Sejer, Zahra Bashir, Chun Kit Wong, Paraskevas Pegios, Alberto Raheli, Morten Bo Søndergaard Svendsen, Mads Nielsen, Martin Grønnebæk Tolsgaard, Anders Nymark Christensen, Aasa Feragen

    Abstract: We introduce the notion of semantic image quality for applications where image quality relies on semantic requirements. Working in fetal ultrasound, where ranking is challenging and annotations are noisy, we design a robust coarse-to-fine model that ranks images based on their semantic image quality and endow our predicted rankings with an uncertainty estimate. To annotate rankings on training dat… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Extended version of the accepted paper at ISBI 2024

  16. arXiv:2401.16247  [pdf, other

    cs.CL cs.CY

    Towards Red Teaming in Multimodal and Multilingual Translation

    Authors: Christophe Ropers, David Dale, Prangthip Hansanti, Gabriel Mejia Gonzalez, Ivan Evtimov, Corinne Wong, Christophe Touret, Kristina Pereyra, Seohyun Sonia Kim, Cristian Canton Ferrer, Pierre Andrews, Marta R. Costa-jussà

    Abstract: Assessing performance in Natural Language Processing is becoming increasingly complex. One particular challenge is the potential for evaluation datasets to overlap with training data, either directly or indirectly, which can lead to skewed results and overestimation of model performance. As a consequence, human evaluation is gaining increasing interest as a means to assess the performance and reli… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2312.05187

    ACM Class: I.2.7

  17. arXiv:2401.07654  [pdf, other

    cs.CV

    Foundation Models for Biomedical Image Segmentation: A Survey

    Authors: Ho Hin Lee, Yu Gu, Theodore Zhao, Yanbo Xu, Jianwei Yang, Naoto Usuyama, Cliff Wong, Mu Wei, Bennett A. Landman, Yuankai Huo, Alberto Santamaria-Pang, Hoifung Poon

    Abstract: Recent advancements in biomedical image analysis have been significantly driven by the Segment Anything Model (SAM). This transformative technology, originally developed for general-purpose computer vision, has found rapid application in medical image processing. Within the last year, marked by over 100 publications, SAM has demonstrated its prowess in zero-shot learning adaptations for medical im… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 22 pages, 4 figures, 7 tables

  18. arXiv:2312.15006  [pdf, other

    cs.AI cs.CL cs.LG

    Assessing the Impact of Prompting Methods on ChatGPT's Mathematical Capabilities

    Authors: Yuhao Chen, Chloe Wong, Hanwen Yang, Juan Aguenza, Sai Bhujangari, Benthan Vu, Xun Lei, Amisha Prasad, Manny Fluss, Eric Phuong, Minghao Liu, Raja Kumar, Vanshika Vats, James Davis

    Abstract: This study critically evaluates the efficacy of prompting methods in enhancing the mathematical reasoning capability of large language models (LLMs). The investigation uses three prescriptive prompting methods - simple, persona, and conversational prompting - known for their effectiveness in enhancing the linguistic tasks of LLMs. We conduct this analysis on OpenAI's LLM chatbot, ChatGPT-3.5, on e… ▽ More

    Submitted 20 February, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

  19. arXiv:2312.08034  [pdf, other

    eess.IV cs.CR cs.CV cs.LG

    Individualized Deepfake Detection Exploiting Traces Due to Double Neural-Network Operations

    Authors: Mushfiqur Rahman, Runze Liu, Chau-Wai Wong, Huaiyu Dai

    Abstract: In today's digital landscape, journalists urgently require tools to verify the authenticity of facial images and videos depicting specific public figures before incorporating them into news stories. Existing deepfake detectors are not optimized for this detection task when an image is associated with a specific and identifiable individual. This study focuses on the deepfake detection of facial ima… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  20. arXiv:2312.05187  [pdf, other

    cs.CL cs.SD eess.AS

    Seamless: Multilingual Expressive and Streaming Speech Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek , et al. (40 additional authors not shown)

    Abstract: Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  21. Analysis of Coding Gain Due to In-Loop Reshaping

    Authors: Chau-Wai Wong, Chang-Hong Fu, Mengting Xu, Guan-Ming Su

    Abstract: Reshaping, a point operation that alters the characteristics of signals, has been shown capable of improving the compression ratio in video coding practices. Out-of-loop reshaping that directly modifies the input video signal was first adopted as the supplemental enhancement information (SEI) for the HEVC/H.265 without the need to alter the core design of the video codec. VVC/H.266 further improve… ▽ More

    Submitted 19 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Published in IEEE Transactions on Image Processing

  22. arXiv:2312.03243  [pdf, other

    cs.NE cs.CE cs.LG

    Generalizable Neural Physics Solvers by Baldwinian Evolution

    Authors: Jian Cheng Wong, Chin Chun Ooi, Abhishek Gupta, Pao-Hsiung Chiu, Joshua Shao Zheng Low, My Ha Dao, Yew-Soon Ong

    Abstract: Physics-informed neural networks (PINNs) are at the forefront of scientific machine learning, making possible the creation of machine intelligence that is cognizant of physical laws and able to accurately simulate them. In this paper, the potential of discovering PINNs that generalize over an entire family of physics tasks is studied, for the first time, through a biological lens of the Baldwin ef… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  23. Image-Based Soil Organic Carbon Remote Sensing from Satellite Images with Fourier Neural Operator and Structural Similarity

    Authors: Ken C. L. Wong, Levente Klein, Ademir Ferreira da Silva, Hongzhi Wang, Jitendra Singh, Tanveer Syeda-Mahmood

    Abstract: Soil organic carbon (SOC) sequestration is the transfer and storage of atmospheric carbon dioxide in soils, which plays an important role in climate change mitigation. SOC concentration can be improved by proper land use, thus it is beneficial if SOC can be estimated at a regional or global scale. As multispectral satellite data can provide SOC-related information such as vegetation and soil prope… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: This paper was accepted by the 2023 IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2023)

  24. arXiv:2311.09581  [pdf, other

    cs.CL

    DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation

    Authors: Yiqing Xie, Sheng Zhang, Hao Cheng, Pengfei Liu, Zelalem Gero, Cliff Wong, Tristan Naumann, Hoifung Poon, Carolyn Rose

    Abstract: Medical text generation aims to assist with administrative work and highlight salient information to support decision-making. To reflect the specific requirements of medical text, in this paper, we propose a set of metrics to evaluate the completeness, conciseness, and attribution of the generated text at a fine-grained level. The metrics can be computed by various types of evaluators including in… ▽ More

    Submitted 18 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  25. arXiv:2311.03032  [pdf, other

    cs.RO

    Reconfigurable, Transformable Soft Pneumatic Actuator with Tunable 3D Deformations for Dexterous Soft Robotics Applications

    Authors: Dickson Chiu Yu Wong, Mingtan Li, Shijie Kang, Lifan Luo, Hongyu Yu

    Abstract: Numerous soft actuators based on PneuNet design have already been proposed and extensively employed across various soft robotics applications in recent years. Despite their widespread use, a common limitation of most existing designs is that their action is pre-determined during the fabrication process, thereby restricting the ability to modify or alter their function during operation. To address… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Submitted to Soft Robotics Journal. 12 pages, 10 figures

  26. arXiv:2311.01301  [pdf, other

    cs.LG cs.AI stat.ME

    TRIALSCOPE: A Unifying Causal Framework for Scaling Real-World Evidence Generation with Biomedical Language Models

    Authors: Javier González, Cliff Wong, Zelalem Gero, Jass Bagga, Risa Ueno, Isabel Chien, Eduard Oravkin, Emre Kiciman, Aditya Nori, Roshanthi Weerasinghe, Rom S. Leidner, Brian Piening, Tristan Naumann, Carlo Bifulco, Hoifung Poon

    Abstract: The rapid digitization of real-world data offers an unprecedented opportunity for optimizing healthcare delivery and accelerating biomedical discovery. In practice, however, such data is most abundantly available in unstructured forms, such as clinical notes in electronic medical records (EMRs), and it is generally plagued by confounders. In this paper, we present TRIALSCOPE, a unifying framework… ▽ More

    Submitted 6 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 6 Figures, 22 Pages, 3 Tables

  27. arXiv:2310.17894  [pdf, other

    cs.CL cs.AI

    Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey

    Authors: Weixu Zhang, Yifei Wang, Yuanfeng Song, Victor Junqiu Wei, Yuxing Tian, Yiyan Qi, Jonathan H. Chan, Raymond Chi-Wing Wong, Haiqin Yang

    Abstract: The emergence of natural language processing has revolutionized the way users interact with tabular data, enabling a shift from traditional query languages and manual plotting to more intuitive, language-based interfaces. The rise of large language models (LLMs) such as ChatGPT and its successors has further advanced this field, opening new avenues for natural language processing techniques. This… ▽ More

    Submitted 19 May, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: 20 pages, 4 figures, 5 tables. Accepted by IEEE TKDE

  28. arXiv:2310.08873  [pdf, other

    cs.RO cs.AI

    Interactive Navigation in Environments with Traversable Obstacles Using Large Language and Vision-Language Models

    Authors: Zhen Zhang, Anran Lin, Chun Wai Wong, Xiangyu Chu, Qi Dou, K. W. Samuel Au

    Abstract: This paper proposes an interactive navigation framework by using large language and vision-language models, allowing robots to navigate in environments with traversable obstacles. We utilize the large language model (GPT-3.5) and the open-set Vision-language Model (Grounding DINO) to create an action-aware costmap to perform effective path planning without fine-tuning. With the large models, we ca… ▽ More

    Submitted 12 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted by 2024 IEEE International Conference on Robotics and Automation (ICRA), 7 pages, 8 figures

  29. arXiv:2310.05370  [pdf, other

    cs.CV

    SocialCircle: Learning the Angle-based Social Interaction Representation for Pedestrian Trajectory Prediction

    Authors: Conghao Wong, Beihao Xia, Ziqian Zou, Yulong Wang, Xinge You

    Abstract: Analyzing and forecasting trajectories of agents like pedestrians and cars in complex scenes has become more and more significant in many intelligent systems and applications. The diversity and uncertainty in socially interactive behaviors among a rich variety of agents make this task more challenging than other deterministic computer vision tasks. Researchers have made a lot of efforts to quantif… ▽ More

    Submitted 26 March, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: CVPR 2024 accepted

  30. HartleyMHA: Self-Attention in Frequency Domain for Resolution-Robust and Parameter-Efficient 3D Image Segmentation

    Authors: Ken C. L. Wong, Hongzhi Wang, Tanveer Syeda-Mahmood

    Abstract: With the introduction of Transformers, different attention-based models have been proposed for image segmentation with promising results. Although self-attention allows capturing of long-range dependencies, it suffers from a quadratic complexity in the image size especially in 3D. To avoid the out-of-memory error during training, input size reduction is usually required for 3D segmentation, but th… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: This paper was accepted by the International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2023). arXiv admin note: text overlap with arXiv:2310.03872

  31. FNOSeg3D: Resolution-Robust 3D Image Segmentation with Fourier Neural Operator

    Authors: Ken C. L. Wong, Hongzhi Wang, Tanveer Syeda-Mahmood

    Abstract: Due to the computational complexity of 3D medical image segmentation, training with downsampled images is a common remedy for out-of-memory errors in deep learning. Nevertheless, as standard spatial convolution is sensitive to variations in image resolution, the accuracy of a convolutional neural network trained with downsampled images can be suboptimal when applied on the original resolution. To… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: This paper was accepted by the IEEE International Symposium on Biomedical Imaging (ISBI) 2023

  32. arXiv:2309.12218  [pdf, other

    cs.IR cs.LG

    SR-PredictAO: Session-based Recommendation with High-Capability Predictor Add-On

    Authors: Ruida Wang, Raymond Chi-Wing Wong, Weile Tan

    Abstract: Session-based recommendation, aiming at making the prediction of the user's next item click based on the information in a single session only even in the presence of some random user's behavior, is a complex problem. This complex problem requires a high-capability model of predicting the user's next action. Most (if not all) existing models follow the encoder-predictor paradigm where all studies f… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  33. arXiv:2309.07921  [pdf, other

    cs.CV

    OpenIllumination: A Multi-Illumination Dataset for Inverse Rendering Evaluation on Real Objects

    Authors: Isabella Liu, Linghao Chen, Ziyang Fu, Liwen Wu, Haian Jin, Zhong Li, Chin Ming Ryan Wong, Yi Xu, Ravi Ramamoorthi, Zexiang Xu, Hao Su

    Abstract: We introduce OpenIllumination, a real-world dataset containing over 108K images of 64 objects with diverse materials, captured under 72 camera views and a large number of different illuminations. For each image in the dataset, we provide accurate camera parameters, illumination ground truth, and foreground segmentation masks. Our dataset enables the quantitative evaluation of most inverse renderin… ▽ More

    Submitted 1 February, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

  34. arXiv:2309.07650  [pdf, other

    cs.CL

    Automatic Data Visualization Generation from Chinese Natural Language Questions

    Authors: Yan Ge, Victor Junqiu Wei, Yuanfeng Song, Jason Chen Zhang, Raymond Chi-Wing Wong

    Abstract: Data visualization has emerged as an effective tool for getting insights from massive datasets. Due to the hardness of manipulating the programming languages of data visualization, automatic data visualization generation from natural languages (Text-to-Vis) is becoming increasingly popular. Despite the plethora of research effort on the English Text-to-Vis, studies have yet to be conducted on data… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  35. arXiv:2308.02180  [pdf, other

    cs.CL cs.LG

    Scaling Clinical Trial Matching Using Large Language Models: A Case Study in Oncology

    Authors: Cliff Wong, Sheng Zhang, Yu Gu, Christine Moung, Jacob Abel, Naoto Usuyama, Roshanthi Weerasinghe, Brian Piening, Tristan Naumann, Carlo Bifulco, Hoifung Poon

    Abstract: Clinical trial matching is a key process in health delivery and discovery. In practice, it is plagued by overwhelming unstructured data and unscalable manual processing. In this paper, we conduct a systematic study on scaling clinical trial matching using large language models (LLMs), with oncology as the focus area. Our study is grounded in a clinical trial matching system currently in test deplo… ▽ More

    Submitted 18 August, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: 24 pages, 5 figures, accepted at Machine Learning for Healthcare (MLHC) 2023

  36. arXiv:2307.16013  [pdf, other

    cs.AI cs.CL cs.DB

    Marrying Dialogue Systems with Data Visualization: Interactive Data Visualization Generation from Natural Language Conversations

    Authors: Yuanfeng Song, Xuefang Zhao, Raymond Chi-Wing Wong

    Abstract: Data visualization (DV) has become the prevailing tool in the market due to its effectiveness into illustrating insights in vast amounts of data. To lower the barrier of using DVs, automatic DV tasks, such as natural language question (NLQ) to visualization translation (formally called text-to-vis), have been investigated in the research community. However, text-to-vis assumes the NLQ to be well-o… ▽ More

    Submitted 29 July, 2023; originally announced July 2023.

  37. arXiv:2307.06439  [pdf, other

    cs.CL cs.AI

    Distilling Large Language Models for Biomedical Knowledge Extraction: A Case Study on Adverse Drug Events

    Authors: Yu Gu, Sheng Zhang, Naoto Usuyama, Yonas Woldesenbet, Cliff Wong, Praneeth Sanapathi, Mu Wei, Naveen Valluri, Erika Strandberg, Tristan Naumann, Hoifung Poon

    Abstract: Large language models (LLMs), such as GPT-4, have demonstrated remarkable capabilities across a wide range of tasks, including health applications. In this paper, we study how LLMs can be used to scale biomedical knowledge curation. We find that while LLMs already possess decent competency in structuring biomedical text, by distillation into a task-specific student model through self-supervised le… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  38. arXiv:2307.04336  [pdf

    cs.AI cs.LG cs.SI

    Source-Aware Embedding Training on Heterogeneous Information Networks

    Authors: Tsai Hor Chan, Chi Ho Wong, Jiajun Shen, Guosheng Yin

    Abstract: Heterogeneous information networks (HINs) have been extensively applied to real-world tasks, such as recommendation systems, social networks, and citation networks. While existing HIN representation learning methods can effectively learn the semantic and structural features in the network, little awareness was given to the distribution discrepancy of subgraphs within a single HIN. However, we find… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Published in Data Intelligence 2023

  39. arXiv:2306.05436  [pdf, other

    stat.AP cs.CY

    Remaining Useful Life Modelling with an Escalator Health Condition Analytic System

    Authors: Inez M. Zwetsloot, Yu Lin, Jiaqi Qiu, Lishuai Li, William Ka Fai Lee, Edmond Yin San Yeung, Colman Yiu Wah Yeung, Chris Chun Long Wong

    Abstract: The refurbishment of an escalator is usually linked with its design life as recommended by the manufacturer. However, the actual useful life of an escalator should be determined by its operating condition which is affected by the runtime, workload, maintenance quality, vibration, etc., rather than age only. The objective of this project is to develop a comprehensive health condition analytic syste… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 14 pages, 12 figures, 7 tables

  40. arXiv:2306.00890  [pdf, other

    cs.CV cs.CL

    LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day

    Authors: Chunyuan Li, Cliff Wong, Sheng Zhang, Naoto Usuyama, Haotian Liu, Jianwei Yang, Tristan Naumann, Hoifung Poon, Jianfeng Gao

    Abstract: Conversational generative AI has demonstrated remarkable promise for empowering biomedical practitioners, but current investigations focus on unimodal text. Multimodal conversational AI has seen rapid progress by leveraging billions of image-text pairs from the public web, but such general-domain vision-language models still lack sophistication in understanding and conversing about biomedical imag… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 17 pages; Website: https://aka.ms/llava-med

  41. arXiv:2305.11199  [pdf, other

    q-bio.QM cs.LG

    At-Admission Prediction of Mortality and Pulmonary Embolism in COVID-19 Patients Using Statistical and Machine Learning Methods: An International Cohort Study

    Authors: Munib Mesinovic, Xin Ci Wong, Giri Shan Rajahram, Barbara Wanjiru Citarella, Kalaiarasu M. Peariasamy, Frank van Someren Greve, Piero Olliaro, Laura Merson, Lei Clifton, Christiana Kartsonaki, ISARIC Characterisation Group

    Abstract: By September, 2022, more than 600 million cases of SARS-CoV-2 infection have been reported globally, resulting in over 6.5 million deaths. COVID-19 mortality risk estimators are often, however, developed with small unrepresentative samples and with methodological limitations. It is highly important to develop predictive tools for pulmonary embolism (PE) in COVID-19 patients as one of the most seve… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  42. arXiv:2305.06403  [pdf, other

    cs.RO

    Sensor Observability Analysis for Maximizing Task-Space Observability of Articulated Robots

    Authors: Christopher Yee Wong, Wael Suleiman

    Abstract: We propose a novel performance metric for articulated robots with distributed directional sensors called the sensor observability analysis (SOA). These robot-mounted distributed directional sensors (e.g., joint torque sensors) change their individual sensing directions as the joints move. SOA transforms individual sensors axes in joint space to provide the cumulative sensing quality of these senso… ▽ More

    Submitted 29 May, 2024; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: 15 pages, 11 figures, journal paper. arXiv admin note: substantial text overlap with arXiv:2206.10798

  43. arXiv:2305.00956  [pdf, other

    cs.IT

    Non-Binary LDPC Code Design for Energy-Time Entanglement Quantum Key Distribution

    Authors: Debarnab Mitra, Lev Tauz, Murat Can Sarihan, Chee Wei Wong, Lara Dolecek

    Abstract: In energy-time entanglement Quantum Key Distribution (QKD), two users extract a shared secret key from the arrival times (discretized as symbols) of entangled photon pairs. In prior work, Zhou et al. proposed a multi-level coding (MLC) scheme that splits the observed symbols into bit layers and utilizes binary Low-Density Parity-Check (LDPC) codes for reconciliation of the symbols. While binary LD… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 5 pages, 4 figures, submitted to International Symposium on Topics in Coding

  44. arXiv:2304.14937  [pdf, other

    cs.CV

    Contactless hand tremor amplitude measurement using smartphones: development and pilot evaluation

    Authors: James Bungay, Osasenaga Emokpae, Samuel D. Relton, Jane Alty, Stefan Williams, Hui Fang, David C. Wong

    Abstract: Background: Physiological tremor is defined as an involuntary and rhythmic shaking. Tremor of the hand is a key symptom of multiple neurological diseases, and its frequency and amplitude differs according to both disease type and disease progression. In routine clinical practice, tremor frequency and amplitude are assessed by expert rating using a 0 to 4 integer scale. Such ratings are subjective… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE EMBC 2023, Sydney (pre-refereed version)

  45. An Automatic Guidance and Quality Assessment System for Doppler Imaging of Umbilical Artery

    Authors: Chun Kit Wong, Manxi Lin, Alberto Raheli, Zahra Bashir, Morten Bo Søndergaard Svendsen, Martin Grønnebæk Tolsgaard, Aasa Feragen, Anders Nymark Christensen

    Abstract: Examination of the umbilical artery with Doppler ultrasonography is performed to investigate blood supply to the fetus through the umbilical cord, which is vital for the monitoring of fetal health. Such examination involves several steps that must be performed correctly: identifying suitable sites on the umbilical artery for the measurement, acquiring the blood flow curve in the form of a Doppler… ▽ More

    Submitted 6 July, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: Fetal Ultrasound, Umbilical Artery, Doppler Ultrasound

    Journal ref: ASMUS 2023. Simplifying Medical Ultrasound pp 13-22. Lecture Notes in Computer Science, vol 14337

  46. arXiv:2304.05106  [pdf, other

    cs.CV

    Another Vertical View: A Hierarchical Network for Heterogeneous Trajectory Prediction via Spectrums

    Authors: Conghao Wong, Beihao Xia, Qinmu Peng, Xinge You

    Abstract: With the fast development of AI-related techniques, the applications of trajectory prediction are no longer limited to easier scenes and trajectories. More and more heterogeneous trajectories with different representation forms, such as 2D or 3D coordinates, 2D or 3D bounding boxes, and even high-dimensional human skeletons, need to be analyzed and forecasted. Among these heterogeneous trajectorie… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  47. arXiv:2303.11899  [pdf, other

    cs.AI

    Large-Scale Traffic Signal Control Using Constrained Network Partition and Adaptive Deep Reinforcement Learning

    Authors: Hankang Gu, Shangbo Wang, Xiaoguang Ma, Dongyao Jia, Guoqiang Mao, Eng Gee Lim, Cheuk Pong Ryan Wong

    Abstract: Multi-agent Deep Reinforcement Learning (MADRL) based traffic signal control becomes a popular research topic in recent years. To alleviate the scalability issue of completely centralized RL techniques and the non-stationarity issue of completely decentralized RL techniques on large-scale traffic networks, some literature utilizes a regional control approach where the whole network is firstly part… ▽ More

    Submitted 7 September, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

  48. arXiv:2303.00915  [pdf, other

    cs.CV cs.CL

    BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs

    Authors: Sheng Zhang, Yanbo Xu, Naoto Usuyama, Hanwen Xu, Jaspreet Bagga, Robert Tinn, Sam Preston, Rajesh Rao, Mu Wei, Naveen Valluri, Cliff Wong, Andrea Tupini, Yu Wang, Matt Mazzola, Swadheen Shukla, Lars Liden, Jianfeng Gao, Matthew P. Lungren, Tristan Naumann, Sheng Wang, Hoifung Poon

    Abstract: Biomedical data is inherently multimodal, comprising physical measurements and natural language narratives. A generalist biomedical AI model needs to simultaneously process different modalities of data, including text and images. Therefore, training an effective generalist biomedical model requires high-quality multimodal data, such as parallel image-text pairs. Here, we present PMC-15M, a novel d… ▽ More

    Submitted 16 January, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: The models are released at https://aka.ms/biomedclip

  49. arXiv:2302.01753  [pdf

    cs.CR

    A Process Model to Improve Information Security Governance in Organisations

    Authors: Chee Kong Wong

    Abstract: Information security governance (ISG) is a relatively new and under-researched topic. A review of literature shows the lack of an ISG framework or model that can help the implementation of ISG. This research aims to introduce an empirically grounded ISG process model as a practical reference to facilitate the implementation of ISG in organisations. This research has adopted an exploratory resear… ▽ More

    Submitted 26 January, 2023; originally announced February 2023.

    Comments: 313 pages, PhD Thesis

  50. arXiv:2302.01518  [pdf, other

    cs.LG cs.CE physics.flu-dyn

    LSA-PINN: Linear Boundary Connectivity Loss for Solving PDEs on Complex Geometry

    Authors: Jian Cheng Wong, Pao-Hsiung Chiu, Chinchun Ooi, My Ha Dao, Yew-Soon Ong

    Abstract: We present a novel loss formulation for efficient learning of complex dynamics from governing physics, typically described by partial differential equations (PDEs), using physics-informed neural networks (PINNs). In our experiments, existing versions of PINNs are seen to learn poorly in many problems, especially for complex geometries, as it becomes increasingly difficult to establish appropriate… ▽ More

    Submitted 2 March, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: 11 pages, 7 figures

    Journal ref: 2023 International Joint Conference on Neural Networks (IJCNN)