Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 167 results for author: Gao, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16987  [pdf

    eess.SP cs.LG

    AI for Equitable Tennis Training: Leveraging AI for Equitable and Accurate Classification of Tennis Skill Levels and Training Phases

    Authors: Gyanna Gao, Hao-Yu Liao, Zhenhong Hu

    Abstract: Numerous studies have demonstrated the manifold benefits of tennis, such as increasing overall physical and mental health. Unfortunately, many children and youth from low-income families are unable to engage in this sport mainly due to financial constraints such as private lesson expenses as well as logistical concerns to and back from such lessons and clinics. While several tennis self-training s… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 21 pages, 9 figures, 1 table

  2. arXiv:2406.16873  [pdf, other

    eess.SP cs.AI cs.LG cs.RO

    A Survey of Machine Learning Techniques for Improving Global Navigation Satellite Systems

    Authors: Adyasha Mohanty, Grace Gao

    Abstract: Global Navigation Satellite Systems (GNSS)-based positioning plays a crucial role in various applications, including navigation, transportation, logistics, mapping, and emergency services. Traditional GNSS positioning methods are model-based and they utilize satellite geometry and the known properties of satellite signals. However, model-based methods have limitations in challenging environments a… ▽ More

    Submitted 29 March, 2024; originally announced June 2024.

    Comments: Under consideration for EURASIP Journal on Advances in Signal Processing

  3. arXiv:2406.16679  [pdf, other

    cs.RO

    Multi-Robot Collaborative Localization and Planning with Inter-Ranging

    Authors: Derek Knowles, Adam Dai, Grace Gao

    Abstract: Robots often use feature-based image tracking to identify their position in their surrounding environment; however, feature-based image tracking is prone to errors in low-textured and poorly lit environments. Specifically, we investigate a scenario where robots are tasked with exploring the surface of the Moon and are required to have an accurate estimate of their position to be able to correctly… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  4. arXiv:2406.09759  [pdf, other

    cs.RO

    Autonomous Constellation Fault Monitoring with Inter-satellite Links: A Rigidity-Based Approach

    Authors: Keidai Iiyama, Daniel Neamati, Grace Gao

    Abstract: To address the need for robust positioning, navigation, and timing services in lunar and Martian environments, this paper proposes a novel fault detection framework for satellite constellations using inter-satellite ranging (ISR). Traditional fault monitoring methods rely on intense monitoring from ground-based stations, which are impractical for lunar and Martian missions due to cost constraints.… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Submitted to ION GNSS+ 2024 Conference

  5. arXiv:2406.07061  [pdf, other

    eess.IV cs.CV

    Triage of 3D pathology data via 2.5D multiple-instance learning to guide pathologist assessments

    Authors: Gan Gao, Andrew H. Song, Fiona Wang, David Brenes, Rui Wang, Sarah S. L. Chow, Kevin W. Bishop, Lawrence D. True, Faisal Mahmood, Jonathan T. C. Liu

    Abstract: Accurate patient diagnoses based on human tissue biopsies are hindered by current clinical practice, where pathologists assess only a limited number of thin 2D tissue slices sectioned from 3D volumetric tissue. Recent advances in non-destructive 3D pathology, such as open-top light-sheet microscopy, enable comprehensive imaging of spatially heterogeneous tissue morphologies, offering the feasibili… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: CVPR CVMI 2024

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 6955-6965

  6. arXiv:2405.15227  [pdf, other

    cs.RO

    Neural Elevation Models for Terrain Mapping and Path Planning

    Authors: Adam Dai, Shubh Gupta, Grace Gao

    Abstract: This work introduces Neural Elevations Models (NEMos), which adapt Neural Radiance Fields to a 2.5D continuous and differentiable terrain model. In contrast to traditional terrain representations such as digital elevation models, NEMos can be readily generated from imagery, a low-cost data source, and provide a lightweight representation of terrain through an implicit continuous and differentiable… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  7. arXiv:2405.05474  [pdf

    cs.HC

    (Dis)placed Contributions: Uncovering Hidden Hurdles to Collaborative Writing Involving Non-Native Speakers, Native Speakers, and AI-Powered Editing Tools

    Authors: Yimin Xiao, Yuewen Chen, Naomi Yamashita, Yuexi Chen, Zhicheng Liu, Ge Gao

    Abstract: Content creation today often takes place via collaborative writing. A longstanding interest of CSCW research lies in understanding and promoting the coordination between co-writers. However, little attention has been paid to individuals who write in their non-native language and to co-writer groups involving them. We present a mixed-method study that fills the above gap. Our participants included… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  8. arXiv:2404.15269  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Aligning LLM Agents by Learning Latent Preference from User Edits

    Authors: Ge Gao, Alexey Taymanov, Eduardo Salinas, Paul Mineiro, Dipendra Misra

    Abstract: We study interactive learning of LLM-based language agents based on user edits made to the agent's output. In a typical setting such as writing assistants, the user interacts with a language agent to generate a response given a context, and may optionally edit the agent response to personalize it based on their latent preference, in addition to improving the correctness. The edit feedback is natur… ▽ More

    Submitted 9 June, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

  9. arXiv:2404.13409  [pdf, other

    cs.HC

    "I Wish There Were an AI": Challenges and AI Potential in Cancer Patient-Provider Communication

    Authors: Ziqi Yang, Xuhai Xu, Bingsheng Yao, Jiachen Li, Jennifer Bagdasarian, Guodong Gao, Dakuo Wang

    Abstract: Patient-provider communication has been crucial to cancer patients' survival after their cancer treatments. However, the research community and patients themselves often overlook the communication challenges after cancer treatments as they are overshadowed by the severity of the patient's illness and the variety and rarity of the cancer disease itself. Meanwhile, the recent technical advances in A… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 18 pages, 2 figures, submission to CSCW'24

  10. arXiv:2404.13273  [pdf, other

    cs.CV cs.LG

    Multi-feature Reconstruction Network using Crossed-mask Restoration for Unsupervised Anomaly Detection

    Authors: Junpu Wang, Guili Xu, Chunlei Li, Guangshuai Gao, Yuehua Cheng

    Abstract: Unsupervised anomaly detection using only normal samples is of great significance for quality inspection in industrial manufacturing. Although existing reconstruction-based methods have achieved promising results, they still face two problems: poor distinguishable information in image reconstruction and well abnormal regeneration caused by model over-generalization ability. To overcome the above i… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  11. arXiv:2404.12617  [pdf, other

    cs.RO

    Greedy Detection and Exclusion of Multiple Faults using Euclidean Distance Matrices

    Authors: Derek Knowles, Grace Gao

    Abstract: Numerous methods have been proposed for global navigation satellite system (GNSS) receivers to detect faulty GNSS signals. One such fault detection and exclusion (FDE) method is based on the mathematical concept of Euclidean distance matrices (EDMs). This paper outlines a greedy approach that uses an improved Euclidean distance matrix-based fault detection and exclusion algorithm. The novel greedy… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Submitted to NAVIGATION: Journal of the Institute of Navigation

  12. arXiv:2404.09155  [pdf, other

    cs.LG cs.AI cs.CL

    Mitigating Heterogeneity among Factor Tensors via Lie Group Manifolds for Tensor Decomposition Based Temporal Knowledge Graph Embedding

    Authors: Jiang Li, Xiangdong Su, Yeyun Gong, Guanglai Gao

    Abstract: Recent studies have highlighted the effectiveness of tensor decomposition methods in the Temporal Knowledge Graphs Embedding (TKGE) task. However, we found that inherent heterogeneity among factor tensors in tensor decomposition significantly hinders the tensor fusion process and further limits the performance of link prediction. To overcome this limitation, we introduce a novel method that maps f… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  13. arXiv:2404.08854  [pdf, other

    cs.RO

    gnss_lib_py: Analyzing GNSS Data with Python

    Authors: Derek Knowles, Ashwin Vivek Kanhere, Daniel Neamati, Grace Gao

    Abstract: This paper presents gnss_lib_py, a Python library used to parse, analyze, and visualize data from a variety of GNSS (Global Navigation Satellite Systems) data sources. The gnss_lib_py library's ease of use, modular capabilities, testing coverage, and extensive documentation make it an attractive tool not only for scientific and industry users wanting a quick, out-of-the-box solution but also for a… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Submitted to the SoftwareX journal

  14. arXiv:2404.06180  [pdf, other

    cs.CV

    YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images

    Authors: Chenguang Liu, Guangshuai Gao, Ziyue Huang, Zhenghui Hu, Qingjie Liu, Yunhong Wang

    Abstract: Detecting objects from aerial images poses significant challenges due to the following factors: 1) Aerial images typically have very large sizes, generally with millions or even hundreds of millions of pixels, while computational resources are limited. 2) Small object size leads to insufficient information for effective detection. 3) Non-uniform object distribution leads to computational resource… ▽ More

    Submitted 16 June, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: accepted to TITS

  15. Fragmented Moments, Balanced Choices: How Do People Make Use of Their Waiting Time?

    Authors: Jian Zheng, Ge Gao

    Abstract: Everyone spends some time waiting every day. HCI research has developed tools for boosting productivity while waiting. However, little is known about how people naturally spend their waiting time. We conducted an experience sampling study with 21 working adults who used a mobile app to report their daily waiting time activities over two weeks. The aim of this study is to understand the activities… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 14 pages. 6 figures. Published at ACM CHI'24

    ACM Class: H.5.m

  16. arXiv:2403.13310  [pdf, other

    cs.IR cs.LG cs.LO

    A Semantic Search Engine for Mathlib4

    Authors: Guoxiong Gao, Haocheng Ju, Jiedong Jiang, Zihan Qin, Bin Dong

    Abstract: The interactive theorem prover, Lean, enables the verification of formal mathematical proofs and is backed by an expanding community. Central to this ecosystem is its mathematical library, mathlib4, which lays the groundwork for the formalization of an expanding range of mathematical theories. However, searching for theorems in mathlib4 can be challenging. To successfully search in mathlib4, users… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  17. arXiv:2403.06064  [pdf, other

    cs.LG cs.AI cs.CL

    L^2GC:Lorentzian Linear Graph Convolutional Networks for Node Classification

    Authors: Qiuyu Liang, Weihua Wang, Feilong Bao, Guanglai Gao

    Abstract: Linear Graph Convolutional Networks (GCNs) are used to classify the node in the graph data. However, we note that most existing linear GCN models perform neural network operations in Euclidean space, which do not explicitly capture the tree-like hierarchical structure exhibited in real-world datasets that modeled as graphs. In this paper, we attempt to introduce hyperbolic space into linear GCN an… ▽ More

    Submitted 14 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  18. arXiv:2403.05817  [pdf, other

    cs.CV

    SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection

    Authors: Gang Zhang, Junnan Chen, Guohuan Gao, Jianmin Li, Si Liu, Xiaolin Hu

    Abstract: LiDAR-based 3D object detection plays an essential role in autonomous driving. Existing high-performing 3D object detectors usually build dense feature maps in the backbone network and prediction head. However, the computational costs introduced by the dense feature maps grow quadratically as the perception range increases, making these models hard to scale up to long-range detection. Some recent… ▽ More

    Submitted 22 April, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024 (Oral)

  19. arXiv:2403.04096  [pdf, ps, other

    cs.HC

    Assisting International Migrants with Everyday Information Seeking: From the Providers' Lens

    Authors: Yongle Zhang, Ge Gao

    Abstract: International migrants face difficulties obtaining information for a quality life and well-being in the host country. Prior research indicates that international migrants often seek information from their co-national cohort or contacts from the same country. The downside of this practice, however, is that people can end up clustering in a small-world environment, hindering the information seekers'… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  20. arXiv:2403.02274  [pdf, other

    cs.RO cs.LG

    NatSGD: A Dataset with Speech, Gestures, and Demonstrations for Robot Learning in Natural Human-Robot Interaction

    Authors: Snehesh Shrestha, Yantian Zha, Saketh Banagiri, Ge Gao, Yiannis Aloimonos, Cornelia Fermuller

    Abstract: Recent advancements in multimodal Human-Robot Interaction (HRI) datasets have highlighted the fusion of speech and gesture, expanding robots' capabilities to absorb explicit and implicit HRI insights. However, existing speech-gesture HRI datasets often focus on elementary tasks, like object pointing and pushing, revealing limitations in scaling to intricate domains and prioritizing human command d… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  21. arXiv:2402.13876  [pdf, other

    cs.CV

    Scene Prior Filtering for Depth Map Super-Resolution

    Authors: Zhengxue Wang, Zhiqiang Yan, Ming-Hsuan Yang, Jinshan Pan, Jian Yang, Ying Tai, Guangwei Gao

    Abstract: Multi-modal fusion is vital to the success of super-resolution of depth maps. However, commonly used fusion strategies, such as addition and concatenation, fall short of effectively bridging the modal gap. As a result, guided image filtering methods have been introduced to mitigate this issue. Nevertheless, it is observed that their filter kernels usually encounter significant texture interference… ▽ More

    Submitted 23 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 14 pages

  22. arXiv:2402.01681  [pdf, other

    cs.CL cs.AI

    Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications

    Authors: Yuhang Zhou, Paiheng Xu, Xiyao Wang, Xuan Lu, Ge Gao, Wei Ai

    Abstract: Emojis, which encapsulate semantics beyond mere words or phrases, have become prevalent in social network communications. This has spurred increasing scholarly interest in exploring their attributes and functionalities. However, emoji-related research and application face two primary challenges. First, researchers typically rely on crowd-sourcing to annotate emojis in order to understand their sen… ▽ More

    Submitted 16 February, 2024; v1 submitted 22 January, 2024; originally announced February 2024.

    Comments: 12 pages, 2 page appendix

  23. arXiv:2401.04429  [pdf, other

    cs.AI cs.MA

    i-Rebalance: Personalized Vehicle Repositioning for Supply Demand Balance

    Authors: Haoyang Chen, Peiyan Sun, Qiyuan Song, Wanyuan Wang, Weiwei Wu, Wencan Zhang, Guanyu Gao, Yan Lyu

    Abstract: Ride-hailing platforms have been facing the challenge of balancing demand and supply. Existing vehicle reposition techniques often treat drivers as homogeneous agents and relocate them deterministically, assuming compliance with the reposition. In this paper, we consider a more realistic and driver-centric scenario where drivers have unique cruising preferences and can decide whether to take the r… ▽ More

    Submitted 2 April, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  24. arXiv:2401.02292  [pdf, other

    cs.CV

    GridFormer: Point-Grid Transformer for Surface Reconstruction

    Authors: Shengtao Li, Ge Gao, Yudong Liu, Yu-Shen Liu, Ming Gu

    Abstract: Implicit neural networks have emerged as a crucial technology in 3D surface reconstruction. To reconstruct continuous surfaces from discrete point clouds, encoding the input points into regular grid features (plane or volume) has been commonly employed in existing approaches. However, these methods typically use the grid as an index for uniformly scattering point features. Compared with the irregu… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  25. arXiv:2312.14931  [pdf

    cs.DB

    A Parallel IFC Normalization Algorithm for Incremental Storage and Version Control

    Authors: Han Liu, Ge Gao, Ming Gu

    Abstract: Industry Foundation Classes (IFC) files are commonly used for data exchange of Building Information Models (BIMs). Due to the equivalent transformations in the graph structure of IFC data, it is a challenge to perform version comparison and incremental storage on IFC files. In this paper, an IFC normalization method is proposed, which can reduce the influence of the equivalent transformations, so… ▽ More

    Submitted 12 September, 2023; originally announced December 2023.

    Comments: in: 30th International Workshop on Intelligent Computing in Engineering (EG-ICE), 2023: 511-520

  26. arXiv:2312.13977  [pdf, other

    cs.CV

    NeuSurf: On-Surface Priors for Neural Surface Reconstruction from Sparse Input Views

    Authors: Han Huang, Yulun Wu, Junsheng Zhou, Ge Gao, Ming Gu, Yu-Shen Liu

    Abstract: Recently, neural implicit functions have demonstrated remarkable results in the field of multi-view reconstruction. However, most existing methods are tailored for dense views and exhibit unsatisfactory performance when dealing with sparse views. Several latest methods have been proposed for generalizing implicit reconstruction to address the sparse view reconstruction task, but they still suffer… ▽ More

    Submitted 21 December, 2023; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024. Project page: https://alvin528.github.io/NeuSurf/

  27. Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation

    Authors: Tianhao Peng, Ge Gao, Heming Sun, Fan Zhang, David Bull

    Abstract: In recent years, end-to-end learnt video codecs have demonstrated their potential to compete with conventional coding algorithms in term of compression efficiency. However, most learning-based video compression models are associated with high computational complexity and latency, in particular at the decoder side, which limits their deployment in practical applications. In this paper, we present a… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Report number: 2312.02605

  28. arXiv:2312.00093  [pdf, other

    cs.CV cs.GR cs.LG

    GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs

    Authors: Gege Gao, Weiyang Liu, Anpei Chen, Andreas Geiger, Bernhard Schölkopf

    Abstract: As pretrained text-to-image diffusion models become increasingly powerful, recent efforts have been made to distill knowledge from these text-to-image pretrained models for optimizing a text-guided 3D model. Most of the existing methods generate a holistic 3D model from a plain text input. This can be problematic when the text describes a complex scene with multiple objects, because the vectorized… ▽ More

    Submitted 10 June, 2024; v1 submitted 30 November, 2023; originally announced December 2023.

    Comments: CVPR 2024 (18 pages, 11 figures, https://graphdreamer.github.io/)

  29. arXiv:2311.16114  [pdf

    cs.CV cs.AI cs.LG

    Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Incomplete Data Scenarios

    Authors: Qi Fan, Haolin Zuo, Rui Liu, Zheng Lian, Guanglai Gao

    Abstract: Multimodal emotion recognition (MER) in practical scenarios is significantly challenged by the presence of missing or incomplete data across different modalities. To overcome these challenges, researchers have aimed to simulate incomplete conditions during the training phase to enhance the system's overall robustness. Traditional methods have often involved discarding data or substituting data seg… ▽ More

    Submitted 7 May, 2024; v1 submitted 21 September, 2023; originally announced November 2023.

  30. arXiv:2310.20234  [pdf, other

    cs.CV

    HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds

    Authors: Gang Zhang, Junnan Chen, Guohuan Gao, Jianmin Li, Xiaolin Hu

    Abstract: 3D object detection in point clouds is important for autonomous driving systems. A primary challenge in 3D object detection stems from the sparse distribution of points within the 3D scene. Existing high-performance methods typically employ 3D sparse convolutional neural networks with small kernels to extract features. To reduce computational costs, these methods resort to submanifold sparse convo… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  31. arXiv:2310.16924  [pdf, other

    cs.CL cs.HC

    Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors

    Authors: Nikita Mehandru, Sweta Agrawal, Yimin Xiao, Elaine C Khoong, Ge Gao, Marine Carpuat, Niloufar Salehi

    Abstract: A major challenge in the practical use of Machine Translation (MT) is that users lack guidance to make informed decisions about when to rely on outputs. Progress in quality estimation research provides techniques to automatically assess MT quality, but these techniques have primarily been evaluated in vitro by comparison against human judgments outside of a specific context of use. This paper eval… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  32. arXiv:2310.11834  [pdf, other

    cs.CV

    HB-net: Holistic bursting cell cluster integrated network for occluded multi-objects recognition

    Authors: Xudong Gao, Xiao Guang Gao, Jia Rong, Xiaowei Chen, Xiang Liao, Jun Chen

    Abstract: Within the realm of image recognition, a specific category of multi-label classification (MLC) challenges arises when objects within the visual field may occlude one another, demanding simultaneous identification of both occluded and occluding objects. Traditional convolutional neural networks (CNNs) can tackle these challenges; however, those models tend to be bulky and can only attain modest lev… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  33. arXiv:2310.07718  [pdf

    cs.NI physics.optics

    Long-term and Real-time High-speed Underwater Wireless Optical Communications in Deep Sea

    Authors: Jialiang Zhang, Sujing Wang, Ziqi Ma, Guanjun Gao, Yonggang Guo, Fei Zhang, Shanguo Huang, Jie Zhang

    Abstract: Seafloor observation network can perform all-weather, long-term, continuous, real-time, and in-situ observation of the ocean by combing various observation methods including cabled seafloor nodes, self-contained nodes, as well as mobile platforms, where reliable and long-term high-speed underwater wireless communication becomes an essential demand. Recently, underwater wireless optical communicati… ▽ More

    Submitted 13 December, 2023; v1 submitted 23 July, 2023; originally announced October 2023.

  34. arXiv:2310.07123  [pdf, other

    cs.LG cs.AI

    Off-Policy Evaluation for Human Feedback

    Authors: Qitong Gao, Ge Gao, Juncheng Dong, Vahid Tarokh, Min Chi, Miroslav Pajic

    Abstract: Off-policy evaluation (OPE) is important for closing the gap between offline training and evaluation of reinforcement learning (RL), by estimating performance and/or rank of target (evaluation) policies using offline trajectories only. It can improve the safety and efficiency of data collection and policy testing procedures in situations where online deployments are expensive, such as healthcare.… ▽ More

    Submitted 14 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  35. arXiv:2310.06534  [pdf

    stat.ML cs.LG

    Disk failure prediction based on multi-layer domain adaptive learning

    Authors: Guangfu Gao, Peng Wu, Hussain Dawood

    Abstract: Large scale data storage is susceptible to failure. As disks are damaged and replaced, traditional machine learning models, which rely on historical data to make predictions, struggle to accurately predict disk failures. This paper presents a novel method for predicting disk failures by leveraging multi-layer domain adaptive learning techniques. First, disk data with numerous faults is selected as… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  36. arXiv:2310.06368  [pdf, other

    cs.CV

    CoinSeg: Contrast Inter- and Intra- Class Representations for Incremental Segmentation

    Authors: Zekang Zhang, Guangyu Gao, Jianbo Jiao, Chi Harold Liu, Yunchao Wei

    Abstract: Class incremental semantic segmentation aims to strike a balance between the model's stability and plasticity by maintaining old knowledge while adapting to new concepts. However, most state-of-the-art methods use the freeze strategy for stability, which compromises the model's plasticity.In contrast, releasing parameter training for plasticity could lead to the best performance for all categories… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted by ICCV 2023

  37. arXiv:2310.05353  [pdf, ps, other

    math.CO cs.CC math.DS

    Complexity of null dynamical systems and Sauer--Shelah lemmas

    Authors: Guorong Gao, Jie Ma, Mingyuan Rong, Tuan Tran

    Abstract: The topological entropy of a topological dynamical system, introduced in a foundational paper by Adler, Konheim and McAndrew [Trans. Am. Math. Soc., 1965], is a nonnegative number that measures the uncertainty or disorder of the system. Comparing with positive entropy systems, zero entropy systems are much less understood. In order to distinguish between zero entropy systems, Huang and Ye [Adv. Ma… ▽ More

    Submitted 11 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  38. arXiv:2310.04407  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Policy-Gradient Training of Language Models for Ranking

    Authors: Ge Gao, Jonathan D. Chang, Claire Cardie, Kianté Brantley, Thorsten Joachim

    Abstract: Text retrieval plays a crucial role in incorporating factual knowledge for decision making into language processing pipelines, ranging from chat-based web search to question answering systems. Current state-of-the-art text retrieval models leverage pre-trained large language models (LLMs) to achieve competitive performance, but training LLM-based retrievers via typical contrastive losses requires… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  39. arXiv:2309.15490  [pdf, other

    cs.CV

    Survey on Deep Face Restoration: From Non-blind to Blind and Beyond

    Authors: Wenjie Li, Mei Wang, Kai Zhang, Juncheng Li, Xiaoming Li, Yuhang Zhang, Guangwei Gao, Weihong Deng, Chia-Wen Lin

    Abstract: Face restoration (FR) is a specialized field within image restoration that aims to recover low-quality (LQ) face images into high-quality (HQ) face images. Recent advances in deep learning technology have led to significant progress in FR methods. In this paper, we begin by examining the prevalent factors responsible for real-world LQ images and introduce degradation techniques used to synthesize… ▽ More

    Submitted 8 October, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Face restoration, Survey, Deep learning, Non-blind/Blind, Joint restoration tasks, Facial priors

  40. arXiv:2309.09357  [pdf, other

    cs.CL cs.AI cs.HC

    Talk2Care: Facilitating Asynchronous Patient-Provider Communication with Large-Language-Model

    Authors: Ziqi Yang, Xuhai Xu, Bingsheng Yao, Shao Zhang, Ethan Rogers, Stephen Intille, Nawar Shara, Guodong Gordon Gao, Dakuo Wang

    Abstract: Despite the plethora of telehealth applications to assist home-based older adults and healthcare providers, basic messaging and phone calls are still the most common communication methods, which suffer from limited availability, information loss, and process inefficiencies. One promising solution to facilitate patient-provider communication is to leverage large language models (LLMs) with their po… ▽ More

    Submitted 3 February, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: Under submission to IMWUT'23, 26 pages

    MSC Class: 68U35 ACM Class: H.5.2; I.2.7

  41. arXiv:2309.08649  [pdf

    cs.CV

    An inspection technology of inner surface of the fine hole based on machine vision

    Authors: Rongfang He, Weibin Zhang, Guofang Gao

    Abstract: Fine holes are an important structural component of industrial components, and their inner surface quality is closely related to their function.In order to detect the quality of the inner surface of the fine hole,a special optical measurement system was investigated in this paper. A sight pipe is employed to guide the external illumination light into the fine hole and output the relevant images si… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  42. arXiv:2308.16360  [pdf, other

    cs.CY cs.HC cs.LG

    Emoji Promotes Developer Participation and Issue Resolution on GitHub

    Authors: Yuhang Zhou, Xuan Lu, Ge Gao, Qiaozhu Mei, Wei Ai

    Abstract: Although remote working is increasingly adopted during the pandemic, many are concerned by the low-efficiency in the remote working. Missing in text-based communication are non-verbal cues such as facial expressions and body language, which hinders the effective communication and negatively impacts the work outcomes. Prevalent on social media platforms, emojis, as alternative non-verbal cues, are… ▽ More

    Submitted 16 April, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Accepted by the 18th International AAAI Conference on Web and Social Media (ICWSM 2024)

  43. arXiv:2307.07848  [pdf, ps, other

    cs.DS cs.DC

    Fully Scalable MPC Algorithms for Clustering in High Dimension

    Authors: Artur Czumaj, Guichen Gao, Shaofeng H. -C. Jiang, Robert Krauthgamer, Pavel Veselý

    Abstract: We design new parallel algorithms for clustering in high-dimensional Euclidean spaces. These algorithms run in the Massively Parallel Computation (MPC) model, and are fully scalable, meaning that the local memory in each machine may be $n^σ$ for arbitrarily small fixed $σ>0$. Importantly, the local memory may be substantially smaller than the number of clusters $k$, yet all our algorithms are fast… ▽ More

    Submitted 14 November, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

  44. arXiv:2307.04692  [pdf, other

    eess.SP cs.RO eess.SY

    Spoofing-Resilient LiDAR-GPS Factor Graph Localization with Chimera Authentication

    Authors: Adam Dai, Tara Minda, Ashwin Kanhere, Grace Gao

    Abstract: Many vehicle platforms typically use sensors such as LiDAR or camera for locally-referenced navigation with GPS for globally-referenced navigation. However, due to the unencrypted nature of GPS signals, all civilian users are vulner-able to spoofing attacks, where a malicious spoofer broadcasts fabricated signals and causes the user to track a false position fix. To protect against such GPS spoofi… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  45. arXiv:2306.14580  [pdf, other

    cs.CL cs.AI

    TransERR: Translation-based Knowledge Graph Embedding via Efficient Relation Rotation

    Authors: Jiang Li, Xiangdong Su, Fujun Zhang, Guanglai Gao

    Abstract: This paper presents a translation-based knowledge geraph embedding method via efficient relation rotation (TransERR), a straightforward yet effective alternative to traditional translation-based knowledge graph embedding models. Different from the previous translation-based models, TransERR encodes knowledge graphs in the hypercomplex-valued space, thus enabling it to possess a higher degree of tr… ▽ More

    Submitted 9 March, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

  46. HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation

    Authors: Ho Man Kwan, Ge Gao, Fan Zhang, Andrew Gower, David Bull

    Abstract: Learning-based video compression is currently a popular research topic, offering the potential to compete with conventional standard video codecs. In this context, Implicit Neural Representations (INRs) have previously been used to represent and compress image and video content, demonstrating relatively high decoding speed compared to other methods. However, existing INR-based methods have failed… ▽ More

    Submitted 26 January, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

  47. arXiv:2305.17193  [pdf

    q-bio.SC cs.AI cs.CV cs.LG physics.bio-ph q-bio.QM

    AI-based analysis of super-resolution microscopy: Biological discovery in the absence of ground truth

    Authors: Ivan R. Nabi, Ben Cardoen, Ismail M. Khater, Guang Gao, Timothy H. Wong, Ghassan Hamarneh

    Abstract: Super-resolution microscopy, or nanoscopy, enables the use of fluorescent-based molecular localization tools to study molecular structure at the nanoscale level in the intact cell, bridging the mesoscale gap to classical structural biology methodologies. Analysis of super-resolution data by artificial intelligence (AI), such as machine learning, offers tremendous potential for discovery of new bio… ▽ More

    Submitted 27 May, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 26 pages, 4 figures

  48. arXiv:2305.16353  [pdf, other

    cs.SD cs.AI cs.CL

    Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion

    Authors: Rui Liu, Jinhua Zhang, Guanglai Gao, Haizhou Li

    Abstract: Audio Deepfake Detection (ADD) aims to detect the fake audio generated by text-to-speech (TTS), voice conversion (VC) and replay, etc., which is an emerging topic. Traditionally we take the mono signal as input and focus on robust feature extraction and effective classifier design. However, the dual-channel stereo information in the audio signal also includes important cues for deepfake, which has… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: To appear at InterSpeech2023

  49. arXiv:2305.12473  [pdf, other

    cs.CL cs.AI cs.LG

    Continually Improving Extractive QA via Human Feedback

    Authors: Ge Gao, Hung-Ting Chen, Yoav Artzi, Eunsol Choi

    Abstract: We study continually improving an extractive question answering (QA) system via human user feedback. We design and deploy an iterative approach, where information-seeking users ask questions, receive model-predicted answers, and provide feedback. We conduct experiments involving thousands of user interactions under diverse setups to broaden the understanding of learning from feedback over time. Ou… ▽ More

    Submitted 3 November, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  50. arXiv:2305.10201  [pdf

    cs.AI cs.CY

    Echoes of Biases: How Stigmatizing Language Affects AI Performance

    Authors: Yizhi Liu, Weiguang Wang, Guodong Gordon Gao, Ritu Agarwal

    Abstract: Electronic health records (EHRs) serve as an essential data source for the envisioned artificial intelligence (AI)-driven transformation in healthcare. However, clinician biases reflected in EHR notes can lead to AI models inheriting and amplifying these biases, perpetuating health disparities. This study investigates the impact of stigmatizing language (SL) in EHR notes on mortality prediction us… ▽ More

    Submitted 12 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: 54 pages, 9 figures