Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 176 results for author: Gong, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.20906  [pdf, other

    cs.CL cs.AI physics.data-an

    Automated Review Generation Method Based on Large Language Models

    Authors: Shican Wu, Xiao Ma, Dehui Luo, Lulu Li, Xiangcheng Shi, Xin Chang, Xiaoyun Lin, Ran Luo, Chunlei Pei, Zhi-Jian Zhao, Jinlong Gong

    Abstract: Literature research, vital for scientific advancement, is overwhelmed by the vast ocean of available information. Addressing this, we propose an automated review generation method based on Large Language Models (LLMs) to streamline literature processing and reduce cognitive load. In case study on propane dehydrogenation (PDH) catalysts, our method swiftly generated comprehensive reviews from 343 a… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 16 pages, 3 figures, 3 tables

  2. arXiv:2407.18267  [pdf, other

    cs.AR cs.AI cs.LG

    MCU-MixQ: A HW/SW Co-optimized Mixed-precision Neural Network Design Framework for MCUs

    Authors: Junfeng Gong, Cheng Liu, Long Cheng, Huawei Li, Xiaowei Li

    Abstract: Mixed-precision neural network (MPNN) that utilizes just enough data width for the neural network processing is an effective approach to meet the stringent resources constraints including memory and computing of MCUs. Nevertheless, there is still a lack of sub-byte and mixed-precision SIMD operations in MCU-class ISA and the limited computing capability of MCUs remains underutilized, which further… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  3. arXiv:2407.17745  [pdf, other

    cs.CL

    Beyond Entity Alignment: Towards Complete Knowledge Graph Alignment via Entity-Relation Synergy

    Authors: Xiaohan Fang, Chaozhuo Li, Yi Zhao, Qian Zang, Litian Zhang, Jiquan Peng, Xi Zhang, Jibing Gong

    Abstract: Knowledge Graph Alignment (KGA) aims to integrate knowledge from multiple sources to address the limitations of individual Knowledge Graphs (KGs) in terms of coverage and depth. However, current KGA models fall short in achieving a ``complete'' knowledge graph alignment. Existing models primarily emphasize the linkage of cross-graph entities but overlook aligning relations across KGs, thereby prov… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  4. arXiv:2407.14982  [pdf, other

    cs.CV cs.AI

    GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image Generation

    Authors: Jingzhi Gong, Sisi Li, Giordano d'Aloisio, Zishuo Ding, Yulong Ye, William B. Langdon, Federica Sarro

    Abstract: Tuning the parameters and prompts for improving AI-based text-to-image generation has remained a substantial yet unaddressed challenge. Hence we introduce GreenStableYolo, which improves the parameters and prompts for Stable Diffusion to both reduce GPU inference time and increase image generation quality using NSGA-II and Yolo. Our experiments show that despite a relatively slight trade-off (18… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: This paper is published in the SSBSE Challenge Track 2024

  5. arXiv:2407.02706  [pdf, other

    cs.SE cs.AI

    Pushing the Boundary: Specialising Deep Configuration Performance Learning

    Authors: Jingzhi Gong

    Abstract: Software systems often have numerous configuration options that can be adjusted to meet different performance requirements. However, understanding the combined impact of these options on performance is often challenging, especially with limited real-world data. To tackle this issue, deep learning techniques have gained popularity due to their ability to capture complex relationships even with limi… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: This PhD thesis was submitted in May 2024

  6. arXiv:2407.00115  [pdf, other

    cs.LG cs.AI

    Instance Temperature Knowledge Distillation

    Authors: Zhengbo Zhang, Yuxi Zhou, Jia Gong, Jun Liu, Zhigang Tu

    Abstract: Knowledge distillation (KD) enhances the performance of a student network by allowing it to learn the knowledge transferred from a teacher network incrementally. Existing methods dynamically adjust the temperature to enable the student network to adapt to the varying learning difficulties at different learning stages of KD. KD is a continuous process, but when adjusting the temperature, these meth… ▽ More

    Submitted 7 July, 2024; v1 submitted 27 June, 2024; originally announced July 2024.

    ACM Class: I.4.0

  7. arXiv:2406.11589  [pdf, other

    cs.SE cs.AI cs.IR

    CoSQA+: Enhancing Code Search Dataset with Matching Code

    Authors: Jing Gong, Yanghui Wu, Linxi Liang, Zibin Zheng, Yanlin Wang

    Abstract: Semantic code search, retrieving code that matches a given natural language query, is an important task to improve productivity in software engineering. Existing code search datasets are problematic: either using unrealistic queries, or with mismatched codes, and typically using one-to-one query-code pairing, which fails to reflect the reality that a query might have multiple valid code matches. T… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 11 pages, 4 figures, conference

    ACM Class: I.2.7; D.2.3

  8. arXiv:2406.11253  [pdf, other

    cs.CV

    Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space

    Authors: Yuan Wang, Zhao Wang, Junhao Gong, Di Huang, Tong He, Wanli Ouyang, Jile Jiao, Xuetao Feng, Qi Dou, Shixiang Tang, Dan Xu

    Abstract: In this paper, we introduce a novel path to $\textit{general}$ human motion generation by focusing on 2D space. Traditional methods have primarily generated human motions in 3D, which, while detailed and realistic, are often limited by the scope of available 3D motion data in terms of both the size and the diversity. To address these limitations, we exploit extensive availability of 2D motion data… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 22 pages, 11figures, 17 tables

  9. arXiv:2406.09467  [pdf, other

    cs.HC

    "I see it as a wellspring for my positive and upward journey in life.": Understanding Current Practices of Assistive Technology's Customized Modification in China

    Authors: Kexin Yang, Junyi Wu, Haokun Xin, Jiangtao Gong

    Abstract: Due to the significant differences in physical conditions and living environments of people with disabilities, standardized assistive technologies (ATs) often fail to meet their needs. Modified AT, especially DIY (Do It Yourself) ATs, are a popular solution in many high-income countries, but there is a lack of documentation for low- and middle-income areas, especially in China, where the culture o… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    MSC Class: H.5.2

    Journal ref: CSCW2024

  10. arXiv:2406.04985  [pdf, ps, other

    eess.SP cs.ET

    Hybrid Beamforming Design for RSMA-assisted mmWave Integrated Sensing and Communications

    Authors: Jun Gong, Wenchi Cheng, Jiangzhou Wang, Jingqing Wang

    Abstract: Integrated sensing and communications (ISAC) has been considered one of the new paradigms for sixth-generation (6G) wireless networks. In the millimeter-wave (mmWave) ISAC system, hybrid beamforming (HBF) is considered an emerging technology to exploit the limited number of radio frequency (RF) chains in order to reduce the system hardware cost and power consumption. However, the HBF structure red… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  11. arXiv:2406.04449  [pdf, other

    cs.CL cs.CV

    MAIRA-2: Grounded Radiology Report Generation

    Authors: Shruthi Bannur, Kenza Bouzid, Daniel C. Castro, Anton Schwaighofer, Sam Bond-Taylor, Maximilian Ilse, Fernando Pérez-García, Valentina Salvatelli, Harshita Sharma, Felix Meissen, Mercy Ranjit, Shaury Srivastav, Julia Gong, Fabian Falck, Ozan Oktay, Anja Thieme, Matthew P. Lungren, Maria Teodora Wetscherek, Javier Alvarez-Valle, Stephanie L. Hyland

    Abstract: Radiology reporting is a complex task that requires detailed image understanding, integration of multiple inputs, including comparison with prior imaging, and precise language generation. This makes it ideal for the development and use of generative multimodal models. Here, we extend report generation to include the localisation of individual findings on the image - a task we call grounded report… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 44 pages, 20 figures

  12. arXiv:2405.17765  [pdf, other

    cs.CV

    PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild

    Authors: Kun Yuan, Hongbo Liu, Mading Li, Muyi Sun, Ming Sun, Jiachao Gong, Jinhua Hao, Chao Zhou, Yansong Tang

    Abstract: Video quality assessment (VQA) is a challenging problem due to the numerous factors that can affect the perceptual quality of a video, \eg, content attractiveness, distortion type, motion pattern, and level. However, annotating the Mean opinion score (MOS) for videos is expensive and time-consuming, which limits the scale of VQA datasets, and poses a significant obstacle for deep learning-based me… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: CVPR 2024, 11 pages, 4 figures, 7 tables

  13. arXiv:2405.15763  [pdf, other

    cs.CV

    FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis

    Authors: Ke Fan, Junshu Tang, Weijian Cao, Ran Yi, Moran Li, Jingyu Gong, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Lizhuang Ma

    Abstract: Text-to-motion synthesis is a crucial task in computer vision. Existing methods are limited in their universality, as they are tailored for single-person or two-person scenarios and can not be applied to generate motions for more individuals. To achieve the number-free motion synthesis, this paper reconsiders motion generation and proposes to unify the single and multi-person motion by the conditi… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  14. arXiv:2405.12663  [pdf, other

    cs.GR cs.CV

    LAGA: Layered 3D Avatar Generation and Customization via Gaussian Splatting

    Authors: Jia Gong, Shenyu Ji, Lin Geng Foo, Kang Chen, Hossein Rahmani, Jun Liu

    Abstract: Creating and customizing a 3D clothed avatar from textual descriptions is a critical and challenging task. Traditional methods often treat the human body and clothing as inseparable, limiting users' ability to freely mix and match garments. In response to this limitation, we present LAyered Gaussian Avatar (LAGA), a carefully designed framework enabling the creation of high-fidelity decomposable a… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  15. arXiv:2405.11272  [pdf, other

    cs.IR cs.AI

    Double Correction Framework for Denoising Recommendation

    Authors: Zhuangzhuang He, Yifan Wang, Yonghui Yang, Peijie Sun, Le Wu, Haoyue Bai, Jinqi Gong, Richang Hong, Min Zhang

    Abstract: As its availability and generality in online services, implicit feedback is more commonly used in recommender systems. However, implicit feedback usually presents noisy samples in real-world recommendation scenarios (such as misclicks or non-preferential behaviors), which will affect precise user preference learning. To overcome the noisy samples problem, a popular solution is based on dropping no… ▽ More

    Submitted 27 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024

  16. arXiv:2404.17820  [pdf, other

    cs.RO cs.AI cs.LG

    Motion planning for off-road autonomous driving based on human-like cognition and weight adaptation

    Authors: Yuchun Wang, Cheng Gong, Jianwei Gong, Peng Jia

    Abstract: Driving in an off-road environment is challenging for autonomous vehicles due to the complex and varied terrain. To ensure stable and efficient travel, the vehicle requires consideration and balancing of environmental factors, such as undulations, roughness, and obstacles, to generate optimal trajectories that can adapt to changing scenarios. However, traditional motion planners often utilize a fi… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Journal ref: Journal of Field Robotics,2024,1-22

  17. Beyond Imitation: A Life-long Policy Learning Framework for Path Tracking Control of Autonomous Driving

    Authors: C. Gong, C. Lu, Z. Li, Z. Liu, J. Gong, X. Chen

    Abstract: Model-free learning-based control methods have recently shown significant advantages over traditional control methods in avoiding complex vehicle characteristic estimation and parameter tuning. As a primary policy learning method, imitation learning (IL) is capable of learning control policies directly from expert demonstrations. However, the performance of IL policies is highly dependent on the d… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Journal ref: IEEE Transactions on Vehicular Technology 2024 Pages 1-14

  18. arXiv:2404.12141  [pdf, other

    q-bio.BM cs.LG

    MolCRAFT: Structure-Based Drug Design in Continuous Parameter Space

    Authors: Yanru Qu, Keyue Qiu, Yuxuan Song, Jingjing Gong, Jiawei Han, Mingyue Zheng, Hao Zhou, Wei-Ying Ma

    Abstract: Generative models for structure-based drug design (SBDD) have shown promising results in recent years. Existing works mainly focus on how to generate molecules with higher binding affinity, ignoring the feasibility prerequisites for generated 3D poses and resulting in false positives. We conduct thorough studies on key factors of ill-conformational problems when applying autoregressive methods and… ▽ More

    Submitted 27 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Accepted to ICML 2024

  19. arXiv:2404.06892  [pdf, other

    cs.CV

    SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving

    Authors: Diankun Zhang, Guoan Wang, Runwen Zhu, Jianbo Zhao, Xiwu Chen, Siyu Zhang, Jiahao Gong, Qibin Zhou, Wenyuan Zhang, Ningzi Wang, Feiyang Tan, Hangning Zhou, Ziyao Xu, Haotian Yao, Chi Zhang, Xiaojun Liu, Xiaoguang Di, Bin Li

    Abstract: End-to-End paradigms use a unified framework to implement multi-tasks in an autonomous driving system. Despite simplicity and clarity, the performance of end-to-end autonomous driving methods on sub-tasks is still far behind the single-task methods. Meanwhile, the widely used dense BEV features in previous end-to-end methods make it costly to extend to more modalities or tasks. In this paper, we p… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  20. arXiv:2404.04579  [pdf, other

    cs.HC

    TeleAware Robot: Designing Awareness-augmented Telepresence Robot for Remote Collaborative Locomotion

    Authors: Ruyi Li, Yaxin Zhu, Min Liu, Yihang Zeng, Shanning Zhuang, Jiayi Fu, Yi Lu, Guyue Zhou, Can Liu, Jiangtao Gong

    Abstract: Telepresence robots can be used to support users to navigate an environment remotely and share the visiting experience with their social partners. Although such systems allow users to see and hear the remote environment and communicate with their partners via live video feed, this does not provide enough awareness of the environment and their remote partner's activities. In this paper, we introduc… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 33 pages, 12 figures

    MSC Class: H.5.2

    Journal ref: IMUWT 2024

  21. arXiv:2404.02700  [pdf, other

    cs.IT

    Optimizing Peak Age of Information in MEC Systems: Computing Preemption and Non-preemption

    Authors: Jianhang Zhu, Jie Gong

    Abstract: The freshness of information in real-time monitoring systems has received increasing attention, with Age of Information (AoI) emerging as a novel metric for measuring information freshness. In many applications, update packets need to be computed before being delivered to a destination. Mobile edge computing (MEC) is a promising approach for efficiently accomplishing the computing process, where t… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  22. arXiv:2404.00925  [pdf, other

    cs.CV cs.CL

    LLMs are Good Sign Language Translators

    Authors: Jia Gong, Lin Geng Foo, Yixuan He, Hossein Rahmani, Jun Liu

    Abstract: Sign Language Translation (SLT) is a challenging task that aims to translate sign videos into spoken language. Inspired by the strong translation capabilities of large language models (LLMs) that are trained on extensive multilingual text corpora, we aim to harness off-the-shelf LLMs to handle SLT. In this paper, we regularize the sign videos to embody linguistic characteristics of spoken language… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  23. arXiv:2403.18969   

    cs.CL cs.AI cs.IT cs.LG

    A Survey on Large Language Models from Concept to Implementation

    Authors: Chen Wang, Jin Zhao, Jiaqi Gong

    Abstract: Recent advancements in Large Language Models (LLMs), particularly those built on Transformer architectures, have significantly broadened the scope of natural language processing (NLP) applications, transcending their initial use in chatbot technology. This paper investigates the multifaceted applications of these models, with an emphasis on the GPT series. This exploration focuses on the transform… ▽ More

    Submitted 27 May, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Section 3 lacks to clarity and accuracy in defining the applications and capabilities of LLMs. More rework needs to be done on illustrate how LLMs being used in cross-domains

  24. arXiv:2403.16257  [pdf, other

    cs.CV

    Unlearning Backdoor Threats: Enhancing Backdoor Defense in Multimodal Contrastive Learning via Local Token Unlearning

    Authors: Siyuan Liang, Kuanrong Liu, Jiajun Gong, Jiawei Liang, Yuan Xun, Ee-Chien Chang, Xiaochun Cao

    Abstract: Multimodal contrastive learning has emerged as a powerful paradigm for building high-quality features using the complementary strengths of various data modalities. However, the open nature of such systems inadvertently increases the possibility of backdoor attacks. These attacks subtly embed malicious behaviors within the model during training, which can be activated by specific triggers in the in… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 6 pages, 2 figures

  25. arXiv:2403.16159  [pdf, other

    cs.HC

    Designing Child-Centric AI Learning Environments: Insights from LLM-Enhanced Creative Project-Based Learning

    Authors: Siyu Zha, Yuehan Qiao, Qingyu Hu, Zhongsheng Li, Jiangtao Gong, Yingqing Xu

    Abstract: Project-based learning (PBL) is an instructional method that is very helpful in nurturing students' creativity, but it requires significant time and energy from both students and teachers. Large language models (LLMs) have been proven to assist in creative tasks, yet much controversy exists regarding their role in fostering creativity. This paper explores the potential of LLMs in PBL settings, wit… ▽ More

    Submitted 5 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  26. arXiv:2403.15441  [pdf, other

    physics.chem-ph cs.AI cs.LG q-bio.BM

    Unified Generative Modeling of 3D Molecules via Bayesian Flow Networks

    Authors: Yuxuan Song, Jingjing Gong, Yanru Qu, Hao Zhou, Mingyue Zheng, Jingjing Liu, Wei-Ying Ma

    Abstract: Advanced generative model (e.g., diffusion model) derived from simplified continuity assumptions of data distribution, though showing promising progress, has been difficult to apply directly to geometry generation applications due to the multi-modality and noise-sensitive nature of molecule geometry. This work introduces Geometric Bayesian Flow Networks (GeoBFN), which naturally fits molecule geom… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  27. arXiv:2403.13351  [pdf, other

    cs.CV

    OrthCaps: An Orthogonal CapsNet with Sparse Attention Routing and Pruning

    Authors: Xinyu Geng, Jiaming Wang, Jiawei Gong, Yuerong Xue, Jun Xu, Fanglin Chen, Xiaolin Huang

    Abstract: Redundancy is a persistent challenge in Capsule Networks (CapsNet),leading to high computational costs and parameter counts. Although previous works have introduced pruning after the initial capsule layer, dynamic routing's fully connected nature and non-orthogonal weight matrices reintroduce redundancy in deeper layers. Besides, dynamic routing requires iterating to converge, further increasing c… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 8 pages

  28. arXiv:2403.11368  [pdf, other

    cs.RO cs.AI

    Driving Style Alignment for LLM-powered Driver Agent

    Authors: Ruoxuan Yang, Xinyue Zhang, Anais Fernandez-Laaksonen, Xin Ding, Jiangtao Gong

    Abstract: Recently, LLM-powered driver agents have demonstrated considerable potential in the field of autonomous driving, showcasing human-like reasoning and decision-making abilities.However, current research on aligning driver agent behaviors with human driving styles remains limited, partly due to the scarcity of high-quality natural language data from human driving behaviors.To address this research ga… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    MSC Class: 68T42

  29. arXiv:2403.11057  [pdf, other

    cs.CV cs.RO

    Large Language Models Powered Context-aware Motion Prediction in Autonomous Driving

    Authors: Xiaoji Zheng, Lixiu Wu, Zhijie Yan, Yuanrong Tang, Hao Zhao, Chen Zhong, Bokui Chen, Jiangtao Gong

    Abstract: Motion prediction is among the most fundamental tasks in autonomous driving. Traditional methods of motion forecasting primarily encode vector information of maps and historical trajectory data of traffic participants, lacking a comprehensive understanding of overall traffic semantics, which in turn affects the performance of prediction tasks. In this paper, we utilized Large Language Models (LLMs… ▽ More

    Submitted 29 July, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: 6 pages,4 figures

    MSC Class: 68T45

  30. arXiv:2403.10887  [pdf, other

    cs.CV

    LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival

    Authors: Yuanxin Zhao, Mi Zhang, Bingnan Yang, Zhan Zhang, Jiaju Kang, Jianya Gong

    Abstract: Image-text retrieval (ITR) plays a significant role in making informed decisions for various remote sensing (RS) applications. Nonetheless, creating ITR datasets containing vision and language modalities not only requires significant geo-spatial sampling area but also varing categories and detailed descriptions. To this end, we introduce an image caption dataset LuojiaHOG, which is geospatial-awar… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  31. arXiv:2403.08002  [pdf, other

    cs.CL cs.CV

    Towards a clinically accessible radiology foundation model: open-access and lightweight, with automated evaluation

    Authors: Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, Ziyi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu Wei, Tristan Naumann, Muhao Chen, Matthew P. Lungren, Akshay Chaudhari, Serena Yeung-Levy, Curtis P. Langlotz , et al. (2 additional authors not shown)

    Abstract: The scaling laws and extraordinary performance of large foundation models motivate the development and utilization of such models in biomedicine. However, despite early promising results on some biomedical benchmarks, there are still major challenges that need to be addressed before these models can be used in real-world clinics. Frontier general-domain models such as GPT-4V still have significant… ▽ More

    Submitted 26 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  32. arXiv:2403.07386  [pdf, other

    cs.IT

    Multi-source Scheduling and Resource Allocation for Age-of-Semantic-Importance Optimization in Status Update Systems

    Authors: Lunyuan Chen, Jie Gong

    Abstract: In recent years, semantic communication is progressively emerging as an effective means of facilitating intelligent and context-aware communication. However, current researches seldom simultaneously consider the reliability and timeliness of semantic communication, where scheduling and resource allocation (SRA) plays a crucial role. In contrast, conventional age-based approaches cannot seamlessly… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 6 pages, 5 figures, accepted by IEEE WCNC wksp 2024

  33. arXiv:2403.03322  [pdf, other

    cs.SE cs.AI cs.LG

    Deep Configuration Performance Learning: A Systematic Survey and Taxonomy

    Authors: Jingzhi Gong, Tao Chen

    Abstract: Performance is arguably the most crucial attribute that reflects the quality of a configurable software system. However, given the increasing scale and complexity of modern software, modeling and predicting how various configurations can impact performance becomes one of the major challenges in software maintenance. As such, performance is often modeled without having a thorough knowledge of the s… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Submitted to TOSEM survey track

  34. arXiv:2403.01740  [pdf, other

    cs.CV

    DEMOS: Dynamic Environment Motion Synthesis in 3D Scenes via Local Spherical-BEV Perception

    Authors: Jingyu Gong, Min Wang, Wentao Liu, Chen Qian, Zhizhong Zhang, Yuan Xie, Lizhuang Ma

    Abstract: Motion synthesis in real-world 3D scenes has recently attracted much attention. However, the static environment assumption made by most current methods usually cannot be satisfied especially for real-time motion synthesis in scanned point cloud scenes, if multiple dynamic objects exist, e.g., moving persons or vehicles. To handle this problem, we propose the first Dynamic Environment MOtion Synthe… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  35. arXiv:2403.00724  [pdf, other

    cs.CL cs.CV

    Few-Shot Relation Extraction with Hybrid Visual Evidence

    Authors: Jiaying Gong, Hoda Eldardiry

    Abstract: The goal of few-shot relation extraction is to predict relations between name entities in a sentence when only a few labeled instances are available for training. Existing few-shot relation extraction methods focus on uni-modal information such as text only. This reduces performance when there are no clear contexts between the name entities described in text. We propose a multi-modal few-shot rela… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 16 pages, 5 figures

    Journal ref: LREC-COLING 2024

  36. arXiv:2402.16915  [pdf, other

    cs.LG cs.AI

    More Than Routing: Joint GPS and Route Modeling for Refine Trajectory Representation Learning

    Authors: Zhipeng Ma, Zheyan Tu, Xinhai Chen, Yan Zhang, Deguo Xia, Guyue Zhou, Yilun Chen, Yu Zheng, Jiangtao Gong

    Abstract: Trajectory representation learning plays a pivotal role in supporting various downstream tasks. Traditional methods in order to filter the noise in GPS trajectories tend to focus on routing-based methods used to simplify the trajectories. However, this approach ignores the motion details contained in the GPS data, limiting the representation capability of trajectory representation learning. To fil… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  37. Understanding Human-AI Collaboration in Music Therapy Through Co-Design with Therapists

    Authors: Jingjing Sun, Jingyi Yang, Guyue Zhou, Yucheng Jin, Jiangtao Gong

    Abstract: The rapid development of musical AI technologies has expanded the creative potential of various musical activities, ranging from music style transformation to music generation. However, little research has investigated how musical AIs can support music therapists, who urgently need new technology support. This study used a mixed method, including semi-structured interviews and a participatory desi… ▽ More

    Submitted 15 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 20 pages, 7 figures

    MSC Class: H.5.2

    Journal ref: CHI2024

  38. "It Must Be Gesturing Towards Me": Gesture-Based Interaction between Autonomous Vehicles and Pedestrians

    Authors: Xiang Chang, Zihe Chen, Xiaoyan Dong, Yuxin Cai, Tingmin Yan, Haolin Cai, Zherui Zhou, Guyue Zhou, Jiangtao Gong

    Abstract: Interacting with pedestrians understandably and efficiently is one of the toughest challenges faced by autonomous vehicles (AVs) due to the limitations of current algorithms and external human-machine interfaces (eHMIs). In this paper, we design eHMIs based on gestures inspired by the most popular method of interaction between pedestrians and human drivers. Eight common gestures were selected to c… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 26 pages,22 figures

    MSC Class: H.5.2

    Journal ref: CHI2024

  39. arXiv:2402.08802  [pdf, other

    cs.IR

    Multi-Label Zero-Shot Product Attribute-Value Extraction

    Authors: Jiaying Gong, Hoda Eldardiry

    Abstract: E-commerce platforms should provide detailed product descriptions (attribute values) for effective product search and recommendation. However, attribute value information is typically not available for new products. To predict unseen attribute values, large quantities of labeled training data are needed to train a traditional supervised learning model. Typically, it is difficult, time-consuming, a… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 12 pages, 4 figures, WWW2024

  40. arXiv:2402.08420  [pdf, other

    cs.HC

    Vision-Based Hand Gesture Customization from a Single Demonstration

    Authors: Soroush Shahi, Cori Tymoszek Park, Richard Kang, Asaf Liberman, Oron Levy, Jun Gong, Abdelkareem Bedri, Gierad Laput

    Abstract: Hand gesture recognition is becoming a more prevalent mode of human-computer interaction, especially as cameras proliferate across everyday devices. Despite continued progress in this field, gesture customization is often underexplored. Customization is crucial since it enables users to define and demonstrate gestures that are more natural, memorable, and accessible. However, customization require… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    ACM Class: H.5.2; I.4

  41. arXiv:2402.04290  [pdf, other

    cs.LG cs.AI

    CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling

    Authors: Junchao Gong, Lei Bai, Peng Ye, Wanghan Xu, Na Liu, Jianhua Dai, Xiaokang Yang, Wanli Ouyang

    Abstract: Precipitation nowcasting based on radar data plays a crucial role in extreme weather prediction and has broad implications for disaster management. Despite progresses have been made based on deep learning, two key challenges of precipitation nowcasting are not well-solved: (i) the modeling of complex precipitation system evolutions with different scales, and (ii) accurate forecasts for extreme pre… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  42. arXiv:2402.03714  [pdf, other

    cs.HC cs.LG

    Advancing Location-Invariant and Device-Agnostic Motion Activity Recognition on Wearable Devices

    Authors: Rebecca Adaimi, Abdelkareem Bedri, Jun Gong, Richard Kang, Joanna Arreaza-Taylor, Gerri-Michelle Pascual, Michael Ralph, Gierad Laput

    Abstract: Wearable sensors have permeated into people's lives, ushering impactful applications in interactive systems and activity recognition. However, practitioners face significant obstacles when dealing with sensing heterogeneities, requiring custom models for different platforms. In this paper, we conduct a comprehensive evaluation of the generalizability of motion models across sensor locations. Our a… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  43. arXiv:2402.03183  [pdf, other

    cs.SE cs.AI cs.LG cs.PF

    Predicting Configuration Performance in Multiple Environments with Sequential Meta-learning

    Authors: Jingzhi Gong, Tao Chen

    Abstract: Learning and predicting the performance of given software configurations are of high importance to many software engineering activities. While configurable software systems will almost certainly face diverse running environments (e.g., version, hardware, and workload), current work often either builds performance models under a single environment or fails to properly handle data from diverse setti… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: This paper has been accepted by FSE'24

  44. arXiv:2402.00059  [pdf, other

    cs.LG cs.AI physics.ao-ph

    FengWu-GHR: Learning the Kilometer-scale Medium-range Global Weather Forecasting

    Authors: Tao Han, Song Guo, Fenghua Ling, Kang Chen, Junchao Gong, Jingjia Luo, Junxia Gu, Kan Dai, Wanli Ouyang, Lei Bai

    Abstract: Kilometer-scale modeling of global atmosphere dynamics enables fine-grained weather forecasting and decreases the risk of disastrous weather and climate activity. Therefore, building a kilometer-scale global forecast model is a persistent pursuit in the meteorology domain. Active international efforts have been made in past decades to improve the spatial resolution of numerical weather models. Non… ▽ More

    Submitted 28 January, 2024; originally announced February 2024.

    Comments: 19 pages

  45. arXiv:2401.08126  [pdf, other

    cs.NI

    Octopus: A Fair Packet Delivery Service

    Authors: Junzhi Gong, Yuliang Li, Devdeep Ray, KK Yap, Nandita Dukkipati

    Abstract: The packet delivery fairness is critical in many applications in the cloud, such as exchange systems, consensus protocols, and online gaming applications. However, due to nonidentical and dynamic packet forwarding paths, as well as many in-network queuing delays, supporting packet delivery fairness is challenging in a shared compute environment. In this paper, we present Octopus, the first general… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  46. arXiv:2401.06199  [pdf, other

    q-bio.QM cs.AI cs.LG

    xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein

    Authors: Bo Chen, Xingyi Cheng, Pan Li, Yangli-ao Geng, Jing Gong, Shen Li, Zhilei Bei, Xu Tan, Boyan Wang, Xin Zeng, Chiming Liu, Aohan Zeng, Yuxiao Dong, Jie Tang, Le Song

    Abstract: Protein language models have shown remarkable success in learning biological information from protein sequences. However, most existing models are limited by either autoencoding or autoregressive pre-training objectives, which makes them struggle to handle protein understanding and generation tasks concurrently. We propose a unified protein language model, xTrimoPGLM, to address these two types of… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  47. arXiv:2312.07168  [pdf, other

    cs.LG cs.AI

    Equivariant Flow Matching with Hybrid Probability Transport

    Authors: Yuxuan Song, Jingjing Gong, Minkai Xu, Ziyao Cao, Yanyan Lan, Stefano Ermon, Hao Zhou, Wei-Ying Ma

    Abstract: The generation of 3D molecules requires simultaneously deciding the categorical features~(atom types) and continuous features~(atom coordinates). Deep generative models, especially Diffusion Models (DMs), have demonstrated effectiveness in generating feature-rich geometries. However, existing DMs typically suffer from unstable probability dynamics with inefficient sampling speed. In this paper, we… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  48. arXiv:2311.15156  [pdf, other

    cs.LG cs.AI q-bio.GN

    xTrimoGene: An Efficient and Scalable Representation Learner for Single-Cell RNA-Seq Data

    Authors: Jing Gong, Minsheng Hao, Xingyi Cheng, Xin Zeng, Chiming Liu, Jianzhu Ma, Xuegong Zhang, Taifeng Wang, Le Song

    Abstract: Advances in high-throughput sequencing technology have led to significant progress in measuring gene expressions at the single-cell level. The amount of publicly available single-cell RNA-seq (scRNA-seq) data is already surpassing 50M records for humans with each record measuring 20,000 genes. This highlights the need for unsupervised representation learning to fully ingest these data, yet classic… ▽ More

    Submitted 24 February, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023

  49. arXiv:2311.13752  [pdf, other

    cs.CV cs.AI

    3D-MIR: A Benchmark and Empirical Study on 3D Medical Image Retrieval in Radiology

    Authors: Asma Ben Abacha, Alberto Santamaria-Pang, Ho Hin Lee, Jameson Merkow, Qin Cai, Surya Teja Devarakonda, Abdullah Islam, Julia Gong, Matthew P. Lungren, Thomas Lin, Noel C Codella, Ivan Tarapov

    Abstract: The increasing use of medical imaging in healthcare settings presents a significant challenge due to the increasing workload for radiologists, yet it also offers opportunity for enhancing healthcare outcomes if effectively leveraged. 3D image retrieval holds potential to reduce radiologist workloads by enabling clinicians to efficiently search through diagnostically similar or otherwise relevant c… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  50. arXiv:2311.04150   

    cs.HC

    What Makes a Fantastic Passenger-Car Driver in Urban Contexts?

    Authors: Yueteng Yu, Zhijie Yi, Xinyu Yang, Mengdi Chu, Junrong Lu, Xiang Chang, Yiyao Liu, Jingli Qin, Ye Jin, Jialin Song, Xingrui Gu, Jirui Yuan, Guyue Zhou, Jiangtao Gong

    Abstract: The accurate evaluation of the quality of driving behavior is crucial for optimizing and implementing autonomous driving technology in practice. However, there is no comprehensive understanding of good driving behaviors currently. In this paper, we sought to understand driving behaviors from the perspectives of both drivers and passengers. We invited 10 expert drivers and 14 novice drivers to comp… ▽ More

    Submitted 12 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: Part of the content of the paper will be modified. One of the authors has recommended its withdrawal due to personal reasons