Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 223 results for author: Hua, Y

.
  1. arXiv:2408.11288  [pdf

    cs.AI

    Applying and Evaluating Large Language Models in Mental Health Care: A Scoping Review of Human-Assessed Generative Tasks

    Authors: Yining Hua, Hongbin Na, Zehan Li, Fenglin Liu, Xiao Fang, David Clifton, John Torous

    Abstract: Large language models (LLMs) are emerging as promising tools for mental health care, offering scalable support through their ability to generate human-like responses. However, the effectiveness of these models in clinical settings remains unclear. This scoping review aimed to assess the current generative applications of LLMs in mental health care, focusing on studies where these models were teste… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  2. OFL-W3: A One-shot Federated Learning System on Web 3.0

    Authors: Linshan Jiang, Moming Duan, Bingsheng He, Yulin Sun, Peishen Yan, Yang Hua, Tao Song

    Abstract: Federated Learning (FL) addresses the challenges posed by data silos, which arise from privacy, security regulations, and ownership concerns. Despite these barriers, FL enables these isolated data repositories to participate in collaborative learning without compromising privacy or security. Concurrently, the advancement of blockchain technology and decentralized applications (DApps) within Web 3.… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: VLDB 24 demo paper

  3. arXiv:2408.01417  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Talk Less, Interact Better: Evaluating In-context Conversational Adaptation in Multimodal LLMs

    Authors: Yilun Hua, Yoav Artzi

    Abstract: Humans spontaneously use increasingly efficient language as interactions progress, by adapting and forming ad-hoc conventions. This phenomenon has been studied extensively using reference games, showing properties of human language that go beyond relaying intents. It remains unexplored whether multimodal large language models (MLLMs) similarly increase communication efficiency during interactions,… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: Accepted to COLM 2024

  4. arXiv:2407.15389  [pdf, other

    cs.LG cs.CR cs.DC

    Poisoning with A Pill: Circumventing Detection in Federated Learning

    Authors: Hanxi Guo, Hao Wang, Tao Song, Tianhang Zheng, Yang Hua, Haibing Guan, Xiangyu Zhang

    Abstract: Without direct access to the client's data, federated learning (FL) is well-known for its unique strength in data privacy protection among existing distributed machine learning techniques. However, its distributive and iterative nature makes FL inherently vulnerable to various poisoning attacks. To counteract these threats, extensive defenses have been proposed to filter out malicious clients, usi… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  5. arXiv:2406.18916  [pdf, other

    cs.CL cs.AI

    TrustUQA: A Trustful Framework for Unified Structured Data Question Answering

    Authors: Wen Zhang, Long Jin, Yushan Zhu, Jiaoyan Chen, Zhiwei Huang, Junjie Wang, Yin Hua, Lei Liang, Huajun Chen

    Abstract: Natural language question answering (QA) over structured data sources such as tables and knowledge graphs (KGs) have been widely investigated, for example with Large Language Models (LLMs). The main solutions include question to formal query parsing and retrieval-based answer generation. However, current methods of the former often suffer from weak generalization, failing to dealing with multiple… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  6. arXiv:2406.15490  [pdf, other

    cs.CL cs.AI cs.LG

    Causal Discovery Inspired Unsupervised Domain Adaptation for Emotion-Cause Pair Extraction

    Authors: Yuncheng Hua, Yujin Huang, Shuo Huang, Tao Feng, Lizhen Qu, Chris Bain, Richard Bassed, Gholamreza Haffari

    Abstract: This paper tackles the task of emotion-cause pair extraction in the unsupervised domain adaptation setting. The problem is challenging as the distributions of the events causing emotions in target domains are dramatically different than those in source domains, despite the distributions of emotional expressions between domains are overlapped. Inspired by causal discovery, we propose a novel deep l… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures, 4 tables; Under Review in EMNLP 2024

    ACM Class: I.2.4

  7. arXiv:2406.10882  [pdf, other

    cs.CL

    SCAR: Efficient Instruction-Tuning for Large Language Models via Style Consistency-Aware Response Ranking

    Authors: Zhuang Li, Yuncheng Hua, Thuy-Trang Vu, Haolan Zhan, Lizhen Qu, Gholamreza Haffari

    Abstract: Recent studies have shown that maintaining a consistent response style by human experts and enhancing data quality in training sets can significantly improve the performance of fine-tuned Large Language Models (LLMs) while reducing the number of training examples needed. However, the precise definition of style and the relationship between style, data quality, and LLM performance remains unclear.… ▽ More

    Submitted 10 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: 21 pages

  8. arXiv:2406.10633  [pdf, other

    cs.CV cs.GR

    fNeRF: High Quality Radiance Fields from Practical Cameras

    Authors: Yi Hua, Christoph Lassner, Carsten Stoll, Iain Matthews

    Abstract: In recent years, the development of Neural Radiance Fields has enabled a previously unseen level of photo-realistic 3D reconstruction of scenes and objects from multi-view camera data. However, previous methods use an oversimplified pinhole camera model resulting in defocus blur being `baked' into the reconstructed radiance field. We propose a modification to the ray casting that leverages the opt… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  9. arXiv:2406.09838  [pdf, other

    cs.CV cs.AI

    Vision-Language Models Meet Meteorology: Developing Models for Extreme Weather Events Detection with Heatmaps

    Authors: Jian Chen, Peilin Zhou, Yining Hua, Dading Chong, Meng Cao, Yaowei Li, Zixuan Yuan, Bing Zhu, Junwei Liang

    Abstract: Real-time detection and prediction of extreme weather protect human lives and infrastructure. Traditional methods rely on numerical threshold setting and manual interpretation of weather heatmaps with Geographic Information Systems (GIS), which can be slow and error-prone. Our research redefines Extreme Weather Events Detection (EWED) by framing it as a Visual Question Answering (VQA) problem, the… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  10. arXiv:2405.19931  [pdf, other

    cs.CV cs.AI cs.LG

    Exploring Diffusion Models' Corruption Stage in Few-Shot Fine-tuning and Mitigating with Bayesian Neural Networks

    Authors: Xiaoyu Wu, Jiaru Zhang, Yang Hua, Bohan Lyu, Hao Wang, Tao Song, Haibing Guan

    Abstract: Few-shot fine-tuning of Diffusion Models (DMs) is a key advancement, significantly reducing training costs and enabling personalized AI applications. However, we explore the training dynamics of DMs and observe an unanticipated phenomenon: during the training process, image fidelity initially improves, then unexpectedly deteriorates with the emergence of noisy patterns, only to recover later with… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Preprint. Under review

  11. arXiv:2405.16663  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Private Edge Density Estimation for Random Graphs: Optimal, Efficient and Robust

    Authors: Hongjie Chen, Jingqiu Ding, Yiding Hua, David Steurer

    Abstract: We give the first polynomial-time, differentially node-private, and robust algorithm for estimating the edge density of Erdős-Rényi random graphs and their generalization, inhomogeneous random graphs. We further prove information-theoretical lower bounds, showing that the error rate of our algorithm is optimal up to logarithmic factors. Previous algorithms incur either exponential running time or… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: fix minor typos; add missing references

  12. arXiv:2405.09980  [pdf, other

    cs.CL cs.AI

    FinTextQA: A Dataset for Long-form Financial Question Answering

    Authors: Jian Chen, Peilin Zhou, Yining Hua, Yingxin Loh, Kehui Chen, Ziyuan Li, Bing Zhu, Junwei Liang

    Abstract: Accurate evaluation of financial question answering (QA) systems necessitates a comprehensive dataset encompassing diverse question types and contexts. However, current financial QA datasets lack scope diversity and question complexity. This work introduces FinTextQA, a novel dataset for long-form question answering (LFQA) in finance. FinTextQA comprises 1,262 high-quality, source-attributed QA pa… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  13. arXiv:2405.00716  [pdf, other

    cs.CL cs.AI

    Large Language Models in the Clinic: A Comprehensive Benchmark

    Authors: Andrew Liu, Hongjian Zhou, Yining Hua, Omid Rohanian, Anshul Thakur, Lei Clifton, David A. Clifton

    Abstract: The adoption of large language models (LLMs) to assist clinicians has attracted remarkable attention. Existing works mainly adopt the close-ended question-answering (QA) task with answer options for evaluation. However, many clinical decisions involve answering open-ended questions without pre-set options. To better understand LLMs in the clinic, we construct a benchmark ClinicBench. We first coll… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 April, 2024; originally announced May 2024.

  14. arXiv:2404.19007  [pdf, other

    cs.CL cs.AI cs.CY

    How Did We Get Here? Summarizing Conversation Dynamics

    Authors: Yilun Hua, Nicholas Chernogor, Yuzhe Gu, Seoyeon Julie Jeong, Miranda Luo, Cristian Danescu-Niculescu-Mizil

    Abstract: Throughout a conversation, the way participants interact with each other is in constant flux: their tones may change, they may resort to different strategies to convey their points, or they might alter their interaction patterns. An understanding of these dynamics can complement that of the actual facts and opinions discussed, offering a more holistic view of the trajectory of the conversation: ho… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: To appear in the Proceedings of NAACL 2024. Data available in ConvoKit https://convokit.cornell.edu/

  15. arXiv:2404.13504  [pdf, other

    cs.CL

    IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models

    Authors: Tao Feng, Lizhen Qu, Zhuang Li, Haolan Zhan, Yuncheng Hua, Gholamreza Haffari

    Abstract: Machine learning models have made incredible progress, but they still struggle when applied to examples from unseen domains. This study focuses on a specific problem of domain generalization, where a model is trained on one source domain and tested on multiple target domains that are unseen during training. We propose IMO: Invariant features Masks for Out-of-Distribution text classification, to ac… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  16. arXiv:2404.04741  [pdf

    physics.ins-det

    Theory and mitigation of motional eddy current in high-field eddy current shielding

    Authors: Seung-Kyun Lee, Yihe Hua

    Abstract: Eddy current shielding by a Faraday cage is an effective way to shield alternating-current (AC) magnetic fields in scientific instrumentation. In a strong static magnetic field, however, the eddy current in the conductive shield is subject to the Lorentz force which causes the shield to vibrate. In addition to mechanical issues, such vibration induces motional eddy current in the shield that can d… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  17. arXiv:2403.15760  [pdf, other

    cs.AI cs.DC

    An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in Heterogeneous Federated Learning

    Authors: Jianqing Zhang, Yang Liu, Yang Hua, Jian Cao

    Abstract: Heterogeneous Federated Learning (HtFL) enables task-specific knowledge sharing among clients with different model architectures while preserving privacy. Despite recent research progress, transferring knowledge in HtFL is still difficult due to data and model heterogeneity. To tackle this, we introduce a public pre-trained generator (e.g., StyleGAN or Stable Diffusion) as the bridge and propose a… ▽ More

    Submitted 19 August, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024. We have incorporated additional analysis for the Stable Diffusion experiments in Appendix A

  18. arXiv:2403.12213  [pdf, ps, other

    cs.DS cs.CC cs.LG stat.ML

    Private graphon estimation via sum-of-squares

    Authors: Hongjie Chen, Jingqiu Ding, Tommaso d'Orsi, Yiding Hua, Chih-Hung Liu, David Steurer

    Abstract: We develop the first pure node-differentially-private algorithms for learning stochastic block models and for graphon estimation with polynomial running time for any constant number of blocks. The statistical utility guarantees match those of the previous best information-theoretic (exponential-time) node-private mechanisms for these problems. The algorithm is based on an exponential mechanism for… ▽ More

    Submitted 18 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 71 pages, accepted to STOC 2024

  19. arXiv:2403.11162  [pdf, other

    cs.CV cs.AI cs.CR cs.CY cs.LG

    CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion

    Authors: Xiaoyu Wu, Yang Hua, Chumeng Liang, Jiaru Zhang, Hao Wang, Tao Song, Haibing Guan

    Abstract: Diffusion Models (DMs) have evolved into advanced image generation tools, especially for few-shot generation where a pretrained model is fine-tuned on a small set of images to capture a specific style or object. Despite their success, concerns exist about potential copyright violations stemming from the use of unauthorized data in this process. In response, we present Contrasting Gradient Inversio… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  20. arXiv:2403.10051  [pdf, other

    cs.DB

    Accelerating Regular Path Queries over Graph Database with Processing-in-Memory

    Authors: Ruoyan Ma, Shengan Zheng, Guifeng Wang, Jin Pu, Yifan Hua, Wentao Wang, Linpeng Huang

    Abstract: Regular path queries (RPQs) in graph databases are bottlenecked by the memory wall. Emerging processing-in-memory (PIM) technologies offer a promising solution to dispatch and execute path matching tasks in parallel within PIM modules. We present Moctopus, a PIM-based data management system for graph databases that supports efficient batch RPQs and graph updates. Moctopus employs a PIM-friendly dy… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  21. arXiv:2403.06438  [pdf, other

    cs.IT eess.SP

    Unification of Secret Key Generation and Wiretap Channel Transmission

    Authors: Yingbo Hua, Md Saydur Rahman

    Abstract: This paper presents further insights into a recently developed round-trip communication scheme called ``Secret-message Transmission by Echoing Encrypted Probes (STEEP)''. A legitimate wireless channel between a multi-antenna user (Alice) and a single-antenna user (Bob) in the presence of a multi-antenna eavesdropper (Eve) is focused on. STEEP does not require full-duplex, channel reciprocity or Ev… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted for presentation at IEEE ICC 2024

  22. Secret-Key Capacity from MIMO Channel Probing

    Authors: Yingbo Hua, Ahmed Maksud

    Abstract: Revealing expressions of secret-key capacity (SKC) based on data sets from Gaussian MIMO channel probing are presented. It is shown that Maurer's upper and lower bounds on SKC coincide when the used data sets are produced from one-way channel probing. As channel coherence time increases, SKC in bits per probing channel use is always lower bounded by a positive value unless eavesdropper's observati… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in IEEE Wireless Communications Letters

  23. arXiv:2402.11541  [pdf, other

    cs.CL cs.AI

    Large Language Models Can Better Understand Knowledge Graphs Than We Thought

    Authors: Xinbang Dai, Yuncheng Hua, Tongtong Wu, Yang Sheng, Qiu Ji, Guilin Qi

    Abstract: As the parameter scale of large language models (LLMs) grows, jointly training knowledge graph (KG) embeddings with model parameters to enhance LLM capabilities becomes increasingly costly. Consequently, the community has shown interest in developing prompt strategies that effectively integrate KG information into LLMs. However, the format for incorporating KGs into LLMs lacks standardization; for… ▽ More

    Submitted 16 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 15 pages

    ACM Class: I.2.4; I.2.7

  24. arXiv:2402.11178  [pdf, other

    cs.CL

    RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural Conversations

    Authors: Haolan Zhan, Zhuang Li, Xiaoxi Kang, Tao Feng, Yuncheng Hua, Lizhen Qu, Yi Ying, Mei Rianto Chandra, Kelly Rosalin, Jureynolds Jureynolds, Suraj Sharma, Shilin Qu, Linhao Luo, Lay-Ki Soon, Zhaleh Semnani Azad, Ingrid Zukerman, Gholamreza Haffari

    Abstract: Norm violations occur when individuals fail to conform to culturally accepted behaviors, which may lead to potential conflicts. Remediating norm violations requires social awareness and cultural sensitivity of the nuances at play. To equip interactive AI systems with a remediation ability, we offer ReNoVi - a large-scale corpus of 9,258 multi-turn dialogues annotated with social norms, as well as… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: work in progress. 15 pages, 7 figures

  25. TDViT: Temporal Dilated Video Transformer for Dense Video Tasks

    Authors: Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

    Abstract: Deep video models, for example, 3D CNNs or video transformers, have achieved promising performance on sparse video tasks, i.e., predicting one result per video. However, challenges arise when adapting existing deep video models to dense video tasks, i.e., predicting one result per frame. Specifically, these models are expensive for deployment, less effective when handling redundant frames, and dif… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  26. Efficient One-stage Video Object Detection by Exploiting Temporal Consistency

    Authors: Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

    Abstract: Recently, one-stage detectors have achieved competitive accuracy and faster speed compared with traditional two-stage detectors on image data. However, in the field of video object detection (VOD), most existing VOD methods are still based on two-stage detectors. Moreover, directly adapting existing VOD methods to one-stage detectors introduces unaffordable computational costs. In this paper, we f… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  27. Spatio-temporal Prompting Network for Robust Video Feature Extraction

    Authors: Guanxiong Sun, Chi Wang, Zhaoyu Zhang, Jiankang Deng, Stefanos Zafeiriou, Yang Hua

    Abstract: Frame quality deterioration is one of the main challenges in the field of video understanding. To compensate for the information loss caused by deteriorated frames, recent approaches exploit transformer-based integration modules to obtain spatio-temporal information. However, these integration modules are heavy and complex. Furthermore, each integration module is specifically tailored for its targ… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Journal ref: 2023 International Conference on Computer Vision (ICCV) 13541-13551

  28. arXiv:2402.01737  [pdf, other

    cs.CL cs.AI

    Assistive Large Language Model Agents for Socially-Aware Negotiation Dialogues

    Authors: Yuncheng Hua, Lizhen Qu, Gholamreza Haffari

    Abstract: We develop assistive agents based on Large Language Models (LLMs) that aid interlocutors in business negotiations. Specifically, we simulate business negotiations by letting two LLM-based agents engage in role play. A third LLM acts as a remediator agent to rewrite utterances violating norms for improving negotiation outcomes. We introduce a simple tuning-free and label-free In-Context Learning (I… ▽ More

    Submitted 18 June, 2024; v1 submitted 29 January, 2024; originally announced February 2024.

    Comments: 25 pages, 3 figures, 13 tables; Under review in EMNLP 2024

    ACM Class: I.2.7

  29. arXiv:2402.01736  [pdf, other

    cs.CL cs.AI

    SADAS: A Dialogue Assistant System Towards Remediating Norm Violations in Bilingual Socio-Cultural Conversations

    Authors: Yuncheng Hua, Zhuang Li, Linhao Luo, Kadek Ananta Satriadi, Tao Feng, Haolan Zhan, Lizhen Qu, Suraj Sharma, Ingrid Zukerman, Zhaleh Semnani-Azad, Gholamreza Haffari

    Abstract: In today's globalized world, bridging the cultural divide is more critical than ever for forging meaningful connections. The Socially-Aware Dialogue Assistant System (SADAS) is our answer to this global challenge, and it's designed to ensure that conversations between individuals from diverse cultural backgrounds unfold with respect and understanding. Our system's novel architecture includes: (1)… ▽ More

    Submitted 29 January, 2024; originally announced February 2024.

    Comments: 8 pages, 2 figures

    ACM Class: I.2.7

  30. arXiv:2402.01097  [pdf, other

    cs.CL

    Let's Negotiate! A Survey of Negotiation Dialogue Systems

    Authors: Haolan Zhan, Yufei Wang, Tao Feng, Yuncheng Hua, Suraj Sharma, Zhuang Li, Lizhen Qu, Zhaleh Semnani Azad, Ingrid Zukerman, Gholamreza Haffari

    Abstract: Negotiation is a crucial ability in human communication. Recently, there has been a resurgent research interest in negotiation dialogue systems, whose goal is to create intelligent agents that can assist people in resolving conflicts or reaching agreements. Although there have been many explorations into negotiation dialogue systems, a systematic review of this task has not been performed to date.… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted by EACL 2024 (findings). arXiv admin note: substantial text overlap with arXiv:2212.09072

  31. MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection

    Authors: Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

    Abstract: State-of-the-art video object detection methods maintain a memory structure, either a sliding window or a memory queue, to enhance the current frame using attention mechanisms. However, we argue that these memory structures are not efficient or sufficient because of two implied operations: (1) concatenating all features in memory for enhancement, leading to a heavy computational cost; (2) frame-wi… ▽ More

    Submitted 1 February, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: update code url https://github.com/guanxiongsun/vfe.pytorch

    Journal ref: In Proceedings of the AAAI Conference on Artificial Intelligence 2021 (Vol. 35, No. 3, pp. 2620-2627)

  32. arXiv:2401.03230  [pdf, other

    cs.LG cs.CR cs.DC

    FedTGP: Trainable Global Prototypes with Adaptive-Margin-Enhanced Contrastive Learning for Data and Model Heterogeneity in Federated Learning

    Authors: Jianqing Zhang, Yang Liu, Yang Hua, Jian Cao

    Abstract: Recently, Heterogeneous Federated Learning (HtFL) has attracted attention due to its ability to support heterogeneous models and data. To reduce the high communication cost of transmitting model parameters, a major challenge in HtFL, prototype-based HtFL methods are proposed to solely share class representatives, a.k.a, prototypes, among heterogeneous clients while maintaining the privacy of clien… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI2024

  33. arXiv:2401.02984  [pdf

    cs.CL cs.AI

    Large Language Models in Mental Health Care: a Scoping Review

    Authors: Yining Hua, Fenglin Liu, Kailai Yang, Zehan Li, Hongbin Na, Yi-han Sheu, Peilin Zhou, Lauren V. Moran, Sophia Ananiadou, Andrew Beam, John Torous

    Abstract: The integration of large language models (LLMs) in mental health care is an emerging field. There is a need to systematically review the application outcomes and delineate the advantages and limitations in clinical settings. This review aims to provide a comprehensive overview of the use of LLMs in mental health care, assessing their efficacy, challenges, and potential for future applications. A s… ▽ More

    Submitted 21 August, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  34. arXiv:2312.12484  [pdf, other

    cs.CR cs.DC cs.LG

    SkyMask: Attack-agnostic Robust Federated Learning with Fine-grained Learnable Masks

    Authors: Peishen Yan, Hao Wang, Tao Song, Yang Hua, Ruhui Ma, Ningxin Hu, Mohammad R. Haghighat, Haibing Guan

    Abstract: Federated Learning (FL) is becoming a popular paradigm for leveraging distributed data and preserving data privacy. However, due to the distributed characteristic, FL systems are vulnerable to Byzantine attacks that compromised clients attack the global model by uploading malicious model updates. With the development of layer-level and parameter-level fine-grained attacks, the attacks' stealthines… ▽ More

    Submitted 18 July, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted by ECCV2024

  35. arXiv:2312.04992  [pdf, ps, other

    cs.LG cs.DC

    PFLlib: Personalized Federated Learning Algorithm Library

    Authors: Jianqing Zhang, Yang Liu, Yang Hua, Hao Wang, Tao Song, Zhengui Xue, Ruhui Ma, Jian Cao

    Abstract: Amid the ongoing advancements in Federated Learning (FL), a machine learning paradigm that allows collaborative learning with data privacy protection, personalized FL (pFL) has gained significant prominence as a research direction within the FL domain. Whereas traditional FL (tFL) focuses on jointly learning a global model, pFL aims to achieve a balance between the global and personalized objectiv… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  36. arXiv:2312.03290  [pdf, other

    cs.AI cs.CL

    Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym

    Authors: Junjie Sheng, Zixiao Huang, Chuyun Shen, Wenhao Li, Yun Hua, Bo Jin, Hongyuan Zha, Xiangfeng Wang

    Abstract: The formidable capacity for zero- or few-shot decision-making in language agents encourages us to pose a compelling question: Can language agents be alternatives to PPO agents in traditional sequential decision-making tasks? To investigate this, we first take environments collected in OpenAI Gym as our testbeds and ground them to textual environments that construct the TextGym simulator. This allo… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  37. arXiv:2311.14975  [pdf, other

    cs.LG cs.DC

    Eliminating Domain Bias for Federated Learning in Representation Space

    Authors: Jianqing Zhang, Yang Hua, Jian Cao, Hao Wang, Tao Song, Zhengui Xue, Ruhui Ma, Haibing Guan

    Abstract: Recently, federated learning (FL) is popular for its privacy-preserving and collaborative learning abilities. However, under statistically heterogeneous scenarios, we observe that biased data domains on clients cause a representation bias phenomenon and further degenerate generic representations during local training, i.e., the representation degeneration phenomenon. To address these issues, we pr… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023, 24 pages

  38. arXiv:2311.10777  [pdf, other

    cs.CL cs.LG

    A Systematic Review of Aspect-based Sentiment Analysis: Domains, Methods, and Trends

    Authors: Yan Cathy Hua, Paul Denny, Katerina Taskova, Jörg Wicker

    Abstract: Aspect-based Sentiment Analysis (ABSA) is a fine-grained type of sentiment analysis that identifies aspects and their associated opinions from a given text. With the surge of digital opinionated text data, ABSA gained increasing popularity for its ability to mine more detailed and targeted insights. Many review papers on ABSA subtasks and solution methodologies exist, however, few focus on trends… ▽ More

    Submitted 26 July, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  39. arXiv:2311.05112  [pdf

    cs.CL cs.AI

    A Survey of Large Language Models in Medicine: Progress, Application, and Challenge

    Authors: Hongjian Zhou, Fenglin Liu, Boyang Gu, Xinyu Zou, Jinfa Huang, Jinge Wu, Yiru Li, Sam S. Chen, Peilin Zhou, Junling Liu, Yining Hua, Chengfeng Mao, Chenyu You, Xian Wu, Yefeng Zheng, Lei Clifton, Zheng Li, Jiebo Luo, David A. Clifton

    Abstract: Large language models (LLMs), such as ChatGPT, have received substantial attention due to their capabilities for understanding and generating human language. While there has been a burgeoning trend in research focusing on the employment of LLMs in supporting different medical tasks (e.g., enhancing clinical diagnostics and providing medical education), a review of these efforts, particularly their… ▽ More

    Submitted 22 July, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: Preprint. Version 6. Update Figures 1-5; Tables 2-3; 31 pages

  40. arXiv:2311.04199  [pdf, other

    cs.IR cs.CL

    Exploring Recommendation Capabilities of GPT-4V(ision): A Preliminary Case Study

    Authors: Peilin Zhou, Meng Cao, You-Liang Huang, Qichen Ye, Peiyan Zhang, Junling Liu, Yueqi Xie, Yining Hua, Jaeboum Kim

    Abstract: Large Multimodal Models (LMMs) have demonstrated impressive performance across various vision and language tasks, yet their potential applications in recommendation tasks with visual assistance remain unexplored. To bridge this gap, we present a preliminary case study investigating the recommendation capabilities of GPT-4V(ison), a recently released LMM by OpenAI. We construct a series of qualitat… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: In Progress

  41. arXiv:2311.00204  [pdf, other

    cs.CL cs.AI

    Continuous Training and Fine-tuning for Domain-Specific Language Models in Medical Question Answering

    Authors: Zhen Guo, Yining Hua

    Abstract: Large language models exhibit promising general capabilities but often lack specialized knowledge for domain-specific tasks. Developing domain experts from a base model enables a range of applications without prohibitive training costs. This work demonstrates a method using continuous training and instruction fine-tuning to rapidly adapt Llama 2 base models to the Chinese medical domain. We first… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  42. arXiv:2310.17956  [pdf, other

    cs.CV cs.AI cs.CL

    Qilin-Med-VL: Towards Chinese Large Vision-Language Model for General Healthcare

    Authors: Junling Liu, Ziming Wang, Qichen Ye, Dading Chong, Peilin Zhou, Yining Hua

    Abstract: Large Language Models (LLMs) have introduced a new era of proficiency in comprehending complex healthcare and biomedical topics. However, there is a noticeable lack of models in languages other than English and models that can interpret multi-modal input, which is crucial for global healthcare accessibility. In response, this study introduces Qilin-Med-VL, the first Chinese large vision-language m… ▽ More

    Submitted 1 November, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

  43. arXiv:2310.17222   

    physics.app-ph

    Maximum Power and The Corresponding Efficiency for A Carnot-like Thermoelectric Cycle Based on Fluctuation Theorem

    Authors: Yuchao Hua, Lingai Luo, Zeng-Yuan Guo

    Abstract: Here, we investigate the maximum power and corresponding efficiency of thermoelectric generators through devising a set of protocols for the isothermal and adiabatic processes of thermoelectricity to build a Carnot-like thermoelectric cycle, with the analysis based on fluctuation theorem (FT). First of all, the Carnot efficiency can be readily obtained for the quasi-static thermoelectric cycle, wi… ▽ More

    Submitted 25 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: the manuscript should be further revised

  44. arXiv:2310.13028  [pdf, other

    cs.CL cs.AI

    Reliable Academic Conference Question Answering: A Study Based on Large Language Model

    Authors: Zhiwei Huang, Juan Li, Long Jin, Junjie Wang, Mingchen Tu, Yin Hua, Zhiqiang Liu, Jiawei Meng, Wen Zhang

    Abstract: As the development of academic conferences fosters global scholarly communication, researchers consistently need to obtain accurate and up-to-date information about academic conferences. Since the information is scattered, using an intelligent question-answering system to efficiently handle researchers' queries and ensure awareness of the latest advancements is necessary. Recently, Large Language… ▽ More

    Submitted 4 August, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 12 pages, 3 figures

  45. arXiv:2310.11971  [pdf, other

    cs.LG cs.AI

    Improving Generalization of Alignment with Human Preferences through Group Invariant Learning

    Authors: Rui Zheng, Wei Shen, Yuan Hua, Wenbin Lai, Shihan Dou, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Haoran Huang, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: The success of AI assistants based on language models (LLMs) hinges crucially on Reinforcement Learning from Human Feedback (RLHF), which enables the generation of responses more aligned with human preferences. As universal AI assistants, there's a growing expectation for them to perform consistently across various domains. However, previous work shows that Reinforcement Learning (RL) often exploi… ▽ More

    Submitted 25 December, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  46. arXiv:2310.10164  [pdf

    physics.optics physics.app-ph

    Performance Enhancement via XPM Suppression in a Linear all-PM NPE Mode-locked Fiber Oscillator

    Authors: Marvin Edelmann, Yi Hua, Mikhail Pergament, Franz X. Kärtner

    Abstract: We demonstrate strong performance enhancement of an all polarization-maintaining fiber oscillator mode-locked using NPE in a linear self-stabilized fiber interferometer via suppression of cross-phase modulation (XPM). Numerical simulations reveal that XPM significantly affects the saturable absorber dynamics resulting in distortions of mode-locked steady-states. In the experiment, we construct an… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  47. arXiv:2310.09089  [pdf, other

    cs.CL

    Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

    Authors: Qichen Ye, Junling Liu, Dading Chong, Peilin Zhou, Yining Hua, Fenglin Liu, Meng Cao, Ziming Wang, Xuxin Cheng, Zhu Lei, Zhenhua Guo

    Abstract: Integrating large language models (LLMs) into healthcare holds great potential but faces challenges. Pre-training LLMs from scratch for domains like medicine is resource-heavy and often unfeasible. On the other hand, sole reliance on Supervised Fine-tuning (SFT) can result in overconfident predictions and may not tap into domain-specific insights. In response, we present a multi-stage training met… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  48. arXiv:2310.07079  [pdf, other

    cs.CR cs.LG

    Secure Decentralized Learning with Blockchain

    Authors: Xiaoxue Zhang, Yifan Hua, Chen Qian

    Abstract: Federated Learning (FL) is a well-known paradigm of distributed machine learning on mobile and IoT devices, which preserves data privacy and optimizes communication efficiency. To avoid the single point of failure problem in FL, decentralized federated learning (DFL) has been proposed to use peer-to-peer communication for model aggregation, which has been considered an attractive solution for mach… ▽ More

    Submitted 11 March, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    ACM Class: I.2.11; C.2.4

  49. arXiv:2310.00428  [pdf

    cs.OS

    First Principles of Big Memory Systems

    Authors: Yu Hua

    Abstract: In this paper, we comprehensively analyze the vertical and horizontal extensions of existing memory hierarchy. The difference between memory and big memory is well reported. We present the state-of-the-art studies upon the big memory systems, together with design methodology and implementations. Persistence is the first principle of big memory systems. We further show the full-stack and moving per… ▽ More

    Submitted 9 December, 2023; v1 submitted 30 September, 2023; originally announced October 2023.

  50. arXiv:2309.14529  [pdf, other

    cs.IT eess.SP

    Secret-Message Transmission by Echoing Encrypted Probes -- STEEP

    Authors: Yingbo Hua

    Abstract: This paper examines the properties of the lower and upper bounds established by Maurer, Ahlswede and Csiszar (MAC) for secret-key capacity in the case of channel probing over single-input and single-output (SISO) channels. Inspired by the insights into MAC's bounds, a scheme called secret-message transmission by echoing encrypted probes (STEEP) is proposed. STEEP consists of two phases: in phase 1… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.