Author: Hu, Xia : Search

research-article

Free

JUST ACCEPTED

Efficient GNN Explanation via Learning Removal-based Attribution

ACM Transactions on Knowledge Discovery from Data (TKDD), Just Accepted https://doi.org/10.1145/3685678

As Graph Neural Networks (GNNs) have been widely used in real-world applications, model explanations are required not only by users but also by legal regulations. However, simultaneously achieving high fidelity and low computational costs in generating ...

research-article

Free

JUST ACCEPTED

Fair-RGNN: Mitigating Relational Bias on Knowledge Graphs

ACM Transactions on Knowledge Discovery from Data (TKDD), Just Accepted https://doi.org/10.1145/3681792

Knowledge graph data are prevalent in real-world applications, and knowledge graph neural networks (KGNNs) are essential techniques for knowledge graph representation learning. Although KGNN effectively models the structural information from knowledge ...

tutorial

Open Access

Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond

ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 18, Issue 6Article No.: 160, Pages 1–32https://doi.org/10.1145/3649506

This article presents a comprehensive and practical guide for practitioners and end-users working with Large Language Models (LLMs) in their downstream Natural Language Processing (NLP) tasks. We provide discussions and insights into the usage of LLMs ...

research-article

Free

The Science of Detecting LLM-Generated Text

Communications of the ACM (CACM), Volume 67, Issue 4April 2024, Pages 50–59https://doi.org/10.1145/3624725

While many detection methods have been proposed, understanding the challenges is far more daunting.

research-article

SPeC: A Soft Prompt-Based Calibration on Performance Variability of Large Language Model in Clinical Notes Summarization

Journal of Biomedical Informatics (JOBI), Volume 151, Issue CMar 2024https://doi.org/10.1016/j.jbi.2024.104606

Abstract

Electronic health records (EHRs) store an extensive array of patient information, encompassing medical histories, diagnoses, treatments, and test outcomes. These records are crucial for enabling healthcare providers to make well-informed ...

Graphical abstract

Display Omitted

research-article

Open Access

Shortcut Learning of Large Language Models in Natural Language Understanding

Communications of the ACM (CACM), Volume 67, Issue 1January 2024, Pages 110–120https://doi.org/10.1145/3596490

Shortcuts often hinder the robustness of large language models.

research-article

Fair graph distillation

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsDecember 2023, Article No.: 3535, Pages 80644–80660

As graph neural networks (GNNs) struggle with large-scale graphs due to high computational demands, graph data distillation promises to alleviate this issue by distilling a large real graph into a smaller distilled graph while maintaining comparable ...

research-article

Setting the trap: capturing and defeating backdoors in pretrained language models through honeypots

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsDecember 2023, Article No.: 3199, Pages 73191–73210

In the field of natural language processing, the prevalent approach involves fine-tuning pretrained language models (PLMs) using local samples. Recent research has exposed the susceptibility of PLMs to backdoor attacks, wherein the adversaries can embed ...

research-article

Chasing fairness under distribution shift: a model weight perturbation approach

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsDecember 2023, Article No.: 2793, Pages 63931–63944

Fairness in machine learning has attracted increasing attention in recent years. The fairness methods improving algorithmic fairness for in-distribution data may not perform well under distribution shifts. In this paper, we first theoretically ...

research-article

One less reason for filter-pruning: gaining free adversarial robustness with structured grouped kernel pruning

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsDecember 2023, Article No.: 2712, Pages 62032–62061

Densely structured pruning methods utilizing simple pruning heuristics can deliver immediate compression and acceleration benefits with acceptable benign performances. However, empirical findings indicate such naïvely pruned networks are extremely ...

research-article

Winner-take-all column row sampling for memory efficient adaptation of language model

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsDecember 2023, Article No.: 150, Pages 3402–3424

As the model size grows rapidly, fine-tuning the large pre-trained language model has become increasingly difficult due to its extensive memory usage. Previous works usually focus on reducing the number of trainable parameters in the network. While the ...

short-paper

Exposing Model Theft: A Robust and Transferable Watermark for Thwarting Model Extraction Attacks

CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge ManagementOctober 2023, Pages 4315–4319https://doi.org/10.1145/3583780.3615211

The increasing prevalence of Deep Neural Networks (DNNs) in cloud-based services has led to their widespread use through various APIs. However, recent studies reveal the susceptibility of these public APIs to model extraction attacks, where adversaries ...

research-article

Tackling Diverse Minorities in Imbalanced Classification

CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge ManagementOctober 2023, Pages 1178–1187https://doi.org/10.1145/3583780.3615071

Imbalanced datasets are commonly observed in various real-world applications, presenting significant challenges in training classifiers. When working with large datasets, the imbalanced issue can be further exacerbated, making it exceptionally difficult ...

short-paper

DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical Research

CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge ManagementOctober 2023, Pages 5021–5025https://doi.org/10.1145/3583780.3614739

The exponential growth in scholarly publications necessitates advanced tools for efficient article retrieval, especially in interdisciplinary fields where diverse terminologies are used to describe similar research. Traditional keyword-based search ...

Article

Deep Serial Number: Computational Watermark for DNN Intellectual Property Protection

Machine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo TrackSep 2023, Pages 157–173https://doi.org/10.1007/978-3-031-43427-3_10

Abstract

In this paper, we present DSN (Deep Serial Number), a simple yet effective watermarking algorithm designed specifically for deep neural networks (DNNs). Unlike traditional methods that incorporate identification signals into DNNs, our approach ...

Article

Mitigating Algorithmic Bias with Limited Annotations

Machine Learning and Knowledge Discovery in Databases: Research TrackSep 2023, Pages 241–258https://doi.org/10.1007/978-3-031-43415-0_15

Abstract

Existing work on fairness modeling commonly assumes that sensitive attributes for all instances are fully available, which may not be true in many real-world applications due to the high cost of acquiring sensitive information. When sensitive ...

research-article

Can Attention Be Used to Explain EHR-Based Mortality Prediction Tasks: A Case Study on Hemorrhagic Stroke

BCB '23: Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology, and Health InformaticsSeptember 2023, Article No.: 26, Pages 1–6https://doi.org/10.1145/3584371.3613002

Stroke is a significant cause of mortality and morbidity, necessitating early predictive strategies to minimize risks. Traditional methods for evaluating patients, such as Acute Physiology and Chronic Health Evaluation (APACHE II, IV) and Simplified ...

research-article

Probabilistic masked attention networks for explainable sequential recommendation

IJCAI '23: Proceedings of the Thirty-Second International Joint Conference on Artificial IntelligenceAugust 2023, Article No.: 230, Pages 2068–2076https://doi.org/10.24963/ijcai.2023/230

Transformer-based models are powerful for modeling temporal dynamics of user preference in sequential recommendation. Most of the variants adopt the Softmax transformation in the self-attention layers to generate dense attention probabilities. However, ...

research-article

Collaborative Graph Neural Networks for Attributed Network Embedding

IEEE Transactions on Knowledge and Data Engineering (IEEECS_TKDE), Volume 36, Issue 3March 2024, Pages 972–986https://doi.org/10.1109/TKDE.2023.3298002

Graph neural networks (GNNs) have shown prominent performance on attributed network embedding. However, existing efforts mainly focus on exploiting network structures, while the exploitation of node attributes is rather limited as they only serve as node ...

research-article

DIVISION: memory efficient training via dual activation precision

ICML'23: Proceedings of the 40th International Conference on Machine LearningJuly 2023, Article No.: 1496, Pages 36036–36057

Activation compressed training provides a solution towards reducing the memory cost of training deep neural networks (DNNs). However, state-of-the-art work combines a search of quantization bitwidth with the training, which makes the procedure ...

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Caption

Efficient GNN Explanation via Learning Removal-based Attribution

Fair-RGNN: Mitigating Relational Bias on Knowledge Graphs

Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond

The Science of Detecting LLM-Generated Text

SPeC: A Soft Prompt-Based Calibration on Performance Variability of Large Language Model in Clinical Notes Summarization

Upcoming Conferences

Shortcut Learning of Large Language Models in Natural Language Understanding

Fair graph distillation

Setting the trap: capturing and defeating backdoors in pretrained language models through honeypots

Chasing fairness under distribution shift: a model weight perturbation approach

One less reason for filter-pruning: gaining free adversarial robustness with structured grouped kernel pruning

Winner-take-all column row sampling for memory efficient adaptation of language model

Exposing Model Theft: A Robust and Transferable Watermark for Thwarting Model Extraction Attacks

Tackling Diverse Minorities in Imbalanced Classification

DiscoverPath: A Knowledge Refinement and Retrieval System for Interdisciplinarity on Biomedical Research

Deep Serial Number: Computational Watermark for DNN Intellectual Property Protection

Mitigating Algorithmic Bias with Limited Annotations

Can Attention Be Used to Explain EHR-Based Mortality Prediction Tasks: A Case Study on Hemorrhagic Stroke

Probabilistic masked attention networks for explainable sequential recommendation

Collaborative Graph Neural Networks for Attributed Network Embedding

DIVISION: memory efficient training via dual activation precision

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences