Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 81 results for author: Duan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.08924  [pdf, other

    cs.CR

    Disassembling Obfuscated Executables with LLM

    Authors: Huanyao Rong, Yue Duan, Hang Zhang, XiaoFeng Wang, Hongbo Chen, Shengchen Duan, Shen Wang

    Abstract: Disassembly is a challenging task, particularly for obfuscated executables containing junk bytes, which is designed to induce disassembly errors. Existing solutions rely on heuristics or leverage machine learning techniques, but only achieve limited successes. Fundamentally, such obfuscation cannot be defeated without in-depth understanding of the binary executable's semantics, which is made possi… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.01183  [pdf, other

    cs.DB

    TCSR-SQL: Towards Table Content-aware Text-to-SQL with Self-retrieval

    Authors: Wenbo Xu, Liang Yan, Peiyi Han, Haifeng Zhu, Chuanyi Liu, Shaoming Duan, Cuiyun Gao, Yingwei Liang

    Abstract: Large Language Model-based (LLM-based) Text-to-SQL methods have achieved important progress in generating SQL queries for real-world applications. When confronted with table content-aware questions in real-world scenarios, ambiguous data content keywords and non-existent database schema column names within the question leads to the poor performance of existing methods. To solve this problem, we pr… ▽ More

    Submitted 12 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.14549  [pdf, other

    cs.CV cs.LG q-bio.NC

    Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Frontier AI Models

    Authors: Sunny Duan, Mikail Khona, Abhiram Iyer, Rylan Schaeffer, Ila R Fiete

    Abstract: Frontier AI systems are making transformative impacts across society, but such benefits are not without costs: models trained on web-scale datasets containing personal and private data raise profound concerns about data privacy and security. Language models are trained on extensive corpora including potentially sensitive or proprietary information, and the risk of data leakage - where the model re… ▽ More

    Submitted 25 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.12793  [pdf, other

    cs.CL

    ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

    Authors: Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Dan Zhang, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, Jing Zhang, Jingyu Sun, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong , et al. (34 additional authors not shown)

    Abstract: We introduce ChatGLM, an evolving family of large language models that we have been developing over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained… ▽ More

    Submitted 29 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  5. arXiv:2406.07436  [pdf, other

    cs.PL

    McEval: Massively Multilingual Code Evaluation

    Authors: Linzheng Chai, Shukai Liu, Jian Yang, Yuwei Yin, Ke Jin, Jiaheng Liu, Tao Sun, Ge Zhang, Changyu Ren, Hongcheng Guo, Zekun Wang, Boyang Wang, Xianjie Wu, Bing Wang, Tongliang Li, Liqun Yang, Sufeng Duan, Zhoujun Li

    Abstract: Code large language models (LLMs) have shown remarkable advances in code understanding, completion, and generation tasks. Programming benchmarks, comprised of a selection of code challenges and corresponding test cases, serve as a standard to evaluate the capability of different LLMs in such tasks. However, most existing benchmarks primarily focus on Python and are still restricted to a limited nu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 22 pages

  6. arXiv:2406.07032  [pdf, other

    cs.CV

    RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks

    Authors: Zhechao Wang, Peirui Cheng, Pengju Tian, Yuchao Wang, Mingxin Chen, Shujing Duan, Zhirui Wang, Xinming Li, Xian Sun

    Abstract: Remote sensing lightweight foundation models have achieved notable success in online perception within remote sensing. However, their capabilities are restricted to performing online inference solely based on their own observations and models, thus lacking a comprehensive understanding of large-scale remote sensing scenarios. To overcome this limitation, we propose a Remote Sensing Distributed Fou… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  7. arXiv:2406.06305  [pdf, other

    cs.CV cs.AI

    NeuroMoCo: A Neuromorphic Momentum Contrast Learning Method for Spiking Neural Networks

    Authors: Yuqi Ma, Huamin Wang, Hangchi Shen, Xuemei Chen, Shukai Duan, Shiping Wen

    Abstract: Recently, brain-inspired spiking neural networks (SNNs) have attracted great research attention owing to their inherent bio-interpretability, event-triggered properties and powerful perception of spatiotemporal information, which is beneficial to handling event-based neuromorphic datasets. In contrast to conventional static image datasets, event-based neuromorphic datasets present heightened compl… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 32 pages,4 figures,4 tables

  8. arXiv:2406.02629  [pdf, other

    cs.CR cs.LG

    SSNet: A Lightweight Multi-Party Computation Scheme for Practical Privacy-Preserving Machine Learning Service in the Cloud

    Authors: Shijin Duan, Chenghong Wang, Hongwu Peng, Yukui Luo, Wujie Wen, Caiwen Ding, Xiaolin Xu

    Abstract: As privacy-preserving becomes a pivotal aspect of deep learning (DL) development, multi-party computation (MPC) has gained prominence for its efficiency and strong security. However, the practice of current MPC frameworks is limited, especially when dealing with large neural networks, exemplified by the prolonged execution time of 25.8 seconds for secure inference on ResNet-152. The primary challe… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 16 pages, 9 figures

  9. arXiv:2405.14185  [pdf, other

    cs.LG cs.PF

    A structure-aware framework for learning device placements on computation graphs

    Authors: Shukai Duan, Heng Ping, Nikos Kanakaris, Xiongye Xiao, Peiyu Zhang, Panagiotis Kyriakis, Nesreen K. Ahmed, Guixiang Ma, Mihai Capota, Shahin Nazarian, Theodore L. Willke, Paul Bogdan

    Abstract: Existing approaches for device placement ignore the topological features of computation graphs and rely mostly on heuristic methods for graph partitioning. At the same time, they either follow a grouper-placer or an encoder-placer architecture, which requires understanding the interaction structure between code operations. To bridge the gap between encoder-placer and grouper-placer techniques, we… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  10. arXiv:2405.05542  [pdf, other

    cs.RO cs.MA

    Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning

    Authors: Yuchen Shi, Shihong Duan, Cheng Xu, Ran Wang, Fangwen Ye, Chau Yuen

    Abstract: This work introduces a novel value decomposition algorithm, termed \textit{Dynamic Deep Factor Graphs} (DDFG). Unlike traditional coordination graphs, DDFG leverages factor graphs to articulate the decomposition of value functions, offering enhanced flexibility and adaptability to complex value function structures. Central to DDFG is a graph structure generation policy that innovatively generates… ▽ More

    Submitted 7 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: submitted to IEEE TPAMI

  11. arXiv:2404.04265  [pdf, other

    cs.IR cs.LG

    Accelerating Matrix Factorization by Dynamic Pruning for Fast Recommendation

    Authors: Yining Wu, Shengyu Duan, Gaole Sai, Chenhong Cao, Guobing Zou

    Abstract: Matrix factorization (MF) is a widely used collaborative filtering (CF) algorithm for recommendation systems (RSs), due to its high prediction accuracy, great flexibility and high efficiency in big data processing. However, with the dramatically increased number of users/items in current RSs, the computational complexity for training a MF model largely increases. Many existing works have accelerat… ▽ More

    Submitted 18 March, 2024; originally announced April 2024.

  12. arXiv:2403.13844  [pdf, other

    cs.LG cs.AI

    Scheduled Knowledge Acquisition on Lightweight Vector Symbolic Architectures for Brain-Computer Interfaces

    Authors: Yejia Liu, Shijin Duan, Xiaolin Xu, Shaolei Ren

    Abstract: Brain-Computer interfaces (BCIs) are typically designed to be lightweight and responsive in real-time to provide users timely feedback. Classical feature engineering is computationally efficient but has low accuracy, whereas the recent neural networks (DNNs) improve accuracy but are computationally expensive and incur high latency. As a promising alternative, the low-dimensional computing (LDC) cl… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted as a full paper by the tinyML Research Symposium 2024

  13. arXiv:2403.06682  [pdf, other

    cs.CL cs.CV cs.CY

    Restoring Ancient Ideograph: A Multimodal Multitask Neural Network Approach

    Authors: Siyu Duan, Jun Wang, Qi Su

    Abstract: Cultural heritage serves as the enduring record of human thought and history. Despite significant efforts dedicated to the preservation of cultural relics, many ancient artefacts have been ravaged irreversibly by natural deterioration and human actions. Deep learning technology has emerged as a valuable tool for restoring various kinds of cultural heritages, including ancient text restoration. Pre… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Accept by Lrec-Coling 2024

  14. arXiv:2403.04204  [pdf, other

    cs.AI cs.CL

    On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models

    Authors: Xinpeng Wang, Shitong Duan, Xiaoyuan Yi, Jing Yao, Shanlin Zhou, Zhihua Wei, Peng Zhang, Dongkuan Xu, Maosong Sun, Xing Xie

    Abstract: Big models have achieved revolutionary breakthroughs in the field of AI, but they might also pose potential concerns. Addressing such concerns, alignment technologies were introduced to make these models conform to human preferences and values. Despite considerable advancements in the past year, various challenges lie in establishing the optimal alignment strategy, such as data cost and scalable o… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 23 pages, 7 figures

  15. arXiv:2403.03419  [pdf, other

    cs.CL cs.AI

    Negating Negatives: Alignment without Human Positive Samples via Distributional Dispreference Optimization

    Authors: Shitong Duan, Xiaoyuan Yi, Peng Zhang, Tun Lu, Xing Xie, Ning Gu

    Abstract: Large language models (LLMs) have revolutionized the role of AI, yet also pose potential risks of propagating unethical content. Alignment technologies have been introduced to steer LLMs towards human preference, gaining increasing attention. Despite notable breakthroughs in this direction, existing methods heavily rely on high-quality positive-negative training pairs, suffering from noisy labels… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  16. arXiv:2402.09725  [pdf, other

    cs.CL cs.AI

    Improving Non-autoregressive Machine Translation with Error Exposure and Consistency Regularization

    Authors: Xinran Chen, Sufeng Duan, Gongshen Liu

    Abstract: Being one of the IR-NAT (Iterative-refinemennt-based NAT) frameworks, the Conditional Masked Language Model (CMLM) adopts the mask-predict paradigm to re-predict the masked low-confidence tokens. However, CMLM suffers from the data distribution discrepancy between training and inference, where the observed tokens are generated differently in the two cases. In this paper, we address this problem wi… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  17. arXiv:2401.00225  [pdf

    eess.AS cs.AI eess.SP

    Enhancing dysarthria speech feature representation with empirical mode decomposition and Walsh-Hadamard transform

    Authors: Ting Zhu, Shufei Duan, Camille Dingam, Huizhi Liang, Wei Zhang

    Abstract: Dysarthria speech contains the pathological characteristics of vocal tract and vocal fold, but so far, they have not yet been included in traditional acoustic feature sets. Moreover, the nonlinearity and non-stationarity of speech have been ignored. In this paper, we propose a feature enhancement algorithm for dysarthria speech called WHFEMD. It combines empirical mode decomposition (EMD) and fast… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  18. arXiv:2312.08998  [pdf

    eess.AS cs.AI cs.SD eess.SP

    Design, construction and evaluation of emotional multimodal pathological speech database

    Authors: Ting Zhu, Shufei Duan, Huizhi Liang, Wei Zhang

    Abstract: The lack of an available emotion pathology database is one of the key obstacles in studying the emotion expression status of patients with dysarthria. The first Chinese multimodal emotional pathological speech database containing multi-perspective information is constructed in this paper. It includes 29 controls and 39 patients with different degrees of motor dysarthria, expressing happy, sad, ang… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  19. arXiv:2312.05657  [pdf, other

    cs.LG cs.AI cs.PL cs.SE

    Leveraging Reinforcement Learning and Large Language Models for Code Optimization

    Authors: Shukai Duan, Nikos Kanakaris, Xiongye Xiao, Heng Ping, Chenyu Zhou, Nesreen K. Ahmed, Guixiang Ma, Mihai Capota, Theodore L. Willke, Shahin Nazarian, Paul Bogdan

    Abstract: Code optimization is a daunting task that requires a significant level of expertise from experienced programmers. This level of expertise is not sufficient when compared to the rapid development of new hardware architectures. Towards advancing the whole code optimization process, recent approaches rely on machine learning and artificial intelligence techniques. This paper introduces a new framewor… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  20. arXiv:2312.00856  [pdf, other

    cs.CV

    QAFE-Net: Quality Assessment of Facial Expressions with Landmark Heatmaps

    Authors: Shuchao Duan, Amirhossein Dadashzadeh, Alan Whone, Majid Mirmehdi

    Abstract: Facial expression recognition (FER) methods have made great inroads in categorising moods and feelings in humans. Beyond FER, pain estimation methods assess levels of intensity in pain expressions, however assessing the quality of all facial expressions is of critical value in health-related applications. In this work, we address the quality of five different facial expressions in patients affecte… ▽ More

    Submitted 12 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Accepted to ELFA workshop at WACV 2024

  21. arXiv:2311.15179  [pdf, other

    cs.SE

    Estimation of the User Contribution Rate by Leveraging Time Sequence in Pairwise Matching function-point between Users Feedback and App Updating Log

    Authors: Shiqi Duan, Jianxun Liu, Yong Xiao, Xiangping Zhang

    Abstract: Mobile applications have become an inseparable part of people's daily life. Nonetheless, the market competition is extremely fierce, and apps lacking recognition among most users are susceptible to market elimination. To this end, developers must swiftly and accurately apprehend the requirements of the wider user base to effectively strategize and promote their apps' orderly and healthy evolution.… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  22. arXiv:2311.09489  [pdf, other

    cs.CR

    MirrorNet: A TEE-Friendly Framework for Secure On-device DNN Inference

    Authors: Ziyu Liu, Yukui Luo, Shijin Duan, Tong Zhou, Xiaolin Xu

    Abstract: Deep neural network (DNN) models have become prevalent in edge devices for real-time inference. However, they are vulnerable to model extraction attacks and require protection. Existing defense approaches either fail to fully safeguard model confidentiality or result in significant latency issues. To overcome these challenges, this paper presents MirrorNet, which leverages Trusted Execution Enviro… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted by ICCAD 2023

  23. arXiv:2311.07619  [pdf, other

    cs.IR cs.AI

    Modeling User Viewing Flow Using Large Language Models for Article Recommendation

    Authors: Zhenghao Liu, Zulong Chen, Moufeng Zhang, Shaoyang Duan, Hong Wen, Liangyue Li, Nan Li, Yu Gu, Ge Yu

    Abstract: This paper proposes the User Viewing Flow Modeling (SINGLE) method for the article recommendation task, which models the user constant preference and instant interest from user-clicked articles. Specifically, we first employ a user constant viewing flow modeling method to summarize the user's general interest to recommend articles. In this case, we utilize Large Language Models (LLMs) to capture c… ▽ More

    Submitted 7 March, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: Accepted by WebConf 2024

  24. arXiv:2311.07603  [pdf, other

    cs.CV

    PECoP: Parameter Efficient Continual Pretraining for Action Quality Assessment

    Authors: Amirhossein Dadashzadeh, Shuchao Duan, Alan Whone, Majid Mirmehdi

    Abstract: The limited availability of labelled data in Action Quality Assessment (AQA), has forced previous works to fine-tune their models pretrained on large-scale domain-general datasets. This common approach results in weak generalisation, particularly when there is a significant domain shift. We propose a novel, parameter efficient, continual pretraining framework, PECoP, to reduce such domain shift vi… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: Accepted to WACV 2024 (preprint)

  25. arXiv:2311.05608  [pdf, other

    cs.CR cs.AI cs.CL

    FigStep: Jailbreaking Large Vision-language Models via Typographic Visual Prompts

    Authors: Yichen Gong, Delong Ran, Jinyuan Liu, Conglei Wang, Tianshuo Cong, Anyu Wang, Sisi Duan, Xiaoyun Wang

    Abstract: Ensuring the safety of artificial intelligence-generated content (AIGC) is a longstanding topic in the artificial intelligence (AI) community, and the safety concerns associated with Large Language Models (LLMs) have been widely investigated. Recently, large vision-language models (VLMs) represent an unprecedented revolution, as they are built upon LLMs but can incorporate additional modalities (e… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Technical Report

  26. Multi-grained Evidence Inference for Multi-choice Reading Comprehension

    Authors: Yilin Zhao, Hai Zhao, Sufeng Duan

    Abstract: Multi-choice Machine Reading Comprehension (MRC) is a major and challenging task for machines to answer questions according to provided options. Answers in multi-choice MRC cannot be directly extracted in the given passages, and essentially require machines capable of reasoning from accurate extracted evidence. However, the critical evidence may be as simple as just one word or phrase, while it is… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted by TASLP 2023, vol. 31, pp. 3896-3907

    Journal ref: in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 3896-3907, 2023

  27. arXiv:2310.11984  [pdf, other

    cs.LG cs.CL

    From Interpolation to Extrapolation: Complete Length Generalization for Arithmetic Transformers

    Authors: Shaoxiong Duan, Yining Shi, Wei Xu

    Abstract: In this paper, we investigate the inherent capabilities of transformer models in learning arithmetic algorithms, such as addition and parity. Through experiments and attention analysis, we identify a number of crucial factors for achieving optimal length generalization. We show that transformer models are able to generalize to long lengths with the help of targeted attention biasing. In particular… ▽ More

    Submitted 10 May, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

  28. arXiv:2310.11053  [pdf, other

    cs.CL cs.AI cs.CY

    Denevil: Towards Deciphering and Navigating the Ethical Values of Large Language Models via Instruction Learning

    Authors: Shitong Duan, Xiaoyuan Yi, Peng Zhang, Tun Lu, Xing Xie, Ning Gu

    Abstract: Large Language Models (LLMs) have made unprecedented breakthroughs, yet their increasing integration into everyday life might raise societal risks due to generated unethical content. Despite extensive study on specific issues like bias, the intrinsic values of LLMs remain largely unexplored from a moral philosophy perspective. This work delves into ethical values utilizing Moral Foundation Theory.… ▽ More

    Submitted 4 March, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024

  29. arXiv:2310.07548  [pdf, other

    cs.CV

    Attribute Localization and Revision Network for Zero-Shot Learning

    Authors: Junzhe Xu, Suling Duan, Chenwei Tang, Zhenan He, Jiancheng Lv

    Abstract: Zero-shot learning enables the model to recognize unseen categories with the aid of auxiliary semantic information such as attributes. Current works proposed to detect attributes from local image regions and align extracted features with class-level semantics. In this paper, we find that the choice between local and global features is not a zero-sum game, global features can also contribute to the… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  30. arXiv:2309.02230  [pdf, other

    cs.CV cs.AI

    DCP-Net: A Distributed Collaborative Perception Network for Remote Sensing Semantic Segmentation

    Authors: Zhechao Wang, Peirui Cheng, Shujing Duan, Kaiqiang Chen, Zhirui Wang, Xinming Li, Xian Sun

    Abstract: Onboard intelligent processing is widely applied in emergency tasks in the field of remote sensing. However, it is predominantly confined to an individual platform with a limited observation range as well as susceptibility to interference, resulting in limited accuracy. Considering the current state of multi-platform collaborative observation, this article innovatively presents a distributed colla… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  31. arXiv:2308.01469  [pdf, other

    cs.LG cs.AI cs.CR

    VertexSerum: Poisoning Graph Neural Networks for Link Inference

    Authors: Ruyi Ding, Shijin Duan, Xiaolin Xu, Yunsi Fei

    Abstract: Graph neural networks (GNNs) have brought superb performance to various applications utilizing graph structural data, such as social analysis and fraud detection. The graph links, e.g., social relationships and transaction history, are sensitive and valuable information, which raises privacy concerns when using GNNs. To exploit these vulnerabilities, we propose VertexSerum, a novel graph poisoning… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  32. arXiv:2307.02751  [pdf, ps, other

    cs.SD cs.CR eess.AS

    DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition

    Authors: Zhifeng Wang, Chunyan Zeng, Surong Duan, Hongjie Ouyang, Hongmin Xu

    Abstract: Speaker recognition is a biometric modality that utilizes the speaker's speech segments to recognize the identity, determining whether the test speaker belongs to one of the enrolled speakers. In order to improve the robustness of the i-vector framework on cross-channel conditions and explore the nova method for applying deep learning to speaker recognition, the Stacked Auto-encoders are used to g… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 12 pages, 3 figures

  33. arXiv:2306.15513  [pdf, other

    cs.CR

    PASNet: Polynomial Architecture Search Framework for Two-party Computation-based Secure Neural Network Deployment

    Authors: Hongwu Peng, Shanglin Zhou, Yukui Luo, Nuo Xu, Shijin Duan, Ran Ran, Jiahui Zhao, Chenghong Wang, Tong Geng, Wujie Wen, Xiaolin Xu, Caiwen Ding

    Abstract: Two-party computation (2PC) is promising to enable privacy-preserving deep learning (DL). However, the 2PC-based privacy-preserving DL implementation comes with high comparison protocol overhead from the non-linear operators. This work presents PASNet, a novel systematic framework that enables low latency, high energy efficiency & accuracy, and security-guaranteed 2PC-DL by integrating the hardwar… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: DAC 2023 accepeted publication, short version was published on AAAI 2023 workshop on DL-Hardware Co-Design for AI Acceleration: RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference

    ACM Class: E.3; I.2; B.0

    Journal ref: DAC 2023

  34. arXiv:2302.12506  [pdf

    cs.IT

    Exploring the Enablers of Digital Transformation in Small and Medium-Sized Enterprise

    Authors: Sachithra Lokuge, Sophia Duan

    Abstract: Recently, digital transformation has caught much attention of both academics and practitioners. With the advent of digital technologies, small-and-medium-sized enterprises (SMEs) have obtained the capacity to initiate digital transformation initiatives in a similar fashion to large-sized organizations. The innate characteristics of digital technologies also favor SMEs in promoting initiation of di… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  35. arXiv:2302.12347  [pdf, other

    cs.LG

    MetaLDC: Meta Learning of Low-Dimensional Computing Classifiers for Fast On-Device Adaption

    Authors: Yejia Liu, Shijin Duan, Xiaolin Xu, Shaolei Ren

    Abstract: Fast model updates for unseen tasks on intelligent edge devices are crucial but also challenging due to the limited computational power. In this paper,we propose MetaLDC, which meta-trains braininspired ultra-efficient low-dimensional computing classifiers to enable fast adaptation on tiny devices with minimal computational costs. Concretely, during the meta-training stage, MetaLDC meta trains a r… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted as a full paper by the TinyML Research Symposium 2023; 8 pages, 5 figures

  36. arXiv:2302.02292  [pdf, other

    cs.CR cs.LG

    RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference

    Authors: Hongwu Peng, Shanglin Zhou, Yukui Luo, Nuo Xu, Shijin Duan, Ran Ran, Jiahui Zhao, Shaoyi Huang, Xi Xie, Chenghong Wang, Tong Geng, Wujie Wen, Xiaolin Xu, Caiwen Ding

    Abstract: The proliferation of deep learning (DL) has led to the emergence of privacy and security concerns. To address these issues, secure Two-party computation (2PC) has been proposed as a means of enabling privacy-preserving DL computation. However, in practice, 2PC methods often incur high computation and communication overhead, which can impede their use in large-scale systems. To address this challen… ▽ More

    Submitted 22 February, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: This is work is a updated version of arXiv:2209.09424, the original version has been withdrawn

    ACM Class: I.2

  37. arXiv:2212.01109  [pdf, other

    cs.LG cs.DC

    Generative Data Augmentation for Non-IID Problem in Decentralized Clinical Machine Learning

    Authors: Zirui Wang, Shaoming Duan, Chengyue Wu, Wenhao Lin, Xinyu Zha, Peiyi Han, Chuanyi Liu

    Abstract: Swarm learning (SL) is an emerging promising decentralized machine learning paradigm and has achieved high performance in clinical applications. SL solves the problem of a central structure in federated learning by combining edge computing and blockchain-based peer-to-peer network. While there are promising results in the assumption of the independent and identically distributed (IID) data across… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  38. arXiv:2211.13116  [pdf, other

    cs.LG cs.CR stat.ML

    Fed-TDA: Federated Tabular Data Augmentation on Non-IID Data

    Authors: Shaoming Duan, Chuanyi Liu, Peiyi Han, Tianyu He, Yifeng Xu, Qiyuan Deng

    Abstract: Non-independent and identically distributed (non-IID) data is a key challenge in federated learning (FL), which usually hampers the optimization convergence and the performance of FL. Existing data augmentation methods based on federated generative models or raw data sharing strategies for solving the non-IID problem still suffer from low performance, privacy protection concerns, and high communic… ▽ More

    Submitted 12 January, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  39. arXiv:2211.03925  [pdf, other

    cs.LG physics.ao-ph

    AutoML-based Almond Yield Prediction and Projection in California

    Authors: Shiheng Duan, Shuaiqi Wu, Erwan Monier, Paul Ullrich

    Abstract: Almonds are one of the most lucrative products of California, but are also among the most sensitive to climate change. In order to better understand the relationship between climatic factors and almond yield, an automated machine learning framework is used to build a collection of machine learning models. The prediction skill is assessed using historical records. Future projections are derived usi… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: Submitted to Tackling Climate Change with Machine Learning: workshop at NeurIPS 2022

  40. arXiv:2209.09424   

    cs.CR cs.LG

    PolyMPCNet: Towards ReLU-free Neural Architecture Search in Two-party Computation Based Private Inference

    Authors: Hongwu Peng, Shanglin Zhou, Yukui Luo, Shijin Duan, Nuo Xu, Ran Ran, Shaoyi Huang, Chenghong Wang, Tong Geng, Ang Li, Wujie Wen, Xiaolin Xu, Caiwen Ding

    Abstract: The rapid growth and deployment of deep learning (DL) has witnessed emerging privacy and security concerns. To mitigate these issues, secure multi-party computation (MPC) has been discussed, to enable the privacy-preserving DL computation. In practice, they often come at very high computation and communication overhead, and potentially prohibit their popularity in large scale systems. Two orthogon… ▽ More

    Submitted 22 February, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: Uploaded a new version of the paper in another new submission: RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference [arXiv:2302.02292]

    ACM Class: I.2; E.3; C.3

  41. arXiv:2207.07012  [pdf, ps, other

    cs.LG physics.ao-ph

    AutoML-Based Drought Forecast with Meteorological Variables

    Authors: Shiheng Duan, Xiurui Zhang

    Abstract: A precise forecast for droughts is of considerable value to scientific research, agriculture, and water resource management. With emerging developments of data-driven approaches for hydro-climate modeling, this paper investigates an AutoML-based framework to forecast droughts in the U.S. Compared with commonly-used temporal deep learning models, the AutoML model can achieve comparable performance… ▽ More

    Submitted 23 August, 2022; v1 submitted 9 June, 2022; originally announced July 2022.

  42. arXiv:2207.02632  [pdf, other

    cs.CV

    Network Pruning via Feature Shift Minimization

    Authors: Yuanzhi Duan, Yue Zhou, Peng He, Qiang Liu, Shukai Duan, Xiaofang Hu

    Abstract: Channel pruning is widely used to reduce the complexity of deep network models. Recent pruning methods usually identify which parts of the network to discard by proposing a channel importance criterion. However, recent studies have shown that these criteria do not work well in all conditions. In this paper, we propose a novel Feature Shift Minimization (FSM) method to compress CNN models, which ev… ▽ More

    Submitted 3 October, 2022; v1 submitted 6 July, 2022; originally announced July 2022.

  43. arXiv:2205.00069  [pdf, other

    cs.CV

    Birds' Eye View: Measuring Behavior and Posture of Chickens as a Metric for Their Well-Being

    Authors: Kevin Hyekang Joo, Shiyuan Duan, Shawna L. Weimer, Mohammad Nayeem Teli

    Abstract: Chicken well-being is important for ensuring food security and better nutrition for a growing global human population. In this research, we represent behavior and posture as a metric to measure chicken well-being. With the objective of detecting chicken posture and behavior in a pen, we employ two algorithms: Mask R-CNN for instance segmentation and YOLOv4 in combination with ResNet50 for classifi… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

    Comments: under review at IJCV

  44. arXiv:2203.12046  [pdf, other

    cs.CR cs.AR

    NNReArch: A Tensor Program Scheduling Framework Against Neural Network Architecture Reverse Engineering

    Authors: Yukui Luo, Shijin Duan, Cheng Gongye, Yunsi Fei, Xiaolin Xu

    Abstract: Architecture reverse engineering has become an emerging attack against deep neural network (DNN) implementations. Several prior works have utilized side-channel leakage to recover the model architecture while the target is executing on a hardware acceleration platform. In this work, we target an open-source deep-learning accelerator, Versatile Tensor Accelerator (VTA), and utilize electromagnetic… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted by FCCM 2022

  45. arXiv:2203.09681  [pdf, other

    cs.CR

    HDLock: Exploiting Privileged Encoding to Protect Hyperdimensional Computing Models against IP Stealing

    Authors: Shijin Duan, Shaolei Ren, Xiaolin Xu

    Abstract: Hyperdimensional Computing (HDC) is facing infringement issues due to straightforward computations. This work, for the first time, raises a critical vulnerability of HDC, an attacker can reverse engineer the entire model, only requiring the unindexed hypervector memory. To mitigate this attack, we propose a defense strategy, namely HDLock, which significantly increases the reasoning cost of encodi… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 7 pages, 9 figures, accepted by and to be presented at DAC 2022

  46. arXiv:2203.09680  [pdf, other

    cs.LG

    LeHDC: Learning-Based Hyperdimensional Computing Classifier

    Authors: Shijin Duan, Yejia Liu, Shaolei Ren, Xiaolin Xu

    Abstract: Thanks to the tiny storage and efficient execution, hyperdimensional Computing (HDC) is emerging as a lightweight learning framework on resource-constrained hardware. Nonetheless, the existing HDC training relies on various heuristic methods, significantly limiting their inference accuracy. In this paper, we propose a new HDC framework, called LeHDC, which leverages a principled learning approach… ▽ More

    Submitted 31 March, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: 7 pages, 6 figures, accepted by and to be presented at DAC 2022

  47. arXiv:2203.04894  [pdf, other

    cs.LG

    A Brain-Inspired Low-Dimensional Computing Classifier for Inference on Tiny Devices

    Authors: Shijin Duan, Xiaolin Xu, Shaolei Ren

    Abstract: By mimicking brain-like cognition and exploiting parallelism, hyperdimensional computing (HDC) classifiers have been emerging as a lightweight framework to achieve efficient on-device inference. Nonetheless, they have two fundamental drawbacks, heuristic training process and ultra-high dimension, which result in sub-optimal inference accuracy and large model sizes beyond the capability of tiny dev… ▽ More

    Submitted 31 March, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: 8 pages, 9 figures, accepted by and presented as a full paper at TinyML Research Symposium 2022

  48. arXiv:2112.05493  [pdf, other

    cs.LG cs.CV eess.IV

    Network Compression via Central Filter

    Authors: Yuanzhi Duan, Xiaofang Hu, Yue Zhou, Qiang Liu, Shukai Duan

    Abstract: Neural network pruning has remarkable performance for reducing the complexity of deep network models. Recent network pruning methods usually focused on removing unimportant or redundant filters in the network. In this paper, by exploring the similarities between feature maps, we propose a novel filter pruning method, Central Filter (CF), which suggests that a filter is approximately equal to a set… ▽ More

    Submitted 13 December, 2021; v1 submitted 10 December, 2021; originally announced December 2021.

  49. arXiv:2111.05989  [pdf

    cs.DL cs.IT

    Towards Understanding Enablers of Digital Transformation in Small and Medium-Sized Enterprises

    Authors: Sachithra Lokuge, Sophia Xiaoxia Duan

    Abstract: Even though, digital transformation has attracted much attention of both academics and practitioners, a very limited number of studies have investigated the digital transformation process in small and medium-sized enterprises (SMEs) and the findings remain fragmented. Given the accessibility and availability of digital technologies to launch digital transformation initiatives and the importance of… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

  50. arXiv:2110.09182  [pdf, other

    cs.LG

    Graph Partner Neural Networks for Semi-Supervised Learning on Graphs

    Authors: Langzhang Liang, Cuiyun Gao, Shiyi Chen, Shishi Duan, Yu pan, Junjin Zheng, Lei Wang, Zenglin Xu

    Abstract: Graph Convolutional Networks (GCNs) are powerful for processing graph-structured data and have achieved state-of-the-art performance in several tasks such as node classification, link prediction, and graph classification. However, it is inevitable for deep GCNs to suffer from an over-smoothing issue that the representations of nodes will tend to be indistinguishable after repeated graph convolutio… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.