Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 52 results for author: Cai, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11116  [pdf

    cs.CL

    Grammaticality Representation in ChatGPT as Compared to Linguists and Laypeople

    Authors: Zhuang Qiu, Xufeng Duan, Zhenguang G. Cai

    Abstract: Large language models (LLMs) have demonstrated exceptional performance across various linguistic tasks. However, it remains uncertain whether LLMs have developed human-like fine-grained grammatical intuition. This preregistered study (https://osf.io/t5nes) presents the first large-scale investigation of ChatGPT's grammatical intuition, building upon a previous study that collected laypeople's gram… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 23 pages

  2. arXiv:2406.07932  [pdf, other

    cs.IR

    Counteracting Duration Bias in Video Recommendation via Counterfactual Watch Time

    Authors: Haiyuan Zhao, Guohao Cai, Jieming Zhu, Zhenhua Dong, Jun Xu, Ji-Rong Wen

    Abstract: In video recommendation, an ongoing effort is to satisfy users' personalized information needs by leveraging their logged watch time. However, watch time prediction suffers from duration bias, hindering its ability to reflect users' interests accurately. Existing label-correction approaches attempt to uncover user interests through grouping and normalizing observed watch time according to video du… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by KDD 2024

  3. arXiv:2404.09578  [pdf, other

    cs.IR

    Recall-Augmented Ranking: Enhancing Click-Through Rate Prediction Accuracy with Cross-Stage Data

    Authors: Junjie Huang, Guohao Cai, Jieming Zhu, Zhenhua Dong, Ruiming Tang, Weinan Zhang, Yong Yu

    Abstract: Click-through rate (CTR) prediction plays an indispensable role in online platforms. Numerous models have been proposed to capture users' shifting preferences by leveraging user behavior sequences. However, these historical sequences often suffer from severe homogeneity and scarcity compared to the extensive item pool. Relying solely on such sequences for user representations is inherently restric… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 4 pages, accepted by WWW 2024 Short Track

  4. arXiv:2404.04693  [pdf, other

    cs.CV cs.RO

    OmniColor: A Global Camera Pose Optimization Approach of LiDAR-360Camera Fusion for Colorizing Point Clouds

    Authors: Bonan Liu, Guoyang Zhao, Jianhao Jiao, Guang Cai, Chengyang Li, Handi Yin, Yuyang Wang, Ming Liu, Pan Hui

    Abstract: A Colored point cloud, as a simple and efficient 3D representation, has many advantages in various fields, including robotic navigation and scene reconstruction. This representation is now commonly used in 3D reconstruction tasks relying on cameras and LiDARs. However, fusing data from these two types of sensors is poorly performed in many existing frameworks, leading to unsatisfactory mapping res… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 2024 IEEE International Conference on Robotics and Automation

  5. arXiv:2403.08569  [pdf, other

    cs.LG physics.comp-ph

    A Physics-driven GraphSAGE Method for Physical Process Simulations Described by Partial Differential Equations

    Authors: Hang Hu, Sidi Wu, Guoxiong Cai, Na Liu

    Abstract: Physics-informed neural networks (PINNs) have successfully addressed various computational physics problems based on partial differential equations (PDEs). However, while tackling issues related to irregularities like singularities and oscillations, trained solutions usually suffer low accuracy. In addition, most current works only offer the trained solution for predetermined input parameters. If… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 18 pages,11 figures, 3 tables

    ACM Class: G.1.8

  6. arXiv:2403.05059  [pdf, other

    cs.SE

    Bug Priority Change: An Empirical Study on Apache Projects

    Authors: Zengyang Li, Guangzong Cai, Qinyi Yu, Peng Liang, Ran Mo, Hui Liu

    Abstract: In issue tracking systems, each bug is assigned a priority level (e.g., Blocker, Critical, Major, Minor, or Trivial in JIRA from highest to lowest), which indicates the urgency level of the bug. In this sense, understanding bug priority changes helps to arrange the work schedule of participants reasonably, and facilitates a better analysis and resolution of bugs. According to the data extracted fr… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Preprint accepted for publication in Journal of Systems and Software, 2024

  7. arXiv:2402.13296  [pdf, other

    cs.NE

    Evolutionary Reinforcement Learning: A Systematic Review and Future Directions

    Authors: Yuanguo Lin, Fan Lin, Guorong Cai, Hong Chen, Lixin Zou, Pengcheng Wu

    Abstract: In response to the limitations of reinforcement learning and evolutionary algorithms (EAs) in complex problem-solving, Evolutionary Reinforcement Learning (EvoRL) has emerged as a synergistic solution. EvoRL integrates EAs and reinforcement learning, presenting a promising avenue for training intelligent agents. This systematic review firstly navigates through the technological background of EvoRL… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 18 pages, 2 figures

  8. arXiv:2311.03653  [pdf, ps, other

    cs.IT eess.SP

    On the Performance of LoRa Empowered Communication for Wireless Body Area Networks

    Authors: Minling Zhang, Guofa Cai, Zhiping Xu, Jiguang He, Markku Juntti

    Abstract: To remotely monitor the physiological status of the human body, long range (LoRa) communication has been considered as an eminently suitable candidate for wireless body area networks (WBANs). Typically, a Rayleigh-lognormal fading channel is encountered by the LoRa links of the WBAN. In this context, we characterize the performance of the LoRa system in WBAN scenarios with an emphasis on the physi… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  9. arXiv:2311.02917  [pdf, ps, other

    cs.IT

    RIS-Enabled Anti-Interference in LoRa Systems

    Authors: Zhaokun Liang, Guofa Cai, Jiguang He, Georges Kaddoum, Chongwen Huang, Merouane Debbah

    Abstract: It has been proved that a long-range (LoRa) system can achieve long-distance and low-power. However, the performance of LoRa systems can be severely degraded by fading. In addition, LoRa technology typically adopts an ALOHA-based access mechanism, which inevitably produces interfering signals for the target user. To overcome the effects of fading and interference, we introduce a reconfigurable int… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  10. arXiv:2308.09489  [pdf, ps, other

    cs.IT eess.SP

    STAR-RIS Aided MISO SWIPT-NOMA System with Energy Buffer: Performance Analysis and Optimization

    Authors: Kengyuan Xie, Guofa Cai, Jiguang He, Georges Kaddoum

    Abstract: In this paper, we propose a simultaneous transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) and energy buffer aided multiple-input single-output (MISO) simultaneous wireless information and power transfer (SWIPT) non-orthogonal multiple access (NOMA) system, which consists of a STAR-RIS, an access point (AP), and reflection users and transmission users with energy buffers. I… ▽ More

    Submitted 16 July, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

  11. Uncovering User Interest from Biased and Noised Watch Time in Video Recommendation

    Authors: Haiyuan Zhao, Lei Zhang, Jun Xu, Guohao Cai, Zhenhua Dong, Ji-Rong Wen

    Abstract: In the video recommendation, watch time is commonly adopted as an indicator of user interest. However, watch time is not only influenced by the matching of users' interests but also by other factors, such as duration bias and noisy watching. Duration bias refers to the tendency for users to spend more time on videos with longer durations, regardless of their actual interest level. Noisy watching,… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: Accepted by Recsys'23

  12. arXiv:2306.08808  [pdf, other

    cs.IR

    ReLoop2: Building Self-Adaptive Recommendation Models via Responsive Error Compensation Loop

    Authors: Jieming Zhu, Guohao Cai, Junjie Huang, Zhenhua Dong, Ruiming Tang, Weinan Zhang

    Abstract: Industrial recommender systems face the challenge of operating in non-stationary environments, where data distribution shifts arise from evolving user behaviors over time. To tackle this challenge, a common approach is to periodically re-train or incrementally update deployed deep models with newly observed data, resulting in a continual training process. However, the conventional learning paradig… ▽ More

    Submitted 29 November, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted by KDD 2023

  13. arXiv:2304.13445  [pdf, other

    cs.CV

    Neural-PBIR Reconstruction of Shape, Material, and Illumination

    Authors: Cheng Sun, Guangyan Cai, Zhengqin Li, Kai Yan, Cheng Zhang, Carl Marshall, Jia-Bin Huang, Shuang Zhao, Zhao Dong

    Abstract: Reconstructing the shape and spatially varying surface appearances of a physical-world object as well as its surrounding illumination based on 2D images (e.g., photographs) of the object has been a long-standing problem in computer vision and graphics. In this paper, we introduce an accurate and highly efficient object reconstruction pipeline combining neural based object reconstruction and physic… ▽ More

    Submitted 1 February, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: ICCV 2023. Project page at https://neural-pbir.github.io/ Update Stanford-ORB results

  14. arXiv:2304.00902  [pdf, other

    cs.IR

    FinalMLP: An Enhanced Two-Stream MLP Model for CTR Prediction

    Authors: Kelong Mao, Jieming Zhu, Liangcai Su, Guohao Cai, Yuru Li, Zhenhua Dong

    Abstract: Click-through rate (CTR) prediction is one of the fundamental tasks for online advertising and recommendation. While multi-layer perceptron (MLP) serves as a core component in many deep CTR prediction models, it has been widely recognized that applying a vanilla MLP network alone is inefficient in learning multiplicative feature interactions. As such, many two-stream interaction models (e.g., Deep… ▽ More

    Submitted 29 November, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: Accepted by AAAI 2023. Code available at https://reczoo.github.io/FinalMLP

  15. arXiv:2303.08014  [pdf

    cs.CL

    Do large language models resemble humans in language use?

    Authors: Zhenguang G. Cai, Xufeng Duan, David A. Haslett, Shuqi Wang, Martin J. Pickering

    Abstract: Large language models (LLMs) such as ChatGPT and Vicuna have shown remarkable capacities in comprehending and producing language. However, their internal workings remain a black box, and it is unclear whether LLMs and chatbots can develop humanlike characteristics in language use. Cognitive scientists have devised many experiments that probe, and have made great progress in explaining, how people… ▽ More

    Submitted 25 March, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

  16. arXiv:2301.08865  [pdf, ps, other

    cs.IT cs.NI

    Performance Analysis and Resource Allocation of STAR-RIS Aided Wireless-Powered NOMA System

    Authors: Kengyuan Xie, Guofa Cai, Georges Kaddoum, Jiguang He

    Abstract: This paper proposes a simultaneous transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) aided wireless-powered non-orthogonal multiple access (NOMA) system, which includes an access point (AP), a STAR-RIS, and two non-orthogonal users located at both sides of the STAR-RIS. In this system, the users first harvest the radio-frequency energy from the AP in the downlink, then adop… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

    Comments: 30 pages, 12 figures

  17. arXiv:2212.00844  [pdf, other

    cs.MA cs.RO

    A Comparison of New Swarm Task Allocation Algorithms in Unknown Environments with Varying Task Density

    Authors: Grace Cai, Noble Harasha, Nancy Lynch

    Abstract: Task allocation is an important problem for robot swarms to solve, allowing agents to reduce task completion time by performing tasks in a distributed fashion. Existing task allocation algorithms often assume prior knowledge of task location and demand or fail to consider the effects of the geometric distribution of tasks on the completion time and communication cost of the algorithms. In this pap… ▽ More

    Submitted 9 February, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: 11 pages, 11 figures

  18. Fault Diagnosis for Power Electronics Converters based on Deep Feedforward Network and Wavelet Compression

    Authors: Lei Kou, Chuang Liu, Guowei Cai, Zhe Zhang

    Abstract: A fault diagnosis method for power electronics converters based on deep feedforward network and wavelet compression is proposed in this paper. The transient historical data after wavelet compression are used to realize the training of fault diagnosis classifier. Firstly, the correlation analysis of the voltage or current data running in various fault states is performed to remove the redundant fea… ▽ More

    Submitted 27 October, 2022; originally announced November 2022.

    Comments: Electric Power Systems Research

    MSC Class: 68T07 ACM Class: I.2

  19. Data-driven design of fault diagnosis for three-phase PWM rectifier using random forests technique with transient synthetic features

    Authors: Lei Kou, Chuang Liu, Guo-wei Cai, Jia-ning Zhou, Quan-de Yuan

    Abstract: A three-phase pulse-width modulation (PWM) rectifier can usually maintain operation when open-circuit faults occur in insulated-gate bipolar transistors (IGBTs), which will lead the system to be unstable and unsafe. Aiming at this problem, based on random forests with transient synthetic features, a data-driven online fault diagnosis method is proposed to locate the open-circuit faults of IGBTs ti… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: IET Power Electronics

    MSC Class: 68T99 ACM Class: I.2

  20. Fault diagnosis for open-circuit faults in NPC inverter based on knowledge-driven and data-driven approaches

    Authors: Lei Kou, Chuang Liu, Guo-wei Cai, Jia-ning Zhou, Quan-de Yuan, Si-miao Pang

    Abstract: In this study, the open-circuit faults diagnosis and location issue of the neutral-point-clamped (NPC) inverters are analysed. A novel fault diagnosis approach based on knowledge driven and data driven was presented for the open-circuit faults in insulated-gate bipolar transistors (IGBTs) of NPC inverter, and Concordia transform (knowledge driven) and random forests (RFs) technique (data driven) a… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: IET Power Electronics

    MSC Class: 68T05 ACM Class: I.2

  21. Review for AI-based Open-Circuit Faults Diagnosis Methods in Power Electronics Converters

    Authors: Chuang Liu, Lei Kou, Guowei Cai, Zihan Zhao, Zhe Zhang

    Abstract: Power electronics converters have been widely used in aerospace system, DC transmission, distributed energy, smart grid and so forth, and the reliability of power electronics converters has been a hotspot in academia and industry. It is of great significance to carry out power electronics converters open-circuit faults monitoring and intelligent fault diagnosis to avoid secondary faults, reduce ti… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: Power System Technology

    MSC Class: 68T99 ACM Class: I.2

  22. arXiv:2206.00587  [pdf, other

    cs.MA

    A Geometry-Sensitive Quorum Sensing Algorithm for the Best-of-N Site Selection Problem

    Authors: Grace Cai, Nancy Lynch

    Abstract: The house hunting behavior of the Temnothorax albipennis ant allows the colony to explore several nest choices and agree on the best one. Their behavior serves as the basis for many bio-inspired swarm models to solve the same problem. However, many of the existing site selection models in both insect colony and swarm literature test the model's accuracy and decision time only on setups where all p… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: 17 pages, 4 figures, submitted to ANTS 2022

  23. arXiv:2205.09626  [pdf, other

    cs.IR

    BARS: Towards Open Benchmarking for Recommender Systems

    Authors: Jieming Zhu, Quanyu Dai, Liangcai Su, Rong Ma, Jinyang Liu, Guohao Cai, Xi Xiao, Rui Zhang

    Abstract: The past two decades have witnessed the rapid development of personalized recommendation techniques. Despite significant progress made in both research and practice of recommender systems, to date, there is a lack of a widely-recognized benchmarking standard in this field. Many existing studies perform model evaluations and comparisons in an ad-hoc manner, for example, by employing their own priva… ▽ More

    Submitted 17 July, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted by SIGIR 2022. Note that version v5 is updated to keep consistency with the ACM camera-ready version

  24. Physics-Based Inverse Rendering using Combined Implicit and Explicit Geometries

    Authors: Guangyan Cai, Kai Yan, Zhao Dong, Ioannis Gkioulekas, Shuang Zhao

    Abstract: Mathematically representing the shape of an object is a key ingredient for solving inverse rendering problems. Explicit representations like meshes are efficient to render in a differentiable fashion but have difficulties handling topology changes. Implicit representations like signed-distance functions, on the other hand, offer better support of topology changes but are much more difficult to use… ▽ More

    Submitted 8 July, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Journal ref: Computer Graphics Forum, Volume 41 (2022), Number 4

  25. arXiv:2204.11165  [pdf, other

    cs.IR

    ReLoop: A Self-Correction Continual Learning Loop for Recommender Systems

    Authors: Guohao Cai, Jieming Zhu, Quanyu Dai, Zhenhua Dong, Xiuqiang He, Ruiming Tang, Rui Zhang

    Abstract: Deep learning-based recommendation has become a widely adopted technique in various online applications. Typically, a deployed model undergoes frequent re-training to capture users' dynamic behaviors from newly collected interaction logs. However, the current model training process only acquires users' feedbacks as labels, but fail to take into account the errors made in previous recommendations.… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: Accepted by SIGIR 2022

  26. arXiv:2204.00815  [pdf, other

    cs.IR

    Unbiased Top-k Learning to Rank with Causal Likelihood Decomposition

    Authors: Haiyuan Zhao, Jun Xu, Xiao Zhang, Guohao Cai, Zhenhua Dong, Ji-Rong Wen

    Abstract: Unbiased learning to rank has been proposed to alleviate the biases in the search ranking, making it possible to train ranking models with user interaction data. In real applications, search engines are designed to display only the most relevant k documents from the retrieved candidate set. The rest candidates are discarded. As a consequence, position bias and sample selection bias usually occur s… ▽ More

    Submitted 13 June, 2024; v1 submitted 2 April, 2022; originally announced April 2022.

    Comments: Accepted by SIGIR-AP 2023

  27. arXiv:2203.12267  [pdf, other

    cs.IR cs.LG

    PEAR: Personalized Re-ranking with Contextualized Transformer for Recommendation

    Authors: Yi Li, Jieming Zhu, Weiwen Liu, Liangcai Su, Guohao Cai, Qi Zhang, Ruiming Tang, Xi Xiao, Xiuqiang He

    Abstract: The goal of recommender systems is to provide ordered item lists to users that best match their interests. As a critical task in the recommendation pipeline, re-ranking has received increasing attention in recent years. In contrast to conventional ranking models that score each item individually, re-ranking aims to explicitly model the mutual influences among items to further refine the ordering o… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Accepted by WWW 2022

  28. arXiv:2203.11720  [pdf, other

    cs.CL

    Continuous Detection, Rapidly React: Unseen Rumors Detection based on Continual Prompt-Tuning

    Authors: Yuhui Zuo, Wei Zhu, Guoyong Cai

    Abstract: Since open social platforms allow for a large and continuous flow of unverified information, rumors can emerge unexpectedly and spread quickly. However, existing rumor detection (RD) models often assume the same training and testing distributions and can not cope with the continuously changing social network environment. This paper proposed a Continual Prompt-Tuning RD (CPT-RD) framework, which av… ▽ More

    Submitted 9 September, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: final version, accpeted by COLING 2022

  29. arXiv:2203.07720  [pdf, other

    cs.CV

    Revitalize Region Feature for Democratizing Video-Language Pre-training of Retrieval

    Authors: Guanyu Cai, Yixiao Ge, Binjie Zhang, Alex Jinpeng Wang, Rui Yan, Xudong Lin, Ying Shan, Lianghua He, Xiaohu Qie, Jianping Wu, Mike Zheng Shou

    Abstract: Recent dominant methods for video-language pre-training (VLP) learn transferable representations from the raw pixels in an end-to-end manner to achieve advanced performance on downstream video-language retrieval. Despite the impressive results, VLP research becomes extremely expensive with the need for massive data and a long training time, preventing further explorations. In this work, we revital… ▽ More

    Submitted 7 February, 2023; v1 submitted 15 March, 2022; originally announced March 2022.

  30. arXiv:2203.07303  [pdf, other

    cs.CV

    All in One: Exploring Unified Video-Language Pre-training

    Authors: Alex Jinpeng Wang, Yixiao Ge, Rui Yan, Yuying Ge, Xudong Lin, Guanyu Cai, Jianping Wu, Ying Shan, Xiaohu Qie, Mike Zheng Shou

    Abstract: Mainstream Video-Language Pre-training models \cite{actbert,clipbert,violet} consist of three parts, a video encoder, a text encoder, and a video-text fusion Transformer. They pursue better performance via utilizing heavier unimodal encoders or multimodal fusion Transformers, resulting in increased parameters with lower efficiency in downstream tasks. In this work, we for the first time introduce… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 18 pages. 11 figures. Code: https://github.com/showlab/all-in-one

  31. arXiv:2201.06056  [pdf, other

    cs.IR

    Debiased Recommendation with User Feature Balancing

    Authors: Mengyue Yang, Guohao Cai, Furui Liu, Zhenhua Dong, Xiuqiang He, Jianye Hao, Jun Wang, Xu Chen

    Abstract: Debiased recommendation has recently attracted increasing attention from both industry and academic communities. Traditional models mostly rely on the inverse propensity score (IPS), which can be hard to estimate and may suffer from the high variance issue. To alleviate these problems, in this paper, we propose a novel debiased recommendation framework based on user feature balancing. The general… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

  32. arXiv:2112.01194  [pdf, other

    cs.CV cs.MM

    Video-Text Pre-training with Learned Regions

    Authors: Rui Yan, Mike Zheng Shou, Yixiao Ge, Alex Jinpeng Wang, Xudong Lin, Guanyu Cai, Jinhui Tang

    Abstract: Video-Text pre-training aims at learning transferable representations from large-scale video-text pairs via aligning the semantics between visual and textual information. State-of-the-art approaches extract visual features from raw pixels in an end-to-end fashion. However, these methods operate at frame-level directly and thus overlook the spatio-temporal structure of objects in video, which yet h… ▽ More

    Submitted 6 December, 2021; v1 submitted 2 December, 2021; originally announced December 2021.

  33. arXiv:2112.00656  [pdf, other

    cs.CV cs.CL

    Object-aware Video-language Pre-training for Retrieval

    Authors: Alex Jinpeng Wang, Yixiao Ge, Guanyu Cai, Rui Yan, Xudong Lin, Ying Shan, Xiaohu Qie, Mike Zheng Shou

    Abstract: Recently, by introducing large-scale dataset and strong transformer network, video-language pre-training has shown great success especially for retrieval. Yet, existing video-language transformer models do not explicitly fine-grained semantic align. In this work, we present Object-aware Transformers, an object-centric approach that extends video-language transformer to incorporate object represent… ▽ More

    Submitted 18 May, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: CVPR2022; Code: https://github.com/FingerRec/OA-Transformer

  34. arXiv:2108.00638  [pdf, ps, other

    cs.IT eess.SP eess.SY

    Performance Analysis of a Two-Hop Relaying LoRa System

    Authors: Wenyang Xu, Guofa Cai, Yi Fang, Guanrong Chen

    Abstract: The conventional LoRa system is not able to sustain long-range communication over fading channels. To resolve the challenging issue, this paper investigates a two-hop opportunistic amplify-and-forward relaying LoRa system. Based on the best relay-selection protocol, the analytical and asymptotic bit error rate (BER), achievable diversity order, coverage probability, and throughput of the proposed… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: 7 pages, 6 figures, conference

  35. arXiv:2106.06867  [pdf, other

    cs.MA

    A Spatially Dependent Probabilistic Model for House Hunting in Ant Colonies

    Authors: Grace Cai, Wendy Wu, Wayne Zhao, Jiajia Zhao, Nancy Lynch

    Abstract: Ant species such as Temnothorax albipennis select a new nest site in a distributed fashion that, if modeled correctly, can serve as useful information for site selection algorithms for robotic swarms and other applications. Studying and replicating the ants' house hunting behavior will also illuminate useful distributed strategies that have evolved in nature. Many of the existing models of househu… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

  36. arXiv:2105.12939  [pdf, other

    cs.CV

    Unsupervised Adaptive Semantic Segmentation with Local Lipschitz Constraint

    Authors: Guanyu Cai, Lianghua He

    Abstract: Recent advances in unsupervised domain adaptation have seen considerable progress in semantic segmentation. Existing methods either align different domains with adversarial training or involve the self-learning that utilizes pseudo labels to conduct supervised training. The former always suffers from the unstable training caused by adversarial training and only focuses on the inter-domain gap that… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  37. arXiv:2103.03578  [pdf, other

    cs.IR

    Non-invasive Self-attention for Side Information Fusion in Sequential Recommendation

    Authors: Chang Liu, Xiaoguang Li, Guohao Cai, Zhenhua Dong, Hong Zhu, Lifeng Shang

    Abstract: Sequential recommender systems aim to model users' evolving interests from their historical behaviors, and hence make customized time-relevant recommendations. Compared with traditional models, deep learning approaches such as CNN and RNN have achieved remarkable advancements in recommendation tasks. Recently, the BERT framework also emerges as a promising method, benefited from its self-attention… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Comments: Accepted at AAAI 2021

  38. arXiv:2103.01654  [pdf, other

    cs.CV

    Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query

    Authors: Guanyu Cai, Jun Zhang, Xinyang Jiang, Yifei Gong, Lianghua He, Fufu Yu, Pai Peng, Xiaowei Guo, Feiyue Huang, Xing Sun

    Abstract: Text-based image retrieval has seen considerable progress in recent years. However, the performance of existing methods suffers in real life since the user is likely to provide an incomplete description of an image, which often leads to results filled with false positives that fit the incomplete description. In this work, we introduce the partial-query problem and extensively analyze its influence… ▽ More

    Submitted 11 August, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: Accepted by ICCV2021

  39. arXiv:2101.03036  [pdf, other

    cs.CV

    Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search

    Authors: Chenyang Gao, Guanyu Cai, Xinyang Jiang, Feng Zheng, Jun Zhang, Yifei Gong, Pai Peng, Xiaowei Guo, Xing Sun

    Abstract: Text-based person search aims at retrieving target person in an image gallery using a descriptive sentence of that person. It is very challenging since modal gap makes effectively extracting discriminative features more difficult. Moreover, the inter-class variance of both pedestrian images and descriptions is small. So comprehensive information is needed to align visual and textual clues across a… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

  40. arXiv:2005.03978  [pdf, ps, other

    eess.SP cs.IT

    Design of Link-Selection Strategies for Buffer-Aided DCSK-SWIPT Relay System

    Authors: Mi Qian, Guofa Cai, Yi Fang, Guojun Han

    Abstract: Adaptive link selection for buffer-aided relaying can achieve significant performance gain compared with the conventional relaying with fixed transmission criterion. However, most of the existing link-selection strategies are designed based on perfect channel state information (CSI), which are very complex by requiring channel estimator. To solve this issue, in this paper, we investigate a buffer-… ▽ More

    Submitted 7 April, 2020; originally announced May 2020.

  41. Toronto-3D: A Large-scale Mobile LiDAR Dataset for Semantic Segmentation of Urban Roadways

    Authors: Weikai Tan, Nannan Qin, Lingfei Ma, Ying Li, Jing Du, Guorong Cai, Ke Yang, Jonathan Li

    Abstract: Semantic segmentation of large-scale outdoor point clouds is essential for urban scene understanding in various applications, especially autonomous driving and urban high-definition (HD) mapping. With rapid developments of mobile laser scanning (MLS) systems, massive point clouds are available for scene understanding, but publicly accessible large-scale labeled datasets, which are essential for de… ▽ More

    Submitted 16 April, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: Toronto-3D dataset can be downloaded at https://github.com/WeikaiTan/Toronto-3D

  42. arXiv:2003.07107  [pdf, ps, other

    cs.NI eess.SP

    Design of an MISO-SWIPT-Aided Code-Index Modulated Multi-Carrier M-DCSK System for e-Health IoT

    Authors: Guofa Cai, Yi Fang, Pingping Chen, Guojun Han, Guoen Cai, Yang Song

    Abstract: Code index modulated multi-carrier M-ary differential chaos shift keying (CIM-MC-M-DCSK) system not only inherits low-power and low-complexity advantages of the conventional DCSK system, but also significantly increases the transmission rate. This feature is of particular importance to Internet of Things (IoT) with trillions of low-cost devices. In particular, for e-health IoT applications, an eff… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: 14 pages, 12 figures, accepted by IEEE Journal on Selected Areas in Communications, 2020.03.15

    Journal ref: IEEE Journal on Selected Areas in Communications,2020.03.15

  43. arXiv:2001.00149  [pdf

    eess.IV cs.CV physics.med-ph

    Simulation of Skin Stretching around the Forehead Wrinkles in Rhytidectomy

    Authors: Ping Zhou, Shuo Huang, Qiang Chen, Siyuan He, Guochao Cai

    Abstract: Objective: Skin stretching around the forehead wrinkles is an important method in rhytidectomy. Proper parameters are required to evaluate the surgical effect. In this paper, a simulation method was proposed to obtain the parameters. Methods: Three-dimensional point cloud data with a resolution of 50 μm were employed. First, a smooth supporting contour under the wrinkled forehead was generated via… ▽ More

    Submitted 1 January, 2020; originally announced January 2020.

  44. arXiv:1907.05855  [pdf, other

    cs.LG cs.AI stat.ML

    DisCoRL: Continual Reinforcement Learning via Policy Distillation

    Authors: René Traoré, Hugo Caselles-Dupré, Timothée Lesort, Te Sun, Guanghang Cai, Natalia Díaz-Rodríguez, David Filliat

    Abstract: In multi-task reinforcement learning there are two main challenges: at training time, the ability to learn different policies with a single model; at test time, inferring which of those policies applying without an external signal. In the case of continual reinforcement learning a third challenge arises: learning tasks sequentially without forgetting the previous ones. In this paper, we tackle the… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: arXiv admin note: text overlap with arXiv:1906.04452

  45. arXiv:1905.10748  [pdf, other

    cs.CV

    Learning Smooth Representation for Unsupervised Domain Adaptation

    Authors: Guanyu Cai, Lianghua He, Mengchu Zhou, Hesham Alhumade, Die Hu

    Abstract: Typical adversarial-training-based unsupervised domain adaptation methods are vulnerable when the source and target datasets are highly-complex or exhibit a large discrepancy between their data distributions. Recently, several Lipschitz-constraint-based methods have been explored. The satisfaction of Lipschitz continuity guarantees a remarkable performance on a target domain. However, they lack a… ▽ More

    Submitted 16 August, 2021; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: Code is available at https://github.com/CuthbertCai/SRDA. Accepted by IEEE Transactions on Neural Networks and Learning Systems

  46. arXiv:1903.01223   

    cs.IT

    Outage-Limit-Approaching Protograph LDPC Codes for Slow-Fading Wireless Communications

    Authors: Yi Fang, Pingping Chen, Guofa Cai, Francis C. M. Lau, Soung Chang Liew, Guojun Han

    Abstract: Block-fading (BF) channel, also known as slow-fading channel, is a type of simple and practical channel model that can characterize the primary feature of a number of wireless-communication applications with low to moderate mobility. Although the BF channel has received significant research attention in the past twenty years, designing low-complexity outage-limit-approaching error-correction codes… ▽ More

    Submitted 20 July, 2021; v1 submitted 4 March, 2019; originally announced March 2019.

    Comments: There are some technical errors in Section II of this paper, need to be corrected

  47. arXiv:1902.04443  [pdf, ps, other

    cs.NI

    QoS-Aware Buffer-Aided Relaying Implant WBAN for Healthcare IoT: Opportunities and Challenges

    Authors: Guofa Cai, Yi Fang, Jinming Wen, Guojun Han, Xiaodong Yang

    Abstract: Internet of Things (IoT) have motivated a paradigm shift in the development of various applications such as mobile health. Wireless body area network (WBAN) comprises many low-power devices in, on, or around the human body, which offers a desirable solution to monitor physiological signals for mobile-health applications. In the implant WBAN, an implant medical device transmits its measured biologi… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

    Journal ref: IEEE Network Magazine, 2019

  48. arXiv:1901.09822  [pdf, other

    cs.CV

    Virtual Conditional Generative Adversarial Networks

    Authors: Haifeng Shi, Guanyu Cai, Yuqin Wang, Shaohua Shang, Lianghua He

    Abstract: When trained on multimodal image datasets, normal Generative Adversarial Networks (GANs) are usually outperformed by class-conditional GANs and ensemble GANs, but conditional GANs is restricted to labeled datasets and ensemble GANs lack efficiency. We propose a novel GAN variant called virtual conditional GAN (vcGAN) which is not only an ensemble GAN with multiple generative paths while adding alm… ▽ More

    Submitted 25 January, 2019; originally announced January 2019.

  49. Unsupervised Domain Adaptation with Adversarial Residual Transform Networks

    Authors: Guanyu Cai, Yuqin Wang, Mengchu Zhou, Lianghua He

    Abstract: Domain adaptation is widely used in learning problems lacking labels. Recent studies show that deep adversarial domain adaptation models can make markable improvements in performance, which include symmetric and asymmetric architectures. However, the former has poor generalization ability whereas the latter is very hard to train. In this paper, we propose a novel adversarial domain adaptation meth… ▽ More

    Submitted 18 September, 2019; v1 submitted 25 April, 2018; originally announced April 2018.

    Comments: accepted by IEEE Transactions on Neural Networks and Learning Systems

  50. arXiv:1803.05471  [pdf

    cs.CV

    Computer-aided diagnosis of lung carcinoma using deep learning - a pilot study

    Authors: Zhang Li, Zheyu Hu, Jiaolong Xu, Tao Tan, Hui Chen, Zhi Duan, Ping Liu, Jun Tang, Guoping Cai, Quchang Ouyang, Yuling Tang, Geert Litjens, Qiang Li

    Abstract: Aim: Early detection and correct diagnosis of lung cancer are the most important steps in improving patient outcome. This study aims to assess which deep learning models perform best in lung cancer diagnosis. Methods: Non-small cell lung carcinoma and small cell lung carcinoma biopsy specimens were consecutively obtained and stained. The specimen slides were diagnosed by two experienced pathologis… ▽ More

    Submitted 14 March, 2018; originally announced March 2018.