Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 63 results for author: Gong, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04697  [pdf, other

    cs.CV cs.MM

    VCoME: Verbal Video Composition with Multimodal Editing Effects

    Authors: Weibo Gong, Xiaojie Jin, Xin Li, Dongliang He, Xinglong Wu

    Abstract: Verbal videos, featuring voice-overs or text overlays, provide valuable content but present significant challenges in composition, especially when incorporating editing effects to enhance clarity and visual appeal. In this paper, we introduce the novel task of verbal video composition with editing effects. This task aims to generate coherent and visually appealing verbal videos by integrating mult… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2406.05666  [pdf, other

    cs.LG cs.IR stat.ML

    General Distribution Learning: A theoretical framework for Deep Learning

    Authors: Binchuan Qi, Li Li, Wei Gong

    Abstract: There remain numerous unanswered research questions on deep learning (DL) within the classical learning theory framework. These include the remarkable generalization capabilities of overparametrized neural networks (NNs), the efficient optimization performance despite non-convexity of objectives, the mechanism of flat minima for generalization, and the exceptional performance of deep architectures… ▽ More

    Submitted 26 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2105.04026 by other authors. arXiv admin note: text overlap with arXiv:2105.04026 by other authors

  3. arXiv:2406.04567  [pdf, other

    cs.LG cs.IR

    Error Bounds of Supervised Classification from Information-Theoretic Perspective

    Authors: Binchuan Qi, Wei Gong, Li Li

    Abstract: There remains a list of unanswered research questions on deep learning (DL), including the remarkable generalization power of overparametrized neural networks, the efficient optimization performance despite the non-convexity, and the mechanisms behind flat minima in generalization. In this paper, we adopt an information-theoretic perspective to explore the theoretical foundations of supervised cla… ▽ More

    Submitted 27 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  4. arXiv:2406.00734  [pdf, other

    cs.LG

    GLADformer: A Mixed Perspective for Graph-level Anomaly Detection

    Authors: Fan Xu, Nan Wang, Hao Wu, Xuezhi Wen, Dalin Zhang, Siyang Lu, Binyong Li, Wei Gong, Hai Wan, Xibin Zhao

    Abstract: Graph-Level Anomaly Detection (GLAD) aims to distinguish anomalous graphs within a graph dataset. However, current methods are constrained by their receptive fields, struggling to learn global features within the graphs. Moreover, most contemporary methods are based on spatial domain and lack exploration of spectral characteristics. In this paper, we propose a multi-perspective hybrid graph-level… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  5. arXiv:2405.00770  [pdf, other

    quant-ph cs.CC cs.LG

    Quantum-Classical Separations in Shallow-Circuit-Based Learning with and without Noises

    Authors: Zhihan Zhang, Weiyuan Gong, Weikang Li, Dong-Ling Deng

    Abstract: We study quantum-classical separations between classical and quantum supervised learning models based on constant depth (i.e., shallow) circuits, in scenarios with and without noises. We construct a classification problem defined by a noiseless shallow quantum circuit and rigorously prove that any classical neural network with bounded connectivity requires logarithmic depth to output correctly wit… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 14 pages, 3 figures

  6. arXiv:2404.19105  [pdf, other

    quant-ph cs.IT

    Optimal tradeoffs for estimating Pauli observables

    Authors: Sitan Chen, Weiyuan Gong, Qi Ye

    Abstract: We revisit the problem of Pauli shadow tomography: given copies of an unknown $n$-qubit quantum state $ρ$, estimate $\text{tr}(Pρ)$ for some set of Pauli operators $P$ to within additive error $ε$. This has been a popular testbed for exploring the advantage of protocols with quantum memory over those without: with enough memory to measure two copies at a time, one can use Bell sampling to estimate… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 59 pages, 1 figure

  7. arXiv:2404.12529  [pdf, other

    cs.NI cs.HC

    A Survey of Bluetooth Indoor Localization

    Authors: Taolei Shi, Wei Gong

    Abstract: Nowadays, indoor localization has received extensive research interest due to more and more applications' needs for location information to provide a more precise and effective service [1], [2]. There are various wireless techniques and mechanisms that have been proposed; some of them have been studied in depth and come into use, such as Wi-Fi, RFID, and sensor networks. In comparison, the develop… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 8 pages, 2 figures

  8. arXiv:2404.09622  [pdf, other

    cs.RO cs.AI

    DIDLM:A Comprehensive Multi-Sensor Dataset with Infrared Cameras, Depth Cameras, LiDAR, and 4D Millimeter-Wave Radar in Challenging Scenarios for 3D Mapping

    Authors: WeiSheng Gong, Chen He, KaiJie Su, QingYong Li

    Abstract: This study presents a comprehensive multi-sensor dataset designed for 3D mapping in challenging indoor and outdoor environments. The dataset comprises data from infrared cameras, depth cameras, LiDAR, and 4D millimeter-wave radar, facilitating exploration of advanced perception and mapping techniques. Integration of diverse sensor data enhances perceptual capabilities in extreme conditions such as… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  9. arXiv:2403.13869  [pdf, other

    cs.LG cs.AI

    Accurately Predicting Probabilities of Safety-Critical Rare Events for Intelligent Systems

    Authors: Ruoxuan Bai, Jingxuan Yang, Weiduo Gong, Yi Zhang, Qiujing Lu, Shuo Feng

    Abstract: Intelligent systems are increasingly integral to our daily lives, yet rare safety-critical events present significant latent threats to their practical deployment. Addressing this challenge hinges on accurately predicting the probability of safety-critical events occurring within a given time step from the current state, a metric we define as 'criticality'. The complexity of predicting criticality… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  10. arXiv:2403.01736  [pdf, other

    cs.CV

    Lightweight Object Detection: A Study Based on YOLOv7 Integrated with ShuffleNetv2 and Vision Transformer

    Authors: Wenkai Gong

    Abstract: As mobile computing technology rapidly evolves, deploying efficient object detection algorithms on mobile devices emerges as a pivotal research area in computer vision. This study zeroes in on optimizing the YOLOv7 algorithm to boost its operational efficiency and speed on mobile platforms while ensuring high accuracy. Leveraging a synergy of advanced techniques such as Group Convolution, ShuffleN… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  11. arXiv:2402.12381  [pdf, other

    cs.AI cs.NE

    Constrained Multi-objective Optimization with Deep Reinforcement Learning Assisted Operator Selection

    Authors: Fei Ming, Wenyin Gong, Ling Wang, Yaochu Jin

    Abstract: Solving constrained multi-objective optimization problems with evolutionary algorithms has attracted considerable attention. Various constrained multi-objective optimization evolutionary algorithms (CMOEAs) have been developed with the use of different algorithmic strategies, evolutionary operators, and constraint-handling techniques. The performance of CMOEAs may be heavily dependent on the opera… ▽ More

    Submitted 15 January, 2024; originally announced February 2024.

  12. arXiv:2402.06665  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    The Essential Role of Causality in Foundation World Models for Embodied AI

    Authors: Tarun Gupta, Wenbo Gong, Chao Ma, Nick Pawlowski, Agrin Hilmkil, Meyer Scetbon, Marc Rigter, Ade Famoti, Ashley Juan Llorens, Jianfeng Gao, Stefan Bauer, Danica Kragic, Bernhard Schölkopf, Cheng Zhang

    Abstract: Recent advances in foundation models, especially in large multi-modal models and conversational agents, have ignited interest in the potential of generally capable embodied agents. Such agents will require the ability to perform new tasks in many different real-world environments. However, current foundation models fail to accurately model physical interactions and are therefore insufficient for E… ▽ More

    Submitted 29 April, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  13. arXiv:2402.00763  [pdf, other

    cs.CV cs.GR

    360-GS: Layout-guided Panoramic Gaussian Splatting For Indoor Roaming

    Authors: Jiayang Bai, Letian Huang, Jie Guo, Wen Gong, Yuanqi Li, Yanwen Guo

    Abstract: 3D Gaussian Splatting (3D-GS) has recently attracted great attention with real-time and photo-realistic renderings. This technique typically takes perspective images as input and optimizes a set of 3D elliptical Gaussians by splatting them onto the image planes, resulting in 2D Gaussians. However, applying 3D-GS to panoramic inputs presents challenges in effectively modeling the projection onto th… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 11 pages, 10 figures

  14. RecDCL: Dual Contrastive Learning for Recommendation

    Authors: Dan Zhang, Yangliao Geng, Wenwen Gong, Zhongang Qi, Zhiyu Chen, Xing Tang, Ying Shan, Yuxiao Dong, Jie Tang

    Abstract: Self-supervised learning (SSL) has recently achieved great success in mining the user-item interactions for collaborative filtering. As a major paradigm, contrastive learning (CL) based SSL helps address data sparsity in Web platforms by contrasting the embeddings between raw and augmented data. However, existing CL-based methods mostly focus on contrasting in a batch-wise way, failing to exploit… ▽ More

    Submitted 18 February, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: Accepted to WWW 2024

    Journal ref: Proceedings of TheWebConf 2024 (WWW '24), May 13--17, 2024, Singapore

  15. Exploring consumers response to text-based chatbots in e-commerce: The moderating role of task complexity and chatbot disclosure

    Authors: Xusen Cheng, Ying Bao, Alex Zarifis, Wankun Gong, Jian Mou

    Abstract: Artificial intelligence based chatbots have brought unprecedented business potential. This study aims to explore consumers trust and response to a text-based chatbot in ecommerce, involving the moderating effects of task complexity and chatbot identity disclosure. A survey method with 299 useable responses was conducted in this research. This study adopted the ordinary least squares regression to… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: Internet Research (2021)

  16. arXiv:2312.08867  [pdf, other

    quant-ph cs.DS

    Complexity of Digital Quantum Simulation in the Low-Energy Subspace: Applications and a Lower Bound

    Authors: Weiyuan Gong, Shuo Zhou, Tongyang Li

    Abstract: Digital quantum simulation has broad applications in approximating unitary evolution of Hamiltonians. In practice, many simulation tasks for quantum systems focus on quantum states in the low-energy subspace instead of the entire Hilbert space. In this paper, we systematically investigate the complexity of digital quantum simulation based on product formulas in the low-energy subspace. We show tha… ▽ More

    Submitted 1 July, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 34 pages, 4 figures, github repo: https://github.com/Qubit-Fernand/Digital-Quantum-Simulation

  17. arXiv:2312.08134  [pdf, other

    cs.NE

    MToP: A MATLAB Optimization Platform for Evolutionary Multitasking

    Authors: Yanchi Li, Wenyin Gong, Fei Ming, Tingyu Zhang, Shuijia Li, Qiong Gu

    Abstract: Evolutionary multitasking (EMT) has emerged as a popular topic of evolutionary computation over the past years. It aims to concurrently address multiple optimization tasks within limited computing resources, leveraging inter-task knowledge transfer techniques. Despite the abundance of multitask evolutionary algorithms (MTEAs) proposed for multitask optimization (MTO), there remains a comprehensive… ▽ More

    Submitted 9 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  18. Bridge the Present and Future: A Cross-Layer Matching Game in Dynamic Cloud-Aided Mobile Edge Networks

    Authors: Houyi Qi, Minghui Liwang, Xianbin Wang, Li Li, Wei Gong, Jian Jin, Zhenzhen Jiao

    Abstract: Cloud-aided mobile edge networks (CAMENs) allow edge servers (ESs) to purchase resources from remote cloud servers (CSs), while overcoming resource shortage when handling computation-intensive tasks of mobile users (MUs). Conventional trading mechanisms (e.g., onsite trading) confront many challenges, including decision-making overhead (e.g., latency) and potential trading failures. This paper inv… ▽ More

    Submitted 8 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Journal ref: IEEE Transactions on Mobile Computing,2024

  19. arXiv:2311.03309  [pdf, other

    cs.LG cs.AI stat.ML

    Neural Structure Learning with Stochastic Differential Equations

    Authors: Benjie Wang, Joel Jennings, Wenbo Gong

    Abstract: Discovering the underlying relationships among variables from temporal observations has been a longstanding challenge in numerous scientific disciplines, including biology, finance, and climate science. The dynamics of such systems are often best described using continuous-time stochastic processes. Unfortunately, most existing structure learning approaches assume that the underlying process evolv… ▽ More

    Submitted 5 May, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: ICLR 2024

  20. arXiv:2309.14326  [pdf, other

    quant-ph cs.CC cs.IT cs.LG math.ST

    Efficient Pauli channel estimation with logarithmic quantum memory

    Authors: Sitan Chen, Weiyuan Gong

    Abstract: Here we revisit one of the prototypical tasks for characterizing the structure of noise in quantum devices: estimating every eigenvalue of an $n$-qubit Pauli noise channel to error $ε$. Prior work (Chen et al., 2022) proved no-go theorems for this task in the practical regime where one has a limited amount of quantum memory, e.g. any protocol with $\le 0.99n$ ancilla qubits of quantum memory must… ▽ More

    Submitted 30 November, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: 57 pages, 3 figures

  21. arXiv:2308.00531  [pdf, ps, other

    cs.NI

    Adaptive Bitrate Video Semantic Communication over Wireless Networks

    Authors: Wentao Gong, Haonan Tong, Sihua Wang, Zhaohui Yang, Xinxin He, Changchuan Yin

    Abstract: This paper investigates the adaptive bitrate (ABR) video semantic communication over wireless networks. In the considered model, video sensing devices must transmit video semantic information to an edge server, to facilitate ubiquitous video sensing services such as road environment monitoring at the edge server in autonomous driving scenario. However, due to the varying wireless network condition… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  22. arXiv:2307.13917  [pdf, other

    cs.LG stat.ME

    BayesDAG: Gradient-Based Posterior Inference for Causal Discovery

    Authors: Yashas Annadani, Nick Pawlowski, Joel Jennings, Stefan Bauer, Cheng Zhang, Wenbo Gong

    Abstract: Bayesian causal discovery aims to infer the posterior distribution over causal models from observed data, quantifying epistemic uncertainty and benefiting downstream tasks. However, computational challenges arise due to joint inference over combinatorial space of Directed Acyclic Graphs (DAGs) and nonlinear functions. Despite recent progress towards efficient posterior inference over DAGs, existin… ▽ More

    Submitted 8 December, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  23. arXiv:2307.13028  [pdf, other

    quant-ph cs.DS

    Improved Digital Quantum Simulation by Non-Unitary Channels

    Authors: W. Gong, Yaroslav Kharkov, Minh C. Tran, Przemyslaw Bienias, Alexey V. Gorshkov

    Abstract: Simulating quantum systems is one of the most promising avenues to harness the computational power of quantum computers. However, hardware errors in noisy near-term devices remain a major obstacle for applications. Ideas based on the randomization of Suzuki-Trotter product formulas have been shown to be a powerful approach to reducing the errors of quantum simulation and lowering the gate count. I… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 24 pages, 9 figures

  24. arXiv:2306.06629  [pdf, other

    cs.CL cs.AI

    GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model

    Authors: Shicheng Tan, Weng Lam Tam, Yuanchun Wang, Wenwen Gong, Yang Yang, Hongyin Tang, Keqing He, Jiahao Liu, Jingang Wang, Shu Zhao, Peng Zhang, Jie Tang

    Abstract: Currently, the reduction in the parameter scale of large-scale pre-trained language models (PLMs) through knowledge distillation has greatly facilitated their widespread deployment on various devices. However, the deployment of knowledge distillation systems faces great challenges in real-world industrial-strength applications, which require the use of complex distillation methods on even larger-s… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: accepted for ACL 2023 industry track

  25. arXiv:2306.06625  [pdf, other

    cs.CL cs.AI

    Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method

    Authors: Shicheng Tan, Weng Lam Tam, Yuanchun Wang, Wenwen Gong, Shu Zhao, Peng Zhang, Jie Tang

    Abstract: The large scale of pre-trained language models poses a challenge for their deployment on various devices, with a growing emphasis on methods to compress these models, particularly knowledge distillation. However, current knowledge distillation methods rely on the model's intermediate layer features and the golden labels (also called hard labels), which usually require aligned model architecture an… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted to Findings of ACL2023

  26. arXiv:2305.09331  [pdf, other

    cs.NI eess.SP

    Energy-Efficient WiFi Backscatter Communication for Green IoTs

    Authors: Yimeng Huang, Lijie Liu, Jihong Yu, Yuguang Fang, Wei Gong

    Abstract: The boom of the Internet of Things has revolutionized people's lives, but it has also resulted in massive resource consumption and environmental pollution. Recently, Green IoT (GIoT) has become a worldwide consensus to address this issue. In this paper, we propose EEWScatter, an energy-efficient WiFi backscatter communication system to pursue the goal of GIoT. Unlike previous backscatter systems t… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  27. arXiv:2305.06563  [pdf, other

    stat.ML cs.LG

    Spatiotemporal Regularized Tucker Decomposition Approach for Traffic Data Imputation

    Authors: Wenwu Gong, Zhejun Huang, Lili Yang

    Abstract: In intelligent transportation systems, traffic data imputation, estimating the missing value from partially observed data is an inevitable and challenging task. Previous studies have not fully considered traffic data's multidimensionality and spatiotemporal correlations, but they are vital to traffic data recovery, especially for high-level missing scenarios. To address this problem, we propose a… ▽ More

    Submitted 30 October, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  28. arXiv:2304.05524  [pdf, other

    cs.LG cs.CL

    Understanding Causality with Large Language Models: Feasibility and Opportunities

    Authors: Cheng Zhang, Stefan Bauer, Paul Bennett, Jiangfeng Gao, Wenbo Gong, Agrin Hilmkil, Joel Jennings, Chao Ma, Tom Minka, Nick Pawlowski, James Vaughan

    Abstract: We assess the ability of large language models (LLMs) to answer causal questions by analyzing their strengths and weaknesses against three types of causal question. We believe that current LLMs can answer causal questions with existing causal knowledge as combined domain experts. However, they are not yet able to provide satisfactory answers for discovering new knowledge or for high-stakes decisio… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  29. arXiv:2303.12363  [pdf

    cs.LG cs.CR

    Distribution-restrained Softmax Loss for the Model Robustness

    Authors: Hao Wang, Chen Li, Jinzhe Jiang, Xin Zhang, Yaqian Zhao, Weifeng Gong

    Abstract: Recently, the robustness of deep learning models has received widespread attention, and various methods for improving model robustness have been proposed, including adversarial training, model architecture modification, design of loss functions, certified defenses, and so on. However, the principle of the robustness to attacks is still not fully understood, also the related research is still not s… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    MSC Class: 68T45

  30. arXiv:2303.08969  [pdf, other

    cs.AI eess.SP

    Relative coordinates are crucial for Ulam's "trick to the train of thought"

    Authors: Weibo Gong, Chirag S. Trasikar, Bradley Zylstra

    Abstract: Spatial signal processing algorithms often use pre-given coordinate systems to label pixel positions. These processing algorithms are thus burdened by an external reference grid, making the acquisition of relative, intrinsic features difficult. This is in contrast to animal vision and cognition: animals recognize features without an external coordinate system. We show that a coordinate system-inde… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 19 pages, 10 figures, conference

    ACM Class: I.2.0

  31. arXiv:2303.05779  [pdf, other

    cs.NI

    CRC-based Reliable WiFi Backscatter Communiation for Supply Chain Management

    Authors: Yun-Hao Liu, Tao Liu, Yimeng Huang, Han Ding, Wei Xi, Wei Gong

    Abstract: Supply chain management is aimed to keep going long-term performance of the supply chain and minimize the costs. Backscatter technology provides a more efficient way of being able to identify items and real-time monitoring. Among the backscatter systems, the ambient backscatter communication (AmBC) system provides a prospect of ultra-low energy consumption and does not require controlled excitatio… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  32. arXiv:2303.05775  [pdf, other

    cs.CV cs.GR

    Self-NeRF: A Self-Training Pipeline for Few-Shot Neural Radiance Fields

    Authors: Jiayang Bai, Letian Huang, Wen Gong, Jie Guo, Yanwen Guo

    Abstract: Recently, Neural Radiance Fields (NeRF) have emerged as a potent method for synthesizing novel views from a dense set of images. Despite its impressive performance, NeRF is plagued by its necessity for numerous calibrated views and its accuracy diminishes significantly in a few-shot setting. To address this challenge, we propose Self-NeRF, a self-evolved NeRF that iteratively refines the radiance… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: 11 pages, 11 figures

  33. A Visual Representation-guided Framework with Global Affinity for Weakly Supervised Salient Object Detection

    Authors: Binwei Xu, Haoran Liang, Weihua Gong, Ronghua Liang, Peng Chen

    Abstract: Fully supervised salient object detection (SOD) methods have made considerable progress in performance, yet these models rely heavily on expensive pixel-wise labels. Recently, to achieve a trade-off between labeling burden and performance, scribble-based SOD methods have attracted increasing attention. Previous scribble-based models directly implement the SOD task only based on SOD training data w… ▽ More

    Submitted 8 June, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  34. arXiv:2301.07868  [pdf, other

    cs.CV

    MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval

    Authors: Xiaojie Jin, Bowen Zhang, Weibo Gong, Kai Xu, XueQing Deng, Peng Wang, Zhao Zhang, Xiaohui Shen, Jiashi Feng

    Abstract: State-of-the-art video-text retrieval (VTR) methods typically involve fully fine-tuning a pre-trained model (e.g. CLIP) on specific datasets. However, this can result in significant storage costs in practical applications as a separate model per task must be stored. To address this issue, we present our pioneering work that enables parameter-efficient VTR using a pre-trained model, with only a sma… ▽ More

    Submitted 11 April, 2024; v1 submitted 18 January, 2023; originally announced January 2023.

  35. arXiv:2212.02548  [pdf, other

    quant-ph cs.DS

    Robustness of Quantum Algorithms for Nonconvex Optimization

    Authors: Weiyuan Gong, Chenyi Zhang, Tongyang Li

    Abstract: Recent results suggest that quantum computers possess the potential to speed up nonconvex optimization problems. However, a crucial factor for the implementation of quantum optimization algorithms is their robustness against experimental and statistical noises. In this paper, we systematically study quantum algorithms for finding an $ε$-approximate second-order stationary point ($ε$-SOSP) of a… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  36. arXiv:2212.02531  [pdf, other

    quant-ph cond-mat.dis-nn cs.AI cs.LG

    Enhancing Quantum Adversarial Robustness by Randomized Encodings

    Authors: Weiyuan Gong, Dong Yuan, Weikang Li, Dong-Ling Deng

    Abstract: The interplay between quantum physics and machine learning gives rise to the emergent frontier of quantum machine learning, where advanced quantum learning models may outperform their classical counterparts in solving certain challenging problems. However, quantum learning systems are vulnerable to adversarial attacks: adding tiny carefully-crafted perturbations on legitimate input samples can cau… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  37. arXiv:2210.14706  [pdf, other

    cs.LG cs.AI stat.ML

    Rhino: Deep Causal Temporal Relationship Learning With History-dependent Noise

    Authors: Wenbo Gong, Joel Jennings, Cheng Zhang, Nick Pawlowski

    Abstract: Discovering causal relationships between different variables from time series data has been a long-standing challenge for many domains such as climate science, finance, and healthcare. Given the complexity of real-world relationships and the nature of observations in discrete time, causal discovery methods need to consider non-linear relations between variables, instantaneous effects and history-d… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 28 pages, 8 figures, 5 tables

  38. arXiv:2210.04410  [pdf, other

    cs.DC

    Accelerating the Delivery of Data Services over Uncertain Mobile Crowdsensing Networks

    Authors: Minghui Liwang, Zhipeng Cheng, Wei Gong, Li Li, Yuhan Su, Zhenzhen Jiao, Seyyedali Hosseinalipour, Xianbin Wang, Huaiyu Dai

    Abstract: The challenge of exchanging and processing of big data over mobile crowdsensing (MCS) networks calls for designing seamless data service provisioning mechanisms to enable utilization of resources of mobile devices/users for crowdsensing tasks. Although conventional onsite spot trading of resources based on real-time network conditions can facilitate data sharing, it often suffers from prohibitivel… ▽ More

    Submitted 8 April, 2024; v1 submitted 9 October, 2022; originally announced October 2022.

  39. arXiv:2209.03007  [pdf, ps, other

    quant-ph cs.CC

    Learning Distributions over Quantum Measurement Outcomes

    Authors: Weiyuan Gong, Scott Aaronson

    Abstract: Shadow tomography for quantum states provides a sample efficient approach for predicting the properties of quantum systems when the properties are restricted to expectation values of $2$-outcome POVMs. However, these shadow tomography procedures yield poor bounds if there are more than 2 outcomes per measurement. In this paper, we consider a general problem of learning properties from unknown quan… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: 25 pages

  40. arXiv:2208.12610  [pdf, ps, other

    cs.CY cs.AI cs.LG

    NeurIPS Competition Instructions and Guide: Causal Insights for Learning Paths in Education

    Authors: Wenbo Gong, Digory Smith, Zichao Wang, Craig Barton, Simon Woodhead, Nick Pawlowski, Joel Jennings, Cheng Zhang

    Abstract: In this competition, participants will address two fundamental causal challenges in machine learning in the context of education using time-series data. The first is to identify the causal relationships between different constructs, where a construct is defined as the smallest element of learning. The second challenge is to predict the impact of learning one construct on the ability to answer ques… ▽ More

    Submitted 31 August, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: 19 pages, NeurIPS 2022 Competition Track

  41. arXiv:2205.10034  [pdf, other

    cs.DC cs.AI

    SE-MoE: A Scalable and Efficient Mixture-of-Experts Distributed Training and Inference System

    Authors: Liang Shen, Zhihua Wu, WeiBao Gong, Hongxiang Hao, Yangfan Bai, HuaChao Wu, Xinxuan Wu, Jiang Bian, Haoyi Xiong, Dianhai Yu, Yanjun Ma

    Abstract: With the increasing diversity of ML infrastructures nowadays, distributed training over heterogeneous computing systems is desired to facilitate the production of big models. Mixture-of-Experts (MoE) models have been proposed to lower the cost of training subject to the overall size of models/data through gating and parallelism in a divide-and-conquer fashion. While DeepSpeed has made efforts in c… ▽ More

    Submitted 12 June, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

  42. arXiv:2205.09470  [pdf, other

    cs.LG cs.AI cs.DC

    Nebula-I: A General Framework for Collaboratively Training Deep Learning Models on Low-Bandwidth Cloud Clusters

    Authors: Yang Xiang, Zhihua Wu, Weibao Gong, Siyu Ding, Xianjie Mo, Yuang Liu, Shuohuan Wang, Peng Liu, Yongshuai Hou, Long Li, Bin Wang, Shaohuai Shi, Yaqian Han, Yue Yu, Ge Li, Yu Sun, Yanjun Ma, Dianhai Yu

    Abstract: The ever-growing model size and scale of compute have attracted increasing interests in training deep learning models over multiple nodes. However, when it comes to training on cloud clusters, especially across remote clusters, huge challenges are faced. In this work, we introduce a general framework, Nebula-I, for collaboratively training deep learning models over remote heterogeneous clusters, t… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 20 pages, 10 figures, technical report

  43. arXiv:2204.02008  [pdf, other

    cs.CV

    Learning Video Salient Object Detection Progressively from Unlabeled Videos

    Authors: Binwei Xu, Haoran Liang, Wentian Ni, Weihua Gong, Ronghua Liang, Peng Chen

    Abstract: Recent deep learning-based video salient object detection (VSOD) has achieved some breakthrough, but these methods rely on expensive annotated videos with pixel-wise annotations, weak annotations, or part of the pixel-wise annotations. In this paper, based on the similarities and the differences between VSOD and image salient object detection (SOD), we propose a novel VSOD method via a progressive… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  44. arXiv:2202.02195  [pdf, other

    stat.ML cs.LG

    Deep End-to-end Causal Inference

    Authors: Tomas Geffner, Javier Antoran, Adam Foster, Wenbo Gong, Chao Ma, Emre Kiciman, Amit Sharma, Angus Lamb, Martin Kukla, Nick Pawlowski, Miltiadis Allamanis, Cheng Zhang

    Abstract: Causal inference is essential for data-driven decision making across domains such as business engagement, medical treatment and policy making. However, research on causal discovery has evolved separately from inference methods, preventing straight-forward combination of methods from both fields. In this work, we develop Deep End-to-end Causal Inference (DECI), a single flow-based non-linear additi… ▽ More

    Submitted 20 June, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

  45. arXiv:2112.12731  [pdf, other

    cs.CL

    ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

    Authors: Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu , et al. (4 additional authors not shown)

    Abstract: Pre-trained language models have achieved state-of-the-art results in various Natural Language Processing (NLP) tasks. GPT-3 has shown that scaling up pre-trained language models can further exploit their enormous potential. A unified framework named ERNIE 3.0 was recently proposed for pre-training large-scale knowledge enhanced models and trained a model with 10 billion parameters. ERNIE 3.0 outp… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Comments: arXiv admin note: text overlap with arXiv:2107.02137

  46. arXiv:2112.02752  [pdf, other

    cs.DC cs.AI cs.LG

    End-to-end Adaptive Distributed Training on PaddlePaddle

    Authors: Yulong Ao, Zhihua Wu, Dianhai Yu, Weibao Gong, Zhiqing Kui, Minxu Zhang, Zilingfeng Ye, Liang Shen, Yanjun Ma, Tian Wu, Haifeng Wang, Wei Zeng, Chao Yang

    Abstract: Distributed training has become a pervasive and effective approach for training a large neural network (NN) model with processing massive data. However, it is very challenging to satisfy requirements from various NN models, diverse computing resources, and their dynamic changes during a training job. In this study, we design our distributed training framework in a systematic end-to-end view to pro… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Comments: 16 pages, 10 figures, 4 tables

  47. arXiv:2111.02426  [pdf, other

    quant-ph cond-mat.dis-nn cs.LG

    Weighted Quantum Channel Compiling through Proximal Policy Optimization

    Authors: Weiyuan Gong, Si Jiang, Dong-Ling Deng

    Abstract: We propose a general and systematic strategy to compile arbitrary quantum channels without using ancillary qubits, based on proximal policy optimization -- a powerful deep reinforcement learning algorithm. We rigorously prove that, in sharp contrast to the case of compiling unitary gates, it is impossible to compile an arbitrary channel to arbitrary precision with any given finite elementary chann… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 14 pages, 4 figures

    Journal ref: Phys. Rev. Research 5, 013060 (2023)

  48. arXiv:2110.08223  [pdf, other

    cs.LG

    Simultaneous Missing Value Imputation and Structure Learning with Groups

    Authors: Pablo Morales-Alvarez, Wenbo Gong, Angus Lamb, Simon Woodhead, Simon Peyton Jones, Nick Pawlowski, Miltiadis Allamanis, Cheng Zhang

    Abstract: Learning structures between groups of variables from data with missing values is an important task in the real world, yet difficult to solve. One typical scenario is discovering the structure among topics in the education domain to identify learning pathways. Here, the observations are student performances for questions under each topic which contain missing values. However, most existing methods… ▽ More

    Submitted 24 February, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  49. arXiv:2107.10538  [pdf, other

    cs.SI

    Diversified and Compatible Web APIs Recommendation in IoT

    Authors: Wenwen Gong, Huiping Wu, Xiaokang Wang, Xuyun Zhang, Yawei Wang, Yifei Chen, Mohammad R. Khosravi

    Abstract: With the ever-increasing popularity of Service-oriented Architecture (SoA) and Internet of Things (IoT), a considerable number of enterprises or organizations are attempting to encapsulate their provided complex business services into various lightweight and accessible web APIs (application programming interfaces) with diverse functions. In this situation, a software developer can select a group o… ▽ More

    Submitted 11 August, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

    Comments: 15 pages, 11 figures

  50. arXiv:2107.10072  [pdf, other

    cs.LG stat.ML

    Interpreting diffusion score matching using normalizing flow

    Authors: Wenbo Gong, Yingzhen Li

    Abstract: Scoring matching (SM), and its related counterpart, Stein discrepancy (SD) have achieved great success in model training and evaluations. However, recent research shows their limitations when dealing with certain types of distributions. One possible fix is incorporating the original score matching (or Stein discrepancy) with a diffusion matrix, which is called diffusion score matching (DSM) (or di… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: 8 pages, International Conference on Machine Learning (ICML) INNF+ 2021 Workshop Spotlight