Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 151–200 of 502 results for author: He, D

.
  1. arXiv:2207.00239  [pdf, other

    math.GT

    hyperbolic fibered slice knots with right-veering monodromy

    Authors: Dongtai He

    Abstract: We construct a hyperbolic fibered slice knot with right-veering monodromy, giving a negative answer to the question posed by Hubbard-Kawamuro-Kose-Martin-Plamenevskaya-Raoux-Truong-Turner.

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: 6 pages, 6 figures

    MSC Class: 57K10 (primary) 57K20; 57K33 (secondary)

  2. arXiv:2206.13953  [pdf, other

    cs.LG

    RAW-GNN: RAndom Walk Aggregation based Graph Neural Network

    Authors: Di Jin, Rui Wang, Meng Ge, Dongxiao He, Xiang Li, Wei Lin, Weixiong Zhang

    Abstract: Graph-Convolution-based methods have been successfully applied to representation learning on homophily graphs where nodes with the same label or similar attributes tend to connect with one another. Due to the homophily assumption of Graph Convolutional Networks (GCNs) that these methods use, they are not suitable for heterophily graphs where nodes with different labels or dissimilar attributes ten… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  3. Pair production of neutral Higgs particles in the B-LSSM

    Authors: Dan He, Tai-Fu Feng, Jin-Lei Yang, Guo-Zhu Ning, Hai-Bin Zhang, Xing-Xing Dong

    Abstract: Higgs pair production provides a unique handle for measuring the strength of Higgs self interaction and constraining the shape of the Higgs potential. Including radiative corrections to the trilinear couplings of $CP$-even Higgs, we investigate the cross section of the lightest neutral Higgs pair production in gluon fusion at the Large Hadron Collider in the supersymmetric extensions of the standa… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 37 pages, 13 figures, accepted for publication in Journal of Physics G: Nuclear and Particle Physics

    Journal ref: J. Phys. G 49, 085002 (2022)

  4. arXiv:2206.04316  [pdf, other

    cs.LG cs.AI stat.ML

    Adversarial Noises Are Linearly Separable for (Nearly) Random Neural Networks

    Authors: Huishuai Zhang, Da Yu, Yiping Lu, Di He

    Abstract: Adversarial examples, which are usually generated for specific inputs with a specific model, are ubiquitous for neural networks. In this paper we unveil a surprising property of adversarial noises when they are put together, i.e., adversarial noises crafted by one-step gradient methods are linearly separable if equipped with the corresponding labels. We theoretically prove this property for a two-… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 13 pages

  5. arXiv:2206.02016  [pdf, other

    cs.LG math.NA

    Is $L^2$ Physics-Informed Loss Always Suitable for Training Physics-Informed Neural Network?

    Authors: Chuwei Wang, Shanda Li, Di He, Liwei Wang

    Abstract: The Physics-Informed Neural Network (PINN) approach is a new and promising way to solve partial differential equations using deep learning. The $L^2$ Physics-Informed Loss is the de-facto standard in training Physics-Informed Neural Networks. In this paper, we challenge this common practice by investigating the relationship between the loss function and the approximation quality of the learned sol… ▽ More

    Submitted 30 December, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

  6. arXiv:2205.14501  [pdf, other

    eess.IV

    PO-ELIC: Perception-Oriented Efficient Learned Image Coding

    Authors: Dailan He, Ziming Yang, Hongjiu Yu, Tongda Xu, Jixiang Luo, Yuan Chen, Chenjian Gao, Xinjie Shi, Hongwei Qin, Yan Wang

    Abstract: In the past years, learned image compression (LIC) has achieved remarkable performance. The recent LIC methods outperform VVC in both PSNR and MS-SSIM. However, the low bit-rate reconstructions of LIC suffer from artifacts such as blurring, color drifting and texture missing. Moreover, those varied artifacts make image quality metrics correlate badly with human perceptual quality. In this paper, w… ▽ More

    Submitted 28 May, 2022; originally announced May 2022.

    Comments: CVPR2022 Workshop, 5-th CLIC Image Compression Track

  7. arXiv:2205.13401  [pdf, other

    cs.LG cs.CL stat.ML

    Your Transformer May Not be as Powerful as You Expect

    Authors: Shengjie Luo, Shanda Li, Shuxin Zheng, Tie-Yan Liu, Liwei Wang, Di He

    Abstract: Relative Positional Encoding (RPE), which encodes the relative distance between any pair of tokens, is one of the most successful modifications to the original Transformer. As far as we know, theoretical understanding of the RPE-based Transformers is largely unexplored. In this work, we mathematically analyze the power of RPE-based Transformers regarding whether the model is capable of approximati… ▽ More

    Submitted 28 October, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: 22 pages; NeurIPS 2022, Camera Ready Version

  8. arXiv:2205.12784  [pdf, ps, other

    cs.LG

    TrustGNN: Graph Neural Network based Trust Evaluation via Learnable Propagative and Composable Nature

    Authors: Cuiying Huo, Di Jin, Chundong Liang, Dongxiao He, Tie Qiu, Lingfei Wu

    Abstract: Trust evaluation is critical for many applications such as cyber security, social communication and recommender systems. Users and trust relationships among them can be seen as a graph. Graph neural networks (GNNs) show their powerful ability for analyzing graph-structural data. Very recently, existing work attempted to introduce the attributes and asymmetry of edges into GNNs for trust evaluation… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  9. arXiv:2205.08055  [pdf

    q-bio.BM cs.AI cs.LG q-bio.QM

    HelixADMET: a robust and endpoint extensible ADMET system incorporating self-supervised knowledge transfer

    Authors: Shanzhuo Zhang, Zhiyuan Yan, Yueyang Huang, Lihang Liu, Donglong He, Wei Wang, Xiaomin Fang, Xiaonan Zhang, Fan Wang, Hua Wu, Haifeng Wang

    Abstract: Accurate ADMET (an abbreviation for "absorption, distribution, metabolism, excretion, and toxicity") predictions can efficiently screen out undesirable drug candidates in the early stage of drug discovery. In recent years, multiple comprehensive ADMET systems that adopt advanced machine learning models have been developed, providing services to estimate multiple endpoints. However, those ADMET sys… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Journal ref: Bioinformatics, 2022

  10. arXiv:2205.00256  [pdf, other

    cs.LG

    Heterogeneous Graph Neural Networks using Self-supervised Reciprocally Contrastive Learning

    Authors: Cuiying Huo, Dongxiao He, Yawen Li, Di Jin, Jianwu Dang, Weixiong Zhang, Witold Pedrycz, Lingfei Wu

    Abstract: Heterogeneous graph neural network (HGNN) is a very popular technique for the modeling and analysis of heterogeneous graphs. Most existing HGNN-based approaches are supervised or semi-supervised learning methods requiring graphs to be annotated, which is costly and time-consuming. Self-supervised contrastive learning has been proposed to address the problem of requiring annotated data by mining in… ▽ More

    Submitted 16 November, 2023; v1 submitted 30 April, 2022; originally announced May 2022.

  11. arXiv:2204.06644  [pdf, other

    cs.LG cs.AI cs.CL

    METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals

    Authors: Payal Bajaj, Chenyan Xiong, Guolin Ke, Xiaodong Liu, Di He, Saurabh Tiwary, Tie-Yan Liu, Paul Bennett, Xia Song, Jianfeng Gao

    Abstract: We present an efficient method of pretraining large-scale autoencoding language models using training signals generated by an auxiliary model. Originated in ELECTRA, this training strategy has demonstrated sample-efficiency to pretrain models at the scale of hundreds of millions of parameters. In this work, we conduct a comprehensive empirical study, and propose a recipe, namely "Model generated d… ▽ More

    Submitted 16 April, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: Update details in scaled initialization and add acknowledgement

  12. The strong coupling $g_{X J/ψφ}$ of $X(4700) \to J/ψφ$ in the light-cone sum rules

    Authors: Yiling Xie, Dazhuang He, Xuan Luo, Hao Sun

    Abstract: We assign the scalar tetraquark and the D-wave tetraquark state for $X(4700)$ and calculate the width of the decay $X(4700)$ $\to J/ψφ$ within the framework of light-cone sum rules. The strong coupling $g_{X J/ψφ}$ is obtained by considering the technique of soft-meson approximation. We also investigate the mass and the decay constant of $X(4700)$ in the framework of SVZ sum rules. Our prediction… ▽ More

    Submitted 28 September, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Journal ref: Nuclear Physics B Volume 987, February 2023, 116113

  13. arXiv:2203.16357  [pdf, other

    eess.IV cs.CV

    Practical Learned Lossless JPEG Recompression with Multi-Level Cross-Channel Entropy Model in the DCT Domain

    Authors: Lina Guo, Xinjie Shi, Dailan He, Yuanyuan Wang, Rui Ma, Hongwei Qin, Yan Wang

    Abstract: JPEG is a popular image compression method widely used by individuals, data center, cloud storage and network filesystems. However, most recent progress on image compression mainly focuses on uncompressed images while ignoring trillions of already-existing JPEG images. To compress these JPEG images adequately and restore them back to JPEG format losslessly when needed, we propose a deep learning b… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  14. arXiv:2203.10886  [pdf, other

    cs.CV eess.IV

    ELIC: Efficient Learned Image Compression with Unevenly Grouped Space-Channel Contextual Adaptive Coding

    Authors: Dailan He, Ziming Yang, Weikun Peng, Rui Ma, Hongwei Qin, Yan Wang

    Abstract: Recently, learned image compression techniques have achieved remarkable performance, even surpassing the best manually designed lossy image coders. They are promising to be large-scale adopted. For the sake of practicality, a thorough investigation of the architecture design of learned image compression, regarding both compression performance and running speed, is essential. In this paper, we firs… ▽ More

    Submitted 29 March, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: accepted by CVPR 2022 (oral)

  15. arXiv:2203.06123   

    physics.chem-ph cs.CE cs.LG

    An Empirical Study of Graphormer on Large-Scale Molecular Modeling Datasets

    Authors: Yu Shi, Shuxin Zheng, Guolin Ke, Yifei Shen, Jiacheng You, Jiyan He, Shengjie Luo, Chang Liu, Di He, Tie-Yan Liu

    Abstract: This technical note describes the recent updates of Graphormer, including architecture design modifications, and the adaption to 3D molecular dynamics simulation. The "Graphormer-V2" could attain better results on large-scale molecular modeling datasets than the vanilla one, and the performance gain could be consistently obtained on downstream tasks. In addition, we show that with a global recepti… ▽ More

    Submitted 14 March, 2022; v1 submitted 28 February, 2022; originally announced March 2022.

    Comments: Wrong dual-submission (arXiv:2203.04810) with negligently

  16. arXiv:2203.04810  [pdf, ps, other

    cs.LG

    Benchmarking Graphormer on Large-Scale Molecular Modeling Datasets

    Authors: Yu Shi, Shuxin Zheng, Guolin Ke, Yifei Shen, Jiacheng You, Jiyan He, Shengjie Luo, Chang Liu, Di He, Tie-Yan Liu

    Abstract: This technical note describes the recent updates of Graphormer, including architecture design modifications, and the adaption to 3D molecular dynamics simulation. With these simple modifications, Graphormer could attain better results on large-scale molecular modeling datasets than the vanilla one, and the performance gain could be consistently obtained on 2D and 3D molecular graph modeling tasks.… ▽ More

    Submitted 7 January, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

  17. arXiv:2203.02792  [pdf, other

    cs.CV

    Adversarial Dual-Student with Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation

    Authors: Cong Cao, Tianwei Lin, Dongliang He, Fu Li, Huanjing Yue, Jingyu Yang, Errui Ding

    Abstract: A common challenge posed to robust semantic segmentation is the expensive data annotation cost. Existing semi-supervised solutions show great potential for solving this problem. Their key idea is constructing consistency regularization with unsupervised data augmentation from unlabeled data for model training. The perturbations for unlabeled data enable the consistency training loss, which benefit… ▽ More

    Submitted 27 September, 2022; v1 submitted 5 March, 2022; originally announced March 2022.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  18. arXiv:2203.01877  [pdf, other

    cs.DB cs.AI cs.LG

    Query Processing on Tensor Computation Runtimes

    Authors: Dong He, Supun Nakandala, Dalitso Banda, Rathijit Sen, Karla Saur, Kwanghyun Park, Carlo Curino, Jesús Camacho-Rodríguez, Konstantinos Karanasos, Matteo Interlandi

    Abstract: The huge demand for computation in artificial intelligence (AI) is driving unparalleled investments in hardware and software systems for AI. This leads to an explosion in the number of specialized hardware devices, which are now offered by major cloud vendors. By hiding the low-level complexity through a tensor-based interface, tensor computation runtimes (TCRs) such as PyTorch allow data scientis… ▽ More

    Submitted 9 February, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

    Journal ref: Proceedings of the VLDB Endowment, 15(11): 2811 - 2825, 2022

  19. arXiv:2203.00911  [pdf, other

    eess.IV cs.CV

    Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence

    Authors: Zhihong Pan, Baopu Li, Dongliang He, Mingde Yao, Wenhao Wu, Tianwei Lin, Xin Li, Errui Ding

    Abstract: Deep learning based single image super-resolution models have been widely studied and superb results are achieved in upscaling low-resolution images with fixed scale factor and downscaling degradation kernel. To improve real world applicability of such models, there are growing interests to develop models optimized for arbitrary upscaling factors. Our proposed method is the first to treat arbitrar… ▽ More

    Submitted 7 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: To appear at CVPR 2022

  20. arXiv:2202.12080  [pdf, other

    quant-ph cond-mat.mes-hall cond-mat.supr-con

    Resonance Fluorescence from a two-level artificial atom strongly coupled to a single-mode cavity

    Authors: Z. H. Peng, D. He, Y. Zhou, J. H. Ding, J. Lu, L. Zhou, J. Q. Liao, L. M. Kuang, Yu-xi Liu, Oleg V. Astafiev, J. S. Tsai

    Abstract: We experimentally demonstrate the resonance fluorescence of a two-level artificial atom strongly coupled to a single-mode cavity field. The effect was theoretically predicted thirty years ago by Savage [Phys. Rev. Lett. 63, 1376 (1989)]. The system consists of a superconducting qubit circuit and a one-dimensional transmission line resonator. In addition, a one-dimensional transmission line strongl… ▽ More

    Submitted 12 April, 2023; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: 6 pages, 4 figures

  21. arXiv:2202.10593  [pdf, other

    eess.AS cs.SD

    VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition

    Authors: Jinhan Wang, Xiaosu Tong, Jinxi Guo, Di He, Roland Maas

    Abstract: While end-to-end models have shown great success on the Automatic Speech Recognition task, performance degrades severely when target sentences are long-form. The previous proposed methods, (partial) overlapping inference are shown to be effective on long-form decoding. For both methods, word error rate (WER) decreases monotonically when overlapping percentage decreases. Setting aside computational… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  22. arXiv:2202.09340  [pdf, other

    cs.LG

    Learning Physics-Informed Neural Networks without Stacked Back-propagation

    Authors: Di He, Shanda Li, Wenlei Shi, Xiaotian Gao, Jia Zhang, Jiang Bian, Liwei Wang, Tie-Yan Liu

    Abstract: Physics-Informed Neural Network (PINN) has become a commonly used machine learning approach to solve partial differential equations (PDE). But, facing high-dimensional secondorder PDE problems, PINN will suffer from severe scalability issues since its loss includes second-order derivatives, the computational cost of which will grow along with the dimension during stacked back-propagation. In this… ▽ More

    Submitted 24 February, 2023; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: AISTATS 2023

  23. arXiv:2202.07919  [pdf, other

    cs.AI

    HousE: Knowledge Graph Embedding with Householder Parameterization

    Authors: Rui Li, Jianan Zhao, Chaozhuo Li, Di He, Yiqi Wang, Yuming Liu, Hao Sun, Senzhang Wang, Weiwei Deng, Yanming Shen, Xing Xie, Qi Zhang

    Abstract: The effectiveness of knowledge graph embedding (KGE) largely depends on the ability to model intrinsic relation patterns and mapping properties. However, existing approaches can only capture some of them with insufficient modeling capacity. In this work, we propose a more powerful KGE framework named HousE, which involves a novel parameterization based on two kinds of Householder transformations:… ▽ More

    Submitted 19 June, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Accepted by ICML 2022

  24. arXiv:2202.07513  [pdf, other

    eess.IV cs.CV

    Post-Training Quantization for Cross-Platform Learned Image Compression

    Authors: Dailan He, Ziming Yang, Yuan Chen, Qi Zhang, Hongwei Qin, Yan Wang

    Abstract: It has been witnessed that learned image compression has outperformed conventional image coding techniques and tends to be practical in industrial applications. One of the most critical issues that need to be considered is the non-deterministic calculation, which makes the probability prediction cross-platform inconsistent and frustrates successful decoding. We propose to solve this problem by int… ▽ More

    Submitted 30 November, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  25. arXiv:2201.04306  [pdf, ps, other

    cond-mat.mtrl-sci

    Weyl-type nodal chains in X2MnO4 (X= Li, Na)

    Authors: R. R. Kang, S. D. He, P. Zhou, L. Z. Sun

    Abstract: Recently, magnetic topological semimetals have received a lot of attention due to their potential applications in the field of spintronics. By using first-principles calculations, we propose that two ferromagnetic spinel materials of X2MnO4 (X=Li, Na) have Weyl-type nodal chains around the Fermi level. Their stabilities are validated by cohesive energies, phonon dispersions, and elastic constants.… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

    Comments: 7 pages, 6 figures

  26. arXiv:2201.04302  [pdf, other

    eess.IV cs.LG

    De-Noising of Photoacoustic Microscopy Images by Deep Learning

    Authors: Da He, Jiasheng Zhou, Xiaoyu Shang, Jiajia Luo, Sung-Liang Chen

    Abstract: As a hybrid imaging technology, photoacoustic microscopy (PAM) imaging suffers from noise due to the maximum permissible exposure of laser intensity, attenuation of ultrasound in the tissue, and the inherent noise of the transducer. De-noising is a post-processing method to reduce noise, and PAM image quality can be recovered. However, previous de-noising techniques usually heavily rely on mathema… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

    Comments: 12 pages, 8 figures

  27. arXiv:2201.02834  [pdf, other

    eess.SP cs.LG

    Reconfigurable Intelligent Surface Enabled Spatial Multiplexing with Fully Convolutional Network

    Authors: Bile Peng, Jan-Aike Termöhlen, Cong Sun, Danping He, Ke Guan, Tim Fingscheidt, Eduard A. Jorswieck

    Abstract: Reconfigurable intelligent surface (RIS) is an emerging technology for future wireless communication systems. In this work, we consider downlink spatial multiplexing enabled by the RIS for weighted sum-rate (WSR) maximization. In the literature, most solutions use alternating gradient-based optimization, which has moderate performance, high complexity, and limited scalability. We propose to apply… ▽ More

    Submitted 21 September, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

  28. arXiv:2201.00517  [pdf, ps, other

    hep-ph

    The study of lepton EDMs in $U(1)_X$ SSM

    Authors: Lu-Hao Su, Dan He, Xing-Xing Dong, Tai-Fu Feng, Shu-Min Zhao

    Abstract: The minimal supersymmetric extension of the standard model (MSSM) is extended to the $U(1)_X$SSM, whose local gauge group is $SU(3)_C \times SU(2)_L \times U(1)_Y \times U(1)_X$. To obtain the $U(1)_X$SSM, we add the new superfields to the MSSM, namely: three Higgs singlets $\hatη,~\hat{\barη},~\hat{S}$ and right-handed neutrinos $\hatν_i$. The CP violating effects are considered to study the lept… ▽ More

    Submitted 9 May, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

    Comments: 23 pages,12 figures

  29. Distributed Evolution Strategies Using TPUs for Meta-Learning

    Authors: Alex Sheng, Derek He

    Abstract: Meta-learning traditionally relies on backpropagation through entire tasks to iteratively improve a model's learning dynamics. However, this approach is computationally intractable when scaled to complex tasks. We propose a distributed evolutionary meta-learning strategy using Tensor Processing Units (TPUs) that is highly parallel and scalable to arbitrarily long tasks with no increase in memory c… ▽ More

    Submitted 31 December, 2021; originally announced January 2022.

    Comments: Published in Proceedings of the 2020 IEEE Symposium Series on Computational Intelligence (SSCI)

    Journal ref: 2020 IEEE Symposium Series on Computational Intelligence (SSCI), 2020, pp. 721-728

  30. arXiv:2112.13562  [pdf, other

    cs.LG

    Powerful Graph Convolutioal Networks with Adaptive Propagation Mechanism for Homophily and Heterophily

    Authors: Tao Wang, Rui Wang, Di Jin, Dongxiao He, Yuxiao Huang

    Abstract: Graph Convolutional Networks (GCNs) have been widely applied in various fields due to their significant power on processing graph-structured data. Typical GCN and its variants work under a homophily assumption (i.e., nodes with same class are prone to connect to each other), while ignoring the heterophily which exists in many real-world networks (i.e., nodes with different classes tend to form edg… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

  31. arXiv:2112.13507  [pdf, other

    cs.LG cs.SI

    Block Modeling-Guided Graph Convolutional Neural Networks

    Authors: Dongxiao He, Chundong Liang, Huixin Liu, Mingxiang Wen, Pengfei Jiao, Zhiyong Feng

    Abstract: Graph Convolutional Network (GCN) has shown remarkable potential of exploring graph representation. However, the GCN aggregating mechanism fails to generalize to networks with heterophily where most nodes have neighbors from different classes, which commonly exists in real-world networks. In order to make the propagation and aggregation mechanism of GCN suitable for both homophily and heterophily… ▽ More

    Submitted 27 December, 2021; v1 submitted 26 December, 2021; originally announced December 2021.

    Comments: Accepted by Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

  32. arXiv:2112.12441  [pdf, other

    cs.CL

    TOD-DA: Towards Boosting the Robustness of Task-oriented Dialogue Modeling on Spoken Conversations

    Authors: Xin Tian, Xinxian Huang, Dongfeng He, Yingzhan Lin, Siqi Bao, Huang He, Liankai Huang, Qiang Ju, Xiyuan Zhang, Jian Xie, Shuqi Sun, Fan Wang, Hua Wu, Haifeng Wang

    Abstract: Task-oriented dialogue systems have been plagued by the difficulties of obtaining large-scale and high-quality annotated conversations. Furthermore, most of the publicly available datasets only include written conversations, which are insufficient to reflect actual human behaviors in practical spoken dialogue systems. In this paper, we propose Task-oriented Dialogue Data Augmentation (TOD-DA), a n… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Comments: Accepted to the AAAI-22 DSTC10 Workshop. First three authors contributed equally to this work

  33. arXiv:2111.14283  [pdf, other

    q-bio.QM cs.AI cs.LG

    Exploration of Dark Chemical Genomics Space via Portal Learning: Applied to Targeting the Undruggable Genome and COVID-19 Anti-Infective Polypharmacology

    Authors: Tian Cai, Li Xie, Muge Chen, Yang Liu, Di He, Shuo Zhang, Cameron Mura, Philip E. Bourne, Lei Xie

    Abstract: Advances in biomedicine are largely fueled by exploring uncharted territories of human biology. Machine learning can both enable and accelerate discovery, but faces a fundamental hurdle when applied to unseen data with distributions that differ from previously observed ones -- a common dilemma in scientific inquiry. We have developed a new deep learning framework, called {\textit{Portal Learning}}… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 18 pages, 6 figures

    MSC Class: 68T07

  34. arXiv:2111.13333  [pdf, other

    cs.CV

    Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model

    Authors: Zipeng Xu, Tianwei Lin, Hao Tang, Fu Li, Dongliang He, Nicu Sebe, Radu Timofte, Luc Van Gool, Errui Ding

    Abstract: To achieve disentangled image manipulation, previous works depend heavily on manual annotation. Meanwhile, the available manipulations are limited to a pre-defined set the models were trained for. We propose a novel framework, i.e., Predict, Prevent, and Evaluate (PPE), for disentangled text-driven image manipulation that requires little manual annotation while being applicable to a wide variety o… ▽ More

    Submitted 24 March, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: To appear in CVPR 2022

  35. arXiv:2111.04257  [pdf, other

    quant-ph physics.optics

    Transverse mode-encoded quantum gate on a silicon photonic chip

    Authors: Lan-Tian Feng, Ming Zhang, Xiao Xiong, Di Liu, Yu-Jie Cheng, Fang-Ming Jing, Xiao-Zhuo Qi, Yang Chen, De-Yong He, Guo-Ping Guo, Guang-Can Guo, Dao-Xin Dai, Xi-Feng Ren

    Abstract: As an important degree of freedom (DoF) in integrated photonic circuits, the orthogonal transverse mode provides a promising and flexible way to increasing communication capability, for both classical and quantum information processing. To construct large-scale on-chip multimode multi-DoF quantum systems, a transverse mode-encoded controlled-NOT (CNOT) gate is necessary. Here, through design and i… ▽ More

    Submitted 7 November, 2021; originally announced November 2021.

  36. arXiv:2111.01353  [pdf, other

    cs.CV cs.LG

    Can Vision Transformers Perform Convolution?

    Authors: Shanda Li, Xiangning Chen, Di He, Cho-Jui Hsieh

    Abstract: Several recent studies have demonstrated that attention-based networks, such as Vision Transformer (ViT), can outperform Convolutional Neural Networks (CNNs) on several computer vision tasks without using convolutional layers. This naturally leads to the following questions: Can a self-attention layer of ViT express any convolution operation? In this work, we prove that a single ViT layer with ima… ▽ More

    Submitted 2 November, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

  37. arXiv:2110.06850  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Boosting the Certified Robustness of L-infinity Distance Nets

    Authors: Bohang Zhang, Du Jiang, Di He, Liwei Wang

    Abstract: Recently, Zhang et al. (2021) developed a new neural network architecture based on $\ell_\infty$-distance functions, which naturally possesses certified $\ell_\infty$ robustness by its construction. Despite the novel design and theoretical foundation, so far the model only achieved comparable performance to conventional networks. In this paper, we make the following two contributions:… ▽ More

    Submitted 15 March, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Accepted for ICLR 2022; 21 pages

  38. arXiv:2109.14863  [pdf, other

    cs.CV eess.IV

    HLIC: Harmonizing Optimization Metrics in Learned Image Compression by Reinforcement Learning

    Authors: Baocheng Sun, Meng Gu, Dailan He, Tongda Xu, Yan Wang, Hongwei Qin

    Abstract: Learned image compression is making good progress in recent years. Peak signal-to-noise ratio (PSNR) and multi-scale structural similarity (MS-SSIM) are the two most popular evaluation metrics. As different metrics only reflect certain aspects of human perception, works in this field normally optimize two models using PSNR and MS-SSIM as loss function separately, which is suboptimal and makes it d… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

    Comments: working paper

  39. Quantum key distribution over scattering channel

    Authors: Qi-Hang Lu, Fang-Xiang Wang, Kun Huang, Xin Wu, Shuang Wang, De-Yong He, Zhen-Qiang Yin, Guang-Can Guo, Wei Chen, Zheng-Fu Han

    Abstract: Scattering of light by cloud, haze, and fog decreases the transmission efficiency of communication channels in quantum key distribution (QKD), reduces the system's practical security, and thus constrains the deployment of free-space QKD. Here, we employ the wavefront shaping technology to compensate distorted optical signals in high-loss scattering quantum channels and fulfill a polarization-encod… ▽ More

    Submitted 27 September, 2021; v1 submitted 25 September, 2021; originally announced September 2021.

    Comments: 8 Pages, 5 Figures and comments are welcome

    Journal ref: Physical Review Applied 17, 034045 (2022)

  40. Triangle mechanism in the decay process $J/ψ\to K^- K^+ a_1(1260)$

    Authors: Xuan Luo, Dazhuang He, Yiling Xie, Hao Sun

    Abstract: The role of triangle mechanism in the decay process $J/ψ\to K^- K^+ a_1(1260)$ is probed. In this mechanism, a close-up resonance with mass $1823$ MeV and width $122$ MeV decays into $K^* φ, K^* \to K π$ and then $K^* \bar{K}$ fuses into the $a_1(1260)$ resonance. We find that this mechanism leads to a triangle singularity around $M_{\rm inv}(K^- a_1(1260))\approx 1920$ MeV, where the axial-vector… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: 5figures, submit to prd

  41. arXiv:2109.08846  [pdf, other

    math.OC

    Decomposition approach for Stackelberg P-median problem with user preferences

    Authors: Qingyun Tian, Yun Hui Lin, Dongdong He

    Abstract: The P-median facility location problem with user preferences (PUP) studies an operator that locates P facilities to serve customers/users in a cost-efficient manner, upon anticipating customer preferences and choices. The problem can be visualized as a leader-follower game in which the operator is the leader that opens facilities, whereas the customer is the follower who observes the operator's lo… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

  42. arXiv:2109.04192  [pdf, ps, other

    eess.SP cs.IT

    Detection of Abrupt Change in Channel Covariance Matrix for Multi-Antenna Communication

    Authors: Runnan Liu, Liang Liu, Dazhi He, Wenjun Zhang, Erik G. Larsson

    Abstract: The knowledge of channel covariance matrices is of paramount importance to the estimation of instantaneous channels and the design of beamforming vectors in multi-antenna systems. In practice, an abrupt change in channel covariance matrices may occur due to the change in the environment and the user location. Although several works have proposed efficient algorithms to estimate the channel covaria… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: accepted by Globecom 2021

  43. arXiv:2109.01294  [pdf, other

    quant-ph cs.CR physics.optics

    Measurement-device-independent quantum key distribution for nonstandalone networks

    Authors: Guan-Jie Fan-Yuan, Feng-Yu Lu, Shuang Wang, Zhen-Qiang Yin, De-Yong He, Zheng Zhou, Jun Teng, Wei Chen, Guang-Can Guo, Zheng-Fu Han

    Abstract: Untrusted node networks initially implemented by measurement-device-independent quantum key distribution (MDI-QKD) protocol are a crucial step on the roadmap of the quantum Internet. Considering extensive QKD implementations of trusted node networks, a workable upgrading tactic of existing networks toward MDI networks needs to be explicit. Here, referring to the nonstandalone (NSA) network of 5G,… ▽ More

    Submitted 6 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Journal ref: Photon. Res. 9, 1881-1891 (2021)

  44. Unbalanced-basis-misalignment tolerant measurement-device-independent quantum key distribution

    Authors: Feng-Yu Lu, Ze-Hao Wang, Zhen-Qiang Yin, Shuang Wang, Rong Wang, Guan-Jie Fan-Yuan, Xiao-Juan Huang, De-Yong He, Wei Chen, Zheng Zhou, Guang-Can Guo, Zheng-Fu Han

    Abstract: Measurement-device-independent quantum key distribution (MDIQKD) is a revolutionary protocol since it is physically immune to all attacks on the detection side. However, the protocol still keeps the strict assumptions on the source side that the four BB84-states must be perfectly prepared to ensure security. Some protocols release part of the assumptions in the encoding system to keep the practica… ▽ More

    Submitted 3 August, 2022; v1 submitted 26 August, 2021; originally announced August 2021.

    Comments: 22 pages, 9 figures

    Journal ref: Optica 9, 886-893 (2022)

  45. arXiv:2108.03798  [pdf, other

    cs.CV

    Paint Transformer: Feed Forward Neural Painting with Stroke Prediction

    Authors: Songhua Liu, Tianwei Lin, Dongliang He, Fu Li, Ruifeng Deng, Xin Li, Errui Ding, Hao Wang

    Abstract: Neural painting refers to the procedure of producing a series of strokes for a given image and non-photo-realistically recreating it using neural networks. While reinforcement learning (RL) based agents can generate a stroke sequence step by step for this task, it is not easy to train a stable RL agent. On the other hand, stroke optimization methods search for a set of stroke parameters iterativel… ▽ More

    Submitted 11 August, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: Accepted by ICCV 2021 (oral). Codes will be released on https://github.com/wzmsltw/PaintTransformer

  46. arXiv:2108.03647  [pdf, other

    cs.CV

    AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer

    Authors: Songhua Liu, Tianwei Lin, Dongliang He, Fu Li, Meiling Wang, Xin Li, Zhengxing Sun, Qian Li, Errui Ding

    Abstract: Fast arbitrary neural style transfer has attracted widespread attention from academic, industrial and art communities due to its flexibility in enabling various applications. Existing solutions either attentively fuse deep style feature into deep content feature without considering feature distributions, or adaptively normalize deep content feature according to the style such that their global sta… ▽ More

    Submitted 11 August, 2021; v1 submitted 8 August, 2021; originally announced August 2021.

    Comments: Accepted by ICCV 2021. Codes will be released on https://github.com/wzmsltw/AdaAttN

  47. arXiv:2108.02927  [pdf, other

    cs.CV

    DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features

    Authors: Min Yang, Dongliang He, Miao Fan, Baorong Shi, Xuetong Xue, Fu Li, Errui Ding, Jizhou Huang

    Abstract: Image Retrieval is a fundamental task of obtaining images similar to the query one from a database. A common image retrieval practice is to firstly retrieve candidate images via similarity search using global image features and then re-rank the candidates by leveraging their local features. Previous learning-based studies mainly focus on either global or local image representation learning to tack… ▽ More

    Submitted 11 August, 2021; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: ICCV2021

  48. TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding

    Authors: Dailan He, Yusheng Zhao, Junyu Luo, Tianrui Hui, Shaofei Huang, Aixi Zhang, Si Liu

    Abstract: Recently proposed fine-grained 3D visual grounding is an essential and challenging task, whose goal is to identify the 3D object referred by a natural language sentence from other distractive objects of the same category. Existing works usually adopt dynamic graph networks to indirectly model the intra/inter-modal interactions, making the model difficult to distinguish the referred object from dis… ▽ More

    Submitted 11 August, 2021; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: ACM MM2021

  49. arXiv:2107.06829  [pdf, other

    cs.RO

    FAST-LIO2: Fast Direct LiDAR-inertial Odometry

    Authors: Wei Xu, Yixi Cai, Dongjiao He, Jiarong Lin, Fu Zhang

    Abstract: This paper presents FAST-LIO2: a fast, robust, and versatile LiDAR-inertial odometry framework. Building on a highly efficient tightly-coupled iterated Kalman filter, FAST-LIO2 has two key novelties that allow fast, robust, and accurate LiDAR navigation (and mapping). The first one is directly registering raw points to the map (and subsequently update the map, i.e., mapping) without extracting fea… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

  50. arXiv:2107.05163  [pdf, ps, other

    q-fin.GN econ.GN

    Recursive Utility with Investment Gains and Losses: Existence, Uniqueness, and Convergence

    Authors: Jing Guo, Xue Dong He

    Abstract: We consider a generalization of the recursive utility model by adding a new component that represents utility of investment gains and losses. We also study the utility process in this generalized model with constant elasticity of intertemporal substitution and relative risk aversion degree, and with infinite time horizon. In a specific, finite-state Markovian setting, we prove that the utility pro… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.