Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 101–150 of 282 results for author: Geng, X

.
  1. arXiv:2212.10192  [pdf, other

    cs.CL

    Adam: Dense Retrieval Distillation with Adaptive Dark Examples

    Authors: Chongyang Tao, Chang Liu, Tao Shen, Can Xu, Xiubo Geng, Binxing Jiao, Daxin Jiang

    Abstract: To improve the performance of the dual-encoder retriever, one effective approach is knowledge distillation from the cross-encoder ranker. Existing works construct the candidate passages following the supervised learning setting where a query is paired with a positive passage and a batch of negatives. However, through empirical observation, we find that even the hard negatives from advanced methods… ▽ More

    Submitted 6 June, 2024; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: 13 pages, 3 figures

  2. arXiv:2212.04318  [pdf, other

    cs.NI cs.LG

    Power Consumption Modeling of 5G Multi-Carrier Base Stations: A Machine Learning Approach

    Authors: Nicola Piovesan, David Lopez-Perez, Antonio De Domenico, Xinli Geng, Harvey Bao

    Abstract: The fifth generation of the Radio Access Network (RAN) has brought new services, technologies, and paradigms with the corresponding societal benefits. However, the energy consumption of 5G networks is today a concern. In recent years, the design of new methods for decreasing the RAN power consumption has attracted interest from both the research community and standardization bodies, and many energ… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  3. arXiv:2212.01611  [pdf, other

    cs.CL

    CoP: Factual Inconsistency Detection by Controlling the Preference

    Authors: Shuaijie She, Xiang Geng, Shujian Huang, Jiajun Chen

    Abstract: Abstractive summarization is the process of generating a summary given a document as input. Although significant progress has been made, the factual inconsistency between the document and the generated summary still limits its practical applications. Previous work found that the probabilities assigned by the generation model reflect its preferences for the generated summary, including the preferen… ▽ More

    Submitted 30 March, 2023; v1 submitted 3 December, 2022; originally announced December 2022.

    Comments: Accepted to AAAI2023 regular paper

  4. arXiv:2211.15144  [pdf, other

    cs.LG

    Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes

    Authors: Aviral Kumar, Rishabh Agarwal, Xinyang Geng, George Tucker, Sergey Levine

    Abstract: The potential of offline reinforcement learning (RL) is that high-capacity models trained on large, heterogeneous datasets can lead to agents that generalize broadly, analogously to similar advances in vision and NLP. However, recent works argue that offline RL methods encounter unique challenges to scaling up model capacity. Drawing on the learnings from these works, we re-examine previous design… ▽ More

    Submitted 17 April, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted at ICLR 2023. Project website: https://sites.google.com/view/scaling-offlinerl/home

  5. arXiv:2211.07477  [pdf, other

    hep-ex hep-ph physics.ins-det

    Search for boosted keV-MeV light dark matter particles from evaporating primordial black holes at the CDEX-10 experiment

    Authors: Z. H. Zhang, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, X. Y. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, H. T. Jia, X. Jiang, S. Karmakar , et al. (59 additional authors not shown)

    Abstract: We present novel constraints on boosted light dark matter particles (denoted as ``$χ$'') from evaporating primordial black holes (PBHs) using 205.4 kg$\cdot$day data from the China Jinping Underground Laboratory's CDEX-10 p-type point contact germanium detector with a 160 eVee analysis threshold. $χ$ from PBHs with masses ranging from 1$\times$10$^{15}$ g to 7$\times$10$^{16}$ g are searched in th… ▽ More

    Submitted 7 September, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: 8 pages, 6 figures. Version updated to match PRD version

    Journal ref: Phys. Rev. D 109, 052006 (2023)

  6. arXiv:2211.02404  [pdf, other

    cs.CV

    Tensor Robust PCA with Nonconvex and Nonlocal Regularization

    Authors: Xiaoyu Geng, Qiang Guo, Shuaixiong Hui, Ming Yang, Caiming Zhang

    Abstract: Tensor robust principal component analysis (TRPCA) is a classical way for low-rank tensor recovery, which minimizes the convex surrogate of tensor rank by shrinking each tensor singular value equally. However, for real-world visual data, large singular values represent more significant information than small singular values. In this paper, we propose a nonconvex TRPCA (N-TRPCA) model based on the… ▽ More

    Submitted 7 July, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: 15 pages, 7 figures. Submitted to CVIU

  7. arXiv:2210.13432  [pdf, other

    cs.CL

    Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models

    Authors: Hao Liu, Xinyang Geng, Lisa Lee, Igor Mordatch, Sergey Levine, Sharan Narang, Pieter Abbeel

    Abstract: Large language models (LLM) trained using the next-token-prediction objective, such as GPT3 and PaLM, have revolutionized natural language processing in recent years by showing impressive zero-shot and few-shot capabilities across a wide range of tasks. In this work, we propose a simple technique that significantly boosts the performance of LLMs without adding computational cost. Our key observati… ▽ More

    Submitted 31 January, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Added T-FCM and better FCM results

  8. arXiv:2210.05955  [pdf, other

    stat.ML cs.LG

    Identifiability and Asymptotics in Learning Homogeneous Linear ODE Systems from Discrete Observations

    Authors: Yuanyuan Wang, Wei Huang, Mingming Gong, Xi Geng, Tongliang Liu, Kun Zhang, Dacheng Tao

    Abstract: Ordinary Differential Equations (ODEs) have recently gained a lot of attention in machine learning. However, the theoretical aspects, e.g., identifiability and asymptotic properties of statistical estimation are still obscure. This paper derives a sufficient condition for the identifiability of homogeneous linear ODE systems from a sequence of equally-spaced error-free observations sampled from a… ▽ More

    Submitted 2 June, 2024; v1 submitted 12 October, 2022; originally announced October 2022.

    Journal ref: Journal of Machine Learning Research 25 (2024) 1-50

  9. arXiv:2210.01604  [pdf, other

    hep-ex hep-ph physics.ins-det

    Search for exotic interactions of solar neutrinos in the CDEX-10 experiment

    Authors: X. P. Geng, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, H. Gong, Q. J. Guo, X. Y. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, H. T. Jia, X. Jiang, S. Karmakar, H. B. Li , et al. (60 additional authors not shown)

    Abstract: We investigate exotic neutrino interactions using the 205.4 kg$\cdot$day dataset from the CDEX-10 experiment at the China Jinping Underground Laboratory. New constraints on the mass and couplings of new gauge bosons are presented. Two nonstandard neutrino interactions are considered: a $U(1)_{B-L}$ gauge-boson-induced interaction between an active neutrino and electron/nucleus, and a dark-photon-i… ▽ More

    Submitted 2 June, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 6 pages, 4 figures. Version updated to match PRD version

    Journal ref: Phys. Rev. D 107, 112002 (2023)

  10. arXiv:2209.13947  [pdf, ps, other

    nucl-ex physics.plasm-ph

    $^{197}$Au($γ,\,xn;\,x\,=\,1\thicksim9$) Reaction Cross Section Measurements using Laser-Driven Ultra-Intense $γ$-Ray Source

    Authors: D. Wu, H. Y. Lan, J. Y. Zhang, J. X. Liu, H. G. Lu, J. F. Lv, X. Z. Wu, H. Zhang, J. Cai, Q. Y. Ma, Y. H. Xia, Z. N. Wang, M. Z. Wang, Z. Y. Yang, X. L. Xu, Y. X. Geng, Y. Y. Zhao, C. Lin, W. J. Ma, J. Q. Yu, H. R. Wang, F. L. Liu, C. Y. He, B. Guo, P. Zhu , et al. (4 additional authors not shown)

    Abstract: We present a new method for the measurements of photonuclear reaction flux-weighted average cross sections and isomeric ratios using a laser-driven bremsstrahlung $γ$-ray source. An ultra-bright ultra-fast 60$\,\thicksim\,$250 MeV bremsstrahlung $γ$-ray source was established using the 200 TW laser facility in the Compact Laser Plasma Accelerator Laboratory, Peking University, which could cover th… ▽ More

    Submitted 23 November, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

  11. Machine Learning and Analytical Power Consumption Models for 5G Base Stations

    Authors: Nicola Piovesan, David Lopez-Perez, Antonio De Domenico, Xinli Geng, Harvey Bao, Merouane Debbah

    Abstract: The energy consumption of the fifth generation(5G) of mobile networks is one of the major concerns of the telecom industry. However, there is not currently an accurate and tractable approach to evaluate 5G base stations (BSs) power consumption. In this article, we propose a novel model for a realistic characterisation of the power consumption of 5G multi-carrier BSs, which builds on a large data c… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: Accepted by IEEE Communications Magazine

  12. Exotic Dark Matter Search with CDEX-10 Experiment at China's Jinping Underground Laboratory

    Authors: W. H. Dai, L. P. Jia, H. Ma, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, X. Y. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, H. T. Jia, X. Jiang, S. Karmakar , et al. (59 additional authors not shown)

    Abstract: A search for exotic dark matter (DM) in the sub-GeV mass range has been conducted using 205 kg$\cdot$day data taken from a p-type point contact germanium detector of CDEX-10 experiment at China Jinping underground laboratory. New low-mass dark matter searching channels, neutral current fermionic DM absorption ($χ+A\rightarrow ν+A$) and DM-nucleus 3$\rightarrow$2 scattering ($χ+χ+A\rightarrow φ+A$)… ▽ More

    Submitted 23 November, 2022; v1 submitted 2 September, 2022; originally announced September 2022.

    Comments: 5 pages, 7 figures

    Journal ref: Phys. Rev. Lett. 129, 221802, 2022

  13. arXiv:2208.14754  [pdf, other

    cs.IR

    LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale Retrieval

    Authors: Tao Shen, Xiubo Geng, Chongyang Tao, Can Xu, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang

    Abstract: In large-scale retrieval, the lexicon-weighting paradigm, learning weighted sparse representations in vocabulary space, has shown promising results with high quality and low latency. Despite it deeply exploiting the lexicon-representing capability of pre-trained language models, a crucial gap remains between language modeling and lexicon-weighting retrieval -- the former preferring certain or low-… ▽ More

    Submitted 4 June, 2023; v1 submitted 31 August, 2022; originally announced August 2022.

    Comments: Appeared at ICLR 2023

  14. arXiv:2208.13661  [pdf, other

    cs.CL

    LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval

    Authors: Kai Zhang, Chongyang Tao, Tao Shen, Can Xu, Xiubo Geng, Binxing Jiao, Daxin Jiang

    Abstract: Retrieval models based on dense representations in semantic space have become an indispensable branch for first-stage retrieval. These retrievers benefit from surging advances in representation learning towards compressive global sequence-level embeddings. However, they are prone to overlook local salient phrases and entity mentions in texts, which usually play pivot roles in first-stage retrieval… ▽ More

    Submitted 2 March, 2023; v1 submitted 29 August, 2022; originally announced August 2022.

    Comments: 14 pages, 6 tables, 4 figures. WWW 2023

  15. arXiv:2208.05853  [pdf, other

    cs.CV

    MultiMatch: Multi-task Learning for Semi-supervised Domain Generalization

    Authors: Lei Qi, Hongpeng Yang, Yinghuan Shi, Xin Geng

    Abstract: Domain generalization (DG) aims at learning a model on source domains to well generalize on the unseen target domain. Although it has achieved great success, most of existing methods require the label information for all training samples in source domains, which is time-consuming and expensive in the real-world application. In this paper, we resort to solving the semi-supervised domain generalizat… ▽ More

    Submitted 29 April, 2024; v1 submitted 11 August, 2022; originally announced August 2022.

    Comments: Accepted by ACM TOMM

  16. arXiv:2208.05617  [pdf, other

    cs.CV

    Language-Guided Face Animation by Recurrent StyleGAN-based Generator

    Authors: Tiankai Hang, Huan Yang, Bei Liu, Jianlong Fu, Xin Geng, Baining Guo

    Abstract: Recent works on language-guided image manipulation have shown great power of language in providing rich semantics, especially for face images. However, the other natural information, motions, in language is less explored. In this paper, we leverage the motion information and study a novel task, language-guided face animation, that aims to animate a static face image with the help of languages. To… ▽ More

    Submitted 3 July, 2024; v1 submitted 10 August, 2022; originally announced August 2022.

  17. arXiv:2206.12161  [pdf, ps, other

    math.PR math.CA

    On the Lack of Gaussian Tail for Rough Line Integrals along Fractional Brownian Paths

    Authors: Horatio Boedihardjo, Xi Geng

    Abstract: We show that the tail probability of the rough line integral $\int_{0}^{1}φ(X_{t})dY_{t}$, where $(X,Y)$ is a 2D fractional Brownian motion with Hurst parameter $H\in(1/4,1/2)$ and $φ$ is a $C_{b}^{\infty}$-function satisfying a mild non-degeneracy condition on its derivative, cannot decay faster than a $γ$-Weibull tail with any exponent $γ>2H+1$. In particular, this produces a simple class of exa… ▽ More

    Submitted 3 November, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: 30 pages

  18. arXiv:2206.10265  [pdf, other

    cs.CL

    KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Low-Resource NLP

    Authors: Yufei Wang, Jiayi Zheng, Can Xu, Xiubo Geng, Tao Shen, Chongyang Tao, Daxin Jiang

    Abstract: This paper focuses on the data augmentation for low-resource NLP tasks where the training set is limited. The existing solutions either leverage task-independent heuristic rules (e.g., Synonym Replacement) or fine-tune general-purpose pre-trained language models (e.g., GPT2) using the limited training instances to produce new synthetic data. Consequently, they have trivial task-specific knowledge… ▽ More

    Submitted 27 January, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Accepted by ICLR 2023 main track at https://openreview.net/forum?id=2nocgE1m0A

  19. arXiv:2206.08063  [pdf, other

    cs.IR cs.CL

    Towards Robust Ranker for Text Retrieval

    Authors: Yucheng Zhou, Tao Shen, Xiubo Geng, Chongyang Tao, Can Xu, Guodong Long, Binxing Jiao, Daxin Jiang

    Abstract: A ranker plays an indispensable role in the de facto 'retrieval & rerank' pipeline, but its training still lags behind -- learning from moderate negatives or/and serving as an auxiliary module for a retriever. In this work, we first identify two major barriers to a robust ranker, i.e., inherent label noises caused by a well-trained retriever and non-ideal negatives sampled for a high-capable ranke… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 11 pages of main content, 4 tables, 3 figures

  20. arXiv:2206.04128  [pdf, other

    hep-ex hep-ph physics.ins-det

    Constraints on Sub-GeV Dark Matter--Electron Scattering from the CDEX-10 Experiment

    Authors: Z. Y. Zhang, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, M. Agartioglu, H. P. An, J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, X. Y. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, H. T. Jia, X. Jiang, H. B. Li , et al. (60 additional authors not shown)

    Abstract: We present improved germanium-based constraints on sub-GeV dark matter via dark matter--electron ($χ$-$e$) scattering using the 205.4 kg$\cdot$day dataset from the CDEX-10 experiment. Using a novel calculation technique, we attain predicted $χ$-$e$ scattering spectra observable in high-purity germanium detectors. In the heavy mediator scenario, our results achieve 3 orders of magnitude of improvem… ▽ More

    Submitted 21 November, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: 6 pages, 3 figures. Version updated to match PRL version

    Journal ref: Phys. Rev. Lett. 129, 221301 (2022)

  21. arXiv:2206.02359  [pdf, ps, other

    math.NA

    An inverse random source problem for the Helium production-diffusion equation driven by a fractional Brownian motion

    Authors: Jing Li, Hao Cheng, Xiaoxiao Geng

    Abstract: In this paper, we consider the prediction of the helium concentrations as function of a spatially variable source term perturbed by fractional Brownian motion. For the direct problem, we show that it is well-posed and has a unique mild solution under some conditions. For the inverse problem, the uniqueness and the instability are given. In the meanwhile, we determine the statistical properties of… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:2101.04744 by other authors

  22. arXiv:2206.00830  [pdf, other

    cs.LG

    Progressive Purification for Instance-Dependent Partial Label Learning

    Authors: Ning Xu, Biao Liu, Jiaqi Lv, Congyu Qiao, Xin Geng

    Abstract: Partial label learning (PLL) aims to train multiclass classifiers from the examples each annotated with a set of candidate labels where a fixed but unknown candidate label is correct. In the last few years, the instance-independent generation process of candidate labels has been extensively studied, on the basis of which many theoretical advances have been made in PLL. Nevertheless, the candidate… ▽ More

    Submitted 9 May, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Accepted to International Conference on Machine Learning 2023 (ICML 2023)

  23. arXiv:2206.00517  [pdf, other

    cs.LG

    One Positive Label is Sufficient: Single-Positive Multi-Label Learning with Label Enhancement

    Authors: Ning Xu, Congyu Qiao, Jiaqi Lv, Xin Geng, Min-Ling Zhang

    Abstract: Multi-label learning (MLL) learns from the examples each associated with multiple labels simultaneously, where the high cost of annotating all relevant labels for each training example is challenging for real-world applications. To cope with the challenge, we investigate single-positive multi-label learning (SPMLL) where each example is annotated with only one relevant label, and show that one can… ▽ More

    Submitted 11 October, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Accepted to NeurIPS 2022

  24. arXiv:2205.14204  [pdf, other

    cs.CV

    Multimodal Masked Autoencoders Learn Transferable Representations

    Authors: Xinyang Geng, Hao Liu, Lisa Lee, Dale Schuurmans, Sergey Levine, Pieter Abbeel

    Abstract: Building scalable models to learn from diverse, multimodal data remains an open challenge. For vision-language data, the dominant approaches are based on contrastive learning objectives that train a separate encoder for each modality. While effective, contrastive learning approaches introduce sampling bias depending on the data augmentations used, which can degrade performance on downstream tasks.… ▽ More

    Submitted 21 October, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

  25. arXiv:2205.11194  [pdf, other

    cs.IR cs.CL

    UnifieR: A Unified Retriever for Large-Scale Retrieval

    Authors: Tao Shen, Xiubo Geng, Chongyang Tao, Can Xu, Guodong Long, Kai Zhang, Daxin Jiang

    Abstract: Large-scale retrieval is to recall relevant documents from a huge collection given a query. It relies on representation learning to embed documents and queries into a common semantic encoding space. According to the encoding space, recent retrieval methods based on pre-trained language models (PLM) can be coarsely categorized into either dense-vector or lexicon-based paradigms. These two paradigms… ▽ More

    Submitted 4 June, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: To appear at KDD ADS 2023

  26. arXiv:2205.10718  [pdf, other

    nucl-ex hep-ex physics.ins-det

    Search for Neutrinoless Double-Beta Decay of $^{76}$Ge with a Natural Broad Energy Germanium Detector

    Authors: CDEX collaboration, W. H. Dai, H. Ma, Q. Yue, Z. She, K. J. Kang, Y. J. Li, M. Agartioglu, H. P. An, J. P. Chang, Y. H. Chen, J. P. Cheng, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, X. Y. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, H. T. Jia, X. Jiang , et al. (61 additional authors not shown)

    Abstract: A natural broad energy germanium (BEGe) detector is operated in the China Jinping Underground Laboratory (CJPL) for a feasibility study of building the next generation experiment of the neutrinoless double-beta (0{$νββ$}) decay of $^{76}$Ge. The setup of the prototype facility, characteristics of the BEGe detector, background reduction methods, and data analysis are described in this paper. A back… ▽ More

    Submitted 5 August, 2022; v1 submitted 21 May, 2022; originally announced May 2022.

    Comments: 10 pages, 15 figures

    Journal ref: Physical Review D 106, 032012 (2022)

  27. arXiv:2205.01620  [pdf, other

    cs.CL

    Unifying the Convergences in Multilingual Neural Machine Translation

    Authors: Yichong Huang, Xiaocheng Feng, Xinwei Geng, Bing Qin

    Abstract: Although all-in-one-model multilingual neural machine translation (multilingual NMT) has achieved remarkable progress, the convergence inconsistency in the joint training is ignored, i.e., different language pairs reaching convergence in different epochs. This leads to the trained MNMT model over-fitting low-resource language translations while under-fitting high-resource ones. In this paper, we p… ▽ More

    Submitted 19 October, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: EMNLP2022

  28. arXiv:2204.05903  [pdf, other

    cs.CV

    Label Distribution Learning for Generalizable Multi-source Person Re-identification

    Authors: Lei Qi, Jiaying Shen, Jiaqi Liu, Yinghuan Shi, Xin Geng

    Abstract: Person re-identification (Re-ID) is a critical technique in the video surveillance system, which has achieved significant success in the supervised setting. However, it is difficult to directly apply the supervised model to arbitrary unseen domains due to the domain gap between the available source domains and unseen target domains. In this paper, we propose a novel label distribution learning (LD… ▽ More

    Submitted 24 August, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

    Comments: Accepted by IEEE Transactions on Information Forensics and Security (TIFS). arXiv admin note: text overlap with arXiv:2201.09846

  29. arXiv:2204.05610  [pdf, other

    cs.CL cs.AI cs.LG

    Stylized Knowledge-Grounded Dialogue Generation via Disentangled Template Rewriting

    Authors: Qingfeng Sun, Can Xu, Huang Hu, Yujing Wang, Jian Miao, Xiubo Geng, Yining Chen, Fei Xu, Daxin Jiang

    Abstract: Current Knowledge-Grounded Dialogue Generation (KDG) models specialize in producing rational and factual responses. However, to establish long-term relationships with users, the KDG model needs the capability to generate responses in a desired style or attribute. Thus, we study a new problem: Stylized Knowledge-Grounded Dialogue Generation (SKDG). It presents two challenges: (1) How to train a SKD… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: Accepted to NAACL 2022 Main Conference

  30. arXiv:2204.05104  [pdf, other

    cs.LG

    Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation

    Authors: Jin Yuan, Feng Hou, Yangzhou Du, Zhongchao Shi, Xin Geng, Jianping Fan, Yong Rui

    Abstract: Domain adaptation (DA) tries to tackle the scenarios when the test data does not fully follow the same distribution of the training data, and multi-source domain adaptation (MSDA) is very attractive for real world applications. By learning from large-scale unlabeled samples, self-supervised learning has now become a new trend in deep learning. It is worth noting that both self-supervised learning… ▽ More

    Submitted 15 January, 2024; v1 submitted 7 April, 2022; originally announced April 2022.

  31. arXiv:2204.03845  [pdf, other

    cs.LG

    Decompositional Generation Process for Instance-Dependent Partial Label Learning

    Authors: Congyu Qiao, Ning Xu, Xin Geng

    Abstract: Partial label learning (PLL) is a typical weakly supervised learning problem, where each training example is associated with a set of candidate labels among which only one is true. Most existing PLL approaches assume that the incorrect labels in each training example are randomly picked as the candidate labels and model the generation process of the candidate labels in a simple way. However, these… ▽ More

    Submitted 1 February, 2023; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: ICLR 2023 Spotlight

  32. arXiv:2204.00753  [pdf, ps, other

    math.OC

    Achieving Social Optimum in Non-convex Cooperative Aggregative Games: A Distributed Stochastic Annealing Approach

    Authors: Yinghui Wang, Xiaoxue Geng, Guanpu Chen, Wenxiao Zhao

    Abstract: This paper designs a distributed stochastic annealing algorithm for non-convex cooperative aggregative games, whose agents' cost functions not only depend on agents' own decision variables but also rely on the sum of agents' decision variables. To seek the the social optimum of cooperative aggregative games, a distributed stochastic annealing algorithm is proposed, where the local cost functions a… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

  33. arXiv:2203.16896  [pdf, other

    cs.CV

    CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow

    Authors: Xiuchao Sui, Shaohua Li, Xue Geng, Yan Wu, Xinxing Xu, Yong Liu, Rick Goh, Hongyuan Zhu

    Abstract: Optical flow estimation aims to find the 2D motion field by identifying corresponding pixels between two images. Despite the tremendous progress of deep learning-based optical flow methods, it remains a challenge to accurately estimate large displacements with motion blur. This is mainly because the correlation volume, the basis of pixel matching, is computed as the dot product of the convolutiona… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: CVPR 2022 camera ready

  34. arXiv:2203.11089  [pdf, other

    cs.CV

    PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark

    Authors: Li Chen, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jianping Shi, Yu Qiao, Junchi Yan

    Abstract: Methods for 3D lane detection have been recently proposed to address the issue of inaccurate lane layouts in many autonomous driving scenarios (uphill/downhill, bump, etc.). Previous work struggled in complex cases due to their simple designs of the spatial transformation between front view and bird's eye view (BEV) and the lack of a realistic dataset. Towards these issues, we present PersFormer:… ▽ More

    Submitted 19 July, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted by ECCV 2022 (Oral). Project page: https://github.com/OpenPerceptionX/PersFormer_3DLane | OpenLane dataset: https://github.com/OpenPerceptionX/OpenLane

  35. arXiv:2203.08517  [pdf, other

    cs.CL cs.AI

    TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge

    Authors: Chao-Hong Tan, Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Huang Hu, Xiubo Geng, Daxin Jiang

    Abstract: Generating natural and informative texts has been a long-standing problem in NLP. Much effort has been dedicated into incorporating pre-trained language models (PLMs) with various open-world knowledge, such as knowledge graphs or wiki pages. However, their ability to access and manipulate the task-specific knowledge is still limited on downstream tasks, as this type of knowledge is usually not wel… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted by Findings of ACL 2022

  36. arXiv:2203.08500  [pdf, other

    cs.CL

    HeterMPC: A Heterogeneous Graph Neural Network for Response Generation in Multi-Party Conversations

    Authors: Jia-Chen Gu, Chao-Hong Tan, Chongyang Tao, Zhen-Hua Ling, Huang Hu, Xiubo Geng, Daxin Jiang

    Abstract: Recently, various response generation models for two-party conversations have achieved impressive improvements, but less effort has been paid to multi-party conversations (MPCs) which are more practical and complicated. Compared with a two-party conversation where a dialogue context is a sequence of utterances, building a response generation model for MPCs is more challenging, since there exist co… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted by ACL 2022

  37. arXiv:2203.04049  [pdf, other

    cs.CV

    Graph Attention Transformer Network for Multi-Label Image Classification

    Authors: Jin Yuan, Shikai Chen, Yao Zhang, Zhongchao Shi, Xin Geng, Jianping Fan, Yong Rui

    Abstract: Multi-label classification aims to recognize multiple objects or attributes from images. However, it is challenging to learn from proper label graphs to effectively characterize such inter-label correlations or dependencies. Current methods often use the co-occurrence probability of labels based on the training set as the adjacency matrix to model this correlation, which is greatly limited by the… ▽ More

    Submitted 15 January, 2024; v1 submitted 8 March, 2022; originally announced March 2022.

  38. arXiv:2203.02225  [pdf, other

    cs.CL

    ClarET: Pre-training a Correlation-Aware Context-To-Event Transformer for Event-Centric Generation and Classification

    Authors: Yucheng Zhou, Tao Shen, Xiubo Geng, Guodong Long, Daxin Jiang

    Abstract: Generating new events given context with correlated ones plays a crucial role in many event-centric reasoning tasks. Existing works either limit their scope to specific scenarios or overlook event-level correlations. In this paper, we propose to pre-train a general Correlation-aware context-to-Event Transformer (ClarET) for event-centric reasoning. To achieve this, we propose three novel event-cen… ▽ More

    Submitted 9 March, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: ACL 2022 camera-ready version

  39. Dual-Branched Spatio-temporal Fusion Network for Multi-horizon Tropical Cyclone Track Forecast

    Authors: Zili Liu, Kun Hao, Xiaoyi Geng, Zhenwei Shi

    Abstract: Tropical cyclone (TC) is an extreme tropical weather system and its trajectory can be described by a variety of spatio-temporal data. Effective mining of these data is the key to accurate TCs track forecasting. However, existing methods face the problem that the model complexity is too high or it is difficult to efficiently extract features from multi-modal data. In this paper, we propose the Dual… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

  40. arXiv:2202.12499  [pdf, other

    cs.CL

    PromDA: Prompt-based Data Augmentation for Low-Resource NLU Tasks

    Authors: Yufei Wang, Can Xu, Qingfeng Sun, Huang Hu, Chongyang Tao, Xiubo Geng, Daxin Jiang

    Abstract: This paper focuses on the Data Augmentation for low-resource Natural Language Understanding (NLU) tasks. We propose Prompt-based D}ata Augmentation model (PromDA) which only trains small-scale Soft Prompt (i.e., a set of trainable vectors) in the frozen Pre-trained Language Models (PLMs). This avoids human effort in collecting unlabeled in-domain data and maintains the quality of generated synthet… ▽ More

    Submitted 17 March, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: Accepted to ACL 2022 Main Conference, Camera-Ready Version

  41. arXiv:2202.08450  [pdf, other

    cs.LG

    Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization

    Authors: Brandon Trabucco, Xinyang Geng, Aviral Kumar, Sergey Levine

    Abstract: Black-box model-based optimization (MBO) problems, where the goal is to find a design input that maximizes an unknown objective function, are ubiquitous in a wide range of domains, such as the design of proteins, DNA sequences, aircraft, and robots. Solving model-based optimization problems typically requires actively querying the unknown objective function on design proposals, which means physica… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  42. arXiv:2202.07225  [pdf

    physics.comp-ph physics.plasm-ph

    Improving the accuracy of hard photon emission by Sigmoid sampling of the QED-table in particle-in-cell-Monte-Carlo simulations

    Authors: Yinlong Guo, Xuesong Geng, Liangliang Ji, Baifei Shen, Ruxin Li

    Abstract: Research on laser-plasma interaction in the quantum-electrodynamic (QED) regime has been greatly advanced by particle-in-cell & Monte-Carlo simulations (PIC-MC). While these simulations are widely used, we find that noticeable numerical error arises due to inappropriate implementation of the quantum process accounting for hard photon emission and pair production in the PIC-MC codes. The error stem… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: 21 pages, 9 figures

  43. arXiv:2202.07222  [pdf

    physics.optics physics.acc-ph physics.plasm-ph

    Quasi-monochromatic bright gamma-ray generation from synchronized Compton scattering via azimuthal spatial-temporal coupling

    Authors: Xuesong Geng, Liangliang Ji, Baifei Shen

    Abstract: High energy photons can be generated via inverse Compton scattering (ICS) in the collision between energetic electrons and intense laser pulse. The development of laser plasma accelerators promises compact and all-optical gamma-ray sources by colliding the electrons from laser wakefield accelerators to its high-power driving pulse reflected by a plasma mirror. However, the law of optical focusing… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: 14 pages, 7 figures

  44. arXiv:2202.02858  [pdf, ps, other

    math.PR

    Non-degeneracy of Stochastic Line Integrals

    Authors: Xi Geng, Sheng Wang

    Abstract: We derive quantitative criteria for the existence of density for stochastic line integrals and iterated line integrals along solutions of hypoelliptic differential equations driven by fractional Brownian motion. As an application, we also study the signature uniqueness problem for these rough differential equations.

    Submitted 6 February, 2022; originally announced February 2022.

    Comments: 37 pages

  45. Effective Boundary Conditions Arising from the Heat Equation with Three-dimensional Interior Inclusion

    Authors: Xingri Geng

    Abstract: We study the initial boundary value problem for a heat equation in a domain containing a thin layer. The thermal conductivity of the layer is drastically different from that of the bulk of the domain; moreover, the layer is anisotropic and ``optimally aligned" in the sense that the normal direction in the layer is always an eigenvector of the thermal tensor. To reveal the effects of the layer, we… ▽ More

    Submitted 24 January, 2023; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: 25 pages, 2 figures, some typos are corrected

    MSC Class: 35K05; 35B40; 35B45; 74K35

    Journal ref: Commun. Pure Appl. Anal.22 (2023)

  46. arXiv:2201.12093  [pdf, other

    cs.CL

    PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings

    Authors: Qiyu Wu, Chongyang Tao, Tao Shen, Can Xu, Xiubo Geng, Daxin Jiang

    Abstract: Learning sentence embeddings in an unsupervised manner is fundamental in natural language processing. Recent common practice is to couple pre-trained language models with unsupervised contrastive learning, whose success relies on augmenting a sentence with a semantically-close positive instance to construct contrastive pairs. Nonetheless, existing approaches usually depend on a mono-augmenting str… ▽ More

    Submitted 19 October, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: To appear at EMNLP 2022

  47. arXiv:2201.09846  [pdf, other

    cs.CV

    A Novel Mix-normalization Method for Generalizable Multi-source Person Re-identification

    Authors: Lei Qi, Lei Wang, Yinghuan Shi, Xin Geng

    Abstract: Person re-identification (Re-ID) has achieved great success in the supervised scenario. However, it is difficult to directly transfer the supervised model to arbitrary unseen domains due to the model overfitting to the seen source domains. In this paper, we aim to tackle the generalizable multi-source person Re-ID task (i.e., there are multiple available source domains, and the testing domain is u… ▽ More

    Submitted 12 June, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

    Comments: Accepted by IEEE Transactions on Multimedia (TMM)

  48. arXiv:2201.05730  [pdf, other

    cs.CV

    Learning Hierarchical Graph Representation for Image Manipulation Detection

    Authors: Wenyan Pan, Zhili Zhou, Miaogen Ling, Xin Geng, Q. M. Jonathan Wu

    Abstract: The objective of image manipulation detection is to identify and locate the manipulated regions in the images. Recent approaches mostly adopt the sophisticated Convolutional Neural Networks (CNNs) to capture the tampering artifacts left in the images to locate the manipulated regions. However, these approaches ignore the feature correlations, i.e., feature inconsistencies, between manipulated regi… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  49. Ultrahigh-energy Gamma-Ray Radiation from the Crab Pulsar Wind Nebula

    Authors: Lin Nie, Yang Liu, Zejun Jiang, Xiongfei Geng

    Abstract: It has been long debated whether the high-energy gamma-ray radiation from the Crab nebula stems from leptonic or hadronic processes. In this work, we investigate the multi-band non-thermal radiation from the Crab pulsar wind nebula with the leptonic and leptonic-hadronic hybrid models, respectively. Then we use the Markov Chain Monte Carlo(MCMC) sampling technology and method of sampling trace to… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: Accepted for publication in ApJ

    Journal ref: 2022, ApJ, 924, 42

  50. arXiv:2201.01704  [pdf, other

    hep-ex hep-ph physics.ins-det

    Constraints on sub-GeV dark matter boosted by cosmic rays from the CDEX-10 experiment at the China Jinping Underground Laboratory

    Authors: R. Xu, L. T. Yang, Q. Yue, K. J. Kang, Y. J. Li, M. Agartioglu, H. P. An, J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, X. Y. Guo, Q. J. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, H. T. Jia, X. Jiang, H. B. Li , et al. (60 additional authors not shown)

    Abstract: We present new constraints on light dark matter boosted by cosmic rays (CRDM) using the 205.4 kg day data of the CDEX-10 experiment conducted at the China Jinping Underground Laboratory. The Monte Carlo simulation package CJPL\_ESS was employed to evaluate the Earth shielding effect. Several key factors have been introduced and discussed in our CRDM analysis, including the contributions from heavi… ▽ More

    Submitted 16 September, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

    Comments: 9 pages, 7 figures. Version updated to match PRD version

    Journal ref: Phys. Rev. D 106, 052008 (2022)