Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 151–200 of 282 results for author: Geng, X

.
  1. arXiv:2112.04069  [pdf, ps, other

    math.SP

    The Z-eigenpairs of orthogonally diagonalizable symmetric tensors

    Authors: Lei Wang, Xiurui Geng

    Abstract: In this paper, we focus on a special class of symmetric tensors, which can be orthogonally diagonalizable, and investigate their Z-eigenpairs problem. We show that the eigenpairs can be uniformly expressed using several basic eigenpairs, and the number of all the eigenpairs is uniquely determined by the order and rank of the symmetric tensor. In addition, we exploit the local optimality of each ei… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

  2. arXiv:2111.15077  [pdf, other

    cs.CV

    Unsupervised Domain Generalization for Person Re-identification: A Domain-specific Adaptive Framework

    Authors: Lei Qi, Jiaqi Liu, Lei Wang, Yinghuan Shi, Xin Geng

    Abstract: Domain generalization (DG) has attracted much attention in person re-identification (ReID) recently. It aims to make a model trained on multiple source domains generalize to an unseen target domain. Although achieving promising progress, existing methods usually need the source domains to be labeled, which could be a significant burden for practical ReID tasks. In this paper, we turn to investigat… ▽ More

    Submitted 23 March, 2023; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: Accepted to Pattern Recognition (PR)

  3. arXiv:2111.11243  [pdf, other

    hep-ex hep-ph physics.ins-det

    Studies of the Earth shielding effect to direct dark matter searches at the China Jinping Underground Laboratory

    Authors: Z. Z. Liu, L. T. Yang, Q. Yue, C. H. Yeh, K. J. Kang, Y. J. Li, M. Agartioglu, H. P. An, J. P. Chang, J. H. Chen, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, X. Y. Guo, Q. J. Guo, L. He, S. M. He, J. W. Hu, H. X. Huang, T. C. Huang, H. T. Jia , et al. (58 additional authors not shown)

    Abstract: Dark matter direct detection experiments mostly operate at deep underground laboratories. It is necessary to consider shielding effect of the Earth, especially for dark matter particles interacting with a large cross section. We analyzed and simulated the Earth shielding effect for dark matter at the China Jinping Underground Laboratory (CJPL) with a simulation package, CJPL Earth Shielding Simula… ▽ More

    Submitted 9 March, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: 8 pages, 8 figures, 2 tables. Version updated to match PRD version

    Journal ref: Phys. Rev. D 105, 052005 (2022)

  4. arXiv:2111.11029  [pdf, other

    cs.CV

    Auto-Encoding Score Distribution Regression for Action Quality Assessment

    Authors: Boyu Zhang, Jiayuan Chen, Yinfei Xu, Hui Zhang, Xu Yang, Xin Geng

    Abstract: The action quality assessment (AQA) of videos is a challenging vision task since the relation between videos and action scores is difficult to model. Thus, AQA has been widely studied in the literature. Traditionally, AQA is treated as a regression problem to learn the underlying mappings between videos and action scores. But previous methods ignored data uncertainty in AQA dataset. To address ale… ▽ More

    Submitted 31 August, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

  5. arXiv:2110.12911  [pdf, other

    cs.LG

    Instance-Dependent Partial Label Learning

    Authors: Ning Xu, Congyu Qiao, Xin Geng, Min-Ling Zhang

    Abstract: Partial label learning (PLL) is a typical weakly supervised learning problem, where each training example is associated with a set of candidate labels among which only one is true. Most existing PLL approaches assume that the incorrect labels in each training example are randomly picked as the candidate labels. However, this assumption is not realistic since the candidate labels are always instanc… ▽ More

    Submitted 25 October, 2021; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021 Spotlight

  6. arXiv:2110.08515  [pdf, other

    cs.CL cs.AI cs.CV cs.LG cs.MM

    Multimodal Dialogue Response Generation

    Authors: Qingfeng Sun, Yujing Wang, Can Xu, Kai Zheng, Yaming Yang, Huang Hu, Fei Xu, Jessica Zhang, Xiubo Geng, Daxin Jiang

    Abstract: Responsing with image has been recognized as an important capability for an intelligent conversational agent. Yet existing works only focus on exploring the multimodal dialogue models which depend on retrieval-based methods, but neglecting generation methods. To fill in the gaps, we first present a multimodal dialogue generation model, which takes the dialogue history as input, then generates a te… ▽ More

    Submitted 29 March, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

    Comments: Accepted to ACL 2022 Main Conference

  7. arXiv:2110.06533  [pdf, other

    cs.CL

    EventBERT: A Pre-Trained Model for Event Correlation Reasoning

    Authors: Yucheng Zhou, Xiubo Geng, Tao Shen, Guodong Long, Daxin Jiang

    Abstract: Event correlation reasoning infers whether a natural language paragraph containing multiple events conforms to human common sense. For example, "Andrew was very drowsy, so he took a long nap, and now he is very alert" is sound and reasonable. In contrast, "Andrew was very drowsy, so he stayed up a long time, now he is very alert" does not comply with human common sense. Such reasoning capability i… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: 12 pages, 6 figures

  8. arXiv:2110.00159  [pdf, other

    cs.CL

    Building an Efficient and Effective Retrieval-based Dialogue System via Mutual Learning

    Authors: Chongyang Tao, Jiazhan Feng, Chang Liu, Juntao Li, Xiubo Geng, Daxin Jiang

    Abstract: Establishing retrieval-based dialogue systems that can select appropriate responses from the pre-built index has gained increasing attention from researchers. For this task, the adoption of pre-trained language models (such as BERT) has led to remarkable progress in a number of benchmarks. There exist two common approaches, including cross-encoders which perform full attention over the inputs, and… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: 9 pages, 4 figures

  9. arXiv:2109.12302  [pdf, other

    cs.CL cs.AI

    Learning Neural Templates for Recommender Dialogue System

    Authors: Zujie Liang, Huang Hu, Can Xu, Jian Miao, Yingying He, Yining Chen, Xiubo Geng, Fan Liang, Daxin Jiang

    Abstract: Though recent end-to-end neural models have shown promising progress on Conversational Recommender System (CRS), two key challenges still remain. First, the recommended items cannot be always incorporated into the generated replies precisely and appropriately. Second, only the items mentioned in the training corpus have a chance to be recommended in the conversation. To tackle these challenges, we… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 long paper, code link: https://github.com/jokieleung/NTRD

  10. arXiv:2109.12273  [pdf, ps, other

    cs.LG cs.DC

    FedProc: Prototypical Contrastive Federated Learning on Non-IID data

    Authors: Xutong Mu, Yulong Shen, Ke Cheng, Xueli Geng, Jiaxuan Fu, Tao Zhang, Zhiwei Zhang

    Abstract: Federated learning allows multiple clients to collaborate to train high-performance deep learning models while keeping the training data locally. However, when the local data of all clients are not independent and identically distributed (i.e., non-IID), it is challenging to implement this form of efficient collaborative learning. Although significant efforts have been dedicated to addressing this… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

  11. arXiv:2109.07582  [pdf, other

    cs.LG

    Pareto-wise Ranking Classifier for Multi-objective Evolutionary Neural Architecture Search

    Authors: Lianbo Ma, Nan Li, Guo Yu, Xiaoyu Geng, Min Huang, Xingwei Wang

    Abstract: In the deployment of deep neural models, how to effectively and automatically find feasible deep models under diverse design objectives is fundamental. Most existing neural architecture search (NAS) methods utilize surrogates to predict the detailed performance (e.g., accuracy and model size) of a candidate architecture during the search, which however is complicated and inefficient. In contrast,… ▽ More

    Submitted 8 March, 2024; v1 submitted 14 September, 2021; originally announced September 2021.

  12. arXiv:2108.13405  [pdf, other

    math.OC eess.SY

    Stochastic Uncertainty Propagation in Power System Dynamics using Measure-valued Proximal Recursions

    Authors: Abhishek Halder, Kenneth F. Caluya, Pegah Ojaghi, Xinbo Geng

    Abstract: We present a proximal algorithm that performs a variational recursion on the space of joint probability measures to propagate the stochastic uncertainties in power system dynamics over high dimensional state space. The proposed algorithm takes advantage of the exact nonlinearity structures in the trajectory-level dynamics of the networked power systems, and is nonparametric. Lifting the dynamics t… ▽ More

    Submitted 24 August, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

  13. arXiv:2107.06882  [pdf, other

    cs.LG

    Conservative Objective Models for Effective Offline Model-Based Optimization

    Authors: Brandon Trabucco, Aviral Kumar, Xinyang Geng, Sergey Levine

    Abstract: Computational design problems arise in a number of settings, from synthetic biology to computer architectures. In this paper, we aim to solve data-driven model-based optimization (MBO) problems, where the goal is to find a design input that maximizes an unknown objective function provided access to only a static dataset of prior experiments. Such data-driven optimization procedures are the only pr… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

    Comments: ICML 2021. First two authors contributed equally. Code at: https://github.com/brandontrabucco/design-baselines/blob/c65a53fe1e6567b740f0adf60c5db9921c1f2330/design_baselines/coms_cleaned/__init__.py

  14. arXiv:2107.01189   

    cs.CV cs.LG

    NTIRE 2021 Multi-modal Aerial View Object Classification Challenge

    Authors: Jerrick Liu, Nathan Inkawhich, Oliver Nina, Radu Timofte, Sahil Jain, Bob Lee, Yuru Duan, Wei Wei, Lei Zhang, Songzheng Xu, Yuxuan Sun, Jiaqi Tang, Xueli Geng, Mengru Ma, Gongzhe Li, Xueli Geng, Huanqia Cai, Chengxue Cai, Sol Cummings, Casian Miron, Alexandru Pasarica, Cheng-Yen Yang, Hung-Min Hsu, Jiarui Cai, Jie Mei , et al. (9 additional authors not shown)

    Abstract: In this paper, we introduce the first Challenge on Multi-modal Aerial View Object Classification (MAVOC) in conjunction with the NTIRE 2021 workshop at CVPR. This challenge is composed of two different tracks using EO andSAR imagery. Both EO and SAR sensors possess different advantages and drawbacks. The purpose of this competition is to analyze how to use both sets of sensory information in compl… ▽ More

    Submitted 6 April, 2022; v1 submitted 2 July, 2021; originally announced July 2021.

    Comments: The paper needs to be withdrawn since it did not properly go through the public release process. We will soon release a new version to replace this one

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2021, 588-595

  15. arXiv:2106.15802  [pdf, other

    cs.AI

    CityNet: A Comprehensive Multi-Modal Urban Dataset for Advanced Research in Urban Computing

    Authors: Zhengfei Zheng, Xu Geng, Hai Yang

    Abstract: Data-driven approaches have emerged as a popular tool for addressing challenges in urban computing. However, current research efforts have primarily focused on limited data sources, which fail to capture the complexity of urban data arising from multiple entities and their interconnections. Therefore, a comprehensive and multifaceted dataset is required to enable more extensive studies in urban co… ▽ More

    Submitted 10 April, 2024; v1 submitted 30 June, 2021; originally announced June 2021.

  16. arXiv:2106.06788  [pdf, other

    cs.LG cs.AI

    Learngene: From Open-World to Your Learning Task

    Authors: Qiufeng Wang, Xin Geng, Shuxia Lin, Shiyu Xia, Lei Qi, Ning Xu

    Abstract: Although deep learning has made significant progress on fixed large-scale datasets, it typically encounters challenges regarding improperly detecting unknown/unseen classes in the open-world scenario, over-parametrized, and overfitting small samples. Since biological systems can overcome the above difficulties very well, individuals inherit an innate gene from collective creatures that have evolve… ▽ More

    Submitted 17 June, 2022; v1 submitted 12 June, 2021; originally announced June 2021.

    Comments: To be appeared in AAAI-22

  17. arXiv:2106.06152  [pdf, other

    cs.LG

    On the Robustness of Average Losses for Partial-Label Learning

    Authors: Jiaqi Lv, Biao Liu, Lei Feng, Ning Xu, Miao Xu, Bo An, Gang Niu, Xin Geng, Masashi Sugiyama

    Abstract: Partial-label learning (PLL) utilizes instances with PLs, where a PL includes several candidate labels but only one is the true label (TL). In PLL, identification-based strategy (IBS) purifies each PL on the fly to select the (most likely) TL for training; average-based strategy (ABS) treats all candidate labels equally for training and let trained models be able to predict TL. Although PLL resear… ▽ More

    Submitted 24 November, 2022; v1 submitted 10 June, 2021; originally announced June 2021.

  18. arXiv:2106.01541  [pdf, other

    cs.CL

    MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding

    Authors: Jia-Chen Gu, Chongyang Tao, Zhen-Hua Ling, Can Xu, Xiubo Geng, Daxin Jiang

    Abstract: Recently, various neural models for multi-party conversation (MPC) have achieved impressive improvements on a variety of tasks such as addressee recognition, speaker identification and response prediction. However, these existing methods on MPC usually represent interlocutors and utterances individually and ignore the inherent complicated structure in MPC which may provide crucial interlocutor and… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted by ACL 2021

  19. arXiv:2105.13073  [pdf, other

    cs.CL cs.AI

    Maria: A Visual Experience Powered Conversational Agent

    Authors: Zujie Liang, Huang Hu, Can Xu, Chongyang Tao, Xiubo Geng, Yining Chen, Fan Liang, Daxin Jiang

    Abstract: Arguably, the visual perception of conversational agents to the physical world is a key way for them to exhibit the human-like intelligence. Image-grounded conversation is thus proposed to address this challenge. Existing works focus on exploring the multimodal dialog models that ground the conversation on a given image. In this paper, we take a step further to study image-grounded conversation un… ▽ More

    Submitted 23 June, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: Accepted by ACL 2021 main conference

  20. arXiv:2105.11631  [pdf, other

    cond-mat.supr-con physics.optics

    Optical trapping of nanoparticles in superfluid helium

    Authors: Yosuke Minowa, Xi Geng, Keisuke Kokado, Kentaro Sato, Tatsuya Kameyama, Tsukasa Torimoto, Masaaki Ashida

    Abstract: Optical tweezers, the three-dimensional confinement of a nanoparticle by a strongly focused beam of light, have been widely employed in investigating biomaterial nanomechanics, nanoscopic fluid properties, and ultrasensitive detections in various environments such as inside living cells, at gigapascal pressure, and under high vacuum. However, the cryogenic operation of solid-state-particle optical… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

    Journal ref: Optica 9, 139-144 (2022)

  21. arXiv:2105.07149  [pdf, other

    cs.CL

    DirectQE: Direct Pretraining for Machine Translation Quality Estimation

    Authors: Qu Cui, Shujian Huang, Jiahuan Li, Xiang Geng, Zaixiang Zheng, Guoping Huang, Jiajun Chen

    Abstract: Machine Translation Quality Estimation (QE) is a task of predicting the quality of machine translations without relying on any reference. Recently, the predictor-estimator framework trains the predictor as a feature extractor, which leverages the extra parallel corpora without QE labels, achieving promising QE performance. However, we argue that there are gaps between the predictor and the estimat… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

  22. arXiv:2104.13597  [pdf, other

    physics.ins-det hep-ex

    SAGE : A Monte Carlo Simulation Framework for Experiments with Germanium Detectors

    Authors: Ze She, Hao Ma, Weihe Zeng, Wenhan Dai, Xinping Geng, Ofoq Normahmedov, Jingzhe Yang, Zhi Zeng, Qian Yue, Jianping Cheng, Junli Li

    Abstract: A Geant4-based simulation framework for rare event searching experiments with germanium detectors named SAGE is presented with details. It is designed for simulating, assessing background distribution, and investigating the response of the germanium detectors. The SAGE framework incorporates its experiment-specific geometries and custom attributes, including the event generators, physics lists and… ▽ More

    Submitted 27 September, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Comments: 12 pages, 5 figures

  23. arXiv:2104.02570  [pdf, other

    cs.LG

    Learning from Noisy Labels via Dynamic Loss Thresholding

    Authors: Hao Yang, Youzhi Jin, Ziyin Li, Deng-Bao Wang, Lei Miao, Xin Geng, Min-Ling Zhang

    Abstract: Numerous researches have proved that deep neural networks (DNNs) can fit everything in the end even given data with noisy labels, and result in poor generalization performance. However, recent studies suggest that DNNs tend to gradually memorize the data, moving from correct data to mislabeled data. Inspired by this finding, we propose a novel method named Dynamic Loss Thresholding (DLT). During t… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  24. arXiv:2103.16424  [pdf, other

    eess.SY

    Two-stage Robust Energy Storage Planning with Probabilistic Guarantees: A Data-driven Approach

    Authors: Chao Yan, Xinbo Geng, Zhaohong Bie, Le Xie

    Abstract: This paper addresses a central challenge of jointly considering shorter-term (e.g. hourly) and longer-term (e.g. yearly) uncertainties in power system planning with increasing penetration of renewable and storage resources. In conventional planning decision making, shorter-term (e.g., hourly) variations are not explicitly accounted for. However, given the deepening penetration of variable resource… ▽ More

    Submitted 10 September, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

  25. arXiv:2102.09026  [pdf, other

    cs.LG

    Optimizing Large-Scale Hyperparameters via Automated Learning Algorithm

    Authors: Bin Gu, Guodong Liu, Yanfu Zhang, Xiang Geng, Heng Huang

    Abstract: Modern machine learning algorithms usually involve tuning multiple (from one to thousands) hyperparameters which play a pivotal role in terms of model generalizability. Black-box optimization and gradient-based algorithms are two dominant approaches to hyperparameter optimization while they have totally distinct advantages. How to design a new hyperparameter optimization technique inheriting all b… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  26. Deep Learning Framework for Multi-Round Service Bundle Recommendation in Iterative Mashup Development

    Authors: Yutao Ma, Xiao Geng, Jian Wang, Keqing He, Dionysis Athanasopoulos

    Abstract: Recent years have witnessed the rapid development of service-oriented computing technologies. The boom of Web services increases software developers' selection burden in developing new service-based systems such as mashups. Timely recommending appropriate component services for developers to build new mashups has become a fundamental problem in service-oriented software engineering. Existing servi… ▽ More

    Submitted 6 September, 2022; v1 submitted 7 January, 2021; originally announced January 2021.

    Comments: 15 pages, 6 figures, and 3 tables

    ACM Class: D.2.10

    Journal ref: CAAI Transactions on Intelligence Technology, 2022

  27. arXiv:2012.14624  [pdf, other

    math.OC eess.SY

    Deferrable Load Scheduling under Demand Charge: A Block Model-Predictive Control Approach

    Authors: Lei Yang, Xinbo Geng, Xiaohong Guan, Lang Tong

    Abstract: Optimal scheduling of deferrable electrical loads can reshape the aggregated load profile to achieve higher operational efficiency and reliability. This paper studies deferrable load scheduling under demand charge that imposes a penalty on the peak consumption over a billing period. Such a terminal cost poses challenges in real-time dispatch when demand forecasts are inaccurate. A block model-pred… ▽ More

    Submitted 11 January, 2021; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: 10 pages, 4 plots

  28. arXiv:2012.07769  [pdf, other

    cs.LG cs.AI

    Variable-Shot Adaptation for Online Meta-Learning

    Authors: Tianhe Yu, Xinyang Geng, Chelsea Finn, Sergey Levine

    Abstract: Few-shot meta-learning methods consider the problem of learning new tasks from a small, fixed number of examples, by meta-learning across static data from a set of previous tasks. However, in many real world settings, it is more natural to view the problem as one of minimizing the total amount of supervision --- both the number of examples needed to learn a new task and the amount of data needed f… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: First two authors contribute equally

  29. arXiv:2012.03502  [pdf, other

    cs.CL

    Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting Summarization

    Authors: Xiachong Feng, Xiaocheng Feng, Bing Qin, Xinwei Geng

    Abstract: Meeting summarization is a challenging task due to its dynamic interaction nature among multiple speakers and lack of sufficient training data. Existing methods view the meeting as a linear sequence of utterances while ignoring the diverse relations between each utterance. Besides, the limited labeled data further hinders the ability of data-hungry neural models. In this paper, we try to mitigate… ▽ More

    Submitted 19 May, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: IJCAI 2021

  30. arXiv:2011.05193  [pdf, other

    eess.SY

    Probabilistic Hosting Capacity Analysis via Bayesian Optimization

    Authors: Xinbo Geng, Lang Tong, Anirban Bhattacharya, Bani Mallick, Le Xie

    Abstract: This paper studies the probabilistic hosting capacity analysis (PHCA) problem in distribution networks considering uncertainties from distributed energy resources (DERs) and residential loads. PHCA aims to compute the hosting capacity, which is defined as the maximal level of DERs that can be securely integrated into a distribution network while satisfying operational constraints with high probabi… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  31. When Is the Conway-Maxwell-Poisson Distribution Infinitely Divisible?

    Authors: Xi Geng, Aihua Xia

    Abstract: An essential character for a distribution to play a central role in the limit theory is infinite divisibility. In this note, we prove that the Conway-Maxwell-Poisson (CMP) distribution is infinitely divisible iff it is the Poisson or geometric distribution. This explains that, despite its applications in a wide range of fields, there is no theoretical foundation for the CMP distribution to be a na… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: 11 pages

    MSC Class: Primary 60F05; Secondary 60E05; 60E07

    Journal ref: Statistics & Probability Letters, Elsevier, vol. 181, 2022

  32. arXiv:2010.01272  [pdf, other

    cs.CL

    Towards Interpretable Reasoning over Paragraph Effects in Situation

    Authors: Mucheng Ren, Xiubo Geng, Tao Qin, Heyan Huang, Daxin Jiang

    Abstract: We focus on the task of reasoning over paragraph effects in situation, which requires a model to understand the cause and effect described in a background paragraph, and apply the knowledge to a novel situation. Existing works ignore the complicated reasoning process and solve it with a one-step "black box" model. Inspired by human cognitive processes, in this paper we propose a sequential approac… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

    Comments: 14 pages. Accepted as EMNLP2020 Long paper

  33. Knowledge-Aware Procedural Text Understanding with Multi-Stage Training

    Authors: Zhihan Zhang, Xiubo Geng, Tao Qin, Yunfang Wu, Daxin Jiang

    Abstract: Procedural text describes dynamic state changes during a step-by-step natural process (e.g., photosynthesis). In this work, we focus on the task of procedural text understanding, which aims to comprehend such documents and track entities' states and locations during a process. Although recent approaches have achieved substantial progress, their results are far behind human performance. Two challen… ▽ More

    Submitted 13 February, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: Published as full paper in Proceedings of the Web Conference 2021 (WWW'21)

  34. arXiv:2009.13084  [pdf, ps, other

    math.CA

    Lipschitz-stability of Controlled Rough Paths and Rough Differential Equations

    Authors: Horatio Boedihardjo, Xi Geng

    Abstract: We provide an account for the existence and uniqueness of solutions to rough differential equations under the framework of controlled rough paths. The case when the driving path is $β$-Hölder continuous, for $β>1/3$, is widely available in the literature. In its extension to the case when $β\leqslant1/3,$ a main challenge and missing ingredient is to show that controlled roughs paths are closed un… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

    Comments: 33 pages

  35. arXiv:2009.13082  [pdf, other

    math.CA

    SL_2(R)-developments and Signature Asymptotics for Planar Paths with Bounded Variation

    Authors: Horatio Boedihardjo, Xi Geng

    Abstract: The signature transform, defined by the formal tensor series of global iterated path integrals, is a homomorphism between the path space and the tensor algebra that has been studied in geometry, control theory, number theory as well as stochastic analysis. An elegant isometry conjecture states that the length of a bounded variation path $γ$ can be recovered from the asymptotics of its normalised s… ▽ More

    Submitted 7 November, 2022; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: 45 pages, 3 figures

  36. arXiv:2009.08607  [pdf, ps, other

    cs.LG stat.ML

    Compact Learning for Multi-Label Classification

    Authors: Jiaqi Lv, Tianran Wu, Chenglun Peng, Yunpeng Liu, Ning Xu, Xin Geng

    Abstract: Multi-label classification (MLC) studies the problem where each instance is associated with multiple relevant labels, which leads to the exponential growth of output space. MLC encourages a popular framework named label compression (LC) for capturing label dependency with dimension reduction. Nevertheless, most existing LC methods failed to consider the influence of the feature space or misguided… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

  37. arXiv:2008.01229  [pdf, ps, other

    math.PR

    Precise Local Estimates for Differential Equations driven by Fractional Brownian Motion: Hypoelliptic Case

    Authors: Xi Geng, Cheng Ouyang, Samy Tindel

    Abstract: This article is concerned with stochastic differential equations driven by a $d$ dimensional fractional Brownian motion with Hurst parameter $H>1/4$, understood in the rough paths sense. Whenever the coefficients of the equation satisfy a uniform hypoellipticity condition, we establish a sharp local estimate on the associated control distance function and a sharp local lower estimate on the densit… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: This preprint is the result of splitting our original submission arXiv:1907.00171, which was slightly too long. The current preprint contains the hypoelliptic part of our analysis. Part of the presentation (and arguments) in the current preprint is different from the original submission

    MSC Class: 60H10; 60H07; 60G15

  38. arXiv:2007.16178  [pdf, ps, other

    math.PR

    Precise Local Estimates for Differential Equations driven by Fractional Brownian Motion: Elliptic Case

    Authors: Xi Geng, Cheng Ouyang, Samy Tindel

    Abstract: This article is concerned with stochastic differential equations driven by a $d$ dimensional fractional Brownian motion with Hurst parameter $H>1/4$, understood in the rough paths sense. Whenever the coefficients of the equation satisfy a uniform ellipticity condition, we establish a sharp local estimate on the associated control distance function and a sharp local lower estimate on the density of… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: This preprint is the result of splitting our original submission arXiv:1907.00171, which was slightly too long. The current preprint contains the elliptic part of our analysis

    MSC Class: 60H10; 60H07; 60G15

  39. arXiv:2007.15555  [pdf, other

    hep-ex hep-ph physics.ins-det

    First experimental constraints on WIMP couplings in the effective field theory framework from CDEX

    Authors: Y. Wang, Z. Zeng, Q. Yue, L. T. Yang, K. J. Kang, Y. J. Li, M. Agartioglu, H. P. An, J. P. Chang, J. H. Chen, Y. H. Chen, J. P. Cheng, C. Y. Chiang, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, X. Y. Guo, H. J. He, L. He, S. M. He, J. W. Hu, T. C. Huang , et al. (63 additional authors not shown)

    Abstract: We present weakly interacting massive particles (WIMPs) search results performed using two approaches of effective field theory from the China Dark Matter Experiment (CDEX), based on the data from both CDEX-1B and CDEX-10 stages. In the nonrelativistic effective field theory approach, both time-integrated and annual modulation analyses were used to set new limits for the coupling of WIMP-nucleon e… ▽ More

    Submitted 26 April, 2021; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: version accepted by Science China-PMA, 8 pages, 8 figures

    Journal ref: Sci. China-Phys. Mech. Astron. 64, 281011 (2021)

  40. arXiv:2007.08929  [pdf, other

    cs.LG stat.ML

    Provably Consistent Partial-Label Learning

    Authors: Lei Feng, Jiaqi Lv, Bo Han, Miao Xu, Gang Niu, Xin Geng, Bo An, Masashi Sugiyama

    Abstract: Partial-label learning (PLL) is a multi-class classification problem, where each training example is associated with a set of candidate labels. Even though many practical PLL methods have been proposed in the last two decades, there lacks a theoretical understanding of the consistency of those methods-none of the PLL methods hitherto possesses a generation process of candidate label sets, and then… ▽ More

    Submitted 23 October, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020 camera-ready version

  41. arXiv:2007.01771  [pdf, other

    cs.CV

    Learning Expectation of Label Distribution for Facial Age and Attractiveness Estimation

    Authors: Bin-Bin Gao, Xin-Xin Liu, Hong-Yu Zhou, Jianxin Wu, Xin Geng

    Abstract: Facial attributes (\eg, age and attractiveness) estimation performance has been greatly improved by using convolutional neural networks. However, existing methods have an inconsistency between the training objectives and the evaluation metric, so they may be suboptimal. In addition, these methods always adopt image classification or face recognition models with a large amount of parameters, which… ▽ More

    Submitted 31 December, 2021; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: submitted to Pattern Recognition

  42. arXiv:2006.07178  [pdf, other

    cs.LG stat.ML

    Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling

    Authors: Russell Mendonca, Xinyang Geng, Chelsea Finn, Sergey Levine

    Abstract: Reinforcement learning algorithms can acquire policies for complex tasks autonomously. However, the number of samples required to learn a diverse set of skills can be prohibitively large. While meta-reinforcement learning methods have enabled agents to leverage prior experience to adapt quickly to new tasks, their performance depends crucially on how close the new task is to the previously experie… ▽ More

    Submitted 15 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

  43. arXiv:2005.00979  [pdf, other

    cs.CL cs.LG

    How Does Selective Mechanism Improve Self-Attention Networks?

    Authors: Xinwei Geng, Longyue Wang, Xing Wang, Bing Qin, Ting Liu, Zhaopeng Tu

    Abstract: Self-attention networks (SANs) with selective mechanism has produced substantial improvements in various NLP tasks by concentrating on a subset of input words. However, the underlying reasons for their strong performance have not been well explained. In this paper, we bridge the gap by assessing the strengths of selective SANs (SSANs), which are implemented with a flexible and universal Gumbel-Sof… ▽ More

    Submitted 3 May, 2020; originally announced May 2020.

    Comments: ACL 2020

  44. arXiv:2004.14164  [pdf, other

    cs.CL cs.LG stat.ML

    MICK: A Meta-Learning Framework for Few-shot Relation Classification with Small Training Data

    Authors: Xiaoqing Geng, Xiwen Chen, Kenny Q. Zhu, Libin Shen, Yinggong Zhao

    Abstract: Few-shot relation classification seeks to classify incoming query instances after meeting only few support instances. This ability is gained by training with large amount of in-domain annotated data. In this paper, we tackle an even harder problem by further limiting the amount of data available at training time. We propose a few-shot learning framework for relation classification, which is partic… ▽ More

    Submitted 14 December, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

    Journal ref: CIKM 2020: The 29th ACM International Conference on Information and Knowledge Management

  45. arXiv:2004.08861  [pdf, other

    cs.LG cs.NE stat.ML

    Role-Wise Data Augmentation for Knowledge Distillation

    Authors: Jie Fu, Xue Geng, Zhijian Duan, Bohan Zhuang, Xingdi Yuan, Adam Trischler, Jie Lin, Chris Pal, Hao Dong

    Abstract: Knowledge Distillation (KD) is a common method for transferring the ``knowledge'' learned by one machine learning model (the \textit{teacher}) into another model (the \textit{student}), where typically, the teacher has a greater capacity (e.g., more parameters or higher bit-widths). To our knowledge, existing methods overlook the fact that although the student absorbs extra knowledge from the teac… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

  46. arXiv:2004.02717  [pdf, other

    cs.NI

    Joint Routing and Scheduling for Large-Scale Deterministic IP Networks

    Authors: Jonatan Krolikowski, Sebastien Martin, Paolo Medagliani, Jeremie Leguay, Shuang Chen, Xiaodong Chang, Xuesong Geng

    Abstract: With the advent of 5G and the evolution of Internet protocols, industrial applications are moving from vertical solutions to general purpose IP-based infrastructures that need to meet deterministic Quality of Service (QoS) requirements. The IETF DetNet working group aims at providing an answer to this need with support for (i) deterministic worst-case latency and jitter, and (ii) zero packet loss… ▽ More

    Submitted 28 October, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: To appear in Elsevier Computer Communications

  47. arXiv:2004.02616  [pdf

    physics.plasm-ph

    A spin-filter for polarized electron acceleration in plasma wakefields

    Authors: Yitong Wu, Liangliang Ji, Xuesong Geng, Johannes Thomas, Markus Büscher, Alexander Pukhov, Anna Hützen, Lingang Zhang, Baifei Shen, Ruxin Li

    Abstract: We propose a filter method to generate electron beams of high polarization from bubble and blow-out wakefield accelerators. The mechanism is based on the idea to identify all electron-beam subsets with low-polarization and to filter them out by an X-shaped slit placed right behind the plasma accelerator. To find these subsets we investigate the dependence between the initial azimuthal angle and th… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  48. arXiv:2002.12591  [pdf, other

    cs.CL

    DC-BERT: Decoupling Question and Document for Efficient Contextual Encoding

    Authors: Yuyu Zhang, Ping Nie, Xiubo Geng, Arun Ramamurthy, Le Song, Daxin Jiang

    Abstract: Recent studies on open-domain question answering have achieved prominent performance improvement using pre-trained language models such as BERT. State-of-the-art approaches typically follow the "retrieve and read" pipeline and employ BERT-based reranker to filter retrieved documents before feeding them into the reader module. The BERT retriever takes as input the concatenation of question and each… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.

  49. arXiv:2002.11089  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement

    Authors: Benjamin Eysenbach, Xinyang Geng, Sergey Levine, Ruslan Salakhutdinov

    Abstract: Multi-task reinforcement learning (RL) aims to simultaneously learn policies for solving many tasks. Several prior works have found that relabeling past experience with different reward functions can improve sample efficiency. Relabeling methods typically ask: if, in hindsight, we assume that our experience was optimal for some task, for what task was it optimal? In this paper, we show that hindsi… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  50. arXiv:2002.08053  [pdf, other

    cs.LG stat.ML

    Progressive Identification of True Labels for Partial-Label Learning

    Authors: Jiaqi Lv, Miao Xu, Lei Feng, Gang Niu, Xin Geng, Masashi Sugiyama

    Abstract: Partial-label learning (PLL) is a typical weakly supervised learning problem, where each training instance is equipped with a set of candidate labels among which only one is the true label. Most existing methods elaborately designed learning objectives as constrained optimizations that must be solved in specific manners, making their computational complexity a bottleneck for scaling up to big data… ▽ More

    Submitted 5 September, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: In Proceedings of the 37th International Conference on Machine Learning (ICML 2020)