Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 104 results for author: Xiong, F

.
  1. arXiv:2407.20155  [pdf, other

    math.NA

    GsPINN: A novel fast Green kernel solver based on symmetric Physics-Informed neural networks

    Authors: Xiaopei Jiao, Fansheng Xiong

    Abstract: Ever since deep learning was introduced in the calculation of partial differential equation (PDE), there has been a lot of interests on real time response of system where the kernel function plays an important role. As a popular tool in recent years, physics-informed neural networks (PINNs) was proposed to perform a mesh-free, semi-supervised learning with high flexibility. This paper explores the… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: 17 pages, 5 figures, 2 tables

  2. arXiv:2407.14507  [pdf, other

    cs.CL

    Internal Consistency and Self-Feedback in Large Language Models: A Survey

    Authors: Xun Liang, Shichao Song, Zifan Zheng, Hanyu Wang, Qingchen Yu, Xunkai Li, Rong-Hua Li, Feiyu Xiong, Zhiyu Li

    Abstract: Large language models (LLMs) are expected to respond accurately but often exhibit deficient reasoning or generate hallucinatory content. To address these, studies prefixed with ``Self-'' such as Self-Consistency, Self-Improve, and Self-Refine have been initiated. They share a commonality: involving LLMs evaluating and updating itself to mitigate the issues. Nonetheless, these efforts lack a unifie… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 27 pages, 9 figures, 10 tables, 14 equations

  3. arXiv:2407.01654  [pdf, other

    physics.flu-dyn

    A thermodynamically consistent phase-field lattice Boltzmann method for two-phase electrohydrodynamic flows

    Authors: Fang Xiong, Lei Wang, Jiangxu Huang, Kang Luo

    Abstract: In this work, we aim to develop a phase-field based lattice Boltzmann (LB) method for simulating two-phase electrohydrodynamics (EHD) flows, which allows for different properties (densities, viscosities, conductivity and permittivity) of each phase while maintaining thermodynamic consistency. To this end, we first present a theoretical analysis on the two-phase EHD flows by using the Onsager's var… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2407.01178  [pdf, other

    cs.CL cs.AI cs.LG

    $\text{Memory}^3$: Language Modeling with Explicit Memory

    Authors: Hongkang Yang, Zehao Lin, Wenjin Wang, Hao Wu, Zhiyu Li, Bo Tang, Wenqiang Wei, Jinbo Wang, Zeyun Tang, Shichao Song, Chenyang Xi, Yu Yu, Kai Chen, Feiyu Xiong, Linpeng Tang, Weinan E

    Abstract: The training and inference of large language models (LLMs) are together a costly process that transports knowledge from raw data to meaningful computation. Inspired by the memory hierarchy of the human brain, we reduce this cost by equipping LLMs with explicit memory, a memory format cheaper than model parameters and text retrieval-augmented generation (RAG). Conceptually, with most of its knowled… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  5. arXiv:2407.00668  [pdf, other

    cs.CL

    HRDE: Retrieval-Augmented Large Language Models for Chinese Health Rumor Detection and Explainability

    Authors: Yanfang Chen, Ding Chen, Shichao Song, Simin Niu, Hanyu Wang, Zeyun Tang, Feiyu Xiong, Zhiyu Li

    Abstract: As people increasingly prioritize their health, the speed and breadth of health information dissemination on the internet have also grown. At the same time, the presence of false health information (health rumors) intermingled with genuine content poses a significant potential threat to public health. However, current research on Chinese health rumors still lacks a large-scale, public, and open-so… ▽ More

    Submitted 3 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

  6. arXiv:2406.16069  [pdf, other

    cs.CL cs.AI

    FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models

    Authors: Junyi Zhu, Shuochen Liu, Yu Yu, Bo Tang, Yibo Yan, Zhiyu Li, Feiyu Xiong, Tong Xu, Matthew B. Blaschko

    Abstract: Large language models (LLMs) excel in generating coherent text, but they often struggle with context awareness, leading to inaccuracies in tasks requiring faithful adherence to provided information. We introduce FastMem, a novel method designed to enhance instruction fine-tuned LLMs' context awareness through fast memorization of the prompt. FastMem maximizes the likelihood of the prompt before in… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  7. arXiv:2405.20763  [pdf, other

    cs.LG math.OC stat.ML

    Improving Generalization and Convergence by Enhancing Implicit Regularization

    Authors: Mingze Wang, Haotian He, Jinbo Wang, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu

    Abstract: In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that I… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 35 pages

  8. arXiv:2405.16933  [pdf, other

    cs.CL cs.IR

    Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-Learning

    Authors: Xun Liang, Simin Niu, Zhiyu li, Sensen Zhang, Shichao Song, Hanyu Wang, Jiawei Yang, Feiyu Xiong, Bo Tang, Chenyang Xi

    Abstract: Retrieval-Augmented Generation (RAG) offers a cost-effective approach to injecting real-time knowledge into large language models (LLMs). Nevertheless, constructing and validating high-quality knowledge repositories require considerable effort. We propose a pre-retrieval framework named Pseudo-Graph Retrieval-Augmented Generation (PG-RAG), which conceptualizes LLMs as students by providing them wi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  9. arXiv:2405.11874  [pdf, other

    cs.CL

    xFinder: Robust and Pinpoint Answer Extraction for Large Language Models

    Authors: Qingchen Yu, Zifan Zheng, Shichao Song, Zhiyu Li, Feiyu Xiong, Bo Tang, Ding Chen

    Abstract: The continuous advancement of large language models (LLMs) has brought increasing attention to the critical issue of developing fair and reliable methods for evaluating their performance. Particularly, the emergence of subjective or non-subjective cheating phenomena, such as test set leakage and prompt format overfitting, poses significant challenges to the reliable evaluation of LLMs. Since evalu… ▽ More

    Submitted 23 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 37 Pages

  10. arXiv:2405.01726  [pdf, ps, other

    eess.IV cs.CV cs.LG

    SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising

    Authors: Guanyiman Fu, Fengchao Xiong, Jianfeng Lu, Jun Zhou

    Abstract: Denoising is a crucial preprocessing step for hyperspectral images (HSIs) due to noise arising from intraimaging mechanisms and environmental factors. Long-range spatial-spectral correlation modeling is beneficial for HSI denoising but often comes with high computational complexity. Based on the state space model (SSM), Mamba is known for its remarkable long-range dependency modeling capabilities… ▽ More

    Submitted 20 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  11. arXiv:2404.06926  [pdf, other

    cs.RO

    Gaussian-LIC: Photo-realistic LiDAR-Inertial-Camera SLAM with 3D Gaussian Splatting

    Authors: Xiaolei Lang, Laijian Li, Hang Zhang, Feng Xiong, Mu Xu, Yong Liu, Xingxing Zuo, Jiajun Lv

    Abstract: We present a real-time LiDAR-Inertial-Camera SLAM system with 3D Gaussian Splatting as the mapping backend. Leveraging robust pose estimates from our LiDAR-Inertial-Camera odometry, Coco-LIC, an incremental photo-realistic mapping system is proposed in this paper. We initialize 3D Gaussians from colorized LiDAR points and optimize them using differentiable rendering powered by 3D Gaussian Splattin… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Submitted to IROS 2024

  12. arXiv:2403.12839  [pdf, other

    cs.CV

    Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering

    Authors: Mingqi Shao, Feng Xiong, Hang Zhang, Shuang Yang, Mu Xu, Wei Bian, Xueqian Wang

    Abstract: Neural radiance fields~(NeRF) have recently been applied to render large-scale scenes. However, their limited model capacity typically results in blurred rendering results. Existing large-scale NeRFs primarily address this limitation by partitioning the scene into blocks, which are subsequently handled by separate sub-NeRFs. These sub-NeRFs, trained from scratch and processed independently, lead t… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  13. arXiv:2403.04283  [pdf, other

    cs.CL cs.AI cs.LG

    Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy

    Authors: Yu Zhu, Chuxiong Sun, Wenfei Yang, Wenqiang Wei, Bo Tang, Tianzhu Zhang, Zhiyu Li, Shifeng Zhang, Feiyu Xiong, Jie Hu, Mingchuan yang

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is the prevailing approach to ensure Large Language Models (LLMs) align with human values. However, existing RLHF methods require a high computational cost, one main reason being that RLHF assigns both the generation and alignment tasks to the LLM simultaneously. In this paper, we introduce Proxy-RLHF, which decouples the generation and alignment p… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  14. arXiv:2403.00862  [pdf, other

    cs.CL cs.AI

    NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism

    Authors: Miao Li, Ming-Bin Chen, Bo Tang, Shengbin Hou, Pengyu Wang, Haiying Deng, Zhiyu Li, Feiyu Xiong, Keming Mao, Peng Cheng, Yi Luo

    Abstract: We present NewsBench, a novel evaluation framework to systematically assess the capabilities of Large Language Models (LLMs) for editorial capabilities in Chinese journalism. Our constructed benchmark dataset is focused on four facets of writing proficiency and six facets of safety adherence, and it comprises manually and carefully designed 1,267 test samples in the types of multiple choice questi… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: Long paper, ACL 2024 Main

  15. arXiv:2402.11218  [pdf, other

    cs.CL

    Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs

    Authors: Xun Liang, Hanyu Wang, Shichao Song, Mengting Hu, Xunzhi Wang, Zhiyu Li, Feiyu Xiong, Bo Tang

    Abstract: Controlled Text Generation (CTG) aims to produce texts that exhibit specific desired attributes. In this study, we introduce a pluggable CTG framework for Large Language Models (LLMs) named Dynamic Attribute Graphs-based controlled text generation (DATG). This framework utilizes an attribute scorer to evaluate the attributes of sentences generated by LLMs and constructs dynamic attribute graphs. D… ▽ More

    Submitted 24 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: 18 Pages, Accepted by ACL 2024 Findings

  16. arXiv:2402.07744  [pdf, other

    cs.AI cs.CL cs.LG

    Towards Unified Alignment Between Agents, Humans, and Environment

    Authors: Zonghan Yang, An Liu, Zijun Liu, Kaiming Liu, Fangzhou Xiong, Yile Wang, Zeyuan Yang, Qingyuan Hu, Xinrui Chen, Zhenhe Zhang, Fuwen Luo, Zhicheng Guo, Peng Li, Yang Liu

    Abstract: The rapid progress of foundation models has led to the prosperity of autonomous agents, which leverage the universal capabilities of foundation models to conduct reasoning, decision-making, and environmental interaction. However, the efficacy of agents remains limited when operating in intricate, realistic environments. In this work, we introduce the principles of $\mathbf{U}$nified $\mathbf{A}$li… ▽ More

    Submitted 14 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Project webpage: https://agent-force.github.io/unified-alignment-for-agents.html

  17. arXiv:2401.17043  [pdf, other

    cs.CL

    CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models

    Authors: Yuanjie Lyu, Zhiyu Li, Simin Niu, Feiyu Xiong, Bo Tang, Wenjin Wang, Hao Wu, Huanyong Liu, Tong Xu, Enhong Chen

    Abstract: Retrieval-Augmented Generation (RAG) is a technique that enhances the capabilities of large language models (LLMs) by incorporating external knowledge sources. This method addresses common LLM limitations, including outdated information and the tendency to produce inaccurate "hallucinated" content. However, the evaluation of RAG systems is challenging, as existing benchmarks are limited in scope a… ▽ More

    Submitted 15 July, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: 40 Pages

  18. arXiv:2401.12326  [pdf, other

    cs.CL cs.AI

    Fine-tuning Large Language Models for Multigenerator, Multidomain, and Multilingual Machine-Generated Text Detection

    Authors: Feng Xiong, Thanet Markchom, Ziwei Zheng, Subin Jung, Varun Ojha, Huizhi Liang

    Abstract: SemEval-2024 Task 8 introduces the challenge of identifying machine-generated texts from diverse Large Language Models (LLMs) in various languages and domains. The task comprises three subtasks: binary classification in monolingual and multilingual (Subtask A), multi-class classification (Subtask B), and mixed text detection (Subtask C). This paper focuses on Subtask A & B. Each subtask is support… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  19. Hyperspectral Image Denoising via Spatial-Spectral Recurrent Transformer

    Authors: Guanyiman Fu, Fengchao Xiong, Jianfeng Lu, Jun Zhou, Jiantao Zhou, Yuntao Qian

    Abstract: Hyperspectral images (HSIs) often suffer from noise arising from both intra-imaging mechanisms and environmental factors. Leveraging domain knowledge specific to HSIs, such as global spectral correlation (GSC) and non-local spatial self-similarity (NSS), is crucial for effective denoising. Existing methods tend to independently utilize each of these knowledge components with multiple blocks, overl… ▽ More

    Submitted 8 January, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

  20. arXiv:2401.03385  [pdf, other

    cs.CL

    Grimoire is All You Need for Enhancing Large Language Models

    Authors: Ding Chen, Shichao Song, Qingchen Yu, Zhiyu Li, Wenjin Wang, Feiyu Xiong, Bo Tang

    Abstract: In-context Learning (ICL) is one of the key methods for enhancing the performance of large language models on specific tasks by providing a set of few-shot examples. However, the ICL capability of different types of models shows significant variation due to factors such as model architecture, volume of learning data, and the size of parameters. Generally, the larger the model's parameter size and… ▽ More

    Submitted 10 January, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: 9 pages

  21. arXiv:2311.15296  [pdf, other

    cs.CL

    UHGEval: Benchmarking the Hallucination of Chinese Large Language Models via Unconstrained Generation

    Authors: Xun Liang, Shichao Song, Simin Niu, Zhiyu Li, Feiyu Xiong, Bo Tang, Yezhaohui Wang, Dawei He, Peng Cheng, Zhonghao Wang, Haiying Deng

    Abstract: Large language models (LLMs) have emerged as pivotal contributors in contemporary natural language processing and are increasingly being applied across a diverse range of industries. However, these large-scale probabilistic statistical models cannot currently ensure the requisite quality in professional content generation. These models often produce hallucinated text, compromising their practical… ▽ More

    Submitted 23 May, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: Accepted by ACL 2024

  22. arXiv:2305.01422  [pdf, other

    cond-mat.mes-hall cond-mat.str-el

    Distinct quasiparticle interference patterns for surface impurity scattering on various Weyl semimetals

    Authors: Feng Xiong, Chaocheng He, Yong Liu, Annica M. Black-Schaffer, Tanay Nag

    Abstract: We examine the response of the Fermi arc in the context of quasi-particle interference (QPI) with regard to a localized surface impurity on various three-dimensional Weyl semimetals (WSMs). Our study also reveals the variation of the local density of states (LDOS), obtained by Fourier transforming the QPI profile, on the two-dimensional surface. We use the $T$-matrix formalism to numerically (anal… ▽ More

    Submitted 28 March, 2024; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 19 pages, 6 figures

    Journal ref: Phys. Rev. B 109, 054201 (2024)

  23. arXiv:2304.09048  [pdf, other

    cs.CL cs.AI cs.IR cs.LG cs.SE

    CodeKGC: Code Language Model for Generative Knowledge Graph Construction

    Authors: Zhen Bi, Jing Chen, Yinuo Jiang, Feiyu Xiong, Wei Guo, Huajun Chen, Ningyu Zhang

    Abstract: Current generative knowledge graph construction approaches usually fail to capture structural knowledge by simply flattening natural language into serialized texts or a specification language. However, large generative language model trained on structured data such as code has demonstrated impressive capability in understanding natural language for structural prediction and reasoning tasks. Intuit… ▽ More

    Submitted 18 January, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: ACM Transactions on Asian and Low-Resource Language Information Processing

  24. arXiv:2304.07423  [pdf, other

    cond-mat.quant-gas physics.atom-ph

    Instability and Momentum Bifurcation of molecular BEC in Exotic Dispersion with Shaken Lattice

    Authors: Kaiyue Wang, Feng Xiong, Yun Long, Yun Ma, Colin V. Parker

    Abstract: We place a molecular Bose-Einstein condensate in a 1D shaken lattice with a Floquet-engineered dispersion, and observe the dynamics in both position and momentum space. At the initial condition of zero momentum, our engineered dispersion is inverted, and therefore unstable. We observe that the condensate is destabilized by the lattice shaking as expected, but rather than decaying incoherently or p… ▽ More

    Submitted 24 August, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

  25. arXiv:2304.06925   

    cs.CV cs.AI

    YOLO-Drone:Airborne real-time detection of dense small objects from high-altitude perspective

    Authors: Li Zhu, Jiahui Xiong, Feng Xiong, Hanzheng Hu, Zhengnan Jiang

    Abstract: Unmanned Aerial Vehicles (UAVs), specifically drones equipped with remote sensing object detection technology, have rapidly gained a broad spectrum of applications and emerged as one of the primary research focuses in the field of computer vision. Although UAV remote sensing systems have the ability to detect various objects, small-scale objects can be challenging to detect reliably due to factors… ▽ More

    Submitted 10 October, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Some contributing authors are not signed

  26. arXiv:2303.02959  [pdf, other

    cs.CV cs.MM eess.IV

    Butterfly: Multiple Reference Frames Feature Propagation Mechanism for Neural Video Compression

    Authors: Feng Wang, Haihang Ruan, Fei Xiong, Jiayu Yang, Litian Li, Ronggang Wang

    Abstract: Using more reference frames can significantly improve the compression efficiency in neural video compression. However, in low-latency scenarios, most existing neural video compression frameworks usually use the previous one frame as reference. Or a few frameworks which use the previous multiple frames as reference only adopt a simple multi-reference frames propagation mechanism. In this paper, we… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted by DCC 2023

  27. arXiv:2211.07504  [pdf, other

    cs.CL cs.AI cs.CV cs.IR cs.LG

    On Analyzing the Role of Image for Visual-enhanced Relation Extraction

    Authors: Lei Li, Xiang Chen, Shuofei Qiao, Feiyu Xiong, Huajun Chen, Ningyu Zhang

    Abstract: Multimodal relation extraction is an essential task for knowledge graph construction. In this paper, we take an in-depth empirical analysis that indicates the inaccurate information in the visual scene graph leads to poor modal alignment weights, further degrading performance. Moreover, the visual shuffle experiments illustrate that the current approaches may not take full advantage of visual info… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Accepted by AAAI 2023 (Student Abstract)

  28. arXiv:2210.08142  [pdf

    physics.optics physics.app-ph

    Time-resolved temperature mapping leveraging the strong thermo-optic effect in phase-change devices

    Authors: Nicholas A. Nobile, John R. Erickson, Carlos Ríos, Yifei Zhang, Juejun Hu, Steven A. Vitale, Feng Xiong, Nathan Youngblood

    Abstract: Optical phase-change materials are highly promising for emerging applications such as tunable metasurfaces, reconfigurable photonic circuits, and non-von Neumann computing. However, these materials typically require both high melting temperatures and fast quenching rates to reversibly switch between their crystalline and amorphous phases, a significant challenge for large-scale integration. Here,… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  29. arXiv:2209.15214  [pdf, other

    cs.AI cs.CL cs.IR cs.LG

    Construction and Applications of Billion-Scale Pre-Trained Multimodal Business Knowledge Graph

    Authors: Shumin Deng, Chengming Wang, Zhoubo Li, Ningyu Zhang, Zelin Dai, Hehong Chen, Feiyu Xiong, Ming Yan, Qiang Chen, Mosha Chen, Jiaoyan Chen, Jeff Z. Pan, Bryan Hooi, Huajun Chen

    Abstract: Business Knowledge Graphs (KGs) are important to many enterprises today, providing factual knowledge and structured data that steer many products and make them more intelligent. Despite their promising benefits, building business KG necessitates solving prohibitive issues of deficient structure and multiple modalities. In this paper, we advance the understanding of the practical challenges related… ▽ More

    Submitted 19 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: OpenBG. Accepted by ICDE 2023. The project is released at https://github.com/OpenBGBenchmark/OpenBG . Website: https://kg.alibaba.com/ , Leaderboard: https://tianchi.aliyun.com/dataset/dataDetail?dataId=122271

  30. arXiv:2207.07790  [pdf, other

    cs.LG cs.IR

    BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion

    Authors: Fanglin Chen, Xiao Liu, Bo Tang, Feiyu Xiong, Serim Hwang, Guomian Zhuang

    Abstract: We utilize an offline reinforcement learning (RL) model for sequential targeted promotion in the presence of budget constraints in a real-world business environment. In our application, the mobile app aims to boost customer retention by sending cash bonuses to customers and control the costs of such cash bonuses during each time period. To achieve the multi-task goal, we propose the Budget Constra… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: 8 pages, DRL4IR@SIGIR

  31. arXiv:2206.03864  [pdf, other

    physics.flu-dyn math.NA

    Discontinuity Computing using Physics-Informed Neural Network

    Authors: Li Liu, Shengping Liu, Hui Xie, Fansheng Xiong, Tengchao Yu, Mengjuan Xiao, Lufeng Liu, Heng Yong

    Abstract: Simulating discontinuities is a long standing problem especially for shock waves with strong nonlinear feather. Despite being a promising method, the recently developed physics-informed neural network (PINN) is still weak for calculating discontinuities compared with traditional shock-capturing methods. In this paper, we intend to improve the shock-capturing ability of the PINN. The primary strate… ▽ More

    Submitted 6 August, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

  32. arXiv:2206.03739  [pdf, other

    cs.AI cs.CV cs.LG

    Disentangled Ontology Embedding for Zero-shot Learning

    Authors: Yuxia Geng, Jiaoyan Chen, Wen Zhang, Yajing Xu, Zhuo Chen, Jeff Z. Pan, Yufeng Huang, Feiyu Xiong, Huajun Chen

    Abstract: Knowledge Graph (KG) and its variant of ontology have been widely used for knowledge representation, and have shown to be quite effective in augmenting Zero-shot Learning (ZSL). However, existing ZSL methods that utilize KGs all neglect the intrinsic complexity of inter-class relationships represented in KGs. One typical feature is that a class is often related to other classes in different semant… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted by KDD'22

  33. arXiv:2205.10852  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Relphormer: Relational Graph Transformer for Knowledge Graph Representations

    Authors: Zhen Bi, Siyuan Cheng, Jing Chen, Xiaozhuan Liang, Feiyu Xiong, Ningyu Zhang

    Abstract: Transformers have achieved remarkable performance in widespread fields, including natural language processing, computer vision and graph mining. However, vanilla Transformer architectures have not yielded promising improvements in the Knowledge Graph (KG) representations, where the translational distance paradigm dominates this area. Note that vanilla Transformer architectures struggle to capture… ▽ More

    Submitted 21 November, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

    Comments: Neurocomputing 2023

  34. arXiv:2205.10362  [pdf, ps, other

    cs.LG cs.AI

    FIND:Explainable Framework for Meta-learning

    Authors: Xinyue Shao, Hongzhi Wang, Xiao Zhu, Feng Xiong

    Abstract: Meta-learning is used to efficiently enable the automatic selection of machine learning models by combining data and prior knowledge. Since the traditional meta-learning technique lacks explainability, as well as shortcomings in terms of transparency and fairness, achieving explainability for meta-learning is crucial. This paper proposes FIND, an interpretable meta-learning framework that not only… ▽ More

    Submitted 12 June, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

  35. arXiv:2205.05889  [pdf, other

    cs.CL cs.AI cs.DB

    Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction

    Authors: Tianshu Wang, Hongyu Lin, Cheng Fu, Xianpei Han, Le Sun, Feiyu Xiong, Hui Chen, Minlong Lu, Xiuwen Zhu

    Abstract: Entity matching (EM) is the most critical step for entity resolution (ER). While current deep learningbased methods achieve very impressive performance on standard EM benchmarks, their realworld application performance is much frustrating. In this paper, we highlight that such the gap between reality and ideality stems from the unreasonable benchmark construction process, which is inconsistent wit… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: Accepted to IJCAI2022

  36. arXiv:2202.12571  [pdf, other

    cs.LG cs.AI cs.CL

    NeuralKG: An Open Source Library for Diverse Representation Learning of Knowledge Graphs

    Authors: Wen Zhang, Xiangnan Chen, Zhen Yao, Mingyang Chen, Yushan Zhu, Hongtao Yu, Yufeng Huang, Zezhong Xu, Yajing Xu, Ningyu Zhang, Zonggang Yuan, Feiyu Xiong, Huajun Chen

    Abstract: NeuralKG is an open-source Python-based library for diverse representation learning of knowledge graphs. It implements three different series of Knowledge Graph Embedding (KGE) methods, including conventional KGEs, GNN-based KGEs, and Rule-based KGEs. With a unified framework, NeuralKG successfully reproduces link prediction results of these methods on benchmarks, freeing users from the laborious… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: work in progress

  37. arXiv:2202.08610  [pdf, other

    cond-mat.mes-hall cond-mat.str-el

    Understanding the three-dimensional quantum Hall effect in generic multi-Weyl semimetals

    Authors: Feng Xiong, Carsten Honerkamp, Dante M. Kennes, Tanay Nag

    Abstract: The quantum Hall effect in three-dimensional Weyl semimetal (WSM) receives significant attention for the emergence of the Fermi loop where the underlying two-dimensional Hall conductivity, namely, sheet Hall conductivity, shows quantized plateaus. Considering the tilted lattice models for multi Weyl semimetals (mWSMs), we systematically study the Landau levels (LLs) and magneto-Hall conductivity i… ▽ More

    Submitted 31 July, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: 19 pages and 9 figures

    Journal ref: Phys. Rev. B 106, 045424 (2022)

  38. arXiv:2202.02113  [pdf, other

    cs.CL cs.AI cs.DB cs.IR cs.LG

    From Discrimination to Generation: Knowledge Graph Completion with Generative Transformer

    Authors: Xin Xie, Ningyu Zhang, Zhoubo Li, Shumin Deng, Hui Chen, Feiyu Xiong, Mosha Chen, Huajun Chen

    Abstract: Knowledge graph completion aims to address the problem of extending a KG with missing triples. In this paper, we provide an approach GenKGC, which converts knowledge graph completion to sequence-to-sequence generation task with the pre-trained language model. We further introduce relation-guided demonstration and entity-aware hierarchical decoding for better representation learning and fast infere… ▽ More

    Submitted 14 March, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: Accepted by WWW 2022 Poster

  39. Building time-surfaces by exploiting the complex volatility of an ECRAM memristor

    Authors: Marco Rasetto, Qingzhou Wan, Himanshu Akolkar, Feng Xiong, Bertram Shi, Ryad Benosman

    Abstract: Memristors have emerged as a promising technology for efficient neuromorphic architectures owing to their ability to act as programmable synapses, combining processing and memory into a single device. Although they are most commonly used for static encoding of synaptic weights, recent work has begun to investigate the use of their dynamical properties, such as Short Term Plasticity (STP), to integ… ▽ More

    Submitted 15 April, 2024; v1 submitted 29 January, 2022; originally announced January 2022.

  40. arXiv:2201.11332  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Ontology-enhanced Prompt-tuning for Few-shot Learning

    Authors: Hongbin Ye, Ningyu Zhang, Shumin Deng, Xiang Chen, Hui Chen, Feiyu Xiong, Xi Chen, Huajun Chen

    Abstract: Few-shot Learning (FSL) is aimed to make predictions based on a limited number of samples. Structured data such as knowledge graphs and ontology libraries has been leveraged to benefit the few-shot setting in various tasks. However, the priors adopted by the existing methods suffer from challenging knowledge missing, knowledge noise, and knowledge heterogeneity, which hinder the performance for fe… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted by WWW2022

  41. arXiv:2201.06206  [pdf, other

    cs.CL cs.SI

    SQUIRE: A Sequence-to-sequence Framework for Multi-hop Knowledge Graph Reasoning

    Authors: Yushi Bai, Xin Lv, Juanzi Li, Lei Hou, Yincen Qu, Zelin Dai, Feiyu Xiong

    Abstract: Multi-hop knowledge graph (KG) reasoning has been widely studied in recent years to provide interpretable predictions on missing links with evidential paths. Most previous works use reinforcement learning (RL) based methods that learn to navigate the path towards the target entity. However, these methods suffer from slow and poor convergence, and they may fail to infer a certain path when there is… ▽ More

    Submitted 31 October, 2022; v1 submitted 16 January, 2022; originally announced January 2022.

    Comments: EMNLP 2022. Code is available at https://github.com/bys0318/SQUIRE

  42. arXiv:2201.03335  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

    Authors: Ningyu Zhang, Xin Xu, Liankuan Tao, Haiyang Yu, Hongbin Ye, Shuofei Qiao, Xin Xie, Xiang Chen, Zhoubo Li, Lei Li, Xiaozhuan Liang, Yunzhi Yao, Shumin Deng, Peng Wang, Wen Zhang, Zhenru Zhang, Chuanqi Tan, Qiang Chen, Feiyu Xiong, Fei Huang, Guozhou Zheng, Huajun Chen

    Abstract: We present an open-source and extensible knowledge extraction toolkit DeepKE, supporting complicated low-resource, document-level and multimodal scenarios in the knowledge base population. DeepKE implements various information extraction tasks, including named entity recognition, relation extraction and attribute extraction. With a unified framework, DeepKE allows developers and researchers to cus… ▽ More

    Submitted 18 September, 2023; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: Accepted by EMNLP 2022 System Demonstrations and the project website is http://deepke.zjukg.cn/

  43. arXiv:2112.08589  [pdf, other

    cs.AI

    Knowledge Graph Embedding in E-commerce Applications: Attentive Reasoning, Explanations, and Transferable Rules

    Authors: Wen Zhang, Shumin Deng, Mingyang Chen, Liang Wang, Qiang Chen, Feiyu Xiong, Xiangwen Liu, Huajun Chen

    Abstract: Knowledge Graphs (KGs), representing facts as triples, have been widely adopted in many applications. Reasoning tasks such as link prediction and rule induction are important for the development of KGs. Knowledge Graph Embeddings (KGEs) embedding entities and relations of a KG into continuous vector spaces, have been proposed for these reasoning tasks and proven to be efficient and robust. But the… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: Accepted at IJCKG2021

  44. arXiv:2108.03989  [pdf, other

    cs.AI

    Spatial-Temporal Deep Intention Destination Networks for Online Travel Planning

    Authors: Yu Li, Fei Xiong, Ziyi Wang, Zulong Chen, Chuanfei Xu, Yuyu Yin, Li Zhou

    Abstract: Nowadays, artificial neural networks are widely used for users' online travel planning. Personalized travel planning has many real applications and is affected by various factors, such as transportation type, intention destination estimation, budget limit and crowdness prediction. Among those factors, users' intention destination prediction is an essential task in online travel platforms. The reas… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  45. Anomaly Detection in Dynamic Graphs via Transformer

    Authors: Yixin Liu, Shirui Pan, Yu Guang Wang, Fei Xiong, Liang Wang, Qingfeng Chen, Vincent CS Lee

    Abstract: Detecting anomalies for dynamic graphs has drawn increasing attention due to their wide applications in social networks, e-commerce, and cybersecurity. Recent deep learning-based approaches have shown promising results over shallow methods. However, they fail to address two core challenges of anomaly detection in dynamic graphs: the lack of informative encoding for unattributed nodes and the diffi… ▽ More

    Submitted 27 October, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: 13 pages, 5 figures

  46. arXiv:2106.03446  [pdf, other

    quant-ph

    Controlling the dynamics of open quantum systems with periodic driving field

    Authors: Fei-Lei Xiong, Wei-Min Zhang

    Abstract: In this paper, we study the exact dynamics of open quantum systems to the case with periodic driving field. It is shown that different from the static adjustment of the system on-site energy that can either generate or destroy the dissipationless localized bound states, the periodic driving can either preserve the existed localized bound states or destroy some of them but cannot generate new local… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: 7 pages, 5 figures

  47. arXiv:2105.13555  [pdf, other

    quant-ph physics.optics

    Lens-free Optical Detection of Thermal Motion of a Sub-millimeter Sphere Diamagnetically Levitated in High Vacuum

    Authors: Fang Xiong, Peiran Yin, Tong Wu, Han Xie, Rui Li, Yingchun Leng, Yanan Li, Changkui Duan, Xi Kong, Pu Huang, Jiangfeng Du

    Abstract: Levitated oscillators with millimeter or sub-millimeter size are particularly attractive due to their potential role in studying various fundamental problems and practical applications. One of the crucial issues towards these goals is to achieve efficient measurements of oscillator motion, while this remains a challenge. Here we theoretically propose a lens-free optical detection scheme, which can… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

    Comments: Physical Review Applied (to be published)

    Journal ref: Phys. Rev. Applied 16, 011003 (2021)

  48. arXiv:2105.05473  [pdf, other

    cs.LG cs.AI

    Interpretable performance analysis towards offline reinforcement learning: A dataset perspective

    Authors: Chenyang Xi, Bo Tang, Jiajun Shen, Xinfu Liu, Feiyu Xiong, Xueying Li

    Abstract: Offline reinforcement learning (RL) has increasingly become the focus of the artificial intelligent research due to its wide real-world applications where the collection of data may be difficult, time-consuming, or costly. In this paper, we first propose a two-fold taxonomy for existing offline RL algorithms from the perspective of exploration and exploitation tendency. Secondly, we derive the exp… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  49. Spin susceptibilities in magnetic type-I and type-II Weyl semimetals

    Authors: Feng Xiong, Xingjie Han, Carsten Honerkamp

    Abstract: We investigate interacting spin susceptibilities in lattice models for $\mathcal{T}$-reversal symmetry-broken Weyl semimetals. We employ a random phase approximation (RPA) method for the spin-SU(2)-symmetry-broken case that includes mixtures of ladder and bubble diagrams, beyond a SU(2)-symmetric case. Within this approach, the relations between the tendency towards magnetic order and the band str… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: 10 pages, 8 figures

    Journal ref: Phys. Rev. B 104, 115151 (2021)

  50. arXiv:2102.10586  [pdf, other

    cond-mat.mes-hall quant-ph

    Generating Majorana qubit coherence in Majorana Aharonov-Bohm interferometer

    Authors: Fei-Lei Xiong, Hon-Lam Lai, Wei-Min Zhang

    Abstract: We propose an Aharonov-Bohm interferometer consisted of two topological superconducting chains (TSCs) to generate coherence of Majorana qubits, each qubit is made of two Majorana zero modes (MZMs) with the definite fermion parity. We obtain the generalized exact master equation as well as its solution and study the real-time dynamics of the MZM qubit states under various operations. We demonstrate… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

    Comments: 8 pages, 5 figures