Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 72 results for author: Ji, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04801  [pdf, other

    cs.CL cs.AI

    Revisiting Structured Sentiment Analysis as Latent Dependency Graph Parsing

    Authors: Chengjie Zhou, Bobo Li, Hao Fei, Fei Li, Chong Teng, Donghong Ji

    Abstract: Structured Sentiment Analysis (SSA) was cast as a problem of bi-lexical dependency graph parsing by prior studies. Multiple formulations have been proposed to construct the graph, which share several intrinsic drawbacks: (1) The internal structures of spans are neglected, thus only the boundary tokens of spans are used for relation prediction and span recognition, thus hindering the model's expres… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2407.01530  [pdf, other

    eess.IV cs.CV

    xLSTM-UNet can be an Effective 2D & 3D Medical Image Segmentation Backbone with Vision-LSTM (ViL) better than its Mamba Counterpart

    Authors: Tianrun Chen, Chaotao Ding, Lanyun Zhu, Tao Xu, Deyi Ji, Yan Wang, Ying Zang, Zejian Li

    Abstract: Convolutional Neural Networks (CNNs) and Vision Transformers (ViT) have been pivotal in biomedical image segmentation, yet their ability to manage long-range dependencies remains constrained by inherent locality and computational overhead. To overcome these challenges, in this technical report, we first propose xLSTM-UNet, a UNet structured deep learning neural network that leverages Vision-LSTM (… ▽ More

    Submitted 2 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.19632  [pdf, other

    cs.CV

    PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation

    Authors: Deyi Ji, Wenwei Jin, Hongtao Lu, Feng Zhao

    Abstract: The ascension of Unmanned Aerial Vehicles (UAVs) in various fields necessitates effective UAV image segmentation, which faces challenges due to the dynamic perspectives of UAV-captured images. Traditional segmentation algorithms falter as they cannot accurately mimic the complexity of UAV perspectives, and the cost of obtaining multi-perspective labeled datasets is prohibitive. To address these is… ▽ More

    Submitted 11 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: IJCAI 2024

  4. arXiv:2406.16021  [pdf, other

    cs.CL cs.AI

    Harvesting Events from Multiple Sources: Towards a Cross-Document Event Extraction Paradigm

    Authors: Qiang Gao, Zixiang Meng, Bobo Li, Jun Zhou, Fei Li, Chong Teng, Donghong Ji

    Abstract: Document-level event extraction aims to extract structured event information from unstructured text. However, a single document often contains limited event information and the roles of different event arguments may be biased due to the influence of the information source. This paper addresses the limitations of traditional document-level event extraction by proposing the task of cross-document ev… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: ACL2024(Findings)

  5. arXiv:2406.15990  [pdf, other

    cs.CL cs.AI

    Enhancing Cross-Document Event Coreference Resolution by Discourse Structure and Semantic Information

    Authors: Qiang Gao, Bobo Li, Zixiang Meng, Yunlong Li, Jun Zhou, Fei Li, Chong Teng, Donghong Ji

    Abstract: Existing cross-document event coreference resolution models, which either compute mention similarity directly or enhance mention representation by extracting event arguments (such as location, time, agent, and patient), lacking the ability to utilize document-level information. As a result, they struggle to capture long-distance dependencies. This shortcoming leads to their underwhelming performan… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Report number: https://aclanthology.org/2024.lrec-main.523/

    Journal ref: LREC|COLING,Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation,2024,5907-5921

  6. arXiv:2406.10475  [pdf, other

    cs.CV

    Discrete Latent Perspective Learning for Segmentation and Detection

    Authors: Deyi Ji, Feng Zhao, Lanyun Zhu, Wenwei Jin, Hongtao Lu, Jieping Ye

    Abstract: In this paper, we address the challenge of Perspective-Invariant Learning in machine learning and computer vision, which involves enabling a network to understand images from varying perspectives to achieve consistent semantic interpretation. While standard approaches rely on the labor-intensive collection of multi-view images or limited data augmentation techniques, we propose a novel framework,… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: ICML 2024 Spotlight

  7. arXiv:2405.19326  [pdf, other

    cs.CV cs.GR cs.HC

    Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models

    Authors: Tianrun Chen, Chunan Yu, Jing Li, Jianqi Zhang, Lanyun Zhu, Deyi Ji, Yong Zhang, Ying Zang, Zejian Li, Lingyun Sun

    Abstract: In this paper, we introduce a new task: Zero-Shot 3D Reasoning Segmentation for parts searching and localization for objects, which is a new paradigm to 3D segmentation that transcends limitations for previous category-specific 3D semantic segmentation, 3D instance segmentation, and open-vocabulary 3D segmentation. We design a simple baseline method, Reasoning3D, with the capability to understand… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  8. arXiv:2405.08816  [pdf, other

    cs.CV cs.RO

    The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

    Authors: Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingxing Wei, Yinpeng Dong, Bo Yang, Shengyin Jiang , et al. (66 additional authors not shown)

    Abstract: In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles. Challenges such as adverse weather, sensor malfunctions, and environmental unpredictability can severely impact the performance of autonomous systems. The 2024 RoboDrive Challenge was crafted to propel the development of driving perception technologies that c… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: ICRA 2024; 32 pages, 24 figures, 5 tables; Code at https://robodrive-24.github.io/

  9. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  10. arXiv:2404.14728  [pdf

    cs.LG cs.CY

    Novel Topological Machine Learning Methodology for Stream-of-Quality Modeling in Smart Manufacturing

    Authors: Jay Lee, Dai-Yan Ji, Yuan-Ming Hsu

    Abstract: This paper presents a topological analytics approach within the 5-level Cyber-Physical Systems (CPS) architecture for the Stream-of-Quality assessment in smart manufacturing. The proposed methodology not only enables real-time quality monitoring and predictive analytics but also discovers the hidden relationships between quality features and process parameters across different manufacturing proces… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: The paper has been submitted to Manufacturing Letters (Under Review)

  11. arXiv:2403.15776  [pdf, other

    cs.CL cs.AI

    Modeling Unified Semantic Discourse Structure for High-quality Headline Generation

    Authors: Minghui Xu, Hao Fei, Fei Li, Shengqiong Wu, Rui Sun, Chong Teng, Donghong Ji

    Abstract: Headline generation aims to summarize a long document with a short, catchy title that reflects the main idea. This requires accurately capturing the core document semantics, which is challenging due to the lengthy and background information-rich na ture of the texts. In this work, We propose using a unified semantic discourse structure (S3) to represent document semantics, achieved by combining do… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  12. arXiv:2403.10830  [pdf, other

    cs.CV

    View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV

    Authors: Deyi Ji, Siqi Gao, Lanyun Zhu, Qi Zhu, Yiru Zhao, Peng Xu, Hongtao Lu, Feng Zhao, Jieping Ye

    Abstract: In this paper, we address the challenge of multi-object tracking (MOT) in moving Unmanned Aerial Vehicle (UAV) scenarios, where irregular flight trajectories, such as hovering, turning left/right, and moving up/down, lead to significantly greater complexity compared to fixed-camera MOT. Specifically, changes in the scene background not only render traditional frame-to-frame object IOU association… ▽ More

    Submitted 14 May, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  13. arXiv:2403.03721  [pdf, other

    cs.CV

    CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection

    Authors: Gyusam Chang, Wonseok Roh, Sujin Jang, Dongwook Lee, Daehyun Ji, Gyeongrok Oh, Jinsun Park, Jinkyu Kim, Sangpil Kim

    Abstract: Recent LiDAR-based 3D Object Detection (3DOD) methods show promising results, but they often do not generalize well to target domains outside the source (or training) data distribution. To reduce such domain gaps and thus to make 3DOD models more generalizable, we introduce a novel unsupervised domain adaptation (UDA) method, called CMDA, which (i) leverages visual semantic cues from an image moda… ▽ More

    Submitted 6 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by AAAI 2024

  14. arXiv:2402.18476  [pdf, other

    cs.CV

    IBD: Alleviating Hallucinations in Large Vision-Language Models via Image-Biased Decoding

    Authors: Lanyun Zhu, Deyi Ji, Tianrun Chen, Peng Xu, Jieping Ye, Jun Liu

    Abstract: Despite achieving rapid developments and with widespread applications, Large Vision-Language Models (LVLMs) confront a serious challenge of being prone to generating hallucinations. An over-reliance on linguistic priors has been identified as a key factor leading to these hallucinations. In this paper, we propose to alleviate this problem by introducing a novel image-biased decoding (IBD) techniqu… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  15. arXiv:2402.13693  [pdf, other

    cs.CL

    CMNER: A Chinese Multimodal NER Dataset based on Social Media

    Authors: Yuanze Ji, Bobo Li, Jun Zhou, Fei Li, Chong Teng, Donghong Ji

    Abstract: Multimodal Named Entity Recognition (MNER) is a pivotal task designed to extract named entities from text with the support of pertinent images. Nonetheless, a notable paucity of data for Chinese MNER has considerably impeded the progress of this natural language processing task within the Chinese domain. Consequently, in this study, we compile a Chinese Multimodal NER dataset (CMNER) utilizing dat… ▽ More

    Submitted 1 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  16. arXiv:2402.07218  [pdf, other

    cs.RO

    Sensor Misalignment-tolerant AUV Navigation with Passive DoA and Doppler Measurements

    Authors: Bingbing Zhang, Shuo Liu, Shanmin Zhou, Daxiong Ji, Tao Wang, Tian Xia, Wen Xu

    Abstract: We present a sensor misalignment-tolerant AUV navigation method that leverages measurements from an acoustic array and dead reckoned information. Recent studies have demonstrated the potential use of passive acoustic Direction of Arrival (DoA) measurements for AUV navigation without requiring ranging measurements. However, the sensor misalignment between the acoustic array and the attitude sensor… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  17. arXiv:2312.17428  [pdf, other

    cs.CV

    ChangeNet: Multi-Temporal Asymmetric Change Detection Dataset

    Authors: Deyi Ji, Siqi Gao, Mingyuan Tao, Hongtao Lu, Feng Zhao

    Abstract: Change Detection (CD) has been attracting extensive interests with the availability of bi-temporal datasets. However, due to the huge cost of multi-temporal images acquisition and labeling, existing change detection datasets are small in quantity, short in temporal, and low in practicability. Therefore, a large-scale practical-oriented dataset covering wide temporal phases is urgently needed to fa… ▽ More

    Submitted 11 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024 Oral/Lecture

  18. arXiv:2312.15291  [pdf, other

    cs.CL

    Reverse Multi-Choice Dialogue Commonsense Inference with Graph-of-Thought

    Authors: Li Zheng, Hao Fei, Fei Li, Bobo Li, Lizi Liao, Donghong Ji, Chong Teng

    Abstract: With the proliferation of dialogic data across the Internet, the Dialogue Commonsense Multi-choice Question Answering (DC-MCQ) task has emerged as a response to the challenge of comprehending user queries and intentions. Although prevailing methodologies exhibit effectiveness in addressing single-choice questions, they encounter difficulties in handling multi-choice queries due to the heightened i… ▽ More

    Submitted 26 December, 2023; v1 submitted 23 December, 2023; originally announced December 2023.

    Comments: This paper has been accepted by the 38th Annual AAAI Conference on Artificial Intelligence (AAAI'24, FEBRUARY 20-27, 2024, VANCOUVER, CANADA)

  19. arXiv:2312.11276  [pdf, other

    cs.CL

    Compositional Generalization for Multi-label Text Classification: A Data-Augmentation Approach

    Authors: Yuyang Chai, Zhuang Li, Jiahui Liu, Lei Chen, Fei Li, Donghong Ji, Chong Teng

    Abstract: Despite significant advancements in multi-label text classification, the ability of existing models to generalize to novel and seldom-encountered complex concepts, which are compositions of elementary ones, remains underexplored. This research addresses this gap. By creating unique data splits across three benchmarks, we assess the compositional generalization ability of existing multi-label text… ▽ More

    Submitted 20 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI'24

  20. arXiv:2311.16926  [pdf, other

    cs.CV

    LLaFS: When Large Language Models Meet Few-Shot Segmentation

    Authors: Lanyun Zhu, Tianrun Chen, Deyi Ji, Jieping Ye, Jun Liu

    Abstract: This paper proposes LLaFS, the first attempt to leverage large language models (LLMs) in few-shot segmentation. In contrast to the conventional few-shot segmentation methods that only rely on the limited and biased information from the annotated support images, LLaFS leverages the vast prior knowledge gained by LLM as an effective supplement and directly uses the LLM to segment images in a few-sho… ▽ More

    Submitted 3 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted to CVPR2024

  21. arXiv:2310.02031  [pdf, other

    cs.CL cs.AI cs.CE cs.LG cs.RO

    OceanGPT: A Large Language Model for Ocean Science Tasks

    Authors: Zhen Bi, Ningyu Zhang, Yida Xue, Yixin Ou, Daxiong Ji, Guozhou Zheng, Huajun Chen

    Abstract: Ocean science, which delves into the oceans that are reservoirs of life and biodiversity, is of great significance given that oceans cover over 70% of our planet's surface. Recently, advances in Large Language Models (LLMs) have transformed the paradigm in science. Despite the success in other domains, current LLMs often fall short in catering to the needs of domain experts like oceanographers, an… ▽ More

    Submitted 23 May, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ACL2024. Project Website: https://oceangpt.zjukg.cn/

  22. arXiv:2308.04502  [pdf, other

    cs.CL

    Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition

    Authors: Bobo Li, Hao Fei, Lizi Liao, Yu Zhao, Chong Teng, Tat-Seng Chua, Donghong Ji, Fei Li

    Abstract: It has been a hot research topic to enable machines to understand human emotions in multimodal contexts under dialogue scenarios, which is tasked with multimodal emotion analysis in conversation (MM-ERC). MM-ERC has received consistent attention in recent years, where a diverse range of methods has been proposed for securing better task performance. Most existing works treat MM-ERC as a standard m… ▽ More

    Submitted 12 August, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM MM 2023

  23. arXiv:2308.04498  [pdf, other

    cs.CL

    DialogRE^C+: An Extension of DialogRE to Investigate How Much Coreference Helps Relation Extraction in Dialogs

    Authors: Yiyun Xiong, Mengwei Dai, Fei Li, Hao Fei, Bobo Li, Shengqiong Wu, Donghong Ji, Chong Teng

    Abstract: Dialogue relation extraction (DRE) that identifies the relations between argument pairs in dialogue text, suffers much from the frequent occurrence of personal pronouns, or entity and speaker coreference. This work introduces a new benchmark dataset DialogRE^C+, introducing coreference resolution into the DRE scenario. With the aid of high-quality coreference knowledge, the reasoning of argument r… ▽ More

    Submitted 12 August, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by NLPCC 2023

  24. arXiv:2308.04424  [pdf, other

    cs.CL

    A Bi-directional Multi-hop Inference Model for Joint Dialog Sentiment Classification and Act Recognition

    Authors: Li Zheng, Fei Li, Yuyang Chai, Chong Teng, Donghong Ji

    Abstract: The joint task of Dialog Sentiment Classification (DSC) and Act Recognition (DAR) aims to predict the sentiment label and act label for each utterance in a dialog simultaneously. However, current methods encode the dialog context in only one direction, which limits their ability to thoroughly comprehend the context. Moreover, these methods overlook the explicit correlations between sentiment and a… ▽ More

    Submitted 12 August, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by NLPCC 2023

  25. arXiv:2307.00711  [pdf, other

    cs.CV

    Guided Patch-Grouping Wavelet Transformer with Spatial Congruence for Ultra-High Resolution Segmentation

    Authors: Deyi Ji, Feng Zhao, Hongtao Lu

    Abstract: Most existing ultra-high resolution (UHR) segmentation methods always struggle in the dilemma of balancing memory cost and local characterization accuracy, which are both taken into account in our proposed Guided Patch-Grouping Wavelet Transformer (GPWFormer) that achieves impressive performances. In this work, GPWFormer is a Transformer ($\mathcal{T}$)-CNN ($\mathcal{C}$) mutual leaning framework… ▽ More

    Submitted 5 July, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: Accepted to IJCAI 2023

  26. arXiv:2306.03975  [pdf, other

    cs.CL

    Revisiting Conversation Discourse for Dialogue Disentanglement

    Authors: Bobo Li, Hao Fei, Fei Li, Shengqiong Wu, Lizi Liao, Yinwei Wei, Tat-Seng Chua, Donghong Ji

    Abstract: Dialogue disentanglement aims to detach the chronologically ordered utterances into several independent sessions. Conversation utterances are essentially organized and described by the underlying discourse, and thus dialogue disentanglement requires the full understanding and harnessing of the intrinsic discourse attribute. In this paper, we propose enhancing dialogue disentanglement by taking ful… ▽ More

    Submitted 10 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: under review

  27. arXiv:2306.03974  [pdf, other

    cs.CL

    TKDP: Threefold Knowledge-enriched Deep Prompt Tuning for Few-shot Named Entity Recognition

    Authors: Jiang Liu, Hao Fei, Fei Li, Jingye Li, Bobo Li, Liang Zhao, Chong Teng, Donghong Ji

    Abstract: Few-shot named entity recognition (NER) exploits limited annotated instances to identify named mentions. Effectively transferring the internal or external resources thus becomes the key to few-shot NER. While the existing prompt tuning methods have shown remarkable few-shot performances, they still fail to make full use of knowledge. In this work, we investigate the integration of rich knowledge t… ▽ More

    Submitted 10 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: under review

  28. arXiv:2306.03969  [pdf, other

    cs.CL

    ECQED: Emotion-Cause Quadruple Extraction in Dialogs

    Authors: Li Zheng, Donghong Ji, Fei Li, Hao Fei, Shengqiong Wu, Jingye Li, Bobo Li, Chong Teng

    Abstract: The existing emotion-cause pair extraction (ECPE) task, unfortunately, ignores extracting the emotion type and cause type, while these fine-grained meta-information can be practically useful in real-world applications, i.e., chat robots and empathic dialog generation. Also the current ECPE is limited to the scenario of single text piece, while neglecting the studies at dialog level that should hav… ▽ More

    Submitted 10 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: under review

  29. arXiv:2305.17497  [pdf, other

    cs.CL

    FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing

    Authors: Zhuang Li, Yuyang Chai, Terry Yue Zhuo, Lizhen Qu, Gholamreza Haffari, Fei Li, Donghong Ji, Quan Hung Tran

    Abstract: Textual scene graph parsing has become increasingly important in various vision-language applications, including image caption evaluation and image retrieval. However, existing scene graph parsers that convert image captions into scene graphs often suffer from two types of errors. First, the generated scene graphs fail to capture the true semantics of the captions or the corresponding images, resu… ▽ More

    Submitted 1 June, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 9 pages, ACL 2023 (findings)

  30. arXiv:2305.10899  [pdf, other

    cs.CV

    Ultra-High Resolution Segmentation with Ultra-Rich Context: A Novel Benchmark

    Authors: Deyi Ji, Feng Zhao, Hongtao Lu, Mingyuan Tao, Jieping Ye

    Abstract: With the increasing interest and rapid development of methods for Ultra-High Resolution (UHR) segmentation, a large-scale benchmark covering a wide range of scenes with full fine-grained dense annotations is urgently needed to facilitate the field. To this end, the URUR dataset is introduced, in the meaning of Ultra-High Resolution dataset with Ultra-Rich Context. As the name suggests, URUR contai… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted to CVPR 2023

  31. arXiv:2305.03944  [pdf, other

    cs.CV

    Structural and Statistical Texture Knowledge Distillation for Semantic Segmentation

    Authors: Deyi Ji, Haoran Wang, Mingyuan Tao, Jianqiang Huang, Xian-Sheng Hua, Hongtao Lu

    Abstract: Existing knowledge distillation works for semantic segmentation mainly focus on transferring high-level contextual knowledge from teacher to student. However, low-level texture knowledge is also of vital importance for characterizing the local structural pattern and global statistical property, such as boundary, smoothness, regularity and color contrast, which may not be well addressed by high-lev… ▽ More

    Submitted 5 July, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: Accepted to CVPR 2022

  32. On the Robustness of Aspect-based Sentiment Analysis: Rethinking Model, Data, and Training

    Authors: Hao Fei, Tat-Seng Chua, Chenliang Li, Donghong Ji, Meishan Zhang, Yafeng Ren

    Abstract: Aspect-based sentiment analysis (ABSA) aims at automatically inferring the specific sentiment polarities toward certain aspects of products or services behind the social media texts or reviews, which has been a fundamental application to the real-world society. Since the early 2010s, ABSA has achieved extraordinarily high accuracy with various deep neural models. However, existing ABSA models with… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: Accepted in ACM Transactions on Information Systems

    Journal ref: [J]. ACM Transactions on Information Systems, 2022, 41(2): 1-32

  33. arXiv:2211.05705  [pdf, other

    cs.CL

    DiaASQ : A Benchmark of Conversational Aspect-based Sentiment Quadruple Analysis

    Authors: Bobo Li, Hao Fei, Fei Li, Yuhan Wu, Jinsong Zhang, Shengqiong Wu, Jingye Li, Yijiang Liu, Lizi Liao, Tat-Seng Chua, Donghong Ji

    Abstract: The rapid development of aspect-based sentiment analysis (ABSA) within recent decades shows great potential for real-world society. The current ABSA works, however, are mostly limited to the scenario of a single text piece, leaving the study in dialogue contexts unexplored. To bridge the gap between fine-grained sentiment analysis and conversational opinion mining, in this work, we introduce a nov… ▽ More

    Submitted 22 May, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: Accepted to Findings of ACL 2023

  34. arXiv:2211.00684  [pdf, other

    cs.CL cs.AI

    TOE: A Grid-Tagging Discontinuous NER Model Enhanced by Embedding Tag/Word Relations and More Fine-Grained Tags

    Authors: Jiang Liu, Donghong Ji, Jingye Li, Dongdong Xie, Chong Teng, Liang Zhao, Fei Li

    Abstract: So far, discontinuous named entity recognition (NER) has received increasing research attention and many related methods have surged such as hypergraph-based methods, span-based methods, and sequence-to-sequence (Seq2Seq) methods, etc. However, these methods more or less suffer from some problems such as decoding ambiguity and efficiency, which limit their performance. Recently, grid-tagging metho… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  35. arXiv:2210.16541  [pdf, other

    cs.CL

    Entity-centered Cross-document Relation Extraction

    Authors: Fengqi Wang, Fei Li, Hao Fei, Jingye Li, Shengqiong Wu, Fangfang Su, Wenxuan Shi, Donghong Ji, Bo Cai

    Abstract: Relation Extraction (RE) is a fundamental task of information extraction, which has attracted a large amount of research attention. Previous studies focus on extracting the relations within a sentence or document, while currently researchers begin to explore cross-document RE. However, current cross-document RE methods directly utilize text snippets surrounding target entities in multiple given do… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

    Comments: This paper was accepted by EMNLP 2022 conference

  36. arXiv:2210.07506  [pdf, other

    cs.CV

    Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation

    Authors: Peihao Chen, Dongyu Ji, Kunyang Lin, Runhao Zeng, Thomas H. Li, Mingkui Tan, Chuang Gan

    Abstract: We address a practical yet challenging problem of training robot agents to navigate in an environment following a path described by some language instructions. The instructions often contain descriptions of objects in the environment. To achieve accurate and efficient navigation, it is critical to build a map that accurately represents both spatial location and the semantic information of the envi… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted by NeurIPS 2022

  37. arXiv:2210.07505  [pdf, other

    cs.CV cs.RO

    Learning Active Camera for Multi-Object Navigation

    Authors: Peihao Chen, Dongyu Ji, Kunyang Lin, Weiwen Hu, Wenbing Huang, Thomas H. Li, Mingkui Tan, Chuang Gan

    Abstract: Getting robots to navigate to multiple objects autonomously is essential yet difficult in robot applications. One of the key challenges is how to explore environments efficiently with camera sensors only. Existing navigation methods mainly focus on fixed cameras and few attempts have been made to navigate with active cameras. As a result, the agent may take a very long time to perceive the environ… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted by NeurIPS 2022

  38. arXiv:2210.03037  [pdf, other

    cs.CL cs.AI

    Conversational Semantic Role Labeling with Predicate-Oriented Latent Graph

    Authors: Hao Fei, Shengqiong Wu, Meishan Zhang, Yafeng Ren, Donghong Ji

    Abstract: Conversational semantic role labeling (CSRL) is a newly proposed task that uncovers the shallow semantic structures in a dialogue text. Unfortunately several important characteristics of the CSRL task have been overlooked by the existing works, such as the structural information integration, near-neighbor influence. In this work, we investigate the integration of a latent graph for CSRL. We propos… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  39. arXiv:2209.04112  [pdf, other

    cs.CL cs.AI

    Joint Alignment of Multi-Task Feature and Label Spaces for Emotion Cause Pair Extraction

    Authors: Shunjie Chen, Xiaochuan Shi, Jingye Li, Shengqiong Wu, Hao Fei, Fei Li, Donghong Ji

    Abstract: Emotion cause pair extraction (ECPE), as one of the derived subtasks of emotion cause analysis (ECA), shares rich inter-related features with emotion extraction (EE) and cause extraction (CE). Therefore EE and CE are frequently utilized as auxiliary tasks for better feature learning, modeled via multi-task learning (MTL) framework by prior works to achieve state-of-the-art (SoTA) ECPE results. How… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: Accepted by Coling 2022

  40. arXiv:2209.02693  [pdf, other

    cs.CL

    OneEE: A One-Stage Framework for Fast Overlapping and Nested Event Extraction

    Authors: Hu Cao, Jingye Li, Fangfang Su, Fei Li, Hao Fei, Shengqiong Wu, Bobo Li, Liang Zhao, Donghong Ji

    Abstract: Event extraction (EE) is an essential task of information extraction, which aims to extract structured event information from unstructured text. Most prior work focuses on extracting flat events while neglecting overlapped or nested ones. A few models for overlapped and nested EE includes several successive stages to extract event triggers and arguments,which suffer from error propagation. Therefo… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: Accepted by COLING'22

  41. arXiv:2204.12148  [pdf, other

    cs.SE

    Morest: Model-based RESTful API Testing with Execution Feedback

    Authors: Yi Liu, Yuekang Li, Gelei Deng, Yang Liu, Ruiyuan Wan, Runchao Wu, Dandan Ji, Shiheng Xu, Minli Bao

    Abstract: RESTful APIs are arguably the most popular endpoints for accessing Web services. Blackbox testing is one of the emerging techniques for ensuring the reliability of RESTful APIs. The major challenge in testing RESTful APIs is the need for correct sequences of API operation calls for in-depth testing. To build meaningful operation call sequences, researchers have proposed techniques to learn and uti… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Journal ref: 44th International Conference on Software Engineering (ICSE 2022)

  42. arXiv:2203.10796  [pdf, other

    cs.CL

    Effective Token Graph Modeling using a Novel Labeling Strategy for Structured Sentiment Analysis

    Authors: Wenxuan Shi, Fei Li, Jingye Li, Hao Fei, Donghong Ji

    Abstract: The state-of-the-art model for structured sentiment analysis casts the task as a dependency parsing problem, which has some limitations: (1) The label proportions for span prediction and span relation prediction are imbalanced. (2) The span lengths of sentiment tuple components may be very large in this task, which will further exacerbate the imbalance problem. (3) Two nodes in a dependency graph… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: to appear at the ACL 2022 Main conference

  43. arXiv:2112.10070  [pdf, other

    cs.CL

    Unified Named Entity Recognition as Word-Word Relation Classification

    Authors: Jingye Li, Hao Fei, Jiang Liu, Shengqiong Wu, Meishan Zhang, Chong Teng, Donghong Ji, Fei Li

    Abstract: So far, named entity recognition (NER) has been involved with three major types, including flat, overlapped (aka. nested), and discontinuous NER, which have mostly been studied individually. Recently, a growing interest has been built for unified NER, tackling the above three jobs concurrently with one single model. Current best-performing methods mainly include span-based and sequence-to-sequence… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI'22

  44. arXiv:2110.02001  [pdf, other

    cs.CL

    Mastering the Explicit Opinion-role Interaction: Syntax-aided Neural Transition System for Unified Opinion Role Labeling

    Authors: Shengqiong Wu, Hao Fei, Fei Li, Donghong Ji, Meishan Zhang, Yijiang Liu, Chong Teng

    Abstract: Unified opinion role labeling (ORL) aims to detect all possible opinion structures of 'opinion-holder-target' in one shot, given a text. The existing transition-based unified method, unfortunately, is subject to longer opinion terms and fails to solve the term overlap issue. Current top performance has been achieved by employing the span-based graph model, which however still suffers from both hig… ▽ More

    Submitted 13 December, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: AAAI2022

  45. arXiv:2106.14373  [pdf, other

    cs.CL

    A Span-Based Model for Joint Overlapped and Discontinuous Named Entity Recognition

    Authors: Fei Li, Zhichao Lin, Meishan Zhang, Donghong Ji

    Abstract: Research on overlapped and discontinuous named entity recognition (NER) has received increasing attention. The majority of previous work focuses on either overlapped or discontinuous entities. In this paper, we propose a novel span-based model that can recognize both overlapped and discontinuous entities jointly. The model includes two major steps. First, entity fragments are recognized by travers… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

    Comments: Accepted in the main conference of ACL 2021

  46. arXiv:2105.02520  [pdf, other

    cs.CL

    Learn from Syntax: Improving Pair-wise Aspect and Opinion Terms Extractionwith Rich Syntactic Knowledge

    Authors: Shengqiong Wu, Hao Fei, Yafeng Ren, Donghong Ji, Jingye Li

    Abstract: In this paper, we propose to enhance the pair-wise aspect and opinion terms extraction (PAOTE) task by incorporating rich syntactic knowledge. We first build a syntax fusion encoder for encoding syntactic features, including a label-aware graph convolutional network (LAGCN) for modeling the dependency edges and labels, as well as the POS tags unifiedly, and a local-attention module encoding POS ta… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: IJCAI2021

  47. arXiv:2105.00991  [pdf, ps, other

    cs.IR cs.CL stat.CO

    Context-aware Ensemble of Multifaceted Factorization Models for Recommendation Prediction in Social Networks

    Authors: Yunwen Chen, Zuotao Liu, Daqi Ji, Yingwei Xin, Wenguang Wang, Lu Yao, Yi Zou

    Abstract: This paper describes the solution of Shanda Innovations team to Task 1 of KDD-Cup 2012. A novel approach called Multifaceted Factorization Models is proposed to incorporate a great variety of features in social networks. Social relationships and actions between users are integrated as implicit feedbacks to improve the recommendation accuracy. Keywords, tags, profiles, time and some other features… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

    Comments: KDD 2012

  48. arXiv:2103.04133  [pdf, other

    cs.CV

    Learning Statistical Texture for Semantic Segmentation

    Authors: Lanyun Zhu, Deyi Ji, Shiping Zhu, Weihao Gan, Wei Wu, Junjie Yan

    Abstract: Existing semantic segmentation works mainly focus on learning the contextual information in high-level semantic features with CNNs. In order to maintain a precise boundary, low-level texture features are directly skip-connected into the deeper layers. Nevertheless, texture features are not only about local structure, but also include global statistical knowledge of the input image. In this paper,… ▽ More

    Submitted 6 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2021

  49. Joint Radar and Communication: A Survey

    Authors: Zhiyong Feng, Zixi Fang, Zhiqing Wei, Xu Chen, Zhi Quan, Danna Ji

    Abstract: Joint radar and communication (JRC) technology has become important for civil and military applications for decades. This paper introduces the concepts, characteristics and advantages of JRC technology, presenting the typical applications that have benefited from JRC technology currently and in the future. This paper explores the state-of-the-art of JRC in the levels of coexistence, cooperation, c… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Journal ref: in China Communications, vol. 17, no. 1, pp. 1-27, Jan. 2020

  50. arXiv:2102.00667  [pdf, ps, other

    cs.LG stat.ML

    Probabilistic Learning Vector Quantization on Manifold of Symmetric Positive Definite Matrices

    Authors: Fengzhen Tang, Haifeng Feng, Peter Tino, Bailu Si, Daxiong Ji

    Abstract: In this paper, we develop a new classification method for manifold-valued data in the framework of probabilistic learning vector quantization. In many classification scenarios, the data can be naturally represented by symmetric positive definite matrices, which are inherently points that live on a curved Riemannian manifold. Due to the non-Euclidean geometry of Riemannian manifolds, traditional Eu… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: 15 pages, 7 figures