Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–13 of 13 results for author: Zang, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.15273  [pdf, other

    cs.CL

    Event Temporal Relation Extraction based on Retrieval-Augmented on LLMs

    Authors: Xiaobin Zhang, Liangjun Zang, Qianwen Liu, Shuchong Wei, Songlin Hu

    Abstract: Event temporal relation (TempRel) is a primary subject of the event relation extraction task. However, the inherent ambiguity of TempRel increases the difficulty of the task. With the rise of prompt engineering, it is important to design effective prompt templates and verbalizers to extract relevant knowledge. The traditional manually designed templates struggle to extract precise temporal knowled… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 8 pages,6 figures.Accepted to the International Joint Conference on Neural Networks (IJCNN2024)

  2. arXiv:2401.13569  [pdf, other

    cs.NI cs.DC

    SPARC-LoRa: A Scalable, Power-efficient, Affordable, Reliable, and Cloud Service-enabled LoRa Networking System for Agriculture Applications

    Authors: Xi Wang, Bryan Hatasaka, Zhengyan Liu, Sayali Tope, Mohit Karkhanis, Seungbeom Noh, Farhan Sium, Ravi V. Mural, Hanseup Kim, Carlos Mastrangelo, Ling Zang, James Schnable, Mingyue Ji

    Abstract: With the rapid development of cloud and edge computing, Internet of Things (IoT) applications have been deployed in various aspects of human life. In this paper, we design and implement a holistic LoRa-based IoT system with LoRa communication capabilities, named SPARC-LoRa, which consists of field sensor nodes and a gateway connected to the Internet. SPARC-LoRa has the following important features… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 6 pages, 8 figures, submitted for publication

  3. arXiv:2401.03512  [pdf, other

    cs.CL cs.AI cs.LG

    CharPoet: A Chinese Classical Poetry Generation System Based on Token-free LLM

    Authors: Chengyue Yu, Lei Zang, Jiaotuan Wang, Chenyi Zhuang, Jinjie Gu

    Abstract: Automatic Chinese classical poetry generation has attracted much research interest, but achieving effective control over format and content simultaneously remains challenging. Traditional systems usually accept keywords as user inputs, resulting in limited control over content. Large language models (LLMs) improve content control by allowing unrestricted user instructions, but the token-by-token g… ▽ More

    Submitted 20 March, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  4. arXiv:2202.13840  [pdf, other

    cs.CL

    Text Smoothing: Enhance Various Data Augmentation Methods on Text Classification Tasks

    Authors: Xing Wu, Chaochen Gao, Meng Lin, Liangjun Zang, Zhongyuan Wang, Songlin Hu

    Abstract: Before entering the neural network, a token is generally converted to the corresponding one-hot representation, which is a discrete distribution of the vocabulary. Smoothed representation is the probability of candidate tokens obtained from a pre-trained masked language model, which can be seen as a more informative substitution to the one-hot representation. We propose an efficient data augmentat… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Comments: ACL 2022 Main Conference Accepted

  5. arXiv:2112.05638  [pdf, other

    cs.AI

    DistilCSE: Effective Knowledge Distillation For Contrastive Sentence Embeddings

    Authors: Chaochen Gao, Xing Wu, Peng Wang, Jue Wang, Liangjun Zang, Zhongyuan Wang, Songlin Hu

    Abstract: Large-scale contrastive learning models can learn very informative sentence embeddings, but are hard to serve online due to the huge model size. Therefore, they often play the role of "teacher", transferring abilities to small "student" models through knowledge distillation. However, knowledge distillation inevitably brings some drop in embedding effect. To tackle that, we propose an effective kno… ▽ More

    Submitted 30 January, 2023; v1 submitted 10 December, 2021; originally announced December 2021.

    Comments: Work in progress

  6. arXiv:2109.04380  [pdf, other

    cs.CL cs.AI

    ESimCSE: Enhanced Sample Building Method for Contrastive Learning of Unsupervised Sentence Embedding

    Authors: Xing Wu, Chaochen Gao, Liangjun Zang, Jizhong Han, Zhongyuan Wang, Songlin Hu

    Abstract: Contrastive learning has been attracting much attention for learning unsupervised sentence embeddings. The current state-of-the-art unsupervised method is the unsupervised SimCSE (unsup-SimCSE). Unsup-SimCSE takes dropout as a minimal data augmentation method, and passes the same input sentence to a pre-trained Transformer encoder (with dropout turned on) twice to obtain the two corresponding embe… ▽ More

    Submitted 11 September, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: COLING 2022

  7. arXiv:2005.11888  [pdf, other

    cs.IR cs.CL

    AutoSUM: Automating Feature Extraction and Multi-user Preference Simulation for Entity Summarization

    Authors: Dongjun Wei, Yaxin Liu, Fuqing Zhu, Liangjun Zang, Wei Zhou, Yijun Lu, Songlin Hu

    Abstract: Withthegrowthofknowledgegraphs, entity descriptions are becoming extremely lengthy. Entity summarization task, aiming to generate diverse, comprehensive, and representative summaries for entities, has received increasing interest recently. In most previous methods, features are usually extracted by the handcrafted templates. Then the feature selection and multi-user preference simulation take plac… ▽ More

    Submitted 24 May, 2020; originally announced May 2020.

    Comments: 11 pages, accepted in PAKDD'2020

  8. arXiv:2002.09634  [pdf, other

    cs.CL

    Data Augmentation for Copy-Mechanism in Dialogue State Tracking

    Authors: Xiaohui Song, Liangjun Zang, Yipeng Su, Xing Wu, Jizhong Han, Songlin Hu

    Abstract: While several state-of-the-art approaches to dialogue state tracking (DST) have shown promising performances on several benchmarks, there is still a significant performance gap between seen slot values (i.e., values that occur in both training set and test set) and unseen ones (values that occur in training set but not in test set). Recently, the copy-mechanism has been widely used in DST models t… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

  9. arXiv:1909.05364  [pdf, other

    cs.CL cs.AI

    TransSent: Towards Generation of Structured Sentences with Discourse Marker

    Authors: Xing Wu, Dongjun Wei, Liangjun Zang, Jizhong Han, Songlin Hu

    Abstract: Structured sentences are important expressions in human writings and dialogues. Previous works on neural text generation fused semantic and structural information by encoding the entire sentence into a mixed hidden representation. However, when a generated sentence becomes complicated, the structure is difficult to be properly maintained. To alleviate this problem, we explicitly separate the model… ▽ More

    Submitted 8 May, 2020; v1 submitted 5 September, 2019; originally announced September 2019.

    Comments: 5 figures

  10. arXiv:1908.08039  [pdf, other

    cs.CL

    "Mask and Infill" : Applying Masked Language Model to Sentiment Transfer

    Authors: Xing Wu, Tao Zhang, Liangjun Zang, Jizhong Han, Songlin Hu

    Abstract: This paper focuses on the task of sentiment transfer on non-parallel text, which modifies sentiment attributes (e.g., positive or negative) of sentences while preserving their attribute-independent content. Due to the limited capability of RNNbased encoder-decoder structure to capture deep and long-range dependencies among words, previous works can hardly generate satisfactory sentences from scrat… ▽ More

    Submitted 21 August, 2019; originally announced August 2019.

    Comments: IJCAI 2019

  11. arXiv:1905.10625  [pdf, other

    cs.CL cs.AI

    ESA: Entity Summarization with Attention

    Authors: Dongjun Wei, Yaxin Liu, Fuqing Zhu, Liangjun Zang, Wei Zhou, Jizhong Han, Songlin Hu

    Abstract: Entity summarization aims at creating brief but informative descriptions of entities from knowledge graphs. While previous work mostly focused on traditional techniques such as clustering algorithms and graph models, we ask how to apply deep learning methods into this task. In this paper we propose ESA, a neural network with supervised attention mechanisms for entity summarization. Specifically, w… ▽ More

    Submitted 25 May, 2020; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: 12pages, accepted in EYRE@CIKM'2019

  12. arXiv:1812.06705  [pdf, other

    cs.CL cs.AI cs.LG

    Conditional BERT Contextual Augmentation

    Authors: Xing Wu, Shangwen Lv, Liangjun Zang, Jizhong Han, Songlin Hu

    Abstract: We propose a novel data augmentation method for labeled sentences called conditional BERT contextual augmentation. Data augmentation methods are often applied to prevent overfitting and improve generalization of deep neural network models. Recently proposed contextual augmentation augments labeled sentences by randomly replacing words with more varied substitutions predicted by language model. BER… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

    Comments: 9 pages, 1 figure

  13. arXiv:1802.07608  [pdf, ps, other

    cs.SE

    Learning to Synthesize

    Authors: Yingfei Xiong, Bo Wang, Guirong Fu, Linfei Zang

    Abstract: In many scenarios we need to find the most likely program under a local context, where the local context can be an incomplete program, a partial specification, natural language description, etc. We call such problem program estimation. In this paper we propose an abstract framework, learning to synthesis, or L2S in short, to address this problem. L2S combines four tools to achieve this: syntax is… ▽ More

    Submitted 21 February, 2018; originally announced February 2018.