Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJuly 2024JUST ACCEPTED
Adaptive Semantic Information Extraction of Tibetan Opera Mask with Recall Loss
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Just Accepted https://doi.org/10.1145/3666041With the development of artificial intelligence, natural language processing enables us to better understand and utilize semantic information. However, traditional object detection algorithms cannot get an effective performance, when dealed with Tibetan ...
- research-articleJuly 2024JUST ACCEPTED
Exploring the Correlation between Emojis and Mood Expression in Thai Twitter Discourse
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Just Accepted https://doi.org/10.1145/3680543Mood, a long-lasting affective state detached from specific stimuli, plays an important role in behavior. Although sentiment analysis and emotion classification have garnered attention, research on mood classification remains in its early stages. This ...
- research-articleJuly 2024
Automatic Algerian Sarcasm Detection from Texts and Images
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 7Article No.: 108, Pages 1–25https://doi.org/10.1145/3670403In recent years, the number of Algerian Internet users has significantly increased, providing a valuable opportunity for collecting and utilizing opinions and sentiments expressed online. They now post not just texts but also images. However, to benefit ...
- research-articleJuly 2024
Enhancing Chinese Event Extraction with Event Trigger Structures
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 7Article No.: 106, Pages 1–18https://doi.org/10.1145/3663567The dependency syntactic structure is widely used in event extraction. However, the dependency structure reflecting syntactic features is essentially different from the event structure that reflects semantic features, leading to the performance ...
- research-articleJuly 2024JUST ACCEPTED
Analyzing the Effects of Transcription Errors on Summary Generation of Bengali Spoken Documents
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Just Accepted https://doi.org/10.1145/3678005Automatic speech recognition (ASR) has become an indispensable part of the AI domain, with various speech technologies reliant on it. The quality of speech recognition depends on the amount of annotated data used to train an ASR system, among other ...
-
- research-articleJuly 2024JUST ACCEPTED
CoMix: Confronting with Noisy Label Learning with Co-training Strategies on Textual Mislabeling
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Just Accepted https://doi.org/10.1145/3678175The existence of noisy labels is inevitable in real-world large-scale corpora. As deep neural networks are notably vulnerable to overfitting on noisy samples, this highlights the importance of the ability of language models to resist noise for efficient ...
- research-articleJuly 2024
- research-articleJuly 2024
KannadaLex: A lexical database with psycholinguistic information
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 7Article No.: 104, Pages 1–21https://doi.org/10.1145/3670688Databases containing lexical properties are of primary importance to psycholinguistic research and speech-language therapy. Several lexical databases for different languages have been developed in the recent past, but Kannada, a language spoken by 50.8 ...
- research-articleJuly 2024
Share What You Already Know: Cross-Language-Script Transfer and Alignment for Sentiment Detection in Code-Mixed Data
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 7Article No.: 103, Pages 1–15https://doi.org/10.1145/3661307Code-switching entails mixing multiple languages. It is an increasingly occurring phenomenon in social media texts. Usually, code-mixed texts are written in a single script, even though the languages involved have different scripts. Pre-trained ...
- research-articleJuly 2024JUST ACCEPTED
Vi-AbSQA: Multi-task Prompt Instruction Tuning Model for Vietnamese Aspect-based Sentiment Quadruple Analysis
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Just Accepted https://doi.org/10.1145/3676886Aspect-based sentiment analysis (ABSA) has recently received considerable attention within the Natural Language Processing (NLP) community, especially for complex tasks like triplet extraction or quadruplet prediction. However, most existing studies focus ...
- research-articleJune 2024JUST ACCEPTED
Towards Vietnamese Question and Answer Generation: An Empirical Study
- Quoc-Hung Pham,
- Huu-Loi Le,
- Minh Dang Nhat,
- Khang Tran T.,
- Manh Tran-Tien,
- Viet-Hung Dang,
- Huy-The Vu,
- Minh-Tien Nguyen,
- Xuan-Hieu Phan
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Just Accepted https://doi.org/10.1145/3675781Question-answer generation (QAG) is a challenging task that generates both questions and answers from a given input paragraph context. The QAG task has recently achieved promising results thanks to the appearance of large pre-trained language models, yet, ...
- short-paperJune 2024JUST ACCEPTED
Optimizing Uyghur Speech Synthesis by Combining Pretrained Cross-Lingual Model
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Just Accepted https://doi.org/10.1145/3675397End-to-end speech synthesis methodologies have exhibited considerable advancements for languages with abundant corpus resources. Nevertheless, such achievements are yet to be realized for languages constrained by limited corpora. This manuscript ...
- research-articleJune 2024JUST ACCEPTED
Travel Agency Task Dialogue Corpus: A Multimodal Dataset with Age-Diverse Speakers
- Michimasa Inaba,
- Yuya Chiba,
- Zhiyang Qi,
- Ryuichiro Higashinaka,
- Kazunori Komatani,
- Yusuke Miyao,
- Takayuki Nagai
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Just Accepted https://doi.org/10.1145/3675166When individuals communicate, they use different vocabularies, speaking speeds, facial expressions, and gestural languages, depending on those with whom they are speaking. This study focuses on the age of the speaker as a factor that affects the style of ...
- research-articleJune 2024
X-Phishing-Writer: A Framework for Cross-lingual Phishing E-mail Generation
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 7Article No.: 102, Pages 1–34https://doi.org/10.1145/3670402Cybercrime is projected to cause annual business losses of $10.5 trillion by 2025, a significant concern given that a majority of security breaches are due to human errors, especially through phishing attacks. The rapid increase in daily identified ...
- research-articleJune 2024
SCBG: Semantic-Constrained Bidirectional Generation for Emotional Support Conversation
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 7Article No.: 101, Pages 1–17https://doi.org/10.1145/3666090The Emotional Support Conversation (ESC) task aims to deliver consolation, encouragement, and advice to individuals undergoing emotional distress, thereby assisting them in overcoming difficulties. In the context of emotional support dialogue systems, it ...
- research-articleJune 2024
Document-Level Relation Extraction Based on Machine Reading Comprehension and Hybrid Pointer-sequence Labeling
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 7Article No.: 100, Pages 1–16https://doi.org/10.1145/3666042Document-level relational extraction requires reading, memorization, and reasoning to discover relevant factual information in multiple sentences. It is difficult for the current hierarchical network and graph network methods to fully capture the ...
- research-articleJune 2024
MizBERT: A Mizo BERT Model
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 7Article No.: 99, Pages 1–14https://doi.org/10.1145/3666003This research investigates the utilization of pre-trained BERT transformers within the context of the Mizo language. BERT, an abbreviation for Bidirectional Encoder Representations from Transformers, symbolizes Google’s forefront neural network approach ...
- research-articleJune 2024
Towards Better Quantity Representations for Solving Math Word Problems
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 7Article No.: 96, Pages 1–18https://doi.org/10.1145/3665644Solving a math word problem requires selecting quantities in it and performing appropriate arithmetic operations to obtain the answer. For deep learning-based methods, it is vital to obtain good quantity representations, i.e., to selectively and ...
- research-articleJune 2024
Neural Machine Translation for Low-Resource Languages from a Chinese-centric Perspective: A Survey
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 6Article No.: 80, Pages 1–60https://doi.org/10.1145/3665244Machine translation–the automatic transformation of one natural language (source language) into another (target language) through computational means–occupies a central role in computational linguistics and stands as a cornerstone of research within the ...
- research-articleJune 2024
Learning Domain Specific Sub-layer Latent Variable for Multi-Domain Adaptation Neural Machine Translation
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 23, Issue 6Article No.: 78, Pages 1–15https://doi.org/10.1145/3661305Domain adaptation proves to be an effective solution for addressing inadequate translation performance within specific domains. However, the straightforward approach of mixing data from multiple domains to obtain the multi-domain neural machine ...