Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
Volume 23, Issue 5May 2024
Editor:
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
ISSN:2375-4699
EISSN:2375-4702
Recommend ACM DL
ALREADY A SUBSCRIBER?SIGN IN
Reflects downloads up to 05 Mar 2025Bibliometrics
Skip Table Of Content Section
research-article
Open Access
More Than Syntaxes: Investigating Semantics to Zero-shot Cross-lingual Relation Extraction and Event Argument Role Labelling
Article No.: 61, Pages 1–21https://doi.org/10.1145/3582261

Syntactic dependency structures are commonly utilized as language-agnostic features to solve the word order difference issues in zero-shot cross-lingual relation and event extraction tasks. However, while sentences in multiple forms can be employed to ...

research-article
A Research on University Students’ Behavioral Intention to Use New-generation Information Technology in Intelligent Foreign Language Learning
Article No.: 62, Pages 1–15https://doi.org/10.1145/3563774

A better understanding of how advancement in science and technology affect students’ learning behavior in an academic setting can help all educators in higher education. With the advancement of science and technology, the new-generation information ...

research-article
Open Access
Knowledge-based Data Processing for Multilingual Natural Language Analysis
Article No.: 63, Pages 1–16https://doi.org/10.1145/3583686

Natural Language Processing (NLP) aids the empowerment of intelligent machines by enhancing human language understanding for linguistic-based human-computer communication. Recent developments in processing power, as well as the availability of large ...

research-article
Automatically Temporal Labeled Data Generation Using Positional Lexicon Expansion for Focus Time Estimation of News Articles
Article No.: 64, Pages 1–20https://doi.org/10.1145/3568164

Many facts change over time, which is a fundamental aspect of our physical environment. In the case of pandemic articles, the user is not interested in the creation date of the document but in the facts and the cause of the last pandemic. Fake news can be ...

research-article
Open Access
Multilingual Neural Machine Translation for Indic to Indic Languages
Article No.: 65, Pages 1–32https://doi.org/10.1145/3652026

The method of translation from one language to another without human intervention is known as Machine Translation (MT). Multilingual neural machine translation (MNMT) is a technique for MT that builds a single model for multiple languages. It is preferred ...

research-article
A Novel Pretrained General-purpose Vision Language Model for the Vietnamese Language
Article No.: 66, Pages 1–16https://doi.org/10.1145/3654796

Lying in the cross-section of computer vision and natural language processing, vision language models are capable of processing images and text at once. These models are helpful in various tasks: text generation from image and vice versa, image-text ...

research-article
Open Access
Crossing Linguistic Barriers: Authorship Attribution in Sinhala Texts
Article No.: 67, Pages 1–14https://doi.org/10.1145/3655620

Authorship attribution involves determining the original author of an anonymous text from a pool of potential authors. The author attribution task has applications in several domains, such as plagiarism detection, digital text forensics, and information ...

research-article
Fast Recurrent Neural Network with Bi-LSTM for Handwritten Tamil Text Segmentation in NLP
Article No.: 68, Pages 1–20https://doi.org/10.1145/3643808

Tamil text segmentation is a long-standing test in language comprehension that entails separating a record into adjacent pieces based on its semantic design. Each segment is important in its own way. The segments are organised according to the purpose of ...

research-article
Multization: Multi-Modal Summarization Enhanced by Multi-Contextually Relevant and Irrelevant Attention Alignment
Article No.: 69, Pages 1–29https://doi.org/10.1145/3651983

This article focuses on the task of Multi-Modal Summarization with Multi-Modal Output for China JD.COM e-commerce product description containing both source text and source images. In the context learning of multi-modal (text and image) input, there ...

research-article
Part-of-speech Tagging for Low-resource Languages: Activation Function for Deep Learning Network to Work with Minimal Training Data
Article No.: 70, Pages 1–31https://doi.org/10.1145/3655023

Numerous natural language processing (NLP) applications exist today, especially for the most commonly spoken languages such as English, Chinese, and Spanish. Popular traditional methods such as Rule based methods, Naive Bayes classifiers, Hidden Markov ...

research-article
Performance of Binarization Algorithms on Tamizhi Inscription Images: An Analysis
Article No.: 71, Pages 1–29https://doi.org/10.1145/3656583

Binarization of Tamizhi (Tamil-Brahmi) inscription images are highly challenging, as it is captured from very old stone inscriptions that exists around 3rd century BCE in India. The difficulty is due to the degradation of these inscriptions by ...

research-article
Knowledge-Enriched Prompt for Low-Resource Named Entity Recognition
Article No.: 72, Pages 1–15https://doi.org/10.1145/3659948

Named Entity Recognition (NER) in low-resource settings aims to identify and categorize entities in a sentence with limited labeled data. Although prompt-based methods have succeeded in low-resource perspectives, challenges persist in effectively ...

short-paper
Cleansing Jewel: A Neural Spelling Correction Model Built On Google OCR-ed Tibetan Manuscripts
Article No.: 73, Pages 1–11https://doi.org/10.1145/3654811

Scholars in the humanities heavily rely on ancient manuscripts to study history, religion, and socio-political structures of the past. Significant efforts have been devoted to digitizing these precious manuscripts using OCR technology. However, most ...

short-paper
Supervised Contrast Learning Text Classification Model Based on Data Quality Augmentation
Article No.: 74, Pages 1–12https://doi.org/10.1145/3653300

Token-level data augmentation generates text samples by modifying the words of the sentences. However, data that are not easily classified can negatively affect the model. In particular, not considering the role of keywords when performing random ...

short-paper
MRMI-TTS: Multi-Reference Audios and Mutual Information Driven Zero-Shot Voice Cloning
Article No.: 75, Pages 1–14https://doi.org/10.1145/3649501

Voice cloning in text-to-speech (TTS) is the process of replicating the voice of a target speaker with limited data. Among various voice cloning techniques, this article focuses on zero-shot voice cloning. Although existing TTS models can generate high-...

Comments

Subjects