Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 3,767 results for author: Nguyen, T

.
  1. arXiv:2407.19560  [pdf, ps, other

    cs.IT eess.SP

    Beamforming Design for Max-Min Fairness Performance Balancing in ISAC Systems

    Authors: Tianyu Fang, Nhan Thanh Nguyen, Markku Juntti

    Abstract: Integrated sensing and communications (ISAC) is envisioned as a key technology for future wireless communications. In this paper, we consider a downlink monostatic ISAC system wherein the base station serves multiple communications users and sensing targets at the same time in the presence of clutter. We aim at both guaranteeing fairness among the communications users while simultaneously balancin… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 5 pages, 3 figures

  2. arXiv:2407.19463  [pdf, other

    cs.RO

    HD-maps as Prior Information for Globally Consistent Mapping in GPS-denied Environments

    Authors: Waqas Ali, Patric Jensfelt, Thien-Minh Nguyen

    Abstract: In recent years, prior maps have become a mainstream tool in autonomous navigation. However, commonly available prior maps are still tailored to control-and-decision tasks, and the use of these maps for localization remains largely unexplored. To bridge this gap, we propose a lidar-based localization and mapping (LOAM) system that can exploit the common HD-maps in autonomous driving scenarios. Spe… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  3. arXiv:2407.19299  [pdf, other

    cs.CL eess.SP

    The Impact of LoRA Adapters for LLMs on Clinical NLP Classification Under Data Limitations

    Authors: Thanh-Dung Le, Ti Ti Nguyen, Vu Nguyen Ha

    Abstract: Fine-tuning Large Language Models (LLMs) for clinical Natural Language Processing (NLP) poses significant challenges due to the domain gap and limited data availability. This study investigates the effectiveness of various adapter techniques, equivalent to Low-Rank Adaptation (LoRA), for fine-tuning LLMs in a resource-constrained hospital environment. We experimented with four structures-Adapter,… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: Under revisions

  4. arXiv:2407.18892  [pdf, other

    cs.RO cs.AI eess.SY

    SHANGUS: Deep Reinforcement Learning Meets Heuristic Optimization for Speedy Frontier-Based Exploration of Autonomous Vehicles in Unknown Spaces

    Authors: Seunghyeop Nam, Tuan Anh Nguyen, Eunmi Choi, Dugki Min

    Abstract: This paper introduces SHANGUS, an advanced framework combining Deep Reinforcement Learning (DRL) with heuristic optimization to improve frontier-based exploration efficiency in unknown environments, particularly for intelligent vehicles in autonomous air services, search and rescue operations, and space exploration robotics. SHANGUS harnesses DRL's adaptability and heuristic prioritization, marked… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  5. arXiv:2407.18467  [pdf

    cs.LG

    Machine Unlearning using a Multi-GAN based Model

    Authors: Amartya Hatua, Trung T. Nguyen, Andrew H. Sung

    Abstract: This article presents a new machine unlearning approach that utilizes multiple Generative Adversarial Network (GAN) based models. The proposed method comprises two phases: i) data reorganization in which synthetic data using the GAN model is introduced with inverted class labels of the forget datasets, and ii) fine-tuning the pre-trained model. The GAN models consist of two pairs of generators and… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  6. arXiv:2407.18266  [pdf, ps, other

    math.NT

    Average orders of Goldbach Estimates in Arithmetic Progressions

    Authors: Thi Thu Nguyen

    Abstract: We obtain asymptotic results on the average numbers of Goldbach representations of an interger as the sum of two primes in different arithmetic progressions. We also prove an omega-result showing that the asymptotic result is essentially the best possible.

    Submitted 17 July, 2024; originally announced July 2024.

    MSC Class: Primary 11P32; Secondary 11P55

  7. arXiv:2407.17801  [pdf, other

    cs.LG cs.AI cs.HC

    EEG-SSM: Leveraging State-Space Model for Dementia Detection

    Authors: Xuan-The Tran, Linh Le, Quoc Toan Nguyen, Thomas Do, Chin-Teng Lin

    Abstract: State-space models (SSMs) have garnered attention for effectively processing long data sequences, reducing the need to segment time series into shorter intervals for model training and inference. Traditionally, SSMs capture only the temporal dynamics of time series data, omitting the equally critical spectral features. This study introduces EEG-SSM, a novel state-space model-based approach for dem… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  8. arXiv:2407.17790  [pdf, other

    cs.LG cs.AR

    Exploring the Limitations of Kolmogorov-Arnold Networks in Classification: Insights to Software Training and Hardware Implementation

    Authors: Van Duy Tran, Tran Xuan Hieu Le, Thi Diem Tran, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Tinh Nguyen, Yasuhiko Nakashima

    Abstract: Kolmogorov-Arnold Networks (KANs), a novel type of neural network, have recently gained popularity and attention due to the ability to substitute multi-layer perceptions (MLPs) in artificial intelligence (AI) with higher accuracy and interoperability. However, KAN assessment is still limited and cannot provide an in-depth analysis of a specific domain. Furthermore, no study has been conducted on t… ▽ More

    Submitted 25 July, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures, 2 tables

  9. arXiv:2407.17053  [pdf, other

    cs.SE

    Automated Code-centric Software Vulnerability Assessment: How Far Are We? An Empirical Study in C/C++

    Authors: Anh The Nguyen, Triet Huynh Minh Le, M. Ali Babar

    Abstract: Background: The C and C++ languages hold significant importance in Software Engineering research because of their widespread use in practice. Numerous studies have utilized Machine Learning (ML) and Deep Learning (DL) techniques to detect software vulnerabilities (SVs) in the source code written in these languages. However, the application of these techniques in function-level SV assessment has be… ▽ More

    Submitted 28 July, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: Accepted as a full paper in the technical track at The International Symposium on Empirical Software Engineering and Measurement (ESEM) 2024

  10. arXiv:2407.16946  [pdf, other

    cs.SE

    Automatic Categorization of GitHub Actions with Transformers and Few-shot Learning

    Authors: Phuong T. Nguyen, Juri Di Rocco, Claudio Di Sipio, Mudita Shakya, Davide Di Ruscio, Massimiliano Di Penta

    Abstract: In the GitHub ecosystem, workflows are used as an effective means to automate development tasks and to set up a Continuous Integration and Delivery (CI/CD pipeline). GitHub Actions (GHA) have been conceived to provide developers with a practical tool to create and maintain workflows, avoiding reinventing the wheel and cluttering the workflow with shell commands. Properly leveraging the power of Gi… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: The paper has been peer-reviewed and accepted for publication in the Proceedings of the 18th International Symposium on Empirical Software Engineering and Measurement (ESEM 2024)

  11. arXiv:2407.15812  [pdf, ps, other

    math.AP

    On the stability of blowup solutions to the complex Ginzburg-Landau equation in R^d

    Authors: Jiajie Chen, Thomas Y. Hou, Van Tien Nguyen, Yixuan Wang

    Abstract: Building upon the idea in \cite{HNWarXiv24}, we establish stability of the type-I blowup with log correction for the complex Ginzburg-Landau equation. In the amplitude-phase representation, a generalized dynamic rescaling formulation is introduced, with modulation parameters capturing the spatial translation and rotation symmetries of the equation and novel additional modulation parameters perturb… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 38 pages

    MSC Class: 35Q56

  12. arXiv:2407.15468  [pdf, ps, other

    math.ST

    Efficient influence functions for Sobol' indices under two designs of experiments

    Authors: Thierry Klein, Agnès Lagnoux, Paul Rochet, Thi Mong Ngoc Nguyen

    Abstract: In this note, we are interested in the asymptotic efficiency of Sobol' indices esti-mators. After recalling the basis of asymptotic efficiency, we compute the efficientinfluence functions for Sobol' indices in two different contexts: the Pick-Freeze andthe given-data settings.

    Submitted 22 July, 2024; originally announced July 2024.

  13. arXiv:2407.13904  [pdf, other

    stat.ME

    In defense of MAR over latent ignorability (or latent MAR) for outcome missingness in studying principal causal effects: a causal graph view

    Authors: Trang Quynh Nguyen

    Abstract: This paper concerns outcome missingness in principal stratification analysis. We revisit a common assumption known as latent ignorability or latent missing-at-random (LMAR), often considered a relaxation of missing-at-random (MAR). LMAR posits that the outcome is independent of its missingness if one conditions on principal stratum (which is partially unobservable) in addition to observed variable… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  14. arXiv:2407.13842  [pdf, other

    cs.RO cs.CV

    Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance

    Authors: Toan Nguyen, Minh Nhat Vu, Baoru Huang, An Vuong, Quan Vuong, Ngan Le, Thieu Vo, Anh Nguyen

    Abstract: 6-DoF grasp detection has been a fundamental and challenging problem in robotic vision. While previous works have focused on ensuring grasp stability, they often do not consider human intention conveyed through natural language, hindering effective collaboration between robots and users in complex 3D environments. In this paper, we present a new approach for language-driven 6-DoF grasp detection i… ▽ More

    Submitted 25 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  15. arXiv:2407.12094  [pdf, other

    cs.CL

    Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models

    Authors: Minh Nguyen, Franck Dernoncourt, Seunghyun Yoon, Hanieh Deilamsalehy, Hao Tan, Ryan Rossi, Quan Hung Tran, Trung Bui, Thien Huu Nguyen

    Abstract: We introduce an approach to identifying speaker names in dialogue transcripts, a crucial task for enhancing content accessibility and searchability in digital media archives. Despite the advancements in speech recognition, the task of text-based speaker identification (SpeakerID) has received limited attention, lacking large-scale, diverse datasets for effective model training. Addressing these ga… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: accepted to INTERSPEECH 2024

  16. arXiv:2407.12064  [pdf, other

    eess.IV cs.CL cs.CV cs.LG cs.MM

    LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task

    Authors: Khai Le-Duc, Ryan Zhang, Ngoc Son Nguyen, Tan-Hanh Pham, Anh Dao, Ba Hung Ngo, Anh Totti Nguyen, Truong-Son Hy

    Abstract: Vision-language models have been extensively explored across a wide range of tasks, achieving satisfactory performance; however, their application in medical imaging remains underexplored. In this work, we propose a unified framework - LiteGPT - for the medical imaging. We leverage multiple pre-trained visual encoders to enrich information and enhance the performance of vision-language models. To… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Preprint, 19 pages

  17. arXiv:2407.12034  [pdf, other

    cs.CL cs.AI cs.LG

    Understanding Transformers via N-gram Statistics

    Authors: Timothy Nguyen

    Abstract: Transformer based large-language models (LLMs) display extreme proficiency with language yet a precise understanding of how they work remains elusive. One way of demystifying transformer predictions would be to describe how they depend on their context in terms of simple template functions. This paper takes a first step in this direction by considering families of functions (i.e. rules) formed out… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  18. arXiv:2407.11771  [pdf, other

    cs.CV cs.AI cs.LG

    XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach

    Authors: Truong Thanh Hung Nguyen, Phuc Truong Loc Nguyen, Hung Cao

    Abstract: Recent advancements in deep learning have significantly improved visual quality inspection and predictive maintenance within industrial settings. However, deploying these technologies on low-resource edge devices poses substantial challenges due to their high computational demands and the inherent complexity of Explainable AI (XAI) methods. This paper addresses these challenges by introducing a no… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 28 pages, preprint submitted to Information Fusion journal

  19. arXiv:2407.11525  [pdf, ps, other

    math.NT

    On a Theorem of Nathanson on Diophantine Approximation

    Authors: Jaroslav Hančl, Tho Phuoc Nguyen

    Abstract: In 1974, M. B. Nathanson proved that every irrational number $α$ represented by a simple continued fraction with infinitely many elements greater than or equal to $k$ is approximable by an infinite number of rational numbers $p/q$ satisfying $|α-p/q|<1/(\sqrt{k^2+4}q^2)$. In this paper we refine this result.

    Submitted 16 July, 2024; originally announced July 2024.

    MSC Class: 11J82; 11A55

  20. arXiv:2407.11194  [pdf, other

    astro-ph.IM astro-ph.EP astro-ph.GA astro-ph.SR cs.AI cs.CL

    AstroMLab 1: Who Wins Astronomy Jeopardy!?

    Authors: Yuan-Sen Ting, Tuan Dung Nguyen, Tirthankar Ghosal, Rui Pan, Hardik Arora, Zechang Sun, Tijmen de Haan, Nesar Ramachandra, Azton Wells, Sandeep Madireddy, Alberto Accomazzi

    Abstract: We present a comprehensive evaluation of proprietary and open-weights large language models using the first astronomy-specific benchmarking dataset. This dataset comprises 4,425 multiple-choice questions curated from the Annual Review of Astronomy and Astrophysics, covering a broad range of astrophysical topics. Our analysis examines model performance across various astronomical subfields and asse… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 45 pages, 12 figures, 7 tables. Submitted to ApJ. Comments welcome. AstroMLab homepage: https://astromlab.org/

  21. arXiv:2407.11166  [pdf, ps, other

    math.NT

    On a Theorem of Legendre on Diophantine Approximation

    Authors: Jaroslav Hančl, Tho Phuoc Nguyen

    Abstract: Legendre's theorem states that every irreducible fraction $\frac{p}{q}$ which satisfies the inequality $\left |α-\frac{p}{q} \right | < \frac{1}{2q^2}$ is convergent to $α$. Later Barbolosi and Jager improved this theorem. In this paper we refine these results.

    Submitted 15 July, 2024; originally announced July 2024.

    MSC Class: 11J82; 11A55

  22. arXiv:2407.11078  [pdf, other

    cs.LG cs.AI cs.CV

    Overcoming Catastrophic Forgetting in Federated Class-Incremental Learning via Federated Global Twin Generator

    Authors: Thinh Nguyen, Khoa D Doan, Binh T. Nguyen, Danh Le-Phuoc, Kok-Seng Wong

    Abstract: Federated Class-Incremental Learning (FCIL) increasingly becomes important in the decentralized setting, where it enables multiple participants to collaboratively train a global model to perform well on a sequence of tasks without sharing their private data. In FCIL, conventional Federated Learning algorithms such as FedAVG often suffer from catastrophic forgetting, resulting in significant perfor… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    MSC Class: 68T07 (Primary); 68T45 (Secondary)

  23. KAT: Dependency-aware Automated API Testing with Large Language Models

    Authors: Tri Le, Thien Tran, Duy Cao, Vy Le, Tien Nguyen, Vu Nguyen

    Abstract: API testing has increasing demands for software companies. Prior API testing tools were aware of certain types of dependencies that needed to be concise between operations and parameters. However, their approaches, which are mostly done manually or using heuristic-based algorithms, have limitations due to the complexity of these dependencies. In this paper, we present KAT (Katalon API Testing), a… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: ICST 2024

  24. arXiv:2407.09740  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.supr-con

    Ferroelectric AlBN Films by Molecular Beam Epitaxy

    Authors: Chandrashekhar Savant, Ved Gund, Kazuki Nomoto, Takuya Maeda, Shubham Jadhav, Joongwon Lee, Madhav Ramesh, Eungkyun Kim, Thai-Son Nguyen, Yu-Hsin Chen, Joseph Casamento, Farhan Rana, Amit Lal, Huili, Xing, Debdeep Jena

    Abstract: We report the properties of molecular beam epitaxy deposited AlBN thin films on a recently developed epitaxial nitride metal electrode Nb2N. While a control AlN thin film exhibits standard capacitive behavior, distinct ferroelectric switching is observed in the AlBN films with increasing Boron mole fraction. The measured remnant polarization Pr of 15 uC/cm2 and coercive field Ec of 1.45 MV/cm in t… ▽ More

    Submitted 17 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: DOI: 10.1063/5.0181217

  25. arXiv:2407.09281  [pdf, other

    cs.AI

    Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning

    Authors: Thuy Ngoc Nguyen, Kasturi Jamale, Cleotilde Gonzalez

    Abstract: Large Language Models (LLMs) have demonstrated their capabilities across various tasks, from language translation to complex reasoning. Understanding and predicting human behavior and biases are crucial for artificial intelligence (AI) assisted systems to provide useful assistance, yet it remains an open question whether these models can achieve this. This paper addresses this gap by leveraging th… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  26. arXiv:2407.09035  [pdf, other

    eess.IV cs.CV

    GPC: Generative and General Pathology Image Classifier

    Authors: Anh Tien Nguyen, Jin Tae Kwak

    Abstract: Deep learning has been increasingly incorporated into various computational pathology applications to improve its efficiency, accuracy, and robustness. Although successful, most previous approaches for image classification have crucial drawbacks. There exist numerous tasks in pathology, but one needs to build a model per task, i.e., a task-specific model, thereby increasing the number of models, t… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: MICCAI-MedAGI 2023 (Best Paper Honorable Mention)

  27. arXiv:2407.09030  [pdf, other

    eess.IV cs.CV

    CAMP: Continuous and Adaptive Learning Model in Pathology

    Authors: Anh Tien Nguyen, Keunho Byeon, Kyungeun Kim, Boram Song, Seoung Wan Chae, Jin Tae Kwak

    Abstract: There exist numerous diagnostic tasks in pathology. Conventional computational pathology formulates and tackles them as independent and individual image classification problems, thereby resulting in computational inefficiency and high costs. To address the challenges, we propose a generic, unified, and universal framework, called a continuous and adaptive learning model in pathology (CAMP), for pa… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Under review

  28. arXiv:2407.08872  [pdf, other

    cs.CV

    Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite Sets

    Authors: Linh Van Ma, Tran Thien Dat Nguyen, Changbeom Shim, Du Yong Kim, Namkoo Ha, Moongu Jeon

    Abstract: This paper proposes an online visual multi-object tracking (MOT) algorithm that resolves object appearance-reappearance and occlusion. Our solution is based on the labeled random finite set (LRFS) filtering approach, which in principle, addresses disappearance, appearance, reappearance, and occlusion via a single Bayesian recursion. However, in practice, existing numerical approximations cause rea… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  29. arXiv:2407.08470  [pdf, other

    cs.CV cs.AI

    Brain Tumor Segmentation in MRI Images with 3D U-Net and Contextual Transformer

    Authors: Thien-Qua T. Nguyen, Hieu-Nghia Nguyen, Thanh-Hieu Bui, Thien B. Nguyen-Tat, Vuong M. Ngo

    Abstract: This research presents an enhanced approach for precise segmentation of brain tumor masses in magnetic resonance imaging (MRI) using an advanced 3D-UNet model combined with a Context Transformer (CoT). By architectural expansion CoT, the proposed model extends its architecture to a 3D format, integrates it smoothly with the base model to utilize the complex contextual information found in MRI scan… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 6 pages, 7 figures

  30. arXiv:2407.07917  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    Non-Cooperative Backdoor Attacks in Federated Learning: A New Threat Landscape

    Authors: Tuan Nguyen, Dung Thuy Nguyen, Khoa D Doan, Kok-Seng Wong

    Abstract: Despite the promise of Federated Learning (FL) for privacy-preserving model training on distributed data, it remains susceptible to backdoor attacks. These attacks manipulate models by embedding triggers (specific input patterns) in the training data, forcing misclassification as predefined classes during deployment. Traditional single-trigger attacks and recent work on cooperative multiple-trigge… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  31. arXiv:2407.07472  [pdf, other

    cs.SE cs.AI

    Rectifier: Code Translation with Corrector via LLMs

    Authors: Xin Yin, Chao Ni, Tien N. Nguyen, Shaohua Wang, Xiaohu Yang

    Abstract: Software migration is garnering increasing attention with the evolution of software and society. Early studies mainly relied on handcrafted translation rules to translate between two languages, the translation process is error-prone and time-consuming. In recent years, researchers have begun to explore the use of pre-trained large language models (LLMs) in code translation. However, code translati… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2308.03109, arXiv:2302.03908 by other authors

  32. arXiv:2407.07421  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    Federated PCA on Grassmann Manifold for IoT Anomaly Detection

    Authors: Tung-Anh Nguyen, Long Tan Le, Tuan Dung Nguyen, Wei Bao, Suranga Seneviratne, Choong Seon Hong, Nguyen H. Tran

    Abstract: With the proliferation of the Internet of Things (IoT) and the rising interconnectedness of devices, network security faces significant challenges, especially from anomalous activities. While traditional machine learning-based intrusion detection systems (ML-IDS) effectively employ supervised learning methods, they possess limitations such as the requirement for labeled data and challenges with hi… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted for publication at IEEE/ACM Transactions on Networking

    Journal ref: IEEE/ACM Transactions on Networking On page(s): 1-16 Print ISSN: 1063-6692 Online ISSN: 1558-2566 Digital Object Identifier: 10.1109/TNET.2024.3423780

  33. arXiv:2407.07369  [pdf, ps, other

    math.ST math.AP math.PR

    Viscosity estimation for 2D pipe flows I. Construction, consistency, asymptotic normality

    Authors: Thi Hien Nguyen, Armen Shirikyan

    Abstract: We consider the motion of incompressible viscous fluid in a rectangle, imposing the periodicity condition in one direction and the no-slip boundary condition in the other. Assuming that the flow is subject to an external random force, white in time and regular in space, we construct an estimator for the viscosity using only observations of the enstrophy. The goal of the paper is to prove that the… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    MSC Class: 35Q30; 37L55; 62M05; 76D06

  34. arXiv:2407.07360  [pdf, other

    cs.CV cs.LG

    Towards a text-based quantitative and explainable histopathology image analysis

    Authors: Anh Tien Nguyen, Trinh Thi Le Vuong, Jin Tae Kwak

    Abstract: Recently, vision-language pre-trained models have emerged in computational pathology. Previous works generally focused on the alignment of image-text pairs via the contrastive pre-training paradigm. Such pre-trained models have been applied to pathology image classification in zero-shot learning or transfer learning fashion. Herein, we hypothesize that the pre-trained vision-language models can be… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: MICCAI 2024 - Early acceptance (Top 11%)

  35. arXiv:2407.06826  [pdf, other

    cs.AI

    VRDSynth: Synthesizing Programs for Multilingual Visually Rich Document Information Extraction

    Authors: Thanh-Dat Nguyen, Tung Do-Viet, Hung Nguyen-Duy, Tuan-Hai Luu, Hung Le, Bach Le, Patanamon, Thongtanunam

    Abstract: Businesses need to query visually rich documents (VRDs) like receipts, medical records, and insurance forms to make decisions. Existing techniques for extracting entities from VRDs struggle with new layouts or require extensive pre-training data. We introduce VRDSynth, a program synthesis method to automatically extract entity relations from multilingual VRDs without pre-training data. To capture… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted in ISSTA'24

  36. arXiv:2407.06581  [pdf, other

    cs.AI cs.CV

    Vision language models are blind

    Authors: Pooyan Rahmanzadehgervi, Logan Bolton, Mohammad Reza Taesiri, Anh Totti Nguyen

    Abstract: While large language models with vision capabilities (VLMs), e.g., GPT-4o and Gemini 1.5 Pro, are powering various image-text applications and scoring high on many vision-understanding benchmarks, we find that they are surprisingly still struggling with low-level vision tasks that are easy to humans. Specifically, on BlindTest, our suite of 7 very simple tasks such as identifying (a) whether two c… ▽ More

    Submitted 25 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

  37. arXiv:2407.06142  [pdf, ps, other

    cs.NI eess.SY math.OC

    Delay-Aware Robust Edge Network Hardening Under Decision-Dependent Uncertainty

    Authors: Jiaming Cheng, Duong Thuy Anh Nguyen, Ni Trieu, Duong Tung Nguyen

    Abstract: Edge computing promises to offer low-latency and ubiquitous computation to numerous devices at the network edge. For delay-sensitive applications, link delays can have a direct impact on service quality. These delays can fluctuate drastically over time due to various factors such as network congestion, changing traffic conditions, cyberattacks, component failures, and natural disasters. Thus, it i… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 14 pages, 18 figures

  38. arXiv:2407.06045  [pdf, other

    cs.CV

    OpenCIL: Benchmarking Out-of-Distribution Detection in Class-Incremental Learning

    Authors: Wenjun Miao, Guansong Pang, Trong-Tung Nguyen, Ruohang Fang, Jin Zheng, Xiao Bai

    Abstract: Class incremental learning (CIL) aims to learn a model that can not only incrementally accommodate new classes, but also maintain the learned knowledge of old classes. Out-of-distribution (OOD) detection in CIL is to retain this incremental learning ability, while being able to reject unknown samples that are drawn from different distributions of the learned classes. This capability is crucial to… ▽ More

    Submitted 9 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  39. arXiv:2407.05469  [pdf, other

    cs.CV

    Smart Camera Parking System With Auto Parking Spot Detection

    Authors: Tuan T. Nguyen, Mina Sartipi

    Abstract: Given the rising urban population and the consequential rise in traffic congestion, the implementation of smart parking systems has emerged as a critical matter of concern. Smart parking solutions use cameras, sensors, and algorithms like computer vision to find available parking spaces. This method improves parking place recognition, reduces traffic and pollution, and optimizes travel time. In re… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  40. arXiv:2407.05452  [pdf, other

    cs.CV

    Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images

    Authors: Tuan T. Nguyen, Phan Le, Yasir Hassan, Mina Sartipi

    Abstract: In this paper, we present the submission to the 5th Annual Smoky Mountains Computational Sciences Data Challenge, Challenge 3. This is the solution for semantic segmentation problem in both real-world and synthetic images from a vehicle s forward-facing camera. We concentrate in building a robust model which performs well across various domains of different outdoor situations such as sunny, snowy,… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 13 pages

  41. arXiv:2407.05205  [pdf, other

    cs.CY cs.AI cs.LG

    The AI Companion in Education: Analyzing the Pedagogical Potential of ChatGPT in Computer Science and Engineering

    Authors: Zhangying He, Thomas Nguyen, Tahereh Miari, Mehrdad Aliasgari, Setareh Rafatirad, Hossein Sayadi

    Abstract: Artificial Intelligence (AI), with ChatGPT as a prominent example, has recently taken center stage in various domains including higher education, particularly in Computer Science and Engineering (CSE). The AI revolution brings both convenience and controversy, offering substantial benefits while lacking formal guidance on their application. The primary objective of this work is to comprehensively… ▽ More

    Submitted 23 April, 2024; originally announced July 2024.

    Comments: conference, 13 pages

  42. arXiv:2407.04992  [pdf, other

    cs.LG cs.AI stat.ME

    Scalable Variational Causal Discovery Unconstrained by Acyclicity

    Authors: Nu Hoang, Bao Duong, Thin Nguyen

    Abstract: Bayesian causal discovery offers the power to quantify epistemic uncertainties among a broad range of structurally diverse causal theories potentially explaining the data, represented in forms of directed acyclic graphs (DAGs). However, existing methods struggle with efficient DAG sampling due to the complex acyclicity constraint. In this study, we propose a scalable Bayesian approach to effective… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted at ECAI 2024

  43. arXiv:2407.04980  [pdf, other

    cs.LG cs.AI stat.ME

    Enabling Causal Discovery in Post-Nonlinear Models with Normalizing Flows

    Authors: Nu Hoang, Bao Duong, Thin Nguyen

    Abstract: Post-nonlinear (PNL) causal models stand out as a versatile and adaptable framework for modeling intricate causal relationships. However, accurately capturing the invertibility constraint required in PNL models remains challenging in existing studies. To address this problem, we introduce CAF-PoNo (Causal discovery via Normalizing Flows for Post-Nonlinear models), harnessing the power of the norma… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Acepted at ECAI 2024

  44. arXiv:2407.04489  [pdf, other

    cs.CV

    Dude: Dual Distribution-Aware Context Prompt Learning For Large Vision-Language Model

    Authors: Duy M. H. Nguyen, An T. Le, Trung Q. Nguyen, Nghiem T. Diep, Tai Nguyen, Duy Duong-Tran, Jan Peters, Li Shen, Mathias Niepert, Daniel Sonntag

    Abstract: Prompt learning methods are gaining increasing attention due to their ability to customize large vision-language models to new domains using pre-trained contextual knowledge and minimal training data. However, existing works typically rely on optimizing unified prompt inputs, often struggling with fine-grained classification tasks due to insufficient discriminative attributes. To tackle this, we c… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Version 1

  45. arXiv:2407.04408  [pdf, ps, other

    eess.SP

    Hybrid Receiver Design for Massive MIMO-OFDM with Low-Resolution ADCs and Oversampling

    Authors: Mengyuan Ma, Nhan Thanh Nguyen, Italo Atzeni, Markku Juntti

    Abstract: Low-resolution analog-to-digital converters (ADCs) and hybrid beamforming have emerged as efficient solutions to reduce power consumption with satisfactory spectral efficiency (SE) in massive multiple-input multiple-output (MIMO) systems. In this paper, we investigate the performance of a hybrid receiver in uplink massive MIMO orthogonal frequency-division multiplexing (OFDM) systems with low-reso… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 6 pages, 4 figures, submitted to GlobeCom 2024

  46. arXiv:2407.03796  [pdf, ps, other

    eess.SP

    Joint Beamforming Design and Bit Allocation in Massive MIMO with Resolution-Adaptive ADCs

    Authors: Mengyuan Ma, Nhan Thanh Nguyen, Italo Atzeni, Markku Juntti

    Abstract: Low-resolution analog-to-digital converters (ADCs) have emerged as a promising technology for reducing power consumption and complexity in massive multiple-input multiple-output (MIMO) systems while maintaining satisfactory spectral and energy efficiencies (SE/EE). In this work, we first identify the essential properties of optimal quantization and leverage them to derive a closed-form approximati… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 13 pages, 14 figures

  47. arXiv:2407.03788  [pdf, other

    cs.CV cs.CL

    Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

    Authors: Thong Nguyen, Yi Bin, Xiaobao Wu, Xinshuai Dong, Zhiyuan Hu, Khoi Le, Cong-Duy Nguyen, See-Kiong Ng, Luu Anh Tuan

    Abstract: Data quality stands at the forefront of deciding the effectiveness of video-language representation learning. However, video-text pairs in previous data typically do not align perfectly with each other, which might lead to video-language representations that do not accurately reflect cross-modal semantics. Moreover, previous data also possess an uneven distribution of concepts, thereby hampering t… ▽ More

    Submitted 19 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  48. arXiv:2407.03665  [pdf, other

    cs.IR cs.AI cs.LG cs.SI stat.ML

    Heterogeneous Hypergraph Embedding for Recommendation Systems

    Authors: Darnbi Sakong, Viet Hung Vu, Thanh Trung Huynh, Phi Le Nguyen, Hongzhi Yin, Quoc Viet Hung Nguyen, Thanh Tam Nguyen

    Abstract: Recent advancements in recommender systems have focused on integrating knowledge graphs (KGs) to leverage their auxiliary information. The core idea of KG-enhanced recommenders is to incorporate rich semantic information for more accurate recommendations. However, two main challenges persist: i) Neglecting complex higher-order interactions in the KG-based user-item network, potentially leading to… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  49. arXiv:2407.03611  [pdf, other

    cs.SE cs.AI

    An Empirical Study on Capability of Large Language Models in Understanding Code Semantics

    Authors: Thu-Trang Nguyen, Thanh Trong Vu, Hieu Dinh Vo, Son Nguyen

    Abstract: Large Language Models for Code (code LLMs) have demonstrated remarkable performance across various software engineering (SE) tasks, increasing the application of code LLMs in software development. Despite the success of code LLMs, there remain significant concerns about the actual capabilities and reliability of these models, "whether these models really learn the semantics of code from the traini… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  50. arXiv:2407.03144  [pdf, other

    cs.CV

    Venomancer: Towards Imperceptible and Target-on-Demand Backdoor Attacks in Federated Learning

    Authors: Son Nguyen, Thinh Nguyen, Khoa D Doan, Kok-Seng Wong

    Abstract: Federated Learning (FL) is a distributed machine learning approach that maintains data privacy by training on decentralized data sources. Similar to centralized machine learning, FL is also susceptible to backdoor attacks, where an attacker can compromise some clients by injecting a backdoor trigger into local models of those clients, leading to the global model's behavior being manipulated as des… ▽ More

    Submitted 11 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.