Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 220 results for author: Nguyen, T H

.
  1. arXiv:2407.12094  [pdf, other

    cs.CL

    Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models

    Authors: Minh Nguyen, Franck Dernoncourt, Seunghyun Yoon, Hanieh Deilamsalehy, Hao Tan, Ryan Rossi, Quan Hung Tran, Trung Bui, Thien Huu Nguyen

    Abstract: We introduce an approach to identifying speaker names in dialogue transcripts, a crucial task for enhancing content accessibility and searchability in digital media archives. Despite the advancements in speech recognition, the task of text-based speaker identification (SpeakerID) has received limited attention, lacking large-scale, diverse datasets for effective model training. Addressing these ga… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: accepted to INTERSPEECH 2024

  2. arXiv:2407.11771  [pdf, other

    cs.CV cs.AI cs.LG

    XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach

    Authors: Truong Thanh Hung Nguyen, Phuc Truong Loc Nguyen, Hung Cao

    Abstract: Recent advancements in deep learning have significantly improved visual quality inspection and predictive maintenance within industrial settings. However, deploying these technologies on low-resource edge devices poses substantial challenges due to their high computational demands and the inherent complexity of Explainable AI (XAI) methods. This paper addresses these challenges by introducing a no… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 28 pages, preprint submitted to Information Fusion journal

  3. arXiv:2407.07369  [pdf, ps, other

    math.ST math.AP math.PR

    Viscosity estimation for 2D pipe flows I. Construction, consistency, asymptotic normality

    Authors: Thi Hien Nguyen, Armen Shirikyan

    Abstract: We consider the motion of incompressible viscous fluid in a rectangle, imposing the periodicity condition in one direction and the no-slip boundary condition in the other. Assuming that the flow is subject to an external random force, white in time and regular in space, we construct an estimator for the viscosity using only observations of the enstrophy. The goal of the paper is to prove that the… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    MSC Class: 35Q30; 37L55; 62M05; 76D06

  4. arXiv:2406.15749  [pdf, ps, other

    hep-ph

    Decay of CP-even Higgs $H\rightarrow h γγ$ in Two Higgs Doublet Model: (I) one-loop analytic results, ward identity checks

    Authors: Khiem Hong Phan, Dzung Tri Tran, Thanh Huy Nguyen

    Abstract: We present the first analytical expressions for one-loop induced contributions for the decay channels of CP-even Higgs $H\rightarrow h γγ$ with $h$ being standard model-like Higgs boson within the framework of Two Higgs Doublet Model in this paper. One-loop form factors for the decay processes are written in terms of the scalar Passarino-Veltman functions following the general notations of the pac… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 39 pages, 8 Figures, 9 Tables

    Report number: DTU_2024-03

  5. arXiv:2406.14835  [pdf, other

    cs.CL cs.LG

    ToVo: Toxicity Taxonomy via Voting

    Authors: Tinh Son Luong, Thanh-Thien Le, Thang Viet Doan, Linh Ngo Van, Thien Huu Nguyen, Diep Thi-Ngoc Nguyen

    Abstract: Existing toxic detection models face significant limitations, such as lack of transparency, customization, and reproducibility. These challenges stem from the closed-source nature of their training data and the paucity of explanations for their evaluation mechanism. To address these issues, we propose a dataset creation mechanism that integrates voting and chain-of-thought processes, producing a h… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  6. arXiv:2405.16623  [pdf, other

    cs.LG cs.AR cs.PF

    Graph neural networks with configuration cross-attention for tensor compilers

    Authors: Dmitrii Khizbullin, Eduardo Rocha de Andrade, Thanh Hau Nguyen, Matheus Pedroza Ferreira, David R. Pugh

    Abstract: With the recent popularity of neural networks comes the need for efficient serving of inference workloads. A neural network inference workload can be represented as a computational graph with nodes as operators transforming multidimensional tensors. The tensors can be transposed and/or tiled in a combinatorially large number of ways, some configurations leading to accelerated inference. We propose… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  7. arXiv:2405.10659  [pdf, other

    cs.CL cs.AI

    Realistic Evaluation of Toxicity in Large Language Models

    Authors: Tinh Son Luong, Thanh-Thien Le, Linh Ngo Van, Thien Huu Nguyen

    Abstract: Large language models (LLMs) have become integral to our professional workflows and daily lives. Nevertheless, these machine companions of ours have a critical flaw: the huge amount of data which endows them with vast and diverse knowledge, also exposes them to the inevitable toxicity and bias. While most LLMs incorporate defense mechanisms to prevent the generation of harmful content, these safeg… ▽ More

    Submitted 20 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: Findings of ACL 2024

  8. Q-learning-based Opportunistic Communication for Real-time Mobile Air Quality Monitoring Systems

    Authors: Trung Thanh Nguyen, Truong Thao Nguyen, Dinh Tuan Anh Nguyen, Thanh Hung Nguyen, Phi Le Nguyen

    Abstract: We focus on real-time air quality monitoring systems that rely on devices installed on automobiles in this research. We investigate an opportunistic communication model in which devices can send the measured data directly to the air quality server through a 4G communication channel or via Wi-Fi to adjacent devices or the so-called Road Side Units deployed along the road. We aim to reduce 4G costs… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 2021 IEEE International Conference on Performance, Computing and Communications (IPCCC). arXiv admin note: substantial text overlap with arXiv:2405.01057

  9. Fuzzy Q-Learning-Based Opportunistic Communication for MEC-Enhanced Vehicular Crowdsensing

    Authors: Trung Thanh Nguyen, Truong Thao Nguyen, Thanh Hung Nguyen, Phi Le Nguyen

    Abstract: This study focuses on MEC-enhanced, vehicle-based crowdsensing systems that rely on devices installed on automobiles. We investigate an opportunistic communication paradigm in which devices can transmit measured data directly to a crowdsensing server over a 4G communication channel or to nearby devices or so-called Road Side Units positioned along the road via Wi-Fi. We tackle a new problem that i… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: IEEE Transactions on Network and Service Management

  10. arXiv:2405.00567  [pdf, other

    eess.IV

    Remote Sensing Data Assimilation with a Chained Hydrologic-hydraulic Model for Flood Forecasting

    Authors: Thanh Huy Nguyen, Andrea Piacentini, Sophie Ricci, Ludovic Cassan, Simon Munier, Quentin Bonassies, Raquel Rodriguez-Suquet

    Abstract: A chained hydrologic-hydraulic model is implemented using predicted runoff from a large-scale hydrologic model (namely ISBA-CTRIP) as inputs to local hydrodynamic models (TELEMAC-2D) to issue forecasts of water level and flood extent. The uncertainties in the hydrological forcing and in friction parameters are reduced by an Ensemble Kalman Filter that jointly assimilates in-situ water levels and f… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 13 pages, 14 figures. Submitted to the IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

  11. arXiv:2404.13417  [pdf, other

    cs.CV cs.AI

    Efficient and Concise Explanations for Object Detection with Gaussian-Class Activation Mapping Explainer

    Authors: Quoc Khanh Nguyen, Truong Thanh Hung Nguyen, Vo Thanh Khang Nguyen, Van Binh Truong, Tuong Phan, Hung Cao

    Abstract: To address the challenges of providing quick and plausible explanations in Explainable AI (XAI) for object detection models, we introduce the Gaussian Class Activation Mapping Explainer (G-CAME). Our method efficiently generates concise saliency maps by utilizing activation maps from selected layers and applying a Gaussian kernel to emphasize critical image regions for the predicted object. Compar… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: Canadian AI 2024

  12. arXiv:2404.02417  [pdf, ps, other

    hep-ph

    One-loop contributions for $A^0 \rightarrow \ell \bar{\ell} V$ with $\ell \equiv e, μ$ and $V\equiv γ, Z$ in Higgs Extensions of the Standard Model

    Authors: Khiem Hong Phan, Dzung Tri Tran, Thanh Huy Nguyen

    Abstract: We present one-loop formulas for the decay of CP-odd Higgs $A^0 \rightarrow \ell \bar{\ell} V$ with $\ell \equiv e, μ$ and $V\equiv γ, Z$ in Higgs Extensions of the Standard Model, considering two higgs doublet model with a complex (and real) scalar, two higgs doublet model as well as triplet higgs model. Analytic results for one-loop amplitudes are expressed in terms of Passarino-Veltman function… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 35 pages

    Report number: DTU_2024-01

  13. arXiv:2404.02246  [pdf, ps, other

    math.CA

    Matrix-weighted estimates beyond Calderón-Zygmund theory

    Authors: Spyridon Kakaroumpas, Thu Hien Nguyen, Dimitris Vardakis

    Abstract: We investigate matrix-weighted bounds for the sublinear non-kernel operators considered by F. Bernicot, D. Frey, and S. Petermichl. We extend their result to sublinear operators acting upon vector-valued functions. First, we dominate these operators by bilinear convex body sparse forms, adapting a recent general principle due to T. Hytönen. Then we use this domination to derive matrix-weighted bou… ▽ More

    Submitted 25 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 59 pages

  14. arXiv:2403.14918  [pdf, other

    cs.LG

    Deep learning-based method for weather forecasting: A case study in Itoshima

    Authors: Yuzhong Cheng, Linh Thi Hoai Nguyen, Akinori Ozaki, Ton Viet Ta

    Abstract: Accurate weather forecasting is of paramount importance for a wide range of practical applications, drawing substantial scientific and societal interest. However, the intricacies of weather systems pose substantial challenges to accurate predictions. This research introduces a multilayer perceptron model tailored for weather forecasting in Itoshima, Kyushu, Japan. Our meticulously designed archite… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  15. arXiv:2403.14395  [pdf, other

    eess.IV physics.ao-ph

    Early Flood Warning Using Satellite-Derived Convective System and Precipitation Data -- A Retrospective Case Study of Central Vietnam

    Authors: Tran-Vu La, Thanh Huy Nguyen, Patrick Matgen, Marco Chini

    Abstract: This paper addresses the challenges of an early flood warning caused by complex convective systems (CSs), by using Low-Earth Orbit and Geostationary satellite data. We focus on a sequence of extreme events that took place in central Vietnam during October 2020, with a specific emphasis on the events leading up to the floods, i.e., those occurring before October 10th, 2020. In this critical phase,… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in IEEE 2024 International Geoscience & Remote Sensing Symposium (IGARSS 2024)

  16. arXiv:2403.14394  [pdf, other

    eess.IV

    Assimilation of SWOT Altimetry and Sentinel-1 Flood Extent Observations for Flood Reanalysis -- A Proof-of-Concept

    Authors: Thanh Huy Nguyen, Sophie Ricci, Andrea Piacentini, Charlotte Emery, Raquel Rodriguez Suquet, Santiago Peña Luque

    Abstract: In spite of astonishing advances and developments in remote sensing technologies, meeting the spatio-temporal requirements for flood hydrodynamic modeling remains a great challenge for Earth Observation. The assimilation of multi-source remote sensing data in 2D hydrodynamic models participates to overcome such a challenge. The recently launched Surface Water and Ocean Topography (SWOT) wide-swath… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in IEEE 2024 International Geoscience & Remote Sensing Symposium (IGARSS 2024)

  17. arXiv:2403.11496  [pdf, other

    cs.RO cs.AI

    MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception

    Authors: Thien-Minh Nguyen, Shenghai Yuan, Thien Hoang Nguyen, Pengyu Yin, Haozhi Cao, Lihua Xie, Maciej Wozniak, Patric Jensfelt, Marko Thiel, Justin Ziegenbein, Noel Blunder

    Abstract: Perception plays a crucial role in various robot applications. However, existing well-annotated datasets are biased towards autonomous driving scenarios, while unlabelled SLAM datasets are quickly over-fitted, and often lack environment and domain variations. To expand the frontier of these fields, we introduce a comprehensive dataset named MCD (Multi-Campus Dataset), featuring a wide range of sen… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024

  18. arXiv:2403.01225  [pdf, other

    cs.RO

    A Cost-Effective Cooperative Exploration and Inspection Strategy for Heterogeneous Aerial System

    Authors: Xinhang Xu, Muqing Cao, Shenghai Yuan, Thien Hoang Nguyen, Thien-Minh Nguyen, Lihua Xie

    Abstract: In this paper, we propose a cost-effective strategy for heterogeneous UAV swarm systems for cooperative aerial inspection. Unlike previous swarm inspection works, the proposed method does not rely on precise prior knowledge of the environment and can complete full 3D surface coverage of objects in any shape. In this work, agents are partitioned into teams, with each drone assign a different task,… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: Baseline method of CARIC at CDC 2023, Singapore

  19. arXiv:2402.12525  [pdf, other

    cs.CV cs.AI

    LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks

    Authors: Truong Thanh Hung Nguyen, Tobias Clement, Phuc Truong Loc Nguyen, Nils Kemmerzell, Van Binh Truong, Vo Thanh Khang Nguyen, Mohamed Abdelaal, Hung Cao

    Abstract: LangXAI is a framework that integrates Explainable Artificial Intelligence (XAI) with advanced vision models to generate textual explanations for visual recognition tasks. Despite XAI advancements, an understanding gap persists for end-users with limited domain knowledge in artificial intelligence and computer vision. LangXAI addresses this by furnishing text-based explanations for classification,… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  20. arXiv:2402.12179  [pdf, other

    cs.CV cs.AI cs.CY

    Examining Monitoring System: Detecting Abnormal Behavior In Online Examinations

    Authors: Dinh An Ngo, Thanh Dat Nguyen, Thi Le Chi Dang, Huy Hoan Le, Ton Bao Ho, Vo Thanh Khang Nguyen, Truong Thanh Hung Nguyen

    Abstract: Cheating in online exams has become a prevalent issue over the past decade, especially during the COVID-19 pandemic. To address this issue of academic dishonesty, our "Exam Monitoring System: Detecting Abnormal Behavior in Online Examinations" is designed to assist proctors in identifying unusual student behavior. Our system demonstrates high accuracy and speed in detecting cheating in real-time s… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  21. arXiv:2402.03706  [pdf, other

    cs.RO cs.AI cs.CV

    MMAUD: A Comprehensive Multi-Modal Anti-UAV Dataset for Modern Miniature Drone Threats

    Authors: Shenghai Yuan, Yizhuo Yang, Thien Hoang Nguyen, Thien-Minh Nguyen, Jianfei Yang, Fen Liu, Jianping Li, Han Wang, Lihua Xie

    Abstract: In response to the evolving challenges posed by small unmanned aerial vehicles (UAVs), which possess the potential to transport harmful payloads or independently cause damage, we introduce MMAUD: a comprehensive Multi-Modal Anti-UAV Dataset. MMAUD addresses a critical gap in contemporary threat detection methodologies by focusing on drone detection, UAV-type classification, and trajectory estimati… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted by ICRA 2024

  22. arXiv:2401.09900  [pdf, other

    cs.CV cs.AI

    XAI-Enhanced Semantic Segmentation Models for Visual Quality Inspection

    Authors: Tobias Clement, Truong Thanh Hung Nguyen, Mohamed Abdelaal, Hung Cao

    Abstract: Visual quality inspection systems, crucial in sectors like manufacturing and logistics, employ computer vision and machine learning for precise, rapid defect detection. However, their unexplained nature can hinder trust, error identification, and system improvement. This paper presents a framework to bolster visual quality inspection by using CAM-based explanations to refine semantic segmentation… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: IEEE ICCE 2024

  23. arXiv:2401.09852  [pdf, other

    cs.CV cs.AI

    Enhancing the Fairness and Performance of Edge Cameras with Explainable AI

    Authors: Truong Thanh Hung Nguyen, Vo Thanh Khang Nguyen, Quoc Hung Cao, Van Binh Truong, Quoc Khanh Nguyen, Hung Cao

    Abstract: The rising use of Artificial Intelligence (AI) in human detection on Edge camera systems has led to accurate but complex models, challenging to interpret and debug. Our research presents a diagnostic method using Explainable AI (XAI) for model debugging, with expert-driven problem identification and solution creation. Validated on the Bytetrack model in a real-world office Edge network, we found t… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: IEEE ICCE 2024

  24. arXiv:2312.11825  [pdf, other

    cs.SD eess.AS

    MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation

    Authors: Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jiaqi Yip, Dianwen Ng, Bin Ma

    Abstract: Our previously proposed MossFormer has achieved promising performance in monaural speech separation. However, it predominantly adopts a self-attention-based MossFormer module, which tends to emphasize longer-range, coarser-scale dependencies, with a deficiency in effectively modelling finer-scale recurrent patterns. In this paper, we introduce a novel hybrid model that provides the capabilities to… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures, accepted by ICASSP 2024

  25. arXiv:2312.05239  [pdf, other

    cs.CV

    SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation

    Authors: Thuan Hoang Nguyen, Anh Tran

    Abstract: Despite their ability to generate high-resolution and diverse images from text prompts, text-to-image diffusion models often suffer from slow iterative sampling processes. Model distillation is one of the most effective directions to accelerate these models. However, previous distillation methods fail to retain the generation quality while requiring a significant amount of images for training, eit… ▽ More

    Submitted 15 July, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted to CVPR 2024; Github: https://github.com/VinAIResearch/SwiftBrush

  26. arXiv:2311.15341  [pdf, other

    cs.LG

    Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning

    Authors: Changyu Chen, Ramesha Karunasena, Thanh Hong Nguyen, Arunesh Sinha, Pradeep Varakantham

    Abstract: Many problems in Reinforcement Learning (RL) seek an optimal policy with large discrete multidimensional yet unordered action spaces; these include problems in randomized allocation of resources such as placements of multiple security resources and emergency response units, etc. A challenge in this setting is that the underlying action space is categorical (discrete and unordered) and large, for w… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: Accepted in NeurIPS 2023. Website: https://cameron-chen.github.io/flow-iar/

  27. arXiv:2311.14747  [pdf, other

    cs.CV

    HOMOE: A Memory-Based and Composition-Aware Framework for Zero-Shot Learning with Hopfield Network and Soft Mixture of Experts

    Authors: Do Huu Dat, Po Yuan Mao, Tien Hoang Nguyen, Wray Buntine, Mohammed Bennamoun

    Abstract: Compositional Zero-Shot Learning (CZSL) has emerged as an essential paradigm in machine learning, aiming to overcome the constraints of traditional zero-shot learning by incorporating compositional thinking into its methodology. Conventional zero-shot learning has difficulty managing unfamiliar combinations of seen and unseen classes because it depends on pre-defined class embeddings. In contrast,… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  28. arXiv:2311.11730  [pdf, ps, other

    math.ST

    Mixing properties for multivariate Hawkes processes

    Authors: Ousmane Boly, Felix Cheysson, Thi Hien Nguyen

    Abstract: Properties of strong mixing have been established for the stationary linear Hawkes process in the univariate case, and can serve as a basis for statistical applications. In this paper, we provide the technical arguments needed to extend the proof to the multivariate case. We illustrate these properties by establishing a functional central limit theorem for multivariate Hawkes processes.

    Submitted 20 November, 2023; originally announced November 2023.

  29. arXiv:2311.02998  [pdf, ps, other

    hep-ph

    One-loop contributions for $h\rightarrow \ell \bar{\ell}γ$ and $e^-e^+\rightarrow hγ$ in $U(1)_{B-L}$ extension of the standard model

    Authors: Dzung Tri Tran, Thanh Huy Nguyen, Khiem Hong Phan

    Abstract: We present one-loop contributing for $h\rightarrow \ell \bar{\ell}γ$ with $\ell =ν_{e,μ, τ}, e, μ$ and $e^-e^+\rightarrow hγ$ in $U(1)_{B-L}$ extension of the standard models. In phenomenological results, the signal strengths for $h\rightarrow \ell \bar{\ell}γ$ at Large Hadron Collider and for $e^-e^+\rightarrow hγ$ at future Lepton Colliders are analyzed in physical parameter space for both vecto… ▽ More

    Submitted 30 January, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: 41 pages, to be published in Chinese Physics C

    Report number: DTU2023-03

  30. arXiv:2310.16242  [pdf, other

    cs.LG cs.CL

    ZzzGPT: An Interactive GPT Approach to Enhance Sleep Quality

    Authors: Yonchanok Khaokaew, Kaixin Ji, Thuc Hanh Nguyen, Hiruni Kegalle, Marwah Alaofi, Hao Xue, Flora D. Salim

    Abstract: This paper explores the intersection of technology and sleep pattern comprehension, presenting a cutting-edge two-stage framework that harnesses the power of Large Language Models (LLMs). The primary objective is to deliver precise sleep predictions paired with actionable feedback, addressing the limitations of existing solutions. This innovative approach involves leveraging the GLOBEM dataset alo… ▽ More

    Submitted 6 May, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

  31. arXiv:2310.09811  [pdf, other

    math-ph math.SP quant-ph

    Spacing distribution for quantum Rabi models

    Authors: Daniel Braak, Linh Thi Hoai Nguyen, Cid Reyes-Bustos, Masato Wakayama

    Abstract: The asymmetric quantum Rabi model (AQRM) is a fundamental model in quantum optics describing the interaction of light and matter. Besides its immediate physical interest, the AQRM possesses an intriguing mathematical structure which is far from being completely understood. In this paper, we focus on the distribution of the level spacing, the difference between consecutive eigenvalues of the AQRM i… ▽ More

    Submitted 9 February, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: 28 pages. 15 figures. The conjecture in Section 4.4 (Theorem 4.5 in the current version) was proved using results published after the previous version. The rest of the manuscript was modified slightly according to this change

    MSC Class: 47B06 (Primary) 81V73; 81R40 (Secondary)

  32. arXiv:2310.06801  [pdf, other

    cs.LG cs.MA

    Inverse Factorized Q-Learning for Cooperative Multi-agent Imitation Learning

    Authors: The Viet Bui, Tien Mai, Thanh Hong Nguyen

    Abstract: This paper concerns imitation learning (IL) (i.e, the problem of learning to mimic expert behaviors from demonstrations) in cooperative multi-agent systems. The learning problem under consideration poses several challenges, characterized by high-dimensional state and action spaces and intricate inter-agent dependencies. In a single-agent setting, IL has proven to be done efficiently through an inv… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  33. arXiv:2309.12608  [pdf, other

    eess.AS cs.SD

    SPGM: Prioritizing Local Features for enhanced speech separation performance

    Authors: Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma

    Abstract: Dual-path is a popular architecture for speech separation models (e.g. Sepformer) which splits long sequences into overlapping chunks for its intra- and inter-blocks that separately model intra-chunk local features and inter-chunk global relationships. However, it has been found that inter-blocks, which comprise half a dual-path model's parameters, contribute minimally to performance. Thus, we pro… ▽ More

    Submitted 10 March, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: This paper was accepted by ICASSP 2024

  34. arXiv:2309.09413  [pdf, other

    cs.SD eess.AS

    Are Soft Prompts Good Zero-shot Learners for Speech Recognition?

    Authors: Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter-Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma

    Abstract: Large self-supervised pre-trained speech models require computationally expensive fine-tuning for downstream tasks. Soft prompt tuning offers a simple parameter-efficient alternative by utilizing minimal soft prompt guidance, enhancing portability while also maintaining competitive performance. However, not many people understand how and why this is so. In this study, we aim to deepen our understa… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  35. arXiv:2309.09400  [pdf, other

    cs.CL cs.AI

    CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

    Authors: Thuat Nguyen, Chien Van Nguyen, Viet Dac Lai, Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Ryan A. Rossi, Thien Huu Nguyen

    Abstract: The driving factors behind the development of large language models (LLMs) with impressive learning capabilities are their colossal model sizes and extensive training datasets. Along with the progress in natural language processing, LLMs have been frequently made accessible to the public to foster deeper investigation and applications. However, when it comes to training datasets for these LLMs, es… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: Ongoing Work

  36. arXiv:2308.10188  [pdf, other

    cs.LG cs.MA

    Mimicking To Dominate: Imitation Learning Strategies for Success in Multiagent Competitive Games

    Authors: The Viet Bui, Tien Mai, Thanh Hong Nguyen

    Abstract: Training agents in multi-agent competitive games presents significant challenges due to their intricate nature. These challenges are exacerbated by dynamics influenced not only by the environment but also by opponents' strategies. Existing methods often struggle with slow convergence and instability. To address this, we harness the potential of imitation learning to comprehend and anticipate oppon… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

  37. arXiv:2307.16039  [pdf, other

    cs.CL cs.LG

    Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

    Authors: Viet Dac Lai, Chien Van Nguyen, Nghia Trung Ngo, Thuat Nguyen, Franck Dernoncourt, Ryan A. Rossi, Thien Huu Nguyen

    Abstract: A key technology for the development of large language models (LLMs) involves instruction tuning that helps align the models' responses with human expectations to realize impressive learning abilities. Two major approaches for instruction tuning characterize supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF), which are currently applied to produce the best commercia… ▽ More

    Submitted 1 August, 2023; v1 submitted 29 July, 2023; originally announced July 2023.

  38. arXiv:2307.12949  [pdf, ps, other

    cs.CL

    Boosting Punctuation Restoration with Data Generation and Reinforcement Learning

    Authors: Viet Dac Lai, Abel Salinas, Hao Tan, Trung Bui, Quan Tran, Seunghyun Yoon, Hanieh Deilamsalehy, Franck Dernoncourt, Thien Huu Nguyen

    Abstract: Punctuation restoration is an important task in automatic speech recognition (ASR) which aim to restore the syntactic structure of generated ASR texts to improve readability. While punctuated texts are abundant from written documents, the discrepancy between written punctuated texts and ASR texts limits the usability of written texts in training punctuation restoration systems for ASR texts. This… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted at INTERSPEECH 2023, 6 pages

  39. arXiv:2307.09069  [pdf, other

    cs.CR

    Mitigating Intersection Attacks in Anonymous Microblogging

    Authors: Sarah Abdelwahab Gaballah, Thanh Hoang Long Nguyen, Lamya Abdullah, Ephraim Zimmer, Max Mühlhäuser

    Abstract: Anonymous microblogging systems are known to be vulnerable to intersection attacks due to network churn. An adversary that monitors all communications can leverage the churn to learn who is publishing what with increasing confidence over time. In this paper, we propose a protocol for mitigating intersection attacks in anonymous microblogging systems by grouping users into anonymity sets based on s… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  40. arXiv:2307.04137  [pdf, other

    cs.CV cs.AI

    A Novel Explainable Artificial Intelligence Model in Image Classification problem

    Authors: Quoc Hung Cao, Truong Thanh Hung Nguyen, Vo Thanh Khang Nguyen, Xuan Phong Nguyen

    Abstract: In recent years, artificial intelligence is increasingly being applied widely in many different fields and has a profound and direct impact on human life. Following this is the need to understand the principles of the model making predictions. Since most of the current high-precision models are black boxes, neither the AI scientist nor the end-user deeply understands what's going on inside these m… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: Published in the Proceedings of FAIC 2021

  41. Reducing Uncertainties of a Chained Hydrologic-hydraulic Model to Improve Flood Forecasting Using Multi-source Earth Observation Data

    Authors: Thanh Huy Nguyen, Sophie Ricci, Andrea Piacentini, Quentin Bonassies, Raquel Rodriguez Suquet, Santiago Peña Luque, Kevin Marlis, Cédric David

    Abstract: The challenges in operational flood forecasting lie in producing reliable forecasts given constrained computational resources and within processing times that are compatible with near-real-time forecasting. Flood hydrodynamic models exploit observed data from gauge networks, e.g. water surface elevation (WSE) and/or discharge that describe the forcing time-series at the upstream and lateral bounda… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: Copyright 2023 IEEE. Published in the IEEE 2023 International Geoscience & Remote Sensing Symposium (IGARSS 2023), scheduled for July 16 - 21, 2023 in Pasadena, California, USA

    Journal ref: IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Pasadena, CA, USA, 2023, pp. 1525-1528

  42. arXiv:2306.08798  [pdf, other

    cs.CL stat.ML

    MPSA-DenseNet: A novel deep learning model for English accent classification

    Authors: Tianyu Song, Linh Thi Hoai Nguyen, Ton Viet Ta

    Abstract: This paper presents three innovative deep learning models for English accent classification: Multi-DenseNet, PSA-DenseNet, and MPSE-DenseNet, that combine multi-task learning and the PSA module attention mechanism with DenseNet. We applied these models to data collected from six dialects of English across native English speaking regions (Britain, the United States, Scotland) and nonnative English… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  43. Dealing With Non-Gaussianity of SAR-derived Wet Surface Ratio for Flood Extent Representation Improvement

    Authors: Thanh Huy Nguyen, Sophie Ricci, Andrea Piacentini, Ehouarn Simon, Raquel Rodriguez Suquet, Santiago Peña Luque

    Abstract: Owing to advances in data assimilation, notably Ensemble Kalman Filter (EnKF), flood simulation and forecast capabilities have greatly improved in recent years. The motivation of the research work is to reduce comprehensively the uncertainties in the model parameters, forcing and hydraulic state, and consequently improve the overall flood reanalysis and forecast capability, especially in the flood… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: Copyright 2023 IEEE. Published in the IEEE 2023 International Geoscience & Remote Sensing Symposium (IGARSS 2023), scheduled for July 16 - 21, 2023 in Pasadena, California, USA. arXiv admin note: text overlap with arXiv:2304.01058

    Journal ref: IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium, Pasadena, CA, USA, 2023, pp. 1595-1598

  44. arXiv:2306.04527  [pdf, other

    eess.IV cs.CV cs.LG

    ContriMix: Scalable stain color augmentation for domain generalization without domain labels in digital pathology

    Authors: Tan H. Nguyen, Dinkar Juyal, Jin Li, Aaditya Prakash, Shima Nofallah, Chintan Shah, Sai Chowdary Gullapally, Limin Yu, Michael Griffin, Anand Sampat, John Abel, Justin Lee, Amaro Taylor-Weiner

    Abstract: Differences in staining and imaging procedures can cause significant color variations in histopathology images, leading to poor generalization when deploying deep-learning models trained from a different data source. Various color augmentation methods have been proposed to generate synthetic images during training to make models more robust, eliminating the need for stain normalization during test… ▽ More

    Submitted 8 March, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

  45. arXiv:2306.03400  [pdf, other

    cs.CV cs.AI cs.LG

    G-CAME: Gaussian-Class Activation Mapping Explainer for Object Detectors

    Authors: Quoc Khanh Nguyen, Truong Thanh Hung Nguyen, Vo Thanh Khang Nguyen, Van Binh Truong, Quoc Hung Cao

    Abstract: Nowadays, deep neural networks for object detection in images are very prevalent. However, due to the complexity of these networks, users find it hard to understand why these objects are detected by models. We proposed Gaussian Class Activation Mapping Explainer (G-CAME), which generates a saliency map as the explanation for object detection models. G-CAME can be considered a CAM-based method that… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: 10 figures

  46. arXiv:2306.02744  [pdf, other

    cs.CV cs.AI cs.LG

    Towards Better Explanations for Object Detection

    Authors: Van Binh Truong, Truong Thanh Hung Nguyen, Vo Thanh Khang Nguyen, Quoc Khanh Nguyen, Quoc Hung Cao

    Abstract: Recent advances in Artificial Intelligence (AI) technology have promoted their use in almost every field. The growing complexity of deep neural networks (DNNs) makes it increasingly difficult and important to explain the inner workings and decisions of the network. However, most current techniques for explaining DNNs focus mainly on interpreting classification tasks. This paper proposes a method t… ▽ More

    Submitted 6 June, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 9 pages, 10 figures

  47. arXiv:2306.02196  [pdf, other

    cs.CL

    Question-Context Alignment and Answer-Context Dependencies for Effective Answer Sentence Selection

    Authors: Minh Van Nguyen, Kishan KC, Toan Nguyen, Thien Huu Nguyen, Ankit Chadha, Thuy Vu

    Abstract: Answer sentence selection (AS2) in open-domain question answering finds answer for a question by ranking candidate sentences extracted from web documents. Recent work exploits answer context, i.e., sentences around a candidate, by incorporating them as additional input string to the Transformer models to improve the correctness scoring. In this paper, we propose to improve the candidate scoring by… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

    Comments: final copy for INTERSPEECH 2023

  48. arXiv:2305.18458  [pdf, other

    cs.LG

    Conditional Support Alignment for Domain Adaptation with Label Shift

    Authors: Anh T Nguyen, Lam Tran, Anh Tong, Tuan-Duy H. Nguyen, Toan Tran

    Abstract: Unsupervised domain adaptation (UDA) refers to a domain adaptation framework in which a learning model is trained based on the labeled samples on the source domain and unlabelled ones in the target domain. The dominant existing methods in the field that rely on the classical covariate shift assumption to learn domain-invariant feature representation have yielded suboptimal performance under the la… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  49. arXiv:2305.12121  [pdf, other

    cs.SD cs.LG eess.AS

    ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention

    Authors: Jia Qi Yip, Tuan Truong, Dianwen Ng, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma

    Abstract: In this paper, we propose ACA-Net, a lightweight, global context-aware speaker embedding extractor for Speaker Verification (SV) that improves upon existing work by using Asymmetric Cross Attention (ACA) to replace temporal pooling. ACA is able to distill large, variable-length sequences into small, fixed-sized latents by attending a small query to large key and value matrices. In ACA-Net, we buil… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: Accepted to INTERSPEECH 2023

  50. arXiv:2305.08217  [pdf, other

    nucl-th hep-ex hep-ph nucl-ex

    Neural Network predictions of inclusive electron-nucleus cross sections

    Authors: O. Al Hammal, M. Martini, J. Frontera-Pons, T. H. Nguyen, R. Perez-Ramos

    Abstract: We investigate whether a neural network approach can reproduce and predict the electron-nucleus cross sections in the kinematical domain of present and future accelerator-based neutrino oscillation experiments. For this purpose, we consider the large amount of data available to the community via the web-page ``Quasielastic Electron Nucleus scattering archive'', and use a residual, fully connected… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.