Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 67 results for author: Do, K

.
  1. arXiv:2407.20249  [pdf, other

    cs.LG eess.SP

    Revisiting the Disequilibrium Issues in Tackling Heart Disease Classification Tasks

    Authors: Thao Hoang, Linh Nguyen, Khoi Do, Duong Nguyen, Viet Dung Nguyen

    Abstract: In the field of heart disease classification, two primary obstacles arise. Firstly, existing Electrocardiogram (ECG) datasets consistently demonstrate imbalances and biases across various modalities. Secondly, these time-series data consist of diverse lead signals, causing Convolutional Neural Networks (CNNs) to become overfitting to the one with higher power, hence diminishing the performance of… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  2. arXiv:2407.20247  [pdf, other

    eess.SP cs.AI cs.LG

    How Homogenizing the Channel-wise Magnitude Can Enhance EEG Classification Model?

    Authors: Huyen Ngo, Khoi Do, Duong Nguyen, Viet Dung Nguyen, Lan Dang

    Abstract: A significant challenge in the electroencephalogram EEG lies in the fact that current data representations involve multiple electrode signals, resulting in data redundancy and dominant lead information. However extensive research conducted on EEG classification focuses on designing model architectures without tackling the underlying issues. Otherwise, there has been a notable gap in addressing dat… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  3. arXiv:2407.18839  [pdf, other

    cs.CV

    Scalable Group Choreography via Variational Phase Manifold Learning

    Authors: Nhat Le, Khoa Do, Xuan Bui, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen

    Abstract: Generating group dance motion from the music is a challenging task with several industrial applications. Although several methods have been proposed to tackle this problem, most of them prioritize optimizing the fidelity in dancing movement, constrained by predetermined dancer counts in datasets. This limitation impedes adaptability to real-world applications. Our study addresses the scalability p… ▽ More

    Submitted 31 July, 2024; v1 submitted 26 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  4. arXiv:2406.07124  [pdf, other

    cs.AI cs.LG

    CHARME: A chain-based reinforcement learning approach for the minor embedding problem

    Authors: Hoang M. Ngo, Nguyen H K. Do, Minh N. Vu, Tamer Kahveci, My T. Thai

    Abstract: Quantum Annealing (QA) holds great potential for solving combinatorial optimization problems efficiently. However, the effectiveness of QA algorithms heavily relies on the embedding of problem instances, represented as logical graphs, into the quantum unit processing (QPU) whose topology is in form of a limited connectivity graph, known as the minor embedding Problem. Existing methods for the mino… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  5. arXiv:2406.01557  [pdf, other

    stat.ME stat.AP

    Bayesian compositional regression with flexible microbiome feature aggregation and selection

    Authors: Satabdi Saha, Liangliang Zhang, Kim-Anh Do, Christine B. Peterson

    Abstract: Ongoing advances in microbiome profiling have allowed unprecedented insights into the molecular activities of microbial communities. This has fueled a strong scientific interest in understanding the critical role the microbiome plays in governing human health, by identifying microbial features associated with clinical outcomes of interest. Several aspects of microbiome data limit the applicability… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  6. arXiv:2405.16388  [pdf, other

    cs.CL cs.LG

    Multi-Reference Preference Optimization for Large Language Models

    Authors: Hung Le, Quan Tran, Dung Nguyen, Kien Do, Saloni Mittal, Kelechi Ogueji, Svetha Venkatesh

    Abstract: How can Large Language Models (LLMs) be aligned with human intentions and values? A typical solution is to gather human preference on model outputs and finetune the LLMs accordingly while ensuring that updates do not deviate too far from a reference model. Recent approaches, such as direct preference optimization (DPO), have eliminated the need for unstable and sluggish reinforcement learning opti… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 20 pages

  7. arXiv:2404.11870  [pdf, ps, other

    cs.LG cs.CL

    Enhancing Length Extrapolation in Sequential Models with Pointer-Augmented Neural Memory

    Authors: Hung Le, Dung Nguyen, Kien Do, Svetha Venkatesh, Truyen Tran

    Abstract: We propose Pointer-Augmented Neural Memory (PANM) to help neural networks understand and apply symbol processing to new, longer sequences of data. PANM integrates an external neural memory that uses novel physical addresses and pointer manipulation techniques to mimic human and computer symbol processing abilities. PANM facilitates pointer assignment, dereference, and arithmetic by explicitly usin… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Preprint

  8. arXiv:2404.05393  [pdf, other

    cs.CV cs.AI

    PAT: Pixel-wise Adaptive Training for Long-tailed Segmentation

    Authors: Khoi Do, Duong Nguyen, Nguyen H. Tran, Viet Dung Nguyen

    Abstract: Beyond class frequency, we recognize the impact of class-wise relationships among various class-specific predictions and the imbalance in label masks on long-tailed segmentation learning. To address these challenges, we propose an innovative Pixel-wise Adaptive Training (PAT) technique tailored for long-tailed segmentation. PAT has two key features: 1) class-wise gradient magnitude homogenization,… ▽ More

    Submitted 10 July, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  9. arXiv:2403.09986  [pdf, other

    cs.CY cs.HC cs.SI

    Designing Sousveillance Tools for Gig Workers

    Authors: Maya De Los Santos, Kimberly Do, Michael Muller, Saiph Savage

    Abstract: As independently-contracted employees, gig workers disproportionately suffer the consequences of workplace surveillance, which include increased pressures to work, breaches of privacy, and decreased digital autonomy. Despite the negative impacts of workplace surveillance, gig workers lack the tools, strategies, and workplace social support to protect themselves against these harms. Meanwhile, some… ▽ More

    Submitted 23 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: Published as a conference paper at the ACM Conference on Human Factors in Computing Systems, CHI 2024, 3 figures, 30 pages

  10. arXiv:2403.09875  [pdf, other

    cs.RO cs.CV

    Touch-GS: Visual-Tactile Supervised 3D Gaussian Splatting

    Authors: Aiden Swann, Matthew Strong, Won Kyung Do, Gadiel Sznaier Camps, Mac Schwager, Monroe Kennedy III

    Abstract: In this work, we propose a novel method to supervise 3D Gaussian Splatting (3DGS) scenes using optical tactile sensors. Optical tactile sensors have become widespread in their use in robotics for manipulation and object representation; however, raw optical tactile sensor data is unsuitable to directly supervise a 3DGS scene. Our representation leverages a Gaussian Process Implicit Surface to impli… ▽ More

    Submitted 18 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 8 pages, 7 figures

  11. arXiv:2403.08997  [pdf, other

    cs.CV cs.RO

    Caltech Aerial RGB-Thermal Dataset in the Wild

    Authors: Connor Lee, Matthew Anderson, Nikhil Raganathan, Xingxing Zuo, Kevin Do, Georgia Gkioxari, Soon-Jo Chung

    Abstract: We present the first publicly-available RGB-thermal dataset designed for aerial robotics operating in natural environments. Our dataset captures a variety of terrain across the United States, including rivers, lakes, coastlines, deserts, and forests, and consists of synchronized RGB, thermal, global positioning, and inertial data. We provide semantic segmentation annotations for 10 classes commonl… ▽ More

    Submitted 31 July, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted to ECCV 2024

  12. arXiv:2402.03577  [pdf, other

    cs.LG

    Revisiting the Dataset Bias Problem from a Statistical Perspective

    Authors: Kien Do, Dung Nguyen, Hung Le, Thao Le, Dang Nguyen, Haripriya Harikumar, Truyen Tran, Santu Rana, Svetha Venkatesh

    Abstract: In this paper, we study the "dataset bias" problem from a statistical standpoint, and identify the main cause of the problem as the strong correlation between a class attribute u and a non-class attribute b in the input x, represented by p(u|b) differing significantly from p(u). Since p(u|b) appears as part of the sampling distributions in the standard maximum log-likelihood (MLL) objective, a mod… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  13. arXiv:2402.02977  [pdf, other

    cs.LG cs.AI

    Variational Flow Models: Flowing in Your Style

    Authors: Kien Do, Duc Kieu, Toan Nguyen, Dang Nguyen, Hung Le, Dung Nguyen, Thin Nguyen

    Abstract: We introduce "posterior flows" - generalizations of "probability flows" to a broader class of stochastic processes not necessarily diffusion processes - and propose a systematic training-free method to transform the posterior flow of a "linear" stochastic process characterized by the equation Xt = at * X0 + st * X1 into a straight constant-speed (SC) flow, reminiscent of Rectified Flow. This trans… ▽ More

    Submitted 29 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  14. arXiv:2310.18986  [pdf, other

    cs.CV

    Controllable Group Choreography using Contrastive Diffusion

    Authors: Nhat Le, Tuong Do, Khoa Do, Hien Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen

    Abstract: Music-driven group choreography poses a considerable challenge but holds significant potential for a wide range of industrial applications. The ability to generate synchronized and visually appealing group dance motions that are aligned with music opens up opportunities in many fields such as entertainment, advertising, and virtual performances. However, most of the recent works are not able to ge… ▽ More

    Submitted 3 November, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

  15. arXiv:2310.18598  [pdf, other

    cs.LG cs.CV

    Domain Generalisation via Risk Distribution Matching

    Authors: Toan Nguyen, Kien Do, Bao Duong, Thin Nguyen

    Abstract: We propose a novel approach for domain generalisation (DG) leveraging risk distributions to characterise domains, thereby achieving domain invariance. In our findings, risk distributions effectively highlight differences between training domains and reveal their inherent complexities. In testing, we may observe similar, or potentially intensifying in magnitude, divergences between risk distributio… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted at 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

  16. Parameter-Efficient Methods for Metastases Detection from Clinical Notes

    Authors: Maede Ashofteh Barabadi, Xiaodan Zhu, Wai Yip Chan, Amber L. Simpson, Richard K. G. Do

    Abstract: Understanding the progression of cancer is crucial for defining treatments for patients. The objective of this study is to automate the detection of metastatic liver disease from free-style computed tomography (CT) radiology reports. Our research demonstrates that transferring knowledge using three approaches can improve model performance. First, we utilize generic language models (LMs), pretraine… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 6 pages, 1 figure, The 36th Canadian Conference on Artificial Intelligence

    Journal ref: Barabadi, M. A., Zhu, X., Chan, W. Y., Simpson, A. L., & Do, R. K. G. (2023). Parameter-Efficient Methods for Metastases Detection fromClinical Notes. Proceedings of the Canadian Conference on Artificial Intelligence

  17. arXiv:2309.14053  [pdf, other

    cs.LG cs.AI

    Revisiting LARS for Large Batch Training Generalization of Neural Networks

    Authors: Khoi Do, Duong Nguyen, Hoa Nguyen, Long Tran-Thanh, Nguyen-Hoang Tran, Quoc-Viet Pham

    Abstract: This paper explores Large Batch Training techniques using layer-wise adaptive scaling ratio (LARS) across diverse settings, uncovering insights. LARS algorithms with warm-up tend to be trapped in sharp minimizers early on due to redundant ratio scaling. Additionally, a fixed steep decline in the latter phase restricts deep neural networks from effectively navigating early-phase sharp minimizers. B… ▽ More

    Submitted 15 February, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

  18. arXiv:2309.08860  [pdf, other

    cs.RO

    DenseTact-Mini: An Optical Tactile Sensor for Grasping Multi-Scale Objects From Flat Surfaces

    Authors: Won Kyung Do, Ankush Kundan Dhawan, Mathilda Kitzmann, Monroe Kennedy III

    Abstract: Dexterous manipulation, especially of small daily objects, continues to pose complex challenges in robotics. This paper introduces the DenseTact-Mini, an optical tactile sensor with a soft, rounded, smooth gel surface and compact design equipped with a synthetic fingernail. We propose three distinct grasping strategies: tap grasping using adhesion forces such as electrostatic and van der Waals, fi… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  19. arXiv:2309.08109  [pdf, other

    stat.ME

    CAT: a conditional association test for microbiome data using a leave-out approach

    Authors: Yushu Shi, Liangliang Zhang, Kim-Anh Do, Robert R. Jenq, Christine B. Peterson

    Abstract: In microbiome analysis, researchers often seek to identify taxonomic features associated with an outcome of interest. However, microbiome features are intercorrelated and linked by phylogenetic relationships, making it challenging to assess the association between an individual feature and an outcome. Researchers have developed global tests for the association of microbiome profiles with outcomes… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  20. Towards Optimal Patch Size in Vision Transformers for Tumor Segmentation

    Authors: Ramtin Mojtahedi, Mohammad Hamghalam, Richard K. G. Do, Amber L. Simpson

    Abstract: Detection of tumors in metastatic colorectal cancer (mCRC) plays an essential role in the early diagnosis and treatment of liver cancer. Deep learning models backboned by fully convolutional neural networks (FCNNs) have become the dominant model for segmenting 3D computerized tomography (CT) scans. However, since their convolution layers suffer from limited kernel size, they are not able to captur… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Journal ref: Multiscale Multimodal Medical Imaging. MMMI 2022. Lecture Notes in Computer Science, vol 13594. Springer, Cham

  21. arXiv:2308.16480  [pdf, other

    cs.RO

    Inter-finger Small Object Manipulation with DenseTact Optical Tactile Sensor

    Authors: Won Kyung Do, Bianca Aumann, Camille Chungyoun, Monroe Kennedy III

    Abstract: The ability to grasp and manipulate small objects in cluttered environments remains a significant challenge. This paper introduces a novel approach that utilizes a tactile sensor-equipped gripper with eight degrees of freedom to overcome these limitations. We employ DenseTact 2.0 for the gripper, enabling precise control and improved grasp success rates, particularly for small objects ranging from… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  22. arXiv:2308.15932  [pdf, other

    eess.IV cs.CV

    Attention-based CT Scan Interpolation for Lesion Segmentation of Colorectal Liver Metastases

    Authors: Mohammad Hamghalam, Richard K. G. Do, Amber L. Simpson

    Abstract: Small liver lesions common to colorectal liver metastases (CRLMs) are challenging for convolutional neural network (CNN) segmentation models, especially when we have a wide range of slice thicknesses in the computed tomography (CT) scans. Slice thickness of CT images may vary by clinical indication. For example, thinner slices are used for presurgical planning when fine anatomic details of small v… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Journal ref: Proc. SPIE 12468, Medical Imaging 2023: Biomedical Applications in Molecular, Structural, and Functional Imaging, 124680U (10 April 2023)

  23. arXiv:2308.13737  [pdf, other

    stat.AP

    survivalContour: Visualizing predicted survival via colored contour plots

    Authors: Yushu Shi, Liangliang Zhang, Kim-Anh Do, Robert R. Jenq, Christine B. Peterson

    Abstract: Advances in survival analysis have facilitated unprecedented flexibility in data modeling, yet there remains a lack of tools for graphically illustrating the influence of continuous covariates on predicted survival outcomes. We propose the utilization of a colored contour plot to depict the predicted survival probabilities over time, and provide a Shiny app and R package as implementations of this… ▽ More

    Submitted 12 January, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

  24. Embedded Object Detection and Mapping in Soft Materials Using Optical Tactile Sensing

    Authors: Jose A. Solano-Castellanos, Won Kyung Do, Monroe Kennedy III

    Abstract: In this paper, we present a methodology that uses an optical tactile sensor for efficient tactile exploration of embedded objects within soft materials. The methodology consists of an exploration phase, where a probabilistic estimate of the location of the embedded objects is built using a Bayesian approach. The exploration phase is then followed by a mapping phase which exploits the probabilistic… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Journal ref: Springer Nature Computer Science, Vol 5, Article 372, 2024

  25. arXiv:2308.04836  [pdf, other

    cs.LG

    Beyond Surprise: Improving Exploration Through Surprise Novelty

    Authors: Hung Le, Kien Do, Dung Nguyen, Svetha Venkatesh

    Abstract: We present a new computing model for intrinsic rewards in reinforcement learning that addresses the limitations of existing surprise-driven explorations. The reward is the novelty of the surprise rather than the surprise norm. We estimate the surprise novelty as retrieval errors of a memory network wherein the memory stores and reconstructs surprises. Our surprise memory (SM) augments the capabili… ▽ More

    Submitted 30 January, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: 17 pages including Appendix

  26. arXiv:2307.16044  [pdf, other

    q-bio.PE physics.bio-ph

    A Schrödinger Equation for Evolutionary Dynamics

    Authors: Vi D. Ao, Duy V. Tran, Kien T. Pham, Duc M. Nguyen, Huy D. Tran, Tuan K. Do, Van H. Do, Trung V. Phan

    Abstract: We establish an analogy between the Fokker-Planck equation describing evolutionary landscape dynamics and the Schrödinger equation which characterizes quantum mechanical particles, showing how a population with multiple genetic traits evolves analogously to a wavefunction under a multi-dimensional energy potential in imaginary time. Furthermore, we discover within this analogy that the stationary… ▽ More

    Submitted 31 August, 2023; v1 submitted 29 July, 2023; originally announced July 2023.

    Journal ref: Quantum Rep. 2023, 5(4), 659-682

  27. arXiv:2304.08329  [pdf, ps, other

    math.NT math.AG

    Computing the Weil representation of a superelliptic curve

    Authors: Irene I. Bouw, Duc Khoi Do, Stefan Wewers

    Abstract: We study the Weil representation $ρ$ of a curve over a $p$-adic field with potential reduction of compact type. We show that $ρ$ can be reconstructed from its stable reduction. For superelliptic curves of the form $y^n=f(x)$ at primes $p$ whose residue characteristic is prime to the exponent $n$ we make this explicit.

    Submitted 30 October, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    MSC Class: 11F80 (primary); 14H25; 11G20; 11S40 (secondary)

  28. "That's important, but...": How Computer Science Researchers Anticipate Unintended Consequences of Their Research Innovations

    Authors: Kimberly Do, Rock Yuren Pang, Jiachen Jiang, Katharina Reinecke

    Abstract: Computer science research has led to many breakthrough innovations but has also been scrutinized for enabling technology that has negative, unintended consequences for society. Given the increasing discussions of ethics in the news and among researchers, we interviewed 20 researchers in various CS sub-disciplines to identify whether and how they consider potential unintended consequences of their… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Corresponding author: Rock Yuren Pang, email provided below. Kimberly Do and Rock Yuren Pang contributed equally to this research. The author order is listed alphabetically. To appear in CHI Conference on Human Factors in Computing Systems (CHI '23), April 23-April 28, 2023, Hamburg, Germany. ACM, New York, NY, USA, 16 pages

  29. arXiv:2303.09084  [pdf, other

    q-bio.PE physics.bio-ph

    Stress-Induced Mutagenesis Can Further Boost Population Success in Static Ecology

    Authors: Kien T. Pham, Duc M. Nguyen, Duy V. Tran, Vi D. Ao, Huy D. Tran, Tuan K. Do, Trung V. Phan

    Abstract: We have developed a mathematical model that captures stress-induced mutagenesis, a fundamental aspect of pathogenic and neoplastic evolutionary dynamics, on the fitness landscape with multiple relevant genetic traits as a high-dimensional Euclidean space. In this framework, stress-induced mutagenesis manifests as a heterogeneous diffusion process. We show how increasing mutations, and thus reducin… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  30. arXiv:2303.06751  [pdf, ps, other

    math.NT

    Diagonal cycles and anticyclotomic Iwasawa theory of modular forms

    Authors: Francesc Castella, Kim Tuan Do

    Abstract: We construct a new anticyclotomic Euler system (in the sense of Jetchev-Nekovar-Skinner) for the Galois representation $V_{f,χ}$ attached to a newform $f$ of weight $k\geq 2$ twisted by an anticyclotomic Hecke character $χ$. We then show some arithmetic applications of the constructed Euler system, including new results on the Bloch-Kato conjecture in ranks zero and one, and a divisibility towards… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

    Comments: 50 pages

  31. arXiv:2301.06926  [pdf, ps, other

    cs.AI cs.LG

    Memory-Augmented Theory of Mind Network

    Authors: Dung Nguyen, Phuoc Nguyen, Hung Le, Kien Do, Svetha Venkatesh, Truyen Tran

    Abstract: Social reasoning necessitates the capacity of theory of mind (ToM), the ability to contextualise and attribute mental states to others without having access to their internal cognitive structure. Recent machine learning approaches to ToM have demonstrated that we can train the observer to read the past and present behaviours of other agents and infer their beliefs (including false beliefs about th… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: Accepted for publication at AAAI 2023

  32. Causal Inference via Style Transfer for Out-of-distribution Generalisation

    Authors: Toan Nguyen, Kien Do, Duc Thanh Nguyen, Bao Duong, Thin Nguyen

    Abstract: Out-of-distribution (OOD) generalisation aims to build a model that can generalise well on an unseen target domain using knowledge from multiple source domains. To this end, the model should seek the causal dependence between inputs and labels, which may be determined by the semantics of inputs and remain invariant across domains. However, statistical or non-causal methods often cannot capture thi… ▽ More

    Submitted 10 June, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 23), August 6-10, 2023, Long Beach, CA, USA. ACM, New York, NY, USA, 19 pages

    Journal ref: In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 23), August 6-10, 2023, Long Beach, CA, USA. ACM, New York, NY, USA, 19 pages

  33. arXiv:2211.10812  [pdf, other

    cs.CV cs.AI

    Face Swapping as A Simple Arithmetic Operation

    Authors: Truong Vu, Kien Do, Khang Nguyen, Khoat Than

    Abstract: We propose a novel high-fidelity face swapping method called "Arithmetic Face Swapping" (AFS) that explicitly disentangles the intermediate latent space W+ of a pretrained StyleGAN into the "identity" and "style" subspaces so that a latent code in W+ is the sum of an "identity" code and a "style" code in the corresponding subspaces. Via our disentanglement, face swapping (FS) can be regarded as a… ▽ More

    Submitted 3 February, 2023; v1 submitted 19 November, 2022; originally announced November 2022.

  34. arXiv:2209.10359  [pdf, other

    cs.CV cs.AI

    Momentum Adversarial Distillation: Handling Large Distribution Shifts in Data-Free Knowledge Distillation

    Authors: Kien Do, Hung Le, Dung Nguyen, Dang Nguyen, Haripriya Harikumar, Truyen Tran, Santu Rana, Svetha Venkatesh

    Abstract: Data-free Knowledge Distillation (DFKD) has attracted attention recently thanks to its appealing capability of transferring knowledge from a teacher network to a student network without using training data. The main idea is to use a generator to synthesize data for training the student. As the generator gets updated, the distribution of synthetic data will change. Such distribution shift could be… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: Accepted to NeurIPS 2022

  35. arXiv:2209.10122  [pdf, other

    cs.RO

    DenseTact 2.0: Optical Tactile Sensor for Shape and Force Reconstruction

    Authors: Won Kyung Do, Bianca Jurewicz, Monroe Kennedy III

    Abstract: Collaborative robots stand to have an immense impact on both human welfare in domestic service applications and industrial superiority in advanced manufacturing with dexterous assembly. The outstanding challenge is providing robotic fingertips with a physical design that makes them adept at performing dexterous tasks that require high-resolution, calibrated shape reconstruction and force sensing.… ▽ More

    Submitted 4 March, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

  36. Multiple Instance Neuroimage Transformer

    Authors: Ayush Singla, Qingyu Zhao, Daniel K. Do, Yuyin Zhou, Kilian M. Pohl, Ehsan Adeli

    Abstract: For the first time, we propose using a multiple instance learning based convolution-free transformer model, called Multiple Instance Neuroimage Transformer (MINiT), for the classification of T1weighted (T1w) MRIs. We first present several variants of transformer models adopted for neuroimages. These models extract non-overlapping 3D blocks from the input volume and perform multi-headed self-attent… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

  37. arXiv:2207.14753  [pdf, other

    stat.ME

    Estimating Causal Effects with Hidden Confounding using Instrumental Variables and Environments

    Authors: James P. Long, Hongxu Zhu, Kim-Anh Do, Min Jin Ha

    Abstract: Recent works have proposed regression models which are invariant across data collection environments. These estimators often have a causal interpretation under conditions on the environments and type of invariance imposed. One recent example, the Causal Dantzig (CD), is consistent under hidden confounding and represents an alternative to classical instrumental variable estimators such as Two Stage… ▽ More

    Submitted 9 November, 2023; v1 submitted 29 July, 2022; originally announced July 2022.

    Comments: 32 pages, 7 figures, 4 tables

  38. arXiv:2207.12106  [pdf, other

    cs.CV cs.AI cs.LG

    Black-box Few-shot Knowledge Distillation

    Authors: Dang Nguyen, Sunil Gupta, Kien Do, Svetha Venkatesh

    Abstract: Knowledge distillation (KD) is an efficient approach to transfer the knowledge from a large "teacher" network to a smaller "student" network. Traditional KD methods require lots of labeled training samples and a white-box teacher (parameters are accessible) to train a good student. However, these resources are not always available in real-world applications. The distillation process often happens… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: To appear at ECCV 2022

  39. arXiv:2207.09991  [pdf, other

    stat.AP q-bio.QM

    Causal Models, Prediction, and Extrapolation in Cell Line Perturbation Experiments

    Authors: James P. Long, Yumeng Yang, Kim-Anh Do

    Abstract: In cell line perturbation experiments, a collection of cells is perturbed with external agents (e.g. drugs) and responses such as protein expression measured. Due to cost constraints, only a small fraction of all possible perturbations can be tested in vitro. This has led to the development of computational (in silico) models which can predict cellular responses to perturbations. Perturbations wit… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: 13 pages, 4 figures

  40. arXiv:2207.03895  [pdf, other

    cs.CV

    Defense Against Multi-target Trojan Attacks

    Authors: Haripriya Harikumar, Santu Rana, Kien Do, Sunil Gupta, Wei Zong, Willy Susilo, Svetha Venkastesh

    Abstract: Adversarial attacks on deep learning-based models pose a significant threat to the current AI infrastructure. Among them, Trojan attacks are the hardest to defend against. In this paper, we first introduce a variation of the Badnet kind of attacks that introduces Trojan backdoors to multiple target classes and allows triggers to be placed anywhere in the image. The former makes it more potent and… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  41. arXiv:2204.09315  [pdf, ps, other

    cs.LG

    Learning to Constrain Policy Optimization with Virtual Trust Region

    Authors: Hung Le, Thommen Karimpanal George, Majid Abdolshah, Dung Nguyen, Kien Do, Sunil Gupta, Svetha Venkatesh

    Abstract: We introduce a constrained optimization method for policy gradient reinforcement learning, which uses a virtual trust region to regulate each policy update. In addition to using the proximity of one single old policy as the normal trust region, we propose forming a second trust region through another virtual policy representing a wide range of past policies. We then enforce the new policy to stay… ▽ More

    Submitted 15 September, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: Preprint, 22 pages

  42. arXiv:2204.09047  [pdf, ps, other

    cs.LG cs.AI

    Learning Theory of Mind via Dynamic Traits Attribution

    Authors: Dung Nguyen, Phuoc Nguyen, Hung Le, Kien Do, Svetha Venkatesh, Truyen Tran

    Abstract: Machine learning of Theory of Mind (ToM) is essential to build social agents that co-live with humans and other agents. This capacity, once acquired, will help machines infer the mental states of others from observed contextual action trajectories, enabling future prediction of goals, intention, actions and successor representations. The underlying mechanism for such a prediction remains unclear,… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

    Comments: Accepted for publication at AAMAS 2022

  43. arXiv:2202.12154  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    Towards Effective and Robust Neural Trojan Defenses via Input Filtering

    Authors: Kien Do, Haripriya Harikumar, Hung Le, Dung Nguyen, Truyen Tran, Santu Rana, Dang Nguyen, Willy Susilo, Svetha Venkatesh

    Abstract: Trojan attacks on deep neural networks are both dangerous and surreptitious. Over the past few years, Trojan attacks have advanced from using only a single input-agnostic trigger and targeting only one class to using multiple, input-specific triggers and targeting multiple classes. However, Trojan defenses have not caught up with this development. Most defense methods still make inadequate assumpt… ▽ More

    Submitted 14 February, 2023; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: Accepted to ECCV 2022

  44. arXiv:2201.01367  [pdf, other

    cs.RO cs.CV

    DenseTact: Optical Tactile Sensor for Dense Shape Reconstruction

    Authors: Won Kyung Do, Monroe Kennedy III

    Abstract: Increasing the performance of tactile sensing in robots enables versatile, in-hand manipulation. Vision-based tactile sensors have been widely used as rich tactile feedback has been shown to be correlated with increased performance in manipulation tasks. Existing tactile sensor solutions with high resolution have limitations that include low accuracy, expensive components, or lack of scalability.… ▽ More

    Submitted 8 March, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

  45. arXiv:2112.01853  [pdf, other

    cs.LG cs.MA

    Episodic Policy Gradient Training

    Authors: Hung Le, Majid Abdolshah, Thommen K. George, Kien Do, Dung Nguyen, Svetha Venkatesh

    Abstract: We introduce a novel training procedure for policy gradient methods wherein episodic memory is used to optimize the hyperparameters of reinforcement learning algorithms on-the-fly. Unlike other hyperparameter searches, we formulate hyperparameter scheduling as a standard Markov Decision Process and use episodic memory to store the outcome of used hyperparameters and their training contexts. At any… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Comments: 19 pages

  46. arXiv:2110.13414  [pdf, ps, other

    cs.CV cs.CR

    Semantic Host-free Trojan Attack

    Authors: Haripriya Harikumar, Kien Do, Santu Rana, Sunil Gupta, Svetha Venkatesh

    Abstract: In this paper, we propose a novel host-free Trojan attack with triggers that are fixed in the semantic space but not necessarily in the pixel space. In contrast to existing Trojan attacks which use clean input images as hosts to carry small, meaningless trigger patterns, our attack considers triggers as full-sized images belonging to a semantically meaningful object class. Since in our attack, the… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  47. arXiv:2107.11635  [pdf, other

    cs.CV cs.AI

    Clustering by Maximizing Mutual Information Across Views

    Authors: Kien Do, Truyen Tran, Svetha Venkatesh

    Abstract: We propose a novel framework for image clustering that incorporates joint representation learning and clustering. Our method consists of two heads that share the same backbone network - a "representation learning" head and a "clustering" head. The "representation learning" head captures fine-grained patterns of objects at the instance level which serve as clues for the "clustering" head to extract… ▽ More

    Submitted 24 July, 2021; originally announced July 2021.

    Comments: Accepted at ICCV 2021

  48. arXiv:2106.05735  [pdf, other

    eess.IV cs.CV cs.LG

    The Medical Segmentation Decathlon

    Authors: Michela Antonelli, Annika Reinke, Spyridon Bakas, Keyvan Farahani, AnnetteKopp-Schneider, Bennett A. Landman, Geert Litjens, Bjoern Menze, Olaf Ronneberger, Ronald M. Summers, Bram van Ginneken, Michel Bilello, Patrick Bilic, Patrick F. Christ, Richard K. G. Do, Marc J. Gollub, Stephan H. Heckers, Henkjan Huisman, William R. Jarnagin, Maureen K. McHugo, Sandy Napel, Jennifer S. Goli Pernicka, Kawal Rhode, Catalina Tobon-Gomez, Eugene Vorontsov , et al. (34 additional authors not shown)

    Abstract: International challenges have become the de facto standard for comparative assessment of image analysis algorithms given a specific task. Segmentation is so far the most widely investigated medical image processing task, but the various segmentation challenges have typically been organized in isolation, such that algorithm development was driven by the need to tackle a single specific clinical pro… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    MSC Class: 68T07

  49. arXiv:2105.04722  [pdf, other

    physics.class-ph

    On the Electrostatic Interaction between Point Charges due to Dielectrical Shielding

    Authors: Long T. Nguyen, Kim Tuan Do, Duy V. Nguyen, Trung Phan

    Abstract: How will the electrostatic interaction between two point charges change if they are shielded from the other by a dielectrical slab? While the physical setting of this electromagnetic problem is relatively simple, it is easy to be wronged and the correct solution is surprisingly complicated. Here we will show a general answer using the method of images, in which the electrical field are not found b… ▽ More

    Submitted 31 October, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

    Journal ref: Progress In Electromagnetics Research Letters, Vol. 107, 111-118, 2022

  50. arXiv:2103.01554  [pdf

    physics.optics

    Sharp spectral variations of the ultrafast transient light extinction by bimetallic nanoparticles in the near-UV

    Authors: Tadele Otomalo, Lorenzo Di Mario, Cyrille Hamon, Doru Constantin, Khanh-Van Do, Patrick O'Keeffe, Daniele Catone, Alessandra Paladini, Bruno Palpant

    Abstract: Noble metal nanoparticles exhibit localized plasmon resonance modes that span the visible and near-infrared spectral ranges and have many applications. Modifying the size, shape, and composition of the nanoparticles changes the number of modes and their properties. The characteristics of these modes are transiently affected when illuminating the nano-objects with ultrashort laser pulses. Here, we… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Journal ref: Advanced Optical Materials, Wiley, 2021, pp.2001778