Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 52 results for author: Tong, A

.
  1. arXiv:2406.14794  [pdf, other

    eess.IV cs.CV cs.LG

    ImageFlowNet: Forecasting Multiscale Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical Images

    Authors: Chen Liu, Ke Xu, Liangbo L. Shen, Guillaume Huguet, Zilong Wang, Alexander Tong, Danilo Bzdok, Jay Stewart, Jay C. Wang, Lucian V. Del Priore, Smita Krishnaswamy

    Abstract: The forecasting of disease progression from images is a holy grail for clinical decision making. However, this task is complicated by the inherent high dimensionality, temporal sparsity and sampling irregularity in longitudinal image acquisitions. Existing methods often rely on extracting hand-crafted features and performing time-series analysis in this vector space, leading to a loss of rich spat… ▽ More

    Submitted 2 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Included reference to codebase. Added acknowledgements

  2. arXiv:2405.20313  [pdf, other

    cs.LG q-bio.BM

    Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation

    Authors: Guillaume Huguet, James Vuckovic, Kilian Fatras, Eric Thibodeau-Laufer, Pablo Lemos, Riashat Islam, Cheng-Hao Liu, Jarrid Rector-Brooks, Tara Akhound-Sadegh, Michael Bronstein, Alexander Tong, Avishek Joey Bose

    Abstract: Proteins are essential for almost all biological processes and derive their diverse functions from complex 3D structures, which are in turn determined by their amino acid sequences. In this paper, we exploit the rich biological inductive bias of amino acid sequences and introduce FoldFlow-2, a novel sequence-conditioned SE(3)-equivariant flow matching model for protein structure generation. FoldFl… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: preprint

  3. arXiv:2405.14780  [pdf, other

    cs.LG stat.ML

    Metric Flow Matching for Smooth Interpolations on the Data Manifold

    Authors: Kacper Kapusniak, Peter Potaptchik, Teodora Reu, Leo Zhang, Alexander Tong, Michael Bronstein, Avishek Joey Bose, Francesco Di Giovanni

    Abstract: Matching objectives underpin the success of modern generative models and rely on constructing conditional paths that transform a source distribution into a target distribution. Despite being a fundamental building block, conditional paths have been designed principally under the assumption of Euclidean geometry, resulting in straight interpolations. However, this can be particularly restrictive fo… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2403.09493  [pdf, other

    cs.CV

    Anomaly Detection by Adapting a pre-trained Vision Language Model

    Authors: Yuxuan Cai, Xinwei He, Dingkang Liang, Ao Tong, Xiang Bai

    Abstract: Recently, large vision and language models have shown their success when adapting them to many downstream tasks. In this paper, we present a unified framework named CLIP-ADA for Anomaly Detection by Adapting a pre-trained CLIP model. To this end, we make two important improvements: 1) To acquire unified anomaly detection across industrial images of multiple categories, we introduce the learnable p… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  5. arXiv:2402.06121  [pdf, other

    cs.LG stat.ML

    Iterated Denoising Energy Matching for Sampling from Boltzmann Densities

    Authors: Tara Akhound-Sadegh, Jarrid Rector-Brooks, Avishek Joey Bose, Sarthak Mittal, Pablo Lemos, Cheng-Hao Liu, Marcin Sendera, Siamak Ravanbakhsh, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Alexander Tong

    Abstract: Efficiently generating statistically independent samples from an unnormalized probability distribution, such as equilibrium samples of many-body systems, is a foundational problem in science. In this paper, we propose Iterated Denoising Energy Matching (iDEM), an iterative algorithm that uses a novel stochastic score matching objective leveraging solely the energy function and its gradient -- and… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Published at ICML 2024. Code for iDEM is available at https://github.com/jarridrb/dem

  6. arXiv:2402.05247  [pdf, other

    physics.flu-dyn

    A Geometric VOF Method for Interface Flow Simulations

    Authors: Dezhi Dai, Haomin Yuan, Albert Y. Tong, Adrian Tentner

    Abstract: A novel numerical technique designed for interface flow simulations using the Volume of Fluid (VOF) method on arbitrary unstructured meshes has been introduced. The method is called SimPLIC, which seamlessly integrates Piecewise Linear Interface Calculation (PLIC) and Simpson's rule. The main focus of the proposed method is to compute the volume of the primary phase that moves across a mesh face w… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  7. arXiv:2312.04823  [pdf, other

    cs.CV cs.AI cs.IT cs.LG

    Assessing Neural Network Representations During Training Using Noise-Resilient Diffusion Spectral Entropy

    Authors: Danqi Liao, Chen Liu, Benjamin W. Christensen, Alexander Tong, Guillaume Huguet, Guy Wolf, Maximilian Nickel, Ian Adelstein, Smita Krishnaswamy

    Abstract: Entropy and mutual information in neural networks provide rich information on the learning process, but they have proven difficult to compute reliably in high dimensions. Indeed, in noisy and high-dimensional data, traditional estimates in ambient dimensions approach a fixed entropy and are prohibitively hard to compute. To address these issues, we leverage data geometry to access the underlying m… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Journal ref: ICML 2023 Workshop on Topology, Algebra, and Geometry in Machine Learning

  8. SigFormer: Signature Transformers for Deep Hedging

    Authors: Anh Tong, Thanh Nguyen-Tang, Dongeun Lee, Toan Tran, Jaesik Choi

    Abstract: Deep hedging is a promising direction in quantitative finance, incorporating models and techniques from deep learning research. While giving excellent hedging strategies, models inherently requires careful treatment in designing architectures for neural networks. To mitigate such difficulties, we introduce SigFormer, a novel deep learning model that combines the power of path signatures and transf… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: ICAIF 2023

  9. arXiv:2310.10649  [pdf, other

    cs.LG math.OC stat.ML

    A Computational Framework for Solving Wasserstein Lagrangian Flows

    Authors: Kirill Neklyudov, Rob Brekelmans, Alexander Tong, Lazar Atanackovic, Qiang Liu, Alireza Makhzani

    Abstract: The dynamical formulation of the optimal transport can be extended through various choices of the underlying geometry (kinetic energy), and the regularization of density paths (potential energy). These combinations yield different variational problems (Lagrangians), encompassing many variations of the optimal transport problem such as the Schrödinger bridge, unbalanced optimal transport, and optim… ▽ More

    Submitted 3 July, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  10. arXiv:2310.03579  [pdf, other

    cs.AI q-bio.MN

    Causal Inference in Gene Regulatory Networks with GFlowNet: Towards Scalability in Large Systems

    Authors: Trang Nguyen, Alexander Tong, Kanika Madan, Yoshua Bengio, Dianbo Liu

    Abstract: Understanding causal relationships within Gene Regulatory Networks (GRNs) is essential for unraveling the gene interactions in cellular processes. However, causal discovery in GRNs is a challenging problem for multiple reasons including the existence of cyclic feedback loops and uncertainty that yields diverse possible causal structures. Previous works in this area either ignore cyclic dynamics (a… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  11. arXiv:2310.02391  [pdf, other

    cs.LG cs.AI

    SE(3)-Stochastic Flow Matching for Protein Backbone Generation

    Authors: Avishek Joey Bose, Tara Akhound-Sadegh, Guillaume Huguet, Kilian Fatras, Jarrid Rector-Brooks, Cheng-Hao Liu, Andrei Cristian Nica, Maksym Korablyov, Michael Bronstein, Alexander Tong

    Abstract: The computational design of novel protein structures has the potential to impact numerous scientific disciplines greatly. Toward this goal, we introduce FoldFlow, a series of novel generative models of increasing modeling power based on the flow-matching paradigm over $3\mathrm{D}$ rigid motions -- i.e. the group $\text{SE}(3)$ -- enabling accurate modeling of protein backbones. We first introduce… ▽ More

    Submitted 11 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Spotlight

  12. arXiv:2307.03672  [pdf, other

    cs.LG

    Simulation-free Schrödinger bridges via score and flow matching

    Authors: Alexander Tong, Nikolay Malkin, Kilian Fatras, Lazar Atanackovic, Yanlei Zhang, Guillaume Huguet, Guy Wolf, Yoshua Bengio

    Abstract: We present simulation-free score and flow matching ([SF]$^2$M), a simulation-free objective for inferring stochastic dynamics given unpaired samples drawn from arbitrary source and target distributions. Our method generalizes both the score-matching loss used in the training of diffusion models and the recently proposed flow matching loss used in the training of continuous normalizing flows. [SF]… ▽ More

    Submitted 11 March, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: AISTATS 2024. Code: https://github.com/atong01/conditional-flow-matching

  13. arXiv:2306.06062  [pdf, other

    cs.CV cs.LG

    Neural FIM for learning Fisher Information Metrics from point cloud data

    Authors: Oluwadamilola Fasina, Guillaume Huguet, Alexander Tong, Yanlei Zhang, Guy Wolf, Maximilian Nickel, Ian Adelstein, Smita Krishnaswamy

    Abstract: Although data diffusion embeddings are ubiquitous in unsupervised learning and have proven to be a viable technique for uncovering the underlying intrinsic geometry of data, diffusion embeddings are inherently limited due to their discrete nature. To this end, we propose neural FIM, a method for computing the Fisher information metric (FIM) from point cloud data - allowing for a continuous manifol… ▽ More

    Submitted 11 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 13 pages, 11 figures, 1 table

  14. arXiv:2306.02508  [pdf, other

    cs.LG stat.ML

    Graph Fourier MMD for Signals on Graphs

    Authors: Samuel Leone, Aarthi Venkat, Guillaume Huguet, Alexander Tong, Guy Wolf, Smita Krishnaswamy

    Abstract: While numerous methods have been proposed for computing distances between probability distributions in Euclidean space, relatively little attention has been given to computing such distances for distributions on graphs. However, there has been a marked increase in data that either lies on graph (such as protein interaction networks) or can be modeled as a graph (single cell data), particularly in… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

  15. arXiv:2305.19043  [pdf, other

    cs.LG q-bio.GN q-bio.QM stat.ML

    A Heat Diffusion Perspective on Geodesic Preserving Dimensionality Reduction

    Authors: Guillaume Huguet, Alexander Tong, Edward De Brouwer, Yanlei Zhang, Guy Wolf, Ian Adelstein, Smita Krishnaswamy

    Abstract: Diffusion-based manifold learning methods have proven useful in representation learning and dimensionality reduction of modern high dimensional, high throughput, noisy datasets. Such datasets are especially present in fields like biology and physics. While it is thought that these methods preserve underlying manifold structure of data by learning a proxy for geodesic distances, no specific theoret… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 31 pages, 13 figures, 10 tables

  16. arXiv:2305.18458  [pdf, other

    cs.LG

    Conditional Support Alignment for Domain Adaptation with Label Shift

    Authors: Anh T Nguyen, Lam Tran, Anh Tong, Tuan-Duy H. Nguyen, Toan Tran

    Abstract: Unsupervised domain adaptation (UDA) refers to a domain adaptation framework in which a learning model is trained based on the labeled samples on the source domain and unlabelled ones in the target domain. The dominant existing methods in the field that rely on the classical covariate shift assumption to learn domain-invariant feature representation have yielded suboptimal performance under the la… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  17. arXiv:2305.03631  [pdf

    physics.optics physics.app-ph

    Capping Layer Effects on $Sb_{2}S_{3}$-based Reconfigurable Photonic Devices

    Authors: Ting Yu Teo, Nanxi Li, Landobasa Y. M. Tobing, Amy S. K. Tong, Doris K. T. Ng, Zhihao Ren, Chengkuo Lee, Lennon Y. T. Lee, Robert Edward Simpson

    Abstract: Capping layers are essential for protecting phase change materials (PCMs) used in non-volatile photonics technologies. This work demonstrates how $(ZnS)_{0.8}-(SiO_2)_{0.2}$ caps radically influence the performance of $Sb_{2}S_{3}$ and Ag-doped $Sb_{2}S_{3}$ integrated photonic devices. We found that at least 30 nm of capping material is necessary to protect the material from Sulfur loss. However,… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  18. arXiv:2304.09254  [pdf

    physics.med-ph cs.LG eess.IV

    FastMRI Prostate: A Publicly Available, Biparametric MRI Dataset to Advance Machine Learning for Prostate Cancer Imaging

    Authors: Radhika Tibrewala, Tarun Dutt, Angela Tong, Luke Ginocchio, Mahesh B Keerthivasan, Steven H Baete, Sumit Chopra, Yvonne W Lui, Daniel K Sodickson, Hersh Chandarana, Patricia M Johnson

    Abstract: The fastMRI brain and knee dataset has enabled significant advances in exploring reconstruction methods for improving speed and image quality for Magnetic Resonance Imaging (MRI) via novel, clinically relevant reconstruction approaches. In this study, we describe the April 2023 expansion of the fastMRI dataset to include biparametric prostate MRI data acquired on a clinical population. The dataset… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 4 pages, 1 figure

  19. arXiv:2302.12325  [pdf, other

    astro-ph.CO

    The Intrinsic Alignment of Red Galaxies in DES Y1 redMaPPer Galaxy Clusters

    Authors: C. Zhou, A. Tong, M. A. Troxel, J. Blazek, C. Lin, D. Bacon, L. Bleem, A. Carnero Rosell, C. Chang, M. Costanzi, J. DeRose, J. P. Dietrich, A. Drlica-Wagner, D. Gruen, R. A. Gruendl, B. Hoyle, M. Jarvis, N. MacCrann, B. Mawdsley, T. McClintock, P. Melchior, J. Prat, A. Pujol, E. Rozo, E. S. Rykoff , et al. (57 additional authors not shown)

    Abstract: Clusters of galaxies are sensitive to the most nonlinear peaks in the cosmic density field. The weak gravitational lensing of background galaxies by clusters can allow us to infer their masses. However, galaxies associated with the local environment of the cluster can also be intrinsically aligned due to the local tidal gradient, contaminating any cosmology derived from the lensing signal. We meas… ▽ More

    Submitted 5 September, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: 14 pages, 13 figures. Accepted to MNRAS

  20. arXiv:2302.04178  [pdf, other

    cs.LG cs.AI

    DynGFN: Towards Bayesian Inference of Gene Regulatory Networks with GFlowNets

    Authors: Lazar Atanackovic, Alexander Tong, Bo Wang, Leo J. Lee, Yoshua Bengio, Jason Hartford

    Abstract: One of the grand challenges of cell biology is inferring the gene regulatory network (GRN) which describes interactions between genes and their products that control gene expression and cellular function. We can treat this as a causal discovery problem but with two non-standard challenges: (1) regulatory networks are inherently cyclic so we should not model a GRN as a directed acyclic graph (DAG),… ▽ More

    Submitted 22 December, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

  21. arXiv:2302.00482  [pdf, other

    cs.LG

    Improving and generalizing flow-based generative models with minibatch optimal transport

    Authors: Alexander Tong, Kilian Fatras, Nikolay Malkin, Guillaume Huguet, Yanlei Zhang, Jarrid Rector-Brooks, Guy Wolf, Yoshua Bengio

    Abstract: Continuous normalizing flows (CNFs) are an attractive generative modeling technique, but they have been held back by limitations in their simulation-based maximum likelihood training. We introduce the generalized conditional flow matching (CFM) technique, a family of simulation-free training objectives for CNFs. CFM features a stable regression objective like that used to train the stochastic flow… ▽ More

    Submitted 11 March, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: TMLR. Code: https://github.com/atong01/conditional-flow-matching

  22. arXiv:2301.11962  [pdf, other

    cs.LG

    On the Feasibility of Machine Learning Augmented Magnetic Resonance for Point-of-Care Identification of Disease

    Authors: Raghav Singhal, Mukund Sudarshan, Anish Mahishi, Sri Kaushik, Luke Ginocchio, Angela Tong, Hersh Chandarana, Daniel K. Sodickson, Rajesh Ranganath, Sumit Chopra

    Abstract: Early detection of many life-threatening diseases (e.g., prostate and breast cancer) within at-risk population can improve clinical outcomes and reduce cost of care. While numerous disease-specific "screening" tests that are closer to Point-of-Care (POC) are in use for this task, their low specificity results in unnecessary biopsies, leading to avoidable patient trauma and wasteful healthcare spen… ▽ More

    Submitted 2 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  23. arXiv:2211.00805  [pdf, other

    cs.LG q-bio.QM

    Geodesic Sinkhorn for Fast and Accurate Optimal Transport on Manifolds

    Authors: Guillaume Huguet, Alexander Tong, María Ramos Zapatero, Christopher J. Tape, Guy Wolf, Smita Krishnaswamy

    Abstract: Efficient computation of optimal transport distance between distributions is of growing importance in data science. Sinkhorn-based methods are currently the state-of-the-art for such computations, but require $O(n^2)$ computations. In addition, Sinkhorn-based methods commonly use an Euclidean ground distance between datapoints. However, with the prevalence of manifold structured scientific data, i… ▽ More

    Submitted 26 September, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: A shorter version without the appendix appeared in the IEEE International Workshop on Machine Learning for Signal Processing (2023)

  24. arXiv:2208.07458  [pdf, other

    cs.LG

    Learnable Filters for Geometric Scattering Modules

    Authors: Alexander Tong, Frederik Wenkel, Dhananjay Bhaskar, Kincaid Macdonald, Jackson Grady, Michael Perlmutter, Smita Krishnaswamy, Guy Wolf

    Abstract: We propose a new graph neural network (GNN) module, based on relaxations of recently proposed geometric scattering transforms, which consist of a cascade of graph wavelet filters. Our learnable geometric scattering (LEGS) module enables adaptive tuning of the wavelets to encourage band-pass features to emerge in learned representations. The incorporation of our LEGS-module in GNNs enables the lear… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: 14 pages, 3 figures, 10 tables. arXiv admin note: substantial text overlap with arXiv:2010.02415

  25. arXiv:2206.14928  [pdf, other

    cs.LG

    Manifold Interpolating Optimal-Transport Flows for Trajectory Inference

    Authors: Guillaume Huguet, D. S. Magruder, Alexander Tong, Oluwadamilola Fasina, Manik Kuchroo, Guy Wolf, Smita Krishnaswamy

    Abstract: We present a method called Manifold Interpolating Optimal-Transport Flow (MIOFlow) that learns stochastic, continuous population dynamics from static snapshot samples taken at sporadic timepoints. MIOFlow combines dynamic models, manifold learning, and optimal transport by training neural ordinary differential equations (Neural ODE) to interpolate between static population snapshots as penalized b… ▽ More

    Submitted 3 November, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: Presented at NeurIPS 2022, 24 pages, 7 tables, 14 figures

  26. Optically enhanced discharge excitation and trapping of $^{39}Ar$

    Authors: Y. -Q. Chu, Z. -F. Wan, F. Ritterbusch, W. -K. Hu, J. -Q. Gu, S. -M. Hu, Z. -H. Jia, W. Jiang, Z. -T. Lu, L. -T. Sun, A. -M. Tong, J. S. Wang, G. -M. Yang

    Abstract: We report on a two-fold increase of the $^{39}Ar$ loading rate in an atom trap by enhancing the generation of metastable atoms in a discharge source. Additional atoms in the metastable $1s_5$ level (Paschen notation) are obtained via optically pumping both the $1s_4$ - $2p_6$ transition at 801 nm and the $1s_2$ - $2p_6$ transition at 923 nm. By solving the master equation for the corresponding six… ▽ More

    Submitted 24 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:2005.11056

    Journal ref: Phys. Rev. A 105, 063108(2022)

  27. arXiv:2203.14860  [pdf, other

    cs.LG stat.ML

    Time-inhomogeneous diffusion geometry and topology

    Authors: Guillaume Huguet, Alexander Tong, Bastian Rieck, Jessie Huang, Manik Kuchroo, Matthew Hirn, Guy Wolf, Smita Krishnaswamy

    Abstract: Diffusion condensation is a dynamic process that yields a sequence of multiscale data representations that aim to encode meaningful abstractions. It has proven effective for manifold learning, denoising, clustering, and visualization of high-dimensional data. Diffusion condensation is constructed as a time-inhomogeneous process where each step first computes and then applies a diffusion operator t… ▽ More

    Submitted 5 January, 2023; v1 submitted 28 March, 2022; originally announced March 2022.

  28. arXiv:2111.10452  [pdf, other

    cs.LG cs.AI

    MURAL: An Unsupervised Random Forest-Based Embedding for Electronic Health Record Data

    Authors: Michal Gerasimiuk, Dennis Shung, Alexander Tong, Adrian Stanley, Michael Schultz, Jeffrey Ngu, Loren Laine, Guy Wolf, Smita Krishnaswamy

    Abstract: A major challenge in embedding or visualizing clinical patient data is the heterogeneity of variable types including continuous lab values, categorical diagnostic codes, as well as missing or incomplete data. In particular, in EHR data, some variables are {\em missing not at random (MNAR)} but deliberately not collected and thus are a source of information. For example, lab tests may be deemed nec… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

  29. arXiv:2107.12334  [pdf, other

    cs.LG eess.SP

    Embedding Signals on Knowledge Graphs with Unbalanced Diffusion Earth Mover's Distance

    Authors: Alexander Tong, Guillaume Huguet, Dennis Shung, Amine Natik, Manik Kuchroo, Guillaume Lajoie, Guy Wolf, Smita Krishnaswamy

    Abstract: In modern relational machine learning it is common to encounter large graphs that arise via interactions or similarities between observations in many domains. Further, in many cases the target entities for analysis are actually signals on such graphs. We propose to compare and organize such datasets of graph signals by using an earth mover's distance (EMD) with a geodesic cost over the underlying… ▽ More

    Submitted 28 March, 2022; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: 5 pages, 5 figures, ICASSP 2022

  30. arXiv:2102.12833  [pdf, other

    cs.LG

    Diffusion Earth Mover's Distance and Distribution Embeddings

    Authors: Alexander Tong, Guillaume Huguet, Amine Natik, Kincaid MacDonald, Manik Kuchroo, Ronald Coifman, Guy Wolf, Smita Krishnaswamy

    Abstract: We propose a new fast method of measuring distances between large numbers of related high dimensional datasets called the Diffusion Earth Mover's Distance (EMD). We model the datasets as distributions supported on common data graph that is derived from the affinity matrix computed on the combined data. In such cases where the graph is a discretization of an underlying Riemannian closed manifold, w… ▽ More

    Submitted 27 July, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: Presented at ICML 2021

  31. COVID-19 Heterogeneity in Islands Chain Environment

    Authors: Monique Chyba, Alice Koniges, Prateek Kunwar, Winnie Lau, Yuriy Mileyko, Alan Tong

    Abstract: As 2021 dawns, the COVID-19 pandemic is still raging strongly as vaccines finally appear and hopes for a return to normalcy start to materialize. There is much to be learned from the pandemic's first year data that will likely remain applicable to future epidemics and possible pandemics. With only minor variants in virus strain, countries across the globe have suffered roughly the same pandemic by… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  32. arXiv:2102.06757  [pdf, other

    cs.LG cs.HC

    Multimodal Data Visualization and Denoising with Integrated Diffusion

    Authors: Manik Kuchroo, Abhinav Godavarthi, Alexander Tong, Guy Wolf, Smita Krishnaswamy

    Abstract: We propose a method called integrated diffusion for combining multimodal datasets, or data gathered via several different measurements on the same system, to create a joint data diffusion operator. As real world data suffers from both local and global noise, we introduce mechanisms to optimally calculate a diffusion operator that reflects the combined information from both modalities. We show the… ▽ More

    Submitted 3 March, 2022; v1 submitted 12 February, 2021; originally announced February 2021.

  33. arXiv:2012.12824  [pdf, other

    astro-ph.CO astro-ph.IM

    Dark Energy Survey Year 3 Results: Deep Field Optical + Near-Infrared Images and Catalogue

    Authors: W. G. Hartley, A. Choi, A. Amon, R. A. Gruendl, E. Sheldon, I. Harrison, G. M. Bernstein, I. Sevilla-Noarbe, B. Yanny, K. Eckert, H. T. Diehl, A. Alarcon, M. Banerji, K. Bechtol, R. Buchs, S. Cantu, C. Conselice, J. Cordero, C. Davis, T. M. Davis, S. Dodelson, A. Drlica-Wagner, S. Everett, A. Ferté, D. Gruen , et al. (93 additional authors not shown)

    Abstract: We describe the Dark Energy Survey (DES) Deep Fields, a set of images and associated multi-wavelength catalogue ($ugrizJHKs$) built from Dark Energy Camera (DECam) and Visible and Infrared Survey Telescope for Astronomy (VISTA) data. The DES Deep Fields comprise 11 fields (10 DES supernova fields plus COSMOS), with a total area of $\sim30~$ square degrees in $ugriz$ bands and reaching a maximum… ▽ More

    Submitted 16 February, 2022; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: 32 pages, 23 figures, version accepted by MNRAS. See https://www.darkenergysurvey.org/des-year-3-cosmology-results-papers/ for the full DES Y3 cosmology release

    Report number: FERMILAB-PUB-20-670-AE

    Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 509, Issue 3, pp.3547-3579, 2022

  34. arXiv:2012.11339  [pdf, other

    cs.LG stat.ML

    Learning Compositional Sparse Gaussian Processes with a Shrinkage Prior

    Authors: Anh Tong, Toan Tran, Hung Bui, Jaesik Choi

    Abstract: Choosing a proper set of kernel functions is an important problem in learning Gaussian Process (GP) models since each kernel structure has different model complexity and data fitness. Recently, automatic kernel composition methods provide not only accurate prediction but also attractive interpretability through search-based methods. However, existing methods suffer from slow kernel composition lea… ▽ More

    Submitted 24 February, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: AAAI 2021

  35. arXiv:2010.09301  [pdf, other

    cs.LG stat.ML

    Characterizing Deep Gaussian Processes via Nonlinear Recurrence Systems

    Authors: Anh Tong, Jaesik Choi

    Abstract: Recent advances in Deep Gaussian Processes (DGPs) show the potential to have more expressive representation than that of traditional Gaussian Processes (GPs). However, there exists a pathology of deep Gaussian processes that their learning capacities reduce significantly when the number of layers increases. In this paper, we present a new analysis in DGPs by studying its corresponding nonlinear dy… ▽ More

    Submitted 21 December, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: AAAI 2021

  36. arXiv:2010.02415  [pdf, other

    cs.LG stat.ML

    Data-Driven Learning of Geometric Scattering Networks

    Authors: Alexander Tong, Frederik Wenkel, Kincaid MacDonald, Smita Krishnaswamy, Guy Wolf

    Abstract: We propose a new graph neural network (GNN) module, based on relaxations of recently proposed geometric scattering transforms, which consist of a cascade of graph wavelet filters. Our learnable geometric scattering (LEGS) module enables adaptive tuning of the wavelets to encourage band-pass features to emerge in learned representations. The incorporation of our LEGS-module in GNNs enables the lear… ▽ More

    Submitted 28 March, 2022; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: 6 pages, 2 figures, 3 tables, Presented at IEEE MLSP 2021

  37. arXiv:2006.06885  [pdf, other

    cs.LG stat.ML

    Uncovering the Folding Landscape of RNA Secondary Structure with Deep Graph Embeddings

    Authors: Egbert Castro, Andrew Benz, Alexander Tong, Guy Wolf, Smita Krishnaswamy

    Abstract: Biomolecular graph analysis has recently gained much attention in the emerging field of geometric deep learning. Here we focus on organizing biomolecular graphs in ways that expose meaningful relations and variations between them. We propose a geometric scattering autoencoder (GSAE) network for learning such graph embeddings. Our embedding network first extracts rich graph features using the recen… ▽ More

    Submitted 28 March, 2022; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 10 pages, 10 figures, 4 tables, Presented at IEEE Big Data 2020

  38. arXiv:2004.10746  [pdf, other

    cs.LG cs.AI

    Chip Placement with Deep Reinforcement Learning

    Authors: Azalia Mirhoseini, Anna Goldie, Mustafa Yazgan, Joe Jiang, Ebrahim Songhori, Shen Wang, Young-Joon Lee, Eric Johnson, Omkar Pathak, Sungmin Bae, Azade Nazi, Jiwoo Pak, Andy Tong, Kavya Srinivasa, William Hang, Emre Tuncer, Anand Babu, Quoc V. Le, James Laudon, Richard Ho, Roger Carpenter, Jeff Dean

    Abstract: In this work, we present a learning-based approach to chip placement, one of the most complex and time-consuming stages of the chip design process. Unlike prior methods, our approach has the ability to learn from past experience and improve over time. In particular, as we train over a greater number of chip blocks, our method becomes better at rapidly generating optimized placements for previously… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

  39. DISIR: Deep Image Segmentation with Interactive Refinement

    Authors: Gaston Lenczner, Bertrand Le Saux, Nicola Luminari, Adrien Chan Hon Tong, Guy Le Besnerais

    Abstract: This paper presents an interactive approach for multi-class segmentation of aerial images. Precisely, it is based on a deep neural network which exploits both RGB images and annotations. Starting from an initial output based on the image only, our network then interactively refines this segmentation map using a concatenation of the image and user annotations. Importantly, user annotations modify t… ▽ More

    Submitted 20 August, 2020; v1 submitted 31 March, 2020; originally announced March 2020.

    Comments: 8 pages, 12 figures. Published in the ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences

    Journal ref: XXIV ISPRS Congress, Commission II (Volume V-2-2020)

  40. arXiv:2002.04461  [pdf, other

    stat.ML cs.CV cs.LG q-bio.QM

    TrajectoryNet: A Dynamic Optimal Transport Network for Modeling Cellular Dynamics

    Authors: Alexander Tong, Jessie Huang, Guy Wolf, David van Dijk, Smita Krishnaswamy

    Abstract: It is increasingly common to encounter data from dynamic processes captured by static cross-sectional measurements over time, particularly in biomedical settings. Recent attempts to model individual trajectories from this data use optimal transport to create pairwise matchings between time points. However, these methods cannot model continuous dynamics and non-linear paths that entities can take i… ▽ More

    Submitted 26 July, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: Presented at ICML 2020

  41. arXiv:1911.06253  [pdf, other

    stat.ML cs.LG

    Understanding Graph Neural Networks with Generalized Geometric Scattering Transforms

    Authors: Michael Perlmutter, Alexander Tong, Feng Gao, Guy Wolf, Matthew Hirn

    Abstract: The scattering transform is a multilayered wavelet-based deep learning architecture that acts as a model of convolutional neural networks. Recently, several works have introduced generalizations of the scattering transform for non-Euclidean settings such as graphs. Our work builds upon these constructions by introducing windowed and non-windowed geometric scattering transforms for graphs based upo… ▽ More

    Submitted 28 June, 2023; v1 submitted 14 November, 2019; originally announced November 2019.

  42. arXiv:1905.13168  [pdf, other

    cs.LG stat.ML

    Confirmatory Bayesian Online Change Point Detection in the Covariance Structure of Gaussian Processes

    Authors: Jiyeon Han, Kyowoon Lee, Anh Tong, Jaesik Choi

    Abstract: In the analysis of sequential data, the detection of abrupt changes is important in predicting future changes. In this paper, we propose statistical hypothesis tests for detecting covariance structure changes in locally smooth time series modeled by Gaussian Processes (GPs). We provide theoretically justified thresholds for the tests, and use them to improve Bayesian Online Change Point Detection… ▽ More

    Submitted 7 February, 2020; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: IJCAI 2019 Comments: 12 pages, LaTeX; Revised conditions of Theorems in section 4, results unchanged

  43. arXiv:1905.10710  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Fixing Bias in Reconstruction-based Anomaly Detection with Lipschitz Discriminators

    Authors: Alexander Tong, Guy Wolf, Smita Krishnaswamy

    Abstract: Anomaly detection is of great interest in fields where abnormalities need to be identified and corrected (e.g., medicine and finance). Deep learning methods for this task often rely on autoencoder reconstruction error, sometimes in conjunction with other errors. We show that this approach exhibits intrinsic biases that lead to undesirable results. Reconstruction-based methods are sensitive to trai… ▽ More

    Submitted 26 July, 2020; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: 6 pages, 4 figures, 2 tables, presented at IEEE MLSP

  44. arXiv:1901.09078  [pdf, other

    cs.LG stat.ML

    Finding Archetypal Spaces Using Neural Networks

    Authors: David van Dijk, Daniel Burkhardt, Matthew Amodio, Alex Tong, Guy Wolf, Smita Krishnaswamy

    Abstract: Archetypal analysis is a data decomposition method that describes each observation in a dataset as a convex combination of "pure types" or archetypes. These archetypes represent extrema of a data space in which there is a trade-off between features, such as in biology where different combinations of traits provide optimal fitness for different environments. Existing methods for archetypal analysis… ▽ More

    Submitted 13 November, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: 9 pages, 10 figures, to be presented at IEEE Big Data 2019

  45. arXiv:1810.00424  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Interpretable Neuron Structuring with Graph Spectral Regularization

    Authors: Alexander Tong, David van Dijk, Jay S. Stanley III, Matthew Amodio, Kristina Yim, Rebecca Muhle, James Noonan, Guy Wolf, Smita Krishnaswamy

    Abstract: While neural networks are powerful approximators used to classify or embed data into lower dimensional spaces, they are often regarded as black boxes with uninterpretable features. Here we propose Graph Spectral Regularization for making hidden layers more interpretable without significantly impacting performance on the primary task. Taking inspiration from spatial organization and localization of… ▽ More

    Submitted 14 February, 2020; v1 submitted 30 September, 2018; originally announced October 2018.

    Comments: 12 pages, 6 figures, presented at IDA 2020

  46. arXiv:1711.07412  [pdf, other

    cs.SI physics.soc-ph

    Distributed Rumor Blocking with Multiple Positive Cascades

    Authors: Guangmo Amo Tong, Weili Wu, Ding-Zhu Du

    Abstract: Misinformation and rumor can spread rapidly and widely through online social networks and therefore rumor controlling has become a critical issue. It is often assumed that there is a single authority whose goal is to minimize the spread of rumor by generating a positive cascade. In this paper, we study a more realistic scenario when there are multiple positive cascades generated by different agent… ▽ More

    Submitted 1 December, 2017; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: under review

  47. Cell Sequence and Mitosis Affect Fibroblast Directional Decision-Making during Chemotaxis in Tissue-Mimicking Microfluidic Mazes

    Authors: Q. L. Pham, D. Chege, T. Dijamco, J. Brito, E. Stein, N. A. N. Tong, S. Basuray, R. S. Voronov

    Abstract: Directed fibroblast migration is central to highly proliferative processes in regenerative medicine and developmental biology, such as wound healing and embryogenesis. However, the mechanisms by which single fibroblasts affect each other's directional decisions, while chemotaxing in microscopic tissue pores, are not well understood. Therefore, we explored the effects of two types of relevant socia… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

    Comments: see last page for supplemental materials

    Journal ref: Pham, Q.L., Rodrigues, L.N., Maximov, M.A. et al. Cel. Mol. Bioeng. (2018)

  48. arXiv:1703.09528  [pdf, other

    stat.ML

    Discovering Latent Covariance Structures for Multiple Time Series

    Authors: Anh Tong, Jaesik Choi

    Abstract: Analyzing multivariate time series data is important to predict future events and changes of complex systems in finance, manufacturing, and administrative decisions. The expressiveness power of Gaussian Process (GP) regression methods has been significantly improved by compositional covariance structures. In this paper, we present a new GP model which naturally handles multiple time series by plac… ▽ More

    Submitted 22 May, 2019; v1 submitted 28 March, 2017; originally announced March 2017.

    Comments: ICML2019, 13 pages

  49. arXiv:1607.00710  [pdf, other

    stat.ML

    Automatic Generation of Probabilistic Programming from Time Series Data

    Authors: Anh Tong, Jaesik Choi

    Abstract: Probabilistic programming languages represent complex data with intermingled models in a few lines of code. Efficient inference algorithms in probabilistic programming languages make possible to build unified frameworks to compute interesting probabilities of various large, real-world problems. When the structure of model is given, constructing a probabilistic program is rather straightforward. Th… ▽ More

    Submitted 13 July, 2016; v1 submitted 3 July, 2016; originally announced July 2016.

  50. arXiv:1603.03703  [pdf, ps, other

    cs.LG

    Searching for Topological Symmetry in Data Haystack

    Authors: Kallol Roy, Anh Tong, Jaesik Choi

    Abstract: Finding interesting symmetrical topological structures in high-dimensional systems is an important problem in statistical machine learning. Limited amount of available high-dimensional data and its sensitivity to noise pose computational challenges to find symmetry. Our paper presents a new method to find local symmetries in a low-dimensional 2-D grid structure which is embedded in high-dimensiona… ▽ More

    Submitted 11 March, 2016; originally announced March 2016.