-
Tractography with T1-weighted MRI and associated anatomical constraints on clinical quality diffusion MRI
Authors:
Tian Yu,
Yunhe Li,
Michael E. Kim,
Chenyu Gao,
Qi Yang,
Leon Y. Cai,
Susane M. Resnick,
Lori L. Beason-Held,
Daniel C. Moyer,
Kurt G. Schilling,
Bennett A. Landman
Abstract:
Diffusion MRI (dMRI) streamline tractography, the gold standard for in vivo estimation of brain white matter (WM) pathways, has long been considered indicative of macroscopic relationships with WM microstructure. However, recent advances in tractography demonstrated that convolutional recurrent neural networks (CoRNN) trained with a teacher-student framework have the ability to learn and propagate…
▽ More
Diffusion MRI (dMRI) streamline tractography, the gold standard for in vivo estimation of brain white matter (WM) pathways, has long been considered indicative of macroscopic relationships with WM microstructure. However, recent advances in tractography demonstrated that convolutional recurrent neural networks (CoRNN) trained with a teacher-student framework have the ability to learn and propagate streamlines directly from T1 and anatomical contexts. Training for this network has previously relied on high-resolution dMRI. In this paper, we generalize the training mechanism to traditional clinical resolution data, which allows generalizability across sensitive and susceptible study populations. We train CoRNN on a small subset of the Baltimore Longitudinal Study of Aging (BLSA), which better resembles clinical protocols. Then, we define a metric, termed the epsilon ball seeding method, to compare T1 tractography and traditional diffusion tractography at the streamline level. Under this metric, T1 tractography generated by CoRNN reproduces diffusion tractography with approximately two millimeters of error.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
2RV+HRV and Testing for Strong VS Full Dependence
Authors:
Tiandong Wang,
Sidney I. Resnick
Abstract:
Preferential attachment models of network growth are bivariate heavy tailed models for in- and out-degree with limit measures which either concentrate on a ray of positive slope from the origin or on all of the positive quadrant depending on whether the model includes reciprocity or not. Concentration on the ray is called full dependence. If there were a reliable way to distinguish full dependence…
▽ More
Preferential attachment models of network growth are bivariate heavy tailed models for in- and out-degree with limit measures which either concentrate on a ray of positive slope from the origin or on all of the positive quadrant depending on whether the model includes reciprocity or not. Concentration on the ray is called full dependence. If there were a reliable way to distinguish full dependence from not-full, we would have guidance about which model to choose. This motivates investigating tests that distinguish between (i) full dependence; (ii) strong dependence (support of the limit measure is a proper subcone of the positive quadrant); (iii) weak dependence (limit measure concentrates on positive quadrant). We give two test statistics, analyze their asymptotically normal behavior under full and not-full dependence, and discuss applicability using bootstrap methods applied to simulated and real data.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
Predicting Age from White Matter Diffusivity with Residual Learning
Authors:
Chenyu Gao,
Michael E. Kim,
Ho Hin Lee,
Qi Yang,
Nazirah Mohd Khairi,
Praitayini Kanakaraj,
Nancy R. Newlin,
Derek B. Archer,
Angela L. Jefferson,
Warren D. Taylor,
Brian D. Boyd,
Lori L. Beason-Held,
Susan M. Resnick,
The BIOCARD Study Team,
Yuankai Huo,
Katherine D. Van Schaik,
Kurt G. Schilling,
Daniel Moyer,
Ivana Išgum,
Bennett A. Landman
Abstract:
Imaging findings inconsistent with those expected at specific chronological age ranges may serve as early indicators of neurological disorders and increased mortality risk. Estimation of chronological age, and deviations from expected results, from structural MRI data has become an important task for developing biomarkers that are sensitive to such deviations. Complementary to structural analysis,…
▽ More
Imaging findings inconsistent with those expected at specific chronological age ranges may serve as early indicators of neurological disorders and increased mortality risk. Estimation of chronological age, and deviations from expected results, from structural MRI data has become an important task for developing biomarkers that are sensitive to such deviations. Complementary to structural analysis, diffusion tensor imaging (DTI) has proven effective in identifying age-related microstructural changes within the brain white matter, thereby presenting itself as a promising additional modality for brain age prediction. Although early studies have sought to harness DTI's advantages for age estimation, there is no evidence that the success of this prediction is owed to the unique microstructural and diffusivity features that DTI provides, rather than the macrostructural features that are also available in DTI data. Therefore, we seek to develop white-matter-specific age estimation to capture deviations from normal white matter aging. Specifically, we deliberately disregard the macrostructural information when predicting age from DTI scalar images, using two distinct methods. The first method relies on extracting only microstructural features from regions of interest. The second applies 3D residual neural networks (ResNets) to learn features directly from the images, which are non-linearly registered and warped to a template to minimize macrostructural variations. When tested on unseen data, the first method yields mean absolute error (MAE) of 6.11 years for cognitively normal participants and MAE of 6.62 years for cognitively impaired participants, while the second method achieves MAE of 4.69 years for cognitively normal participants and MAE of 4.96 years for cognitively impaired participants. We find that the ResNet model captures subtler, non-macrostructural features for brain age prediction.
△ Less
Submitted 21 January, 2024; v1 submitted 6 November, 2023;
originally announced November 2023.
-
Robust Fiber ODF Estimation Using Deep Constrained Spherical Deconvolution for Diffusion MRI
Authors:
Tianyuan Yao,
Francois Rheault,
Leon Y Cai,
Vishwesh nath,
Zuhayr Asad,
Nancy Newlin,
Can Cui,
Ruining Deng,
Karthik Ramadass,
Andrea Shafer,
Susan Resnick,
Kurt Schilling,
Bennett A. Landman,
Yuankai Huo
Abstract:
Diffusion-weighted magnetic resonance imaging (DW-MRI) is a critical imaging method for capturing and modeling tissue microarchitecture at a millimeter scale. A common practice to model the measured DW-MRI signal is via fiber orientation distribution function (fODF). This function is the essential first step for the downstream tractography and connectivity analyses. With recent advantages in data…
▽ More
Diffusion-weighted magnetic resonance imaging (DW-MRI) is a critical imaging method for capturing and modeling tissue microarchitecture at a millimeter scale. A common practice to model the measured DW-MRI signal is via fiber orientation distribution function (fODF). This function is the essential first step for the downstream tractography and connectivity analyses. With recent advantages in data sharing, large-scale multi-site DW-MRI datasets are being made available for multi-site studies. However, measurement variabilities (e.g., inter- and intra-site variability, hardware performance, and sequence design) are inevitable during the acquisition of DW-MRI. Most existing model-based methods (e.g., constrained spherical deconvolution (CSD)) and learning based methods (e.g., deep learning (DL)) do not explicitly consider such variabilities in fODF modeling, which consequently leads to inferior performance on multi-site and/or longitudinal diffusion studies. In this paper, we propose a novel data-driven deep constrained spherical deconvolution method to explicitly constrain the scan-rescan variabilities for a more reproducible and robust estimation of brain microstructure from repeated DW-MRI scans. Specifically, the proposed method introduces a new 3D volumetric scanner-invariant regularization scheme during the fODF estimation. We study the Human Connectome Project (HCP) young adults test-retest group as well as the MASiVar dataset (with inter- and intra-site scan/rescan data). The Baltimore Longitudinal Study of Aging (BLSA) dataset is employed for external validation. From the experimental results, the proposed data-driven framework outperforms the existing benchmarks in repeated fODF estimation. The proposed method is assessing the downstream connectivity analysis and shows increased performance in distinguishing subjects with different biomarkers.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Rapid Brain Meninges Surface Reconstruction with Layer Topology Guarantee
Authors:
Peiyu Duan,
Yuan Xue,
Shuo Han,
Lianrui Zuo,
Aaron Carass,
Caitlyn Bernhard,
Savannah Hays,
Peter A. Calabresi,
Susan M. Resnick,
James S. Duncan,
Jerry L. Prince
Abstract:
The meninges, located between the skull and brain, are composed of three membrane layers: the pia, the arachnoid, and the dura. Reconstruction of these layers can aid in studying volume differences between patients with neurodegenerative diseases and normal aging subjects. In this work, we use convolutional neural networks (CNNs) to reconstruct surfaces representing meningeal layer boundaries from…
▽ More
The meninges, located between the skull and brain, are composed of three membrane layers: the pia, the arachnoid, and the dura. Reconstruction of these layers can aid in studying volume differences between patients with neurodegenerative diseases and normal aging subjects. In this work, we use convolutional neural networks (CNNs) to reconstruct surfaces representing meningeal layer boundaries from magnetic resonance (MR) images. We first use the CNNs to predict the signed distance functions (SDFs) representing these surfaces while preserving their anatomical ordering. The marching cubes algorithm is then used to generate continuous surface representations; both the subarachnoid space (SAS) and the intracranial volume (ICV) are computed from these surfaces. The proposed method is compared to a state-of-the-art deformable model-based reconstruction method, and we show that our method can reconstruct smoother and more accurate surfaces using less computation time. Finally, we conduct experiments with volumetric analysis on both subjects with multiple sclerosis and healthy controls. For healthy and MS subjects, ICVs and SAS volumes are found to be significantly correlated to sex (p<0.01) and age (p<0.03) changes, respectively.
△ Less
Submitted 12 April, 2023;
originally announced April 2023.
-
Gene-SGAN: a method for discovering disease subtypes with imaging and genetic signatures via multi-view weakly-supervised deep clustering
Authors:
Zhijian Yang,
Junhao Wen,
Ahmed Abdulkadir,
Yuhan Cui,
Guray Erus,
Elizabeth Mamourian,
Randa Melhem,
Dhivya Srinivasan,
Sindhuja T. Govindarajan,
Jiong Chen,
Mohamad Habes,
Colin L. Masters,
Paul Maruff,
Jurgen Fripp,
Luigi Ferrucci,
Marilyn S. Albert,
Sterling C. Johnson,
John C. Morris,
Pamela LaMontagne,
Daniel S. Marcus,
Tammie L. S. Benzinger,
David A. Wolk,
Li Shen,
Jingxuan Bao,
Susan M. Resnick
, et al. (3 additional authors not shown)
Abstract:
Disease heterogeneity has been a critical challenge for precision diagnosis and treatment, especially in neurologic and neuropsychiatric diseases. Many diseases can display multiple distinct brain phenotypes across individuals, potentially reflecting disease subtypes that can be captured using MRI and machine learning methods. However, biological interpretability and treatment relevance are limite…
▽ More
Disease heterogeneity has been a critical challenge for precision diagnosis and treatment, especially in neurologic and neuropsychiatric diseases. Many diseases can display multiple distinct brain phenotypes across individuals, potentially reflecting disease subtypes that can be captured using MRI and machine learning methods. However, biological interpretability and treatment relevance are limited if the derived subtypes are not associated with genetic drivers or susceptibility factors. Herein, we describe Gene-SGAN - a multi-view, weakly-supervised deep clustering method - which dissects disease heterogeneity by jointly considering phenotypic and genetic data, thereby conferring genetic correlations to the disease subtypes and associated endophenotypic signatures. We first validate the generalizability, interpretability, and robustness of Gene-SGAN in semi-synthetic experiments. We then demonstrate its application to real multi-site datasets from 28,858 individuals, deriving subtypes of Alzheimer's disease and brain endophenotypes associated with hypertension, from MRI and SNP data. Derived brain phenotypes displayed significant differences in neuroanatomical patterns, genetic determinants, biological and clinical biomarkers, indicating potentially distinct underlying neuropathologic processes, genetic drivers, and susceptibility factors. Overall, Gene-SGAN is broadly applicable to disease subtyping and endophenotype discovery, and is herein tested on disease-related, genetically-driven neuroimaging phenotypes.
△ Less
Submitted 25 January, 2023;
originally announced January 2023.
-
HACA3: A Unified Approach for Multi-site MR Image Harmonization
Authors:
Lianrui Zuo,
Yihao Liu,
Yuan Xue,
Blake E. Dewey,
Samuel W. Remedios,
Savannah P. Hays,
Murat Bilgel,
Ellen M. Mowry,
Scott D. Newsome,
Peter A. Calabresi,
Susan M. Resnick,
Jerry L. Prince,
Aaron Carass
Abstract:
The lack of standardization is a prominent issue in magnetic resonance (MR) imaging. This often causes undesired contrast variations in the acquired images due to differences in hardware and acquisition parameters. In recent years, image synthesis-based MR harmonization with disentanglement has been proposed to compensate for the undesired contrast variations. Despite the success of existing metho…
▽ More
The lack of standardization is a prominent issue in magnetic resonance (MR) imaging. This often causes undesired contrast variations in the acquired images due to differences in hardware and acquisition parameters. In recent years, image synthesis-based MR harmonization with disentanglement has been proposed to compensate for the undesired contrast variations. Despite the success of existing methods, we argue that three major improvements can be made. First, most existing methods are built upon the assumption that multi-contrast MR images of the same subject share the same anatomy. This assumption is questionable, since different MR contrasts are specialized to highlight different anatomical features. Second, these methods often require a fixed set of MR contrasts for training (e.g., both T1-weighted and T2-weighted images), limiting their applicability. Lastly, existing methods are generally sensitive to imaging artifacts. In this paper, we present Harmonization with Attention-based Contrast, Anatomy, and Artifact Awareness (HACA3), a novel approach to address these three issues. HACA3 incorporates an anatomy fusion module that accounts for the inherent anatomical differences between MR contrasts. Furthermore, HACA3 is also robust to imaging artifacts and can be trained and applied to any set of MR contrasts. HACA3 is developed and evaluated on diverse MR datasets acquired from 21 sites with varying field strengths, scanner platforms, and acquisition protocols. Experiments show that HACA3 achieves state-of-the-art performance under multiple image quality metrics. We also demonstrate the applicability and versatility of HACA3 on downstream tasks including white matter lesion segmentation and longitudinal volumetric analyses.
△ Less
Submitted 25 April, 2023; v1 submitted 12 December, 2022;
originally announced December 2022.
-
Random Networks with Heterogeneous Reciprocity
Authors:
Tiandong Wang,
Sidney Resnick
Abstract:
Users of social networks display diversified behavior and online habits. For instance, a user's tendency to reply to a post can depend on the user and the person posting. For convenience, we group users into aggregated behavioral patterns, focusing here on the tendency to reply to or reciprocate messages. The reciprocity feature in social networks reflects the information exchange among users. We…
▽ More
Users of social networks display diversified behavior and online habits. For instance, a user's tendency to reply to a post can depend on the user and the person posting. For convenience, we group users into aggregated behavioral patterns, focusing here on the tendency to reply to or reciprocate messages. The reciprocity feature in social networks reflects the information exchange among users. We study the properties of a preferential attachment model with heterogeneous reciprocity levels, give the growth rate of model edge counts, and prove convergence of empirical degree frequencies to a limiting distribution. This limiting distribution is not only multivariate regularly varying, but also has the property of hidden regular variation.
△ Less
Submitted 30 July, 2022;
originally announced August 2022.
-
Disentangling A Single MR Modality
Authors:
Lianrui Zuo,
Yihao Liu,
Yuan Xue,
Shuo Han,
Murat Bilgel,
Susan M. Resnick,
Jerry L. Prince,
Aaron Carass
Abstract:
Disentangling anatomical and contrast information from medical images has gained attention recently, demonstrating benefits for various image analysis tasks. Current methods learn disentangled representations using either paired multi-modal images with the same underlying anatomy or auxiliary labels (e.g., manual delineations) to provide inductive bias for disentanglement. However, these requireme…
▽ More
Disentangling anatomical and contrast information from medical images has gained attention recently, demonstrating benefits for various image analysis tasks. Current methods learn disentangled representations using either paired multi-modal images with the same underlying anatomy or auxiliary labels (e.g., manual delineations) to provide inductive bias for disentanglement. However, these requirements could significantly increase the time and cost in data collection and limit the applicability of these methods when such data are not available. Moreover, these methods generally do not guarantee disentanglement. In this paper, we present a novel framework that learns theoretically and practically superior disentanglement from single modality magnetic resonance images. Moreover, we propose a new information-based metric to quantitatively evaluate disentanglement. Comparisons over existing disentangling methods demonstrate that the proposed method achieves superior performance in both disentanglement and cross-domain image-to-image translation tasks.
△ Less
Submitted 10 May, 2022;
originally announced May 2022.
-
Preferential Attachment with Reciprocity: Properties and Estimation
Authors:
Daniel Cirkovic,
Tiandong Wang,
Sidney Resnick
Abstract:
Reciprocity in social networks helps understand information exchange between two individuals, and indicates interaction patterns between pairs of users. A recent study indicates the reciprocity coefficient of a classical directed preferential attachment (PA) model does not match empirical evidence. In this paper, we extend the classical 3-scenario directed PA model by adding an additional paramete…
▽ More
Reciprocity in social networks helps understand information exchange between two individuals, and indicates interaction patterns between pairs of users. A recent study indicates the reciprocity coefficient of a classical directed preferential attachment (PA) model does not match empirical evidence. In this paper, we extend the classical 3-scenario directed PA model by adding an additional parameter that controls the probability of creating a reciprocal edge. Our proposed model also allows edge creation between two existing nodes, making it a more realistic choice for fitting to real datasets. In addition to analysis of the theoretical properties of this PA model with reciprocity, we provide and compare two estimation procedures for the fitting of the extended model to both simulated and real datasets. The fitted models provide a good match with the empirical tail distributions of both in- and out-degrees. Other mismatched diagnostics suggest that further generalization of the model is warranted.
△ Less
Submitted 10 January, 2022;
originally announced January 2022.
-
Multidimensional representations in late-life depression: convergence in neuroimaging, cognition, clinical symptomatology and genetics
Authors:
Junhao Wen,
Cynthia H. Y. Fu,
Duygu Tosun,
Yogasudha Veturi,
Zhijian Yang,
Ahmed Abdulkadir,
Elizabeth Mamourian,
Dhivya Srinivasan,
Jingxuan Bao,
Guray Erus,
Haochang Shou,
Mohamad Habes,
Jimit Doshi,
Erdem Varol,
Scott R Mackin,
Aristeidis Sotiras,
Yong Fan,
Andrew J. Saykin,
Yvette I. Sheline,
Li Shen,
Marylyn D. Ritchie,
David A. Wolk,
Marilyn Albert,
Susan M. Resnick,
Christos Davatzikos
Abstract:
Late-life depression (LLD) is characterized by considerable heterogeneity in clinical manifestation. Unraveling such heterogeneity would aid in elucidating etiological mechanisms and pave the road to precision and individualized medicine. We sought to delineate, cross-sectionally and longitudinally, disease-related heterogeneity in LLD linked to neuroanatomy, cognitive functioning, clinical sympto…
▽ More
Late-life depression (LLD) is characterized by considerable heterogeneity in clinical manifestation. Unraveling such heterogeneity would aid in elucidating etiological mechanisms and pave the road to precision and individualized medicine. We sought to delineate, cross-sectionally and longitudinally, disease-related heterogeneity in LLD linked to neuroanatomy, cognitive functioning, clinical symptomatology, and genetic profiles. Multimodal data from a multicentre sample (N=996) were analyzed. A semi-supervised clustering method (HYDRA) was applied to regional grey matter (GM) brain volumes to derive dimensional representations. Two dimensions were identified, which accounted for the LLD-related heterogeneity in voxel-wise GM maps, white matter (WM) fractional anisotropy (FA), neurocognitive functioning, clinical phenotype, and genetics. Dimension one (Dim1) demonstrated relatively preserved brain anatomy without WM disruptions relative to healthy controls. In contrast, dimension two (Dim2) showed widespread brain atrophy and WM integrity disruptions, along with cognitive impairment and higher depression severity. Moreover, one de novo independent genetic variant (rs13120336) was significantly associated with Dim 1 but not with Dim 2. Notably, the two dimensions demonstrated significant SNP-based heritability of 18-27% within the general population (N=12,518 in UKBB). Lastly, in a subset of individuals having longitudinal measurements, Dim2 demonstrated a more rapid longitudinal decrease in GM and brain age, and was more likely to progress to Alzheimers disease, compared to Dim1 (N=1,413 participants and 7,225 scans from ADNI, BLSA, and BIOCARD datasets).
△ Less
Submitted 25 October, 2021; v1 submitted 20 October, 2021;
originally announced October 2021.
-
Disentangling Alzheimer's disease neurodegeneration from typical brain aging using machine learning
Authors:
Gyujoon Hwang,
Ahmed Abdulkadir,
Guray Erus,
Mohamad Habes,
Raymond Pomponio,
Haochang Shou,
Jimit Doshi,
Elizabeth Mamourian,
Tanweer Rashid,
Murat Bilgel,
Yong Fan,
Aristeidis Sotiras,
Dhivya Srinivasan,
John C. Morris,
Daniel Marcus,
Marilyn S. Albert,
Nick R. Bryan,
Susan M. Resnick,
Ilya M. Nasrallah,
Christos Davatzikos,
David A. Wolk
Abstract:
Neuroimaging biomarkers that distinguish between typical brain aging and Alzheimer's disease (AD) are valuable for determining how much each contributes to cognitive decline. Machine learning models can derive multi-variate brain change patterns related to the two processes, including the SPARE-AD (Spatial Patterns of Atrophy for Recognition of Alzheimer's Disease) and SPARE-BA (of Brain Aging) in…
▽ More
Neuroimaging biomarkers that distinguish between typical brain aging and Alzheimer's disease (AD) are valuable for determining how much each contributes to cognitive decline. Machine learning models can derive multi-variate brain change patterns related to the two processes, including the SPARE-AD (Spatial Patterns of Atrophy for Recognition of Alzheimer's Disease) and SPARE-BA (of Brain Aging) investigated herein. However, substantial overlap between brain regions affected in the two processes confounds measuring them independently. We present a methodology toward disentangling the two. T1-weighted MRI images of 4,054 participants (48-95 years) with AD, mild cognitive impairment (MCI), or cognitively normal (CN) diagnoses from the iSTAGING (Imaging-based coordinate SysTem for AGIng and NeurodeGenerative diseases) consortium were analyzed. First, a subset of AD patients and CN adults were selected based purely on clinical diagnoses to train SPARE-BA1 (regression of age using CN individuals) and SPARE-AD1 (classification of CN versus AD). Second, analogous groups were selected based on clinical and molecular markers to train SPARE-BA2 and SPARE-AD2: amyloid-positive (A+) AD continuum group (consisting of A+AD, A+MCI, and A+ and tau-positive CN individuals) and amyloid-negative (A-) CN group. Finally, the combined group of the AD continuum and A-/CN individuals was used to train SPARE-BA3, with the intention to estimate brain age regardless of AD-related brain changes. Disentangled SPARE models derived brain patterns that were more specific to the two types of the brain changes. Correlation between the SPARE-BA and SPARE-AD was significantly reduced. Correlation of disentangled SPARE-AD was non-inferior to the molecular measurements and to the number of APOE4 alleles, but was less to AD-related psychometric test scores, suggesting contribution of advanced brain aging to these scores.
△ Less
Submitted 8 September, 2021;
originally announced September 2021.
-
Exact and Asymptotic Tests for Sufficient Followup in Censored Survival Data
Authors:
Ross Maller,
Sidney Resnick,
Soudabeh Shemehsavar
Abstract:
The existence of immune or cured individuals in a population and whether there is sufficient followup in a sample of censored observations on their lifetimes to be confident of their presence are questions of major importance in medical survival analysis. So far only a few candidates have been put forward as possible test statistics for the existence of sufficient followup in a sample. Here we inv…
▽ More
The existence of immune or cured individuals in a population and whether there is sufficient followup in a sample of censored observations on their lifetimes to be confident of their presence are questions of major importance in medical survival analysis. So far only a few candidates have been put forward as possible test statistics for the existence of sufficient followup in a sample. Here we investigate one such statistic and give a detailed analysis, obtaining an exact finite sample as well as asymptotic distributions for it, and use these to calculate the power of the test as a function of the followup in the sample.
△ Less
Submitted 19 June, 2022; v1 submitted 6 September, 2021;
originally announced September 2021.
-
Asymptotic Dependence of In- and Out-Degrees in a Preferential Attachment Model with Reciprocity
Authors:
Tiandong Wang,
Sidney I. Resnick
Abstract:
Reciprocity characterizes the information exchange between users in a network, and some empirical studies have revealed that social networks have a high proportion of reciprocal edges. Classical directed preferential attachment (PA) models, though generating scale-free networks, may give networks with low reciprocity. This points out one potential problem of fitting a classical PA model to a given…
▽ More
Reciprocity characterizes the information exchange between users in a network, and some empirical studies have revealed that social networks have a high proportion of reciprocal edges. Classical directed preferential attachment (PA) models, though generating scale-free networks, may give networks with low reciprocity. This points out one potential problem of fitting a classical PA model to a given network dataset with high reciprocity, and indicates alternative models need to be considered. We give one possible modification of the classical PA model by including another parameter which controls the probability of adding a reciprocated edge at each step. Asymptotic analyses suggest that large in- and out-degrees become fully dependent in this modified model, as a result of the additional reciprocated edges.
△ Less
Submitted 6 August, 2021;
originally announced August 2021.
-
Measuring Reciprocity in a Directed Preferential Attachment Network
Authors:
Tiandong Wang,
Sidney Resnick
Abstract:
Empirical studies show that online social networks have not only in- and out-degree distributions with Pareto-like tails but also a high proportion of reciprocal edges. A classical directed preferential attachment (PA) model generates in- and out-degree distribution with power-law tails, but theoretical properties of the reciprocity feature in this model have not yet been studied. We derive the as…
▽ More
Empirical studies show that online social networks have not only in- and out-degree distributions with Pareto-like tails but also a high proportion of reciprocal edges. A classical directed preferential attachment (PA) model generates in- and out-degree distribution with power-law tails, but theoretical properties of the reciprocity feature in this model have not yet been studied. We derive the asymptotic results on the number of reciprocal edges between two fixed nodes, as well as the proportion of reciprocal edges in the entire PA network. We see that with certain choices of parameters, the proportion of reciprocal edges in a directed PA network is close to 0, which differs from the empirical observation. This points out one potential problem of fitting a classical PA model to a given network dataset with high reciprocity and indicates alternative models need to be considered.
△ Less
Submitted 12 March, 2021;
originally announced March 2021.
-
Splitting the Sample at the Largest Uncensored Observation
Authors:
Ross Maller,
Sidney Resnick,
Soudabeh Shemehsavar
Abstract:
We calculate finite sample and asymptotic distributions for the largest censored and uncensored survival times, and some related statistics, from a sample of survival data generated according to an iid censoring model. These statistics are important for assessing whether there is sufficient followup in the sample to be confident of the presence of immune or cured individuals in the population. A k…
▽ More
We calculate finite sample and asymptotic distributions for the largest censored and uncensored survival times, and some related statistics, from a sample of survival data generated according to an iid censoring model. These statistics are important for assessing whether there is sufficient followup in the sample to be confident of the presence of immune or cured individuals in the population. A key structural result obtained is that, conditional on the value of the largest uncensored survival time, and knowing the number of censored observations exceeding this time, the sample partitions into two independent subsamples, each subsample having the distribution of an iid sample of censored survival times, of reduced size, from truncated random variables. This result provides valuable insight into the construction of censored survival data, and facilitates the calculation of explicit finite sample formulae. We illustrate for distributions of statistics useful for testing for sufficient followup in a sample, and apply extreme value methods to derive asymptotic distributions for some of those.
△ Less
Submitted 12 September, 2021; v1 submitted 1 March, 2021;
originally announced March 2021.
-
Disentangling brain heterogeneity via semi-supervised deep-learning and MRI: dimensional representations of Alzheimer's Disease
Authors:
Zhijian Yang,
Ilya M. Nasrallah,
Haochang Shou,
Junhao Wen,
Jimit Doshi,
Mohamad Habes,
Guray Erus,
Ahmed Abdulkadir,
Susan M. Resnick,
David Wolk,
Christos Davatzikos
Abstract:
Heterogeneity of brain diseases is a challenge for precision diagnosis/prognosis. We describe and validate Smile-GAN (SeMI-supervised cLustEring-Generative Adversarial Network), a novel semi-supervised deep-clustering method, which dissects neuroanatomical heterogeneity, enabling identification of disease subtypes via their imaging signatures relative to controls. When applied to MRIs (2 studies;…
▽ More
Heterogeneity of brain diseases is a challenge for precision diagnosis/prognosis. We describe and validate Smile-GAN (SeMI-supervised cLustEring-Generative Adversarial Network), a novel semi-supervised deep-clustering method, which dissects neuroanatomical heterogeneity, enabling identification of disease subtypes via their imaging signatures relative to controls. When applied to MRIs (2 studies; 2,832 participants; 8,146 scans) including cognitively normal individuals and those with cognitive impairment and dementia, Smile-GAN identified 4 neurodegenerative patterns/axes: P1, normal anatomy and highest cognitive performance; P2, mild/diffuse atrophy and more prominent executive dysfunction; P3, focal medial temporal atrophy and relatively greater memory impairment; P4, advanced neurodegeneration. Further application to longitudinal data revealed two distinct progression pathways: P1$\rightarrow$P2$\rightarrow$P4 and P1$\rightarrow$P3$\rightarrow$P4. Baseline expression of these patterns predicted the pathway and rate of future neurodegeneration. Pattern expression offered better yet complementary performance in predicting clinical progression, compared to amyloid/tau. These deep-learning derived biomarkers offer promise for precision diagnostics and targeted clinical trial recruitment.
△ Less
Submitted 24 February, 2021;
originally announced February 2021.
-
Medical Image Harmonization Using Deep Learning Based Canonical Mapping: Toward Robust and Generalizable Learning in Imaging
Authors:
Vishnu M. Bashyam,
Jimit Doshi,
Guray Erus,
Dhivya Srinivasan,
Ahmed Abdulkadir,
Mohamad Habes,
Yong Fan,
Colin L. Masters,
Paul Maruff,
Chuanjun Zhuo,
Henry Völzke,
Sterling C. Johnson,
Jurgen Fripp,
Nikolaos Koutsouleris,
Theodore D. Satterthwaite,
Daniel H. Wolf,
Raquel E. Gur,
Ruben C. Gur,
John C. Morris,
Marilyn S. Albert,
Hans J. Grabe,
Susan M. Resnick,
R. Nick Bryan,
David A. Wolk,
Haochang Shou
, et al. (2 additional authors not shown)
Abstract:
Conventional and deep learning-based methods have shown great potential in the medical imaging domain, as means for deriving diagnostic, prognostic, and predictive biomarkers, and by contributing to precision medicine. However, these methods have yet to see widespread clinical adoption, in part due to limited generalization performance across various imaging devices, acquisition protocols, and pat…
▽ More
Conventional and deep learning-based methods have shown great potential in the medical imaging domain, as means for deriving diagnostic, prognostic, and predictive biomarkers, and by contributing to precision medicine. However, these methods have yet to see widespread clinical adoption, in part due to limited generalization performance across various imaging devices, acquisition protocols, and patient populations. In this work, we propose a new paradigm in which data from a diverse range of acquisition conditions are "harmonized" to a common reference domain, where accurate model learning and prediction can take place. By learning an unsupervised image to image canonical mapping from diverse datasets to a reference domain using generative deep learning models, we aim to reduce confounding data variation while preserving semantic information, thereby rendering the learning task easier in the reference domain. We test this approach on two example problems, namely MRI-based brain age prediction and classification of schizophrenia, leveraging pooled cohorts of neuroimaging MRI data spanning 9 sites and 9701 subjects. Our results indicate a substantial improvement in these tasks in out-of-sample data, even when training is restricted to a single site.
△ Less
Submitted 11 October, 2020;
originally announced October 2020.
-
A Directed Preferential Attachment Model with Poisson Measurement
Authors:
Tiandong Wang,
Sidney I. Resnick
Abstract:
When modeling a directed social network, one choice is to use the traditional preferential attachment model, which generates power-law tail distributions. In a traditional directed preferential attachment, every new edge is added sequentially into the network. However, for real datasets, it is common to only have coarse timestamps available, which means several new edges are created at the same ti…
▽ More
When modeling a directed social network, one choice is to use the traditional preferential attachment model, which generates power-law tail distributions. In a traditional directed preferential attachment, every new edge is added sequentially into the network. However, for real datasets, it is common to only have coarse timestamps available, which means several new edges are created at the same timestamp. Previous analyses on the evolution of social networks reveal that after reaching a stable phase, the growth of edge counts in a network follows a non-homogeneous Poisson process with a constant rate across the day but varying rates from day to day. Taking such empirical observations into account, we propose a modified preferential attachment model with Poisson measurement, and study its asymptotic behavior. This modified model is then fitted to real datasets, and we see it provides a better fit than the traditional one.
△ Less
Submitted 16 August, 2020;
originally announced August 2020.
-
Extremes of Censored and Uncensored Lifetimes in Survival Data
Authors:
Ross A. Maller,
Sidney I. Resnick
Abstract:
The i.i.d. censoring model for survival analysis assumes two independent sequences of i.i.d. positive random variables, $(T_i^*)_{1\le i\le n}$ and $(U_i)_{1\le i\le n}$. The data consists of observations on the random sequence $\big(T_i=\min(T_i^*,U_i)$ together with accompanying censor indicators. Values of $T_i$ with $T_i^*\le U_i$ are said to be uncensored, those with $T_i^*> U_i$ are censored…
▽ More
The i.i.d. censoring model for survival analysis assumes two independent sequences of i.i.d. positive random variables, $(T_i^*)_{1\le i\le n}$ and $(U_i)_{1\le i\le n}$. The data consists of observations on the random sequence $\big(T_i=\min(T_i^*,U_i)$ together with accompanying censor indicators. Values of $T_i$ with $T_i^*\le U_i$ are said to be uncensored, those with $T_i^*> U_i$ are censored. We assume that the distributions of the $T_i^*$ and $U_i$ are in the domain of attraction of the Gumbel distribution and obtain the asymptotic distributions, as sample size $n\to\infty$, of the maximum values of the censored and uncensored lifetimes in the data, and of statistics related to them. These enable us to examine questions concerning the possible existence of cured individuals in the population.
△ Less
Submitted 25 February, 2020;
originally announced February 2020.
-
Common Growth Patterns for Regional Social Networks: a Point Process Approach
Authors:
Tiandong Wang,
Sidney I. Resnick
Abstract:
Although recent research on social networks emphasizes microscopic dynamics such as retweets and social connectivity of an individual user, we focus on macroscopic growth dynamics of social network link formation. Rather than focusing on one particular dataset, we find invariant behavior in regional social networks that are geographically concentrated. Empirical findings suggest that the startup p…
▽ More
Although recent research on social networks emphasizes microscopic dynamics such as retweets and social connectivity of an individual user, we focus on macroscopic growth dynamics of social network link formation. Rather than focusing on one particular dataset, we find invariant behavior in regional social networks that are geographically concentrated. Empirical findings suggest that the startup phase of a regional network can be modeled by a self-exciting point process. After the startup phase ends, the growth of the links can be modeled by a non-homogeneous Poisson process with constant rate across the day but varying rates from day to day, plus a nightly inactive period when local users are expected to be asleep. Conclusions are drawn based on analyzing four different datasets, three of which are regional and a non-regional one is included for contrast.
△ Less
Submitted 18 November, 2019;
originally announced November 2019.
-
Asymptotic independence and support detection techniques for heavy-tailed multivariate data
Authors:
Jaakko Lehtomaa,
Sidney Resnick
Abstract:
One of the central objectives of modern risk management is to find a set of risks where the probability of multiple simultaneous catastrophic events is negligible. That is, risks are taken only when their joint behavior seems sufficiently independent. This paper aims to help to identify asymptotically independent risks by providing additional tools for describing dependence structures of multiple…
▽ More
One of the central objectives of modern risk management is to find a set of risks where the probability of multiple simultaneous catastrophic events is negligible. That is, risks are taken only when their joint behavior seems sufficiently independent. This paper aims to help to identify asymptotically independent risks by providing additional tools for describing dependence structures of multiple risks when the individual risks can obtain very large values.
The study is performed in the setting of multivariate regular variation. We show how asymptotic independence is connected to properties of the support of the angular measure and present an asymptotically consistent estimator of the support. The estimator generalizes to any dimension $N\geq 2$ and requires no prior knowledge of the support. The validity of the support estimate can be rigorously tested under mild assumptions by an asymptotically normal test statistic.
△ Less
Submitted 1 April, 2019;
originally announced April 2019.
-
3D Whole Brain Segmentation using Spatially Localized Atlas Network Tiles
Authors:
Yuankai Huo,
Zhoubing Xu,
Yunxi Xiong,
Katherine Aboud,
Prasanna Parvathaneni,
Shunxing Bao,
Camilo Bermudez,
Susan M. Resnick,
Laurie E. Cutting,
Bennett A. Landman
Abstract:
Detailed whole brain segmentation is an essential quantitative technique, which provides a non-invasive way of measuring brain regions from a structural magnetic resonance imaging (MRI). Recently, deep convolution neural network (CNN) has been applied to whole brain segmentation. However, restricted by current GPU memory, 2D based methods, downsampling based 3D CNN methods, and patch-based high-re…
▽ More
Detailed whole brain segmentation is an essential quantitative technique, which provides a non-invasive way of measuring brain regions from a structural magnetic resonance imaging (MRI). Recently, deep convolution neural network (CNN) has been applied to whole brain segmentation. However, restricted by current GPU memory, 2D based methods, downsampling based 3D CNN methods, and patch-based high-resolution 3D CNN methods have been the de facto standard solutions. 3D patch-based high resolution methods typically yield superior performance among CNN approaches on detailed whole brain segmentation (>100 labels), however, whose performance are still commonly inferior compared with multi-atlas segmentation methods (MAS) due to the following challenges: (1) a single network is typically used to learn both spatial and contextual information for the patches, (2) limited manually traced whole brain volumes are available (typically less than 50) for training a network. In this work, we propose the spatially localized atlas network tiles (SLANT) method to distribute multiple independent 3D fully convolutional networks (FCN) for high-resolution whole brain segmentation. To address the first challenge, multiple spatially distributed networks were used in the SLANT method, in which each network learned contextual information for a fixed spatial location. To address the second challenge, auxiliary labels on 5111 initially unlabeled scans were created by multi-atlas segmentation for training. Since the method integrated multiple traditional medical image processing methods with deep learning, we developed a containerized pipeline to deploy the end-to-end solution. From the results, the proposed method achieved superior performance compared with multi-atlas segmentation methods, while reducing the computational time from >30 hours to 15 minutes (https://github.com/MASILab/SLANTbrainSeg).
△ Less
Submitted 28 March, 2019;
originally announced March 2019.
-
On a minimum distance procedure for threshold selection in tail analysis
Authors:
Holger Drees,
Anja Janßen,
Sidney I. Resnick,
Tiandong Wang
Abstract:
Power-law distributions have been widely observed in different areas of scientific research. Practical estimation issues include how to select a threshold above which observations follow a power-law distribution and then how to estimate the power-law tail index. A minimum distance selection procedure (MDSP) is proposed in Clauset et al. (2009) and has been widely adopted in practice, especially in…
▽ More
Power-law distributions have been widely observed in different areas of scientific research. Practical estimation issues include how to select a threshold above which observations follow a power-law distribution and then how to estimate the power-law tail index. A minimum distance selection procedure (MDSP) is proposed in Clauset et al. (2009) and has been widely adopted in practice, especially in the analyses of social networks. However, theoretical justifications for this selection procedure remain scant. In this paper, we study the asymptotic behavior of the selected threshold and the corresponding power-law index given by the MDSP. We find that the MDSP tends to choose too high a threshold level and leads to Hill estimates with large variances and root mean squared errors for simulated data with Pareto-like tails.
△ Less
Submitted 12 February, 2020; v1 submitted 15 November, 2018;
originally announced November 2018.
-
Degree Growth Rates and Index Estimation in a Directed Preferential Attachment Model
Authors:
Tiandong Wang,
Sidney I. Resnick
Abstract:
Preferential attachment is widely used to model power-law behavior of degree distributions in both directed and undirected networks. In a directed preferential attachment model, despite the well-known marginal power-law degree distributions, not much investigation has been done on the joint behavior of the in- and out-degree growth. Also, statistical estimates of the marginal tail exponent of the…
▽ More
Preferential attachment is widely used to model power-law behavior of degree distributions in both directed and undirected networks. In a directed preferential attachment model, despite the well-known marginal power-law degree distributions, not much investigation has been done on the joint behavior of the in- and out-degree growth. Also, statistical estimates of the marginal tail exponent of the power-law degree distribution often use the Hill estimator as one of the key summary statistics, even though no theoretical justification has been given. This paper focuses on convergence of the joint empirical measure for in- and out-degrees and proves the consistency of the Hill estimator. To do this, we first derive the asymptotic behavior of the joint degree sequences by embedding the in- and out-degrees of a fixed node into a pair of switched birth processes with immigration and then establish the convergence of the joint tail empirical measure. From these steps, the consistency of the Hill estimators is obtained.
△ Less
Submitted 5 August, 2018;
originally announced August 2018.
-
Data-driven Probabilistic Atlases Capture Whole-brain Individual Variation
Authors:
Yuankai Huo,
Katherine Swett,
Susan M. Resnick,
Laurie E. Cutting,
Bennett A. Landman
Abstract:
Probabilistic atlases provide essential spatial contextual information for image interpretation, Bayesian modeling, and algorithmic processing. Such atlases are typically constructed by grouping subjects with similar demographic information. Importantly, use of the same scanner minimizes inter-group variability. However, generalizability and spatial specificity of such approaches is more limited t…
▽ More
Probabilistic atlases provide essential spatial contextual information for image interpretation, Bayesian modeling, and algorithmic processing. Such atlases are typically constructed by grouping subjects with similar demographic information. Importantly, use of the same scanner minimizes inter-group variability. However, generalizability and spatial specificity of such approaches is more limited than one might like. Inspired by Commowick "Frankenstein's creature paradigm" which builds a personal specific anatomical atlas, we propose a data-driven framework to build a personal specific probabilistic atlas under the large-scale data scheme. The data-driven framework clusters regions with similar features using a point distribution model to learn different anatomical phenotypes. Regional structural atlases and corresponding regional probabilistic atlases are used as indices and targets in the dictionary. By indexing the dictionary, the whole brain probabilistic atlases adapt to each new subject quickly and can be used as spatial priors for visualization and processing. The novelties of this approach are (1) it provides a new perspective of generating personal specific whole brain probabilistic atlases (132 regions) under data-driven scheme across sites. (2) The framework employs the large amount of heterogeneous data (2349 images). (3) The proposed framework achieves low computational cost since only one affine registration and Pearson correlation operation are required for a new subject. Our method matches individual regions better with higher Dice similarity value when testing the probabilistic atlases. Importantly, the advantage the large-scale scheme is demonstrated by the better performance of using large-scale training data (1888 images) than smaller training set (720 images).
△ Less
Submitted 6 June, 2018;
originally announced June 2018.
-
Spatially Localized Atlas Network Tiles Enables 3D Whole Brain Segmentation from Limited Data
Authors:
Yuankai Huo,
Zhoubing Xu,
Katherine Aboud,
Prasanna Parvathaneni,
Shunxing Bao,
Camilo Bermudez,
Susan M. Resnick,
Laurie E. Cutting,
Bennett A. Landman
Abstract:
Whole brain segmentation on a structural magnetic resonance imaging (MRI) is essential in non-invasive investigation for neuroanatomy. Historically, multi-atlas segmentation (MAS) has been regarded as the de facto standard method for whole brain segmentation. Recently, deep neural network approaches have been applied to whole brain segmentation by learning random patches or 2D slices. Yet, few pre…
▽ More
Whole brain segmentation on a structural magnetic resonance imaging (MRI) is essential in non-invasive investigation for neuroanatomy. Historically, multi-atlas segmentation (MAS) has been regarded as the de facto standard method for whole brain segmentation. Recently, deep neural network approaches have been applied to whole brain segmentation by learning random patches or 2D slices. Yet, few previous efforts have been made on detailed whole brain segmentation using 3D networks due to the following challenges: (1) fitting entire whole brain volume into 3D networks is restricted by the current GPU memory, and (2) the large number of targeting labels (e.g., > 100 labels) with limited number of training 3D volumes (e.g., < 50 scans). In this paper, we propose the spatially localized atlas network tiles (SLANT) method to distribute multiple independent 3D fully convolutional networks to cover overlapped sub-spaces in a standard atlas space. This strategy simplifies the whole brain learning task to localized sub-tasks, which was enabled by combing canonical registration and label fusion techniques with deep learning. To address the second challenge, auxiliary labels on 5111 initially unlabeled scans were created by MAS for pre-training. From empirical validation, the state-of-the-art MAS method achieved mean Dice value of 0.76, 0.71, and 0.68, while the proposed method achieved 0.78, 0.73, and 0.71 on three validation cohorts. Moreover, the computational time reduced from > 30 hours using MAS to ~15 minutes using the proposed method. The source code is available online https://github.com/MASILab/SLANTbrainSeg
△ Less
Submitted 5 June, 2018; v1 submitted 1 June, 2018;
originally announced June 2018.
-
Trimmed Lévy Processes and their Extremal Components
Authors:
Yuguang Ipsen,
Ross Maller,
Sidney Resnick
Abstract:
We analyse a trimmed stochastic process of the form ${}^{(r)}X_t= X_t - \sum_{i=1}^r Δ_t^{(i)}$, where $(X_t)_{t \geq 0}$ is a driftless subordinator on $\mathbb{R}$ with its jumps on $[0,t]$ ordered as $ Δ_t^{(1)}\ge Δ_t^{(2)} \cdots$. When $r\to\infty$, both ${}^{(r)}X_t \to 0$ and $Δ_t^{(r)} \to 0$ a.s. for each $t>0$, and it is interesting to study the weak limiting behaviour of…
▽ More
We analyse a trimmed stochastic process of the form ${}^{(r)}X_t= X_t - \sum_{i=1}^r Δ_t^{(i)}$, where $(X_t)_{t \geq 0}$ is a driftless subordinator on $\mathbb{R}$ with its jumps on $[0,t]$ ordered as $ Δ_t^{(1)}\ge Δ_t^{(2)} \cdots$. When $r\to\infty$, both ${}^{(r)}X_t \to 0$ and $Δ_t^{(r)} \to 0$ a.s. for each $t>0$, and it is interesting to study the weak limiting behaviour of $\bigl({}^{(r)}X_t, Δ_t^{(r)}\bigr)$ in this case. We term this "large-trimming" behaviour. Concentrating on the case $t=1$, we study joint convergence of $\bigl({}^{(r)}X_1, Δ_1^{(r)}\bigr)$ under linear normalization, assuming extreme value-related conditions on the Lévy measure of $X$ which guarantee that $Δ_1^{(r)}$ has a limit distribution with linear normalization. Allowing ${}^{(r)}X_1$ to have random centering and scaling in a natural way, we show that $\bigl({}^{(r)}X_1, Δ_1^{(r)}\bigr)$ has a bivariate normal limiting distribution, as $r\to\infty$; but replacing the random normalizations with natural deterministic ones produces non-normal limits which we can specify.
△ Less
Submitted 27 February, 2018;
originally announced February 2018.
-
Learning Implicit Brain MRI Manifolds with Deep Learning
Authors:
Camilo Bermudez,
Andrew J. Plassard,
Larry T. Davis,
Allen T. Newton,
Susan M Resnick,
Bennett A. Landman
Abstract:
An important task in image processing and neuroimaging is to extract quantitative information from the acquired images in order to make observations about the presence of disease or markers of development in populations. Having a lowdimensional manifold of an image allows for easier statistical comparisons between groups and the synthesis of group representatives. Previous studies have sought to i…
▽ More
An important task in image processing and neuroimaging is to extract quantitative information from the acquired images in order to make observations about the presence of disease or markers of development in populations. Having a lowdimensional manifold of an image allows for easier statistical comparisons between groups and the synthesis of group representatives. Previous studies have sought to identify the best mapping of brain MRI to a low-dimensional manifold, but have been limited by assumptions of explicit similarity measures. In this work, we use deep learning techniques to investigate implicit manifolds of normal brains and generate new, high-quality images. We explore implicit manifolds by addressing the problems of image synthesis and image denoising as important tools in manifold learning. First, we propose the unsupervised synthesis of T1-weighted brain MRI using a Generative Adversarial Network (GAN) by learning from 528 examples of 2D axial slices of brain MRI. Synthesized images were first shown to be unique by performing a crosscorrelation with the training set. Real and synthesized images were then assessed in a blinded manner by two imaging experts providing an image quality score of 1-5. The quality score of the synthetic image showed substantial overlap with that of the real images. Moreover, we use an autoencoder with skip connections for image denoising, showing that the proposed method results in higher PSNR than FSL SUSAN after denoising. This work shows the power of artificial networks to synthesize realistic imaging data, which can be used to improve image processing techniques and provide a quantitative framework to structural changes in the brain.
△ Less
Submitted 5 January, 2018;
originally announced January 2018.
-
Are Extreme Value Estimation Methods Useful for Network Data?
Authors:
Phyllis Wan,
Tiandong Wang,
Richard A. Davis,
Sidney I. Resnick
Abstract:
Preferential attachment is an appealing edge generating mechanism for modeling social networks. It provides both an intuitive description of network growth and an explanation for the observed power laws in degree distributions. However, there are often limitations in fitting parametric network models to data due to the complex nature of real-world networks. In this paper, we consider a semi-parame…
▽ More
Preferential attachment is an appealing edge generating mechanism for modeling social networks. It provides both an intuitive description of network growth and an explanation for the observed power laws in degree distributions. However, there are often limitations in fitting parametric network models to data due to the complex nature of real-world networks. In this paper, we consider a semi-parametric estimation approach by looking at only the nodes with large in- or out-degrees of the network. This method examines the tail behavior of both the marginal and joint degree distributions and is based on extreme value theory. We compare it with the existing parametric approaches and demonstrate how it can provide more robust estimates of parameters associated with the network when the data are corrupted or when the model is misspecified.
△ Less
Submitted 19 December, 2017;
originally announced December 2017.
-
Consistency of Hill Estimators in a Linear Preferential Attachment Model
Authors:
Tiandong Wang,
Sidney Resnick
Abstract:
Preferential attachment is widely used to model power-law behavior of degree distributions in both directed and undirected networks. Practical analyses on the tail exponent of the power-law degree distribution use the Hill estimator as one of the key summary statistics, whose consistency is justified mostly for iid data. The major goal in this paper is to answer the question whether the Hill estim…
▽ More
Preferential attachment is widely used to model power-law behavior of degree distributions in both directed and undirected networks. Practical analyses on the tail exponent of the power-law degree distribution use the Hill estimator as one of the key summary statistics, whose consistency is justified mostly for iid data. The major goal in this paper is to answer the question whether the Hill estimator is still consistent when applied to non-iid network data. To do this, we first derive the asymptotic behavior of the degree sequence via embedding the degree growth of a fixed node into a birth immigration process. We also need to show the convergence of the tail empirical measure, from which the consistency of Hill estimators is obtained. This step requires checking the concentration of degree counts. We give a proof for a particular linear preferential attachment model and use simulation results as an illustration in other choices of models.
△ Less
Submitted 15 November, 2017;
originally announced November 2017.
-
4D Multi-atlas Label Fusion using Longitudinal Images
Authors:
Yuankai Huo,
Susan M. Resnick,
Bennett A. Landman
Abstract:
Longitudinal reproducibility is an essential concern in automated medical image segmentation, yet has proven to be an elusive objective as manual brain structure tracings have shown more than 10% variability. To improve reproducibility, lon-gitudinal segmentation (4D) approaches have been investigated to reconcile tem-poral variations with traditional 3D approaches. In the past decade, multi-atlas…
▽ More
Longitudinal reproducibility is an essential concern in automated medical image segmentation, yet has proven to be an elusive objective as manual brain structure tracings have shown more than 10% variability. To improve reproducibility, lon-gitudinal segmentation (4D) approaches have been investigated to reconcile tem-poral variations with traditional 3D approaches. In the past decade, multi-atlas la-bel fusion has become a state-of-the-art segmentation technique for 3D image and many efforts have been made to adapt it to a 4D longitudinal fashion. However, the previous methods were either limited by using application specified energy function (e.g., surface fusion and multi model fusion) or only considered tem-poral smoothness on two consecutive time points (t and t+1) under sparsity as-sumption. Therefore, a 4D multi-atlas label fusion theory for general label fusion purpose and simultaneously considering temporal consistency on all time points is appealing. Herein, we propose a novel longitudinal label fusion algorithm, called 4D joint label fusion (4DJLF), to incorporate the temporal consistency modeling via non-local patch-intensity covariance models. The advantages of 4DJLF include: (1) 4DJLF is under the general label fusion framework by simul-taneously incorporating the spatial and temporal covariance on all longitudinal time points. (2) The proposed algorithm is a longitudinal generalization of a lead-ing joint label fusion method (JLF) that has proven adaptable to a wide variety of applications. (3) The spatial temporal consistency of atlases is modeled in a prob-abilistic model inspired from both voting based and statistical fusion. The pro-posed approach improves the consistency of the longitudinal segmentation while retaining sensitivity compared with original JLF approach using the same set of atlases. The method is available online in open-source.
△ Less
Submitted 29 August, 2017;
originally announced August 2017.
-
Ratios of Ordered Points of Point Processes with Regularly Varying Intensity Measures
Authors:
Yuguang Ipsen,
Ross Maller,
Sidney Resnick
Abstract:
We study limiting properties of ratios of ordered points of point processes whose intensity measures have regularly varying tails, giving a systematic treatment which points the way to "large-trimming" properties of extremal processes and a variety of applications. Our point process approach facilitates a connection with the negative binomial process of Gregoire (1984) and consequently to certain…
▽ More
We study limiting properties of ratios of ordered points of point processes whose intensity measures have regularly varying tails, giving a systematic treatment which points the way to "large-trimming" properties of extremal processes and a variety of applications. Our point process approach facilitates a connection with the negative binomial process of Gregoire (1984) and consequently to certain generalised versions of the Poisson-Dirichlet distribution.
△ Less
Submitted 30 July, 2017;
originally announced July 2017.
-
Fitting the Linear Preferential Attachment Model
Authors:
Phyllis Wan,
Tiandong Wang,
Richard A. Davis,
Sidney I. Resnick
Abstract:
Preferential attachment is an appealing mechanism for modeling power-law behavior of the degree distributions in directed social networks. In this paper, we consider methods for fitting a 5-parameter linear preferential model to network data under two data scenarios. In the case where full history of the network formation is given, we derive the maximum likelihood estimator of the parameters and s…
▽ More
Preferential attachment is an appealing mechanism for modeling power-law behavior of the degree distributions in directed social networks. In this paper, we consider methods for fitting a 5-parameter linear preferential model to network data under two data scenarios. In the case where full history of the network formation is given, we derive the maximum likelihood estimator of the parameters and show that it is strongly consistent and asymptotically normal. In the case where only a single-time snapshot of the network is available, we propose an estimation method which combines method of moments with an approximation to the likelihood. The resulting estimator is also strongly consistent and performs quite well compared to the MLE estimator. We illustrate both estimation procedures through simulated data, and explore the usage of this model in a real data example.
△ Less
Submitted 27 August, 2017; v1 submitted 8 March, 2017;
originally announced March 2017.
-
Processes of rth Largest
Authors:
Boris Buchmann,
Ross Maller,
Sidney Resnick
Abstract:
For integers $n\geq r$, we treat the $r$th largest of a sample of size $n$ as an $\mathbb{R}^\infty$-valued stochastic process in $r$ which we denote $\mathbf{M}^{(r)}$. We show that the sequence regarded in this way satisfies the Markov property. We go on to study the asymptotic behaviour of $\mathbf{M}^{(r)}$ as $r\to\infty$, and, borrowing from classical extreme value theory, show that left-tai…
▽ More
For integers $n\geq r$, we treat the $r$th largest of a sample of size $n$ as an $\mathbb{R}^\infty$-valued stochastic process in $r$ which we denote $\mathbf{M}^{(r)}$. We show that the sequence regarded in this way satisfies the Markov property. We go on to study the asymptotic behaviour of $\mathbf{M}^{(r)}$ as $r\to\infty$, and, borrowing from classical extreme value theory, show that left-tail domain of attraction conditions on the underlying distribution of the sample guarantee weak limits for both the range of $\mathbf{M}^{(r)}$ and $\mathbf{M}^{(r)}$ itself, after norming and centering. In continuous time, an analogous process $\mathbf{Y}^{(r)}r$ based on a two-dimensional Poisson process on $\mathbb{R}_+\times \mathbb{R}$ is treated similarly, but we find that the continuous time problems have a distinctive additional feature: there are always infinitely many points below the $r$th highest point up to time $t$ for any $t>0$. This necessitates a different approach to the asymptotics in this case.
△ Less
Submitted 28 July, 2016;
originally announced July 2016.
-
A multivariate nonlinear mixed effects model for longitudinal image analysis: Application to amyloid imaging
Authors:
Murat Bilgel,
Jerry L. Prince,
Dean F. Wong,
Susan M. Resnick,
Bruno M. Jedynak
Abstract:
It is important to characterize the temporal trajectories of disease-related biomarkers in order to monitor progression and identify potential points of intervention. This is especially important for neurodegenerative diseases, as therapeutic intervention is most likely to be effective in the preclinical disease stages prior to significant neuronal damage. Longitudinal neuroimaging allows for the…
▽ More
It is important to characterize the temporal trajectories of disease-related biomarkers in order to monitor progression and identify potential points of intervention. This is especially important for neurodegenerative diseases, as therapeutic intervention is most likely to be effective in the preclinical disease stages prior to significant neuronal damage. Longitudinal neuroimaging allows for the measurement of structural, functional, and metabolic integrity of the brain over time at the level of voxels. However, commonly used longitudinal analysis approaches, such as linear mixed effects models, do not account for the fact that individuals enter a study at various disease stages and progress at different rates, and generally consider each voxelwise measure independently. We propose a multivariate nonlinear mixed effects model for estimating the trajectories of voxelwise neuroimaging biomarkers from longitudinal data that accounts for such differences across individuals. The method involves the prediction of a progression score for each visit based on a collective analysis of voxelwise biomarker data within an expectation-maximization framework that efficiently handles large amounts of measurements and variable number of visits per individual, and accounts for spatial correlations among voxels. This score allows individuals with similar progressions to be aligned and analyzed together, which enables the construction of a trajectory of brain changes as a function of an underlying progression or disease stage. Application of our method to studying images of beta-amyloid deposition, a hallmark of preclinical Alzheimer's disease, suggests that precuneus is the earliest cortical region to accumulate amyloid. The proposed method can be applied to other types of longitudinal imaging data, including metabolism, blood flow, tau, and structural imaging-derived measures.
△ Less
Submitted 4 April, 2016;
originally announced April 2016.
-
Hidden Regular Variation under Full and Strong Asymptotic Dependence
Authors:
Bikramjit Das,
Sidney I. Resnick
Abstract:
Data exhibiting heavy-tails in one or more dimensions is often studied using the framework of regular variation. In a multivariate setting this requires identifying specific forms of dependence in the data; this means identifying that the data tends to concentrate along particular directions and does not cover the full space. This is observed in various data sets from finance, insurance, network t…
▽ More
Data exhibiting heavy-tails in one or more dimensions is often studied using the framework of regular variation. In a multivariate setting this requires identifying specific forms of dependence in the data; this means identifying that the data tends to concentrate along particular directions and does not cover the full space. This is observed in various data sets from finance, insurance, network traffic, social networks, etc. In this paper we discuss the notions of full and strong asymptotic dependence for bivariate data along with the idea of hidden regular variation in these cases. In a risk analysis setting, this leads to improved risk estimation accuracy when regular methods provide a zero estimate of risk. Analyses of both real and simulated data sets illustrate concepts of generation and detection of such models.
△ Less
Submitted 31 January, 2017; v1 submitted 3 February, 2016;
originally announced February 2016.
-
Multivariate Regular Variation of Discrete Mass Functions with Applications to Preferential Attachment Networks
Authors:
Tiandong Wang,
Sidney I. Resnick
Abstract:
Regular variation of a multivariate measure with a Lebesgue density implies the regular variation of its density provided the density satisfies some regularity conditions. Unlike the univariate case, the converse also requires regularity conditions. We extend these arguments to discrete mass functions and their associated measures using the concept that the the mass function can be embedded in a c…
▽ More
Regular variation of a multivariate measure with a Lebesgue density implies the regular variation of its density provided the density satisfies some regularity conditions. Unlike the univariate case, the converse also requires regularity conditions. We extend these arguments to discrete mass functions and their associated measures using the concept that the the mass function can be embedded in a continuous density function. We give two different conditions, monotonicity and convergence on the unit sphere, both of which can make the discrete function embeddable. Our results are then applied to the preferential attachment network model, and we conclude that the joint mass function of in- and out-degree is embeddable and thus regularly varying.
△ Less
Submitted 10 January, 2016;
originally announced January 2016.
-
Asymptotic Normality of In- and Out-Degree Counts in a Preferential Attachment Model
Authors:
Tiandong Wang,
Sidney I. Resnick
Abstract:
Preferential attachment in a directed scale-free graph is widely used to model the evolution of social networks. Statistical analyses of social networks often relies on node based data rather than conventional repeated sampling. For our directed edge model with preferential attachment, we prove asymptotic normality of node counts based on a martingale construction and a martingale central limit th…
▽ More
Preferential attachment in a directed scale-free graph is widely used to model the evolution of social networks. Statistical analyses of social networks often relies on node based data rather than conventional repeated sampling. For our directed edge model with preferential attachment, we prove asymptotic normality of node counts based on a martingale construction and a martingale central limit theorem. This helps justify estimation methods based on the statistics of node counts which have specified in-degree and out-degree.
△ Less
Submitted 1 October, 2015;
originally announced October 2015.
-
Asymptotic Normality of Degree Counts in a Preferential Attachment Model
Authors:
Sidney Resnick,
Gennady Samorodnitsky
Abstract:
Preferential attachment is a widely adopted paradigm for understanding the dynamics of social networks. Formal statistical inference,for instance GLM techniques, and model verification methods will require knowing test statistics are asymptotically normal even though node or count based network data is nothing like classical data from independently replicated experiments. We therefore study asympt…
▽ More
Preferential attachment is a widely adopted paradigm for understanding the dynamics of social networks. Formal statistical inference,for instance GLM techniques, and model verification methods will require knowing test statistics are asymptotically normal even though node or count based network data is nothing like classical data from independently replicated experiments. We therefore study asymptotic normality of degree counts for a sequence of growing simple undirected preferential attachment graphs. The methods of proof rely on identifying martingales and then exploiting the martingale central limit theorems.
△ Less
Submitted 27 April, 2015;
originally announced April 2015.
-
Tauberian Theory for Multivariate Regularly Varying Distributions with Application to Preferential Attachment Networks
Authors:
Sidney Resnick,
Gennady Samorodnitsky
Abstract:
Abel-Tauberian theorems relate power law behavior of distributions and their transforms. We formulate and prove a multivariate version for non-standard regularly varying measures on $\mathbb{R}_+^p$ and then apply it to prove that the joint distribution of in- and out-degree in a directed edge preferential attachement model has jointly regularly varying tails.
Abel-Tauberian theorems relate power law behavior of distributions and their transforms. We formulate and prove a multivariate version for non-standard regularly varying measures on $\mathbb{R}_+^p$ and then apply it to prove that the joint distribution of in- and out-degree in a directed edge preferential attachement model has jointly regularly varying tails.
△ Less
Submitted 24 June, 2014;
originally announced June 2014.
-
Nonstandard regular variation of in-degree and out-degree in the preferential attachment model
Authors:
Gennady Samorodnitsky,
Sidney Resnick,
Don Towsley,
Richard Davis,
Amy Willis,
Phyllis Wan
Abstract:
For the directed edge preferential attachment network growth model studied by Bollobas et al. (2003) and Krapivsky and Redner (2001), we prove that the joint distribution of in-degree and out-degree has jointly regularly varying tails. Typically the marginal tails of the in-degree distribution and the out-degree distribution have different regular variation indices and so the joint regular variati…
▽ More
For the directed edge preferential attachment network growth model studied by Bollobas et al. (2003) and Krapivsky and Redner (2001), we prove that the joint distribution of in-degree and out-degree has jointly regularly varying tails. Typically the marginal tails of the in-degree distribution and the out-degree distribution have different regular variation indices and so the joint regular variation is non-standard. Only marginal regular variation has been previously established for this distribution in the cases where the marginal tail indices are different.
△ Less
Submitted 19 May, 2014;
originally announced May 2014.
-
Generation and Detection of Multivariate Regular Variation and Hidden Regular Variation
Authors:
Bikramjit Das,
Sidney Resnick
Abstract:
We review definitions of multivariate regular variation (MRV) and hidden regular variation (HRV) for distributions of random vectors and then summarize methods for generating models exhibiting both properties. We also discuss diagnostic techniques that detect these properties in multivariate data and indicate when models exhibiting both MRV and HRV are plausible fits for the data. We illustrate ou…
▽ More
We review definitions of multivariate regular variation (MRV) and hidden regular variation (HRV) for distributions of random vectors and then summarize methods for generating models exhibiting both properties. We also discuss diagnostic techniques that detect these properties in multivariate data and indicate when models exhibiting both MRV and HRV are plausible fits for the data. We illustrate our techniques on simulated data and also two real Internet data sets.
△ Less
Submitted 23 March, 2014;
originally announced March 2014.
-
Hidden regular variation of moving average processes with heavy-tailed innovations
Authors:
Sideny I. Resnick,
Joyjit Roy
Abstract:
We look at joint regular variation properties of MA($\infty$) processes of the form $\mathbf{X} = (X_k, k \in \mathbb{Z})$ where $X_k = \sum_{j=0}^{\infty} ψ_j Z_{k-j}$ and the sequence of random variables $(Z_i, i \in \mathbb{Z})$ are i.i.d. with regularly varying tails. We use the setup of $\mathbb{M}_{\mathbb{O}}$-convergence and obtain hidden regular variation properties for $\mathbf{X}$ under…
▽ More
We look at joint regular variation properties of MA($\infty$) processes of the form $\mathbf{X} = (X_k, k \in \mathbb{Z})$ where $X_k = \sum_{j=0}^{\infty} ψ_j Z_{k-j}$ and the sequence of random variables $(Z_i, i \in \mathbb{Z})$ are i.i.d. with regularly varying tails. We use the setup of $\mathbb{M}_{\mathbb{O}}$-convergence and obtain hidden regular variation properties for $\mathbf{X}$ under suitable summability conditions on the constant coefficients $(ψ_j : j \geq 0)$. Our approach emphasizes continuity properties of mappings and produces regular variation in sequence space.
△ Less
Submitted 30 September, 2013;
originally announced September 2013.
-
Regularly Varying Measures on Metric Spaces: Hidden Regular Variation and Hidden Jumps
Authors:
Filip Lindskog,
Sidney I. Resnick,
Joyjit Roy
Abstract:
We develop a framework for regularly varying measures on complete separable metric spaces $\mathbb{S}$ with a closed cone $\mathbb{C}$ removed, extending material in Hult & Lindskog (2006), Das, Mitra & Resnick (2013). Our framework provides a flexible way to consider hidden regular variation and allows simultaneous regular variation properties to exist at different scales and provides potential f…
▽ More
We develop a framework for regularly varying measures on complete separable metric spaces $\mathbb{S}$ with a closed cone $\mathbb{C}$ removed, extending material in Hult & Lindskog (2006), Das, Mitra & Resnick (2013). Our framework provides a flexible way to consider hidden regular variation and allows simultaneous regular variation properties to exist at different scales and provides potential for more accurate estimation of probabilities of risk regions. We apply our framework to iid random variables in $\mathbb{R}_+^\infty$ with marginal distributions having regularly varying tails and to càdlàg Lévy processes whose Lévy measures have regularly varying tails. In both cases, an infinite number of regular variation properties coexist distinguished by different scaling functions and state spaces.
△ Less
Submitted 22 July, 2013;
originally announced July 2013.
-
Markov Kernels and the Conditional Extreme Value Model
Authors:
Sidney Resnick,
David Zeber
Abstract:
The classical approach to multivariate extreme value modelling assumes that the joint distribution belongs to a multivariate domain of attraction. This requires each marginal distribution be individually attracted to a univariate extreme value distribution. An apparently more flexible extremal model for multivariate data was proposed by Heffernan and Tawn under which not all the components are req…
▽ More
The classical approach to multivariate extreme value modelling assumes that the joint distribution belongs to a multivariate domain of attraction. This requires each marginal distribution be individually attracted to a univariate extreme value distribution. An apparently more flexible extremal model for multivariate data was proposed by Heffernan and Tawn under which not all the components are required to belong to an extremal domain of attraction but assumes instead the existence of an asymptotic approximation to the conditional distribution of the random vector given one of the components is extreme. Combined with the knowledge that the conditioning component belongs to a univariate domain of attraction, this leads to an approximation of the probability of certain risk regions. The original focus on conditional distributions had technical drawbacks but is natural in several contexts. We place this approach in the context of the more general approach using convergence of measures and multivariate regular variation on cones.
△ Less
Submitted 10 October, 2012;
originally announced October 2012.
-
Clustering of Markov chain exceedances
Authors:
Sidney I. Resnick,
David Zeber
Abstract:
The tail chain of a Markov chain can be used to model the dependence between extreme observations. For a positive recurrent Markov chain, the tail chain aids in describing the limit of a sequence of point processes $\{N_n,n\geq1\}$, consisting of normalized observations plotted against scaled time points. Under fairly general conditions on extremal behaviour, $\{N_n\}$ converges to a cluster Poiss…
▽ More
The tail chain of a Markov chain can be used to model the dependence between extreme observations. For a positive recurrent Markov chain, the tail chain aids in describing the limit of a sequence of point processes $\{N_n,n\geq1\}$, consisting of normalized observations plotted against scaled time points. Under fairly general conditions on extremal behaviour, $\{N_n\}$ converges to a cluster Poisson process. Our technique decomposes the sample path of the chain into i.i.d. regenerative cycles rather than using blocking argument typically employed in the context of stationarity with mixing.
△ Less
Submitted 30 September, 2013; v1 submitted 8 October, 2012;
originally announced October 2012.
-
Asymptotics of Markov Kernels and the Tail Chain
Authors:
Sidney I. Resnick,
David Zeber
Abstract:
An asymptotic model for extreme behavior of certain Markov chains is the "tail chain". Generally taking the form of a multiplicative random walk, it is useful in deriving extremal characteristics such as point process limits. We place this model in a more general context, formulated in terms of extreme value theory for transition kernels, and extend it by formalizing the distinction between extrem…
▽ More
An asymptotic model for extreme behavior of certain Markov chains is the "tail chain". Generally taking the form of a multiplicative random walk, it is useful in deriving extremal characteristics such as point process limits. We place this model in a more general context, formulated in terms of extreme value theory for transition kernels, and extend it by formalizing the distinction between extreme and non-extreme states. We make the link between the update function and transition kernel forms considered in previous work, and we show that the tail chain model leads to a multivariate regular variation property of the finite-dimensional distributions under assumptions on the marginal tails alone.
△ Less
Submitted 24 December, 2011;
originally announced December 2011.
-
Modeling Multiple Risks: Hidden Domain of Attraction
Authors:
Abhimanyu Mitra,
Sidney I. Resnick
Abstract:
Hidden regular variation is a sub-model of multivariate regular variation and facilitates accurate estimation of joint tail probabilities. We generalize the model of hidden regular variation to what we call hidden domain of attraction. We exhibit examples that illustrate the need for a more general model and discuss detection and estimation techniques.
Hidden regular variation is a sub-model of multivariate regular variation and facilitates accurate estimation of joint tail probabilities. We generalize the model of hidden regular variation to what we call hidden domain of attraction. We exhibit examples that illustrate the need for a more general model and discuss detection and estimation techniques.
△ Less
Submitted 3 October, 2011;
originally announced October 2011.
-
Living on the multi-dimensional edge: seeking hidden risks using regular variation
Authors:
Bikramjit Das,
Abhimanyu Mitra,
Sidney Resnick
Abstract:
Multivariate regular variation plays a role assessing tail risk in diverse applications such as finance, telecommunications, insurance and environmental science. The classical theory, being based on an asymptotic model, sometimes leads to inaccurate and useless estimates of probabilities of joint tail regions. This problem can be partly ameliorated by using hidden regular variation [Resnick, 2002,…
▽ More
Multivariate regular variation plays a role assessing tail risk in diverse applications such as finance, telecommunications, insurance and environmental science. The classical theory, being based on an asymptotic model, sometimes leads to inaccurate and useless estimates of probabilities of joint tail regions. This problem can be partly ameliorated by using hidden regular variation [Resnick, 2002, Mitra and Resnick, 2010]. We offer a more flexible definition of hidden regular variation that provides improved risk estimates for a larger class of risk tail regions.
△ Less
Submitted 29 August, 2011;
originally announced August 2011.