-
Subthalamic Nucleus segmentation in high-field Magnetic Resonance data. Is space normalization by template co-registration necessary?
Authors:
Tomás Lima,
Igor Varga,
Eduard Bakštein,
Daniel Novák,
Victor Alves
Abstract:
Deep Brain Stimulation (DBS) is one of the most successful methods to diminish late-stage Parkinson's Disease (PD) symptoms. It is a delicate surgical procedure which requires detailed pre-surgical patient's study. High-field Magnetic Resonance Imaging (MRI) has proven its improved capacity of capturing the Subthalamic Nucleus (STN) - the main target of DBS in PD - in greater detail than lower fie…
▽ More
Deep Brain Stimulation (DBS) is one of the most successful methods to diminish late-stage Parkinson's Disease (PD) symptoms. It is a delicate surgical procedure which requires detailed pre-surgical patient's study. High-field Magnetic Resonance Imaging (MRI) has proven its improved capacity of capturing the Subthalamic Nucleus (STN) - the main target of DBS in PD - in greater detail than lower field images. Here, we present a comparison between the performance of two different Deep Learning (DL) automatic segmentation architectures, one based in the registration to a brain template and the other performing the segmentation in in the MRI acquisition native space. The study was based on publicly available high-field 7 Tesla (T) brain MRI datasets of T1-weighted and T2-weighted sequences. nnUNet was used on the segmentation step of both architectures, while the data pre and post-processing pipelines diverged. The evaluation metrics showed that the performance of the segmentation directly in the native space yielded better results for the STN segmentation, despite not showing any advantage over the template-based method for the to other analysed structures: the Red Nucleus (RN) and the Substantia Nigra (SN).
△ Less
Submitted 22 July, 2024;
originally announced July 2024.
-
Deep Dive into MRI: Exploring Deep Learning Applications in 0.55T and 7T MRI
Authors:
Ana Carolina Alves,
André Ferreira,
Behrus Puladi,
Jan Egger,
Victor Alves
Abstract:
The development of magnetic resonance imaging (MRI) for medical imaging has provided a leap forward in diagnosis, providing a safe, non-invasive alternative to techniques involving ionising radiation exposure for diagnostic purposes. It was described by Block and Purcel in 1946, and it was not until 1980 that the first clinical application of MRI became available. Since that time the MRI has gone…
▽ More
The development of magnetic resonance imaging (MRI) for medical imaging has provided a leap forward in diagnosis, providing a safe, non-invasive alternative to techniques involving ionising radiation exposure for diagnostic purposes. It was described by Block and Purcel in 1946, and it was not until 1980 that the first clinical application of MRI became available. Since that time the MRI has gone through many advances and has altered the way diagnosing procedures are performed. Due to its ability to improve constantly, MRI has become a commonly used practice among several specialisations in medicine. Particularly starting 0.55T and 7T MRI technologies have pointed out enhanced preservation of image detail and advanced tissue characterisation. This review examines the integration of deep learning (DL) techniques into these MRI modalities, disseminating and exploring the study applications. It highlights how DL contributes to 0.55T and 7T MRI data, showcasing the potential of DL in improving and refining these technologies. The review ends with a brief overview of how MRI technology will evolve in the coming years.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
The influence of ionized gas kinematics on HII galaxies. The cases of Tol 1004-296 and Tol 0957-278
Authors:
Henri Plana,
Vitor G. Alves,
Maiara S. Carvalho
Abstract:
Blue Compact Galaxies (BCGs), also known as \HII\ galaxies, are dwarf, star-forming objects with relatively simple dynamics, which allows for the investigation of star formation mechanisms in a cleaner manner compared to late-type objects. In this study, we have examined various characteristics of the interstellar medium, in connection with the kinematics and dynamics of ionized gas, in Tol 1004-2…
▽ More
Blue Compact Galaxies (BCGs), also known as \HII\ galaxies, are dwarf, star-forming objects with relatively simple dynamics, which allows for the investigation of star formation mechanisms in a cleaner manner compared to late-type objects. In this study, we have examined various characteristics of the interstellar medium, in connection with the kinematics and dynamics of ionized gas, in Tol 1004-296 and Tol 0957-278. These two objects were observed using the SOAR Integral Field Spectrometer (SIFS) attached to the Southern Observatory for Astrophysical Research (SOAR). Both galaxies were observed with two gratings: one with medium resolution for monochromatic and abundance maps, and another with high resolution for kinematics and profile analysis. Additionally, we conducted an analysis on the velocity and velocity dispersion maps using intensity-velocity dispersion (I - $σ$) and velocity-velocity dispersion (Vr - $σ$) diagrams. Neither object exhibits a rotation pattern, and only Tol 1004-296 shows a velocity gradient between the two principal knots. However, the study reveals the significant role played by velocity dispersion in the star formation process. Specifically, we identified a relationship between monochromatic intensity, metallicity, and velocity dispersion, where high emission corresponds to low metallicity and low velocity dispersion. Tol 1004-296, in particular, exhibits a distinctive linear high velocity dispersion pattern between the two main knots, suggesting that both star formation sites are pushing the gas in opposite directions.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
A Nonhydrostatic and Mass Conserving Ground-to-Thermosphere Dynamical Core based on Specific Internal Energy
Authors:
James F. Kelly,
Felipe A. V. Alves,
John T. Emmert,
Stephen D. Eckermann,
Francis X. Giraldo,
P. Alex Reinecke
Abstract:
This paper presents the development of a deep-atmosphere, nonhydrostatic dynamical core (DyCore) targeted towards ground-thermosphere atmospheric prediction. This DyCore is based on a novel formulation of the specific internal energy equation (SIEE), which, unlike standard potential temperature formulations, is valid for variable composition atmospheres. Two versions of a SIEE are derived from bas…
▽ More
This paper presents the development of a deep-atmosphere, nonhydrostatic dynamical core (DyCore) targeted towards ground-thermosphere atmospheric prediction. This DyCore is based on a novel formulation of the specific internal energy equation (SIEE), which, unlike standard potential temperature formulations, is valid for variable composition atmospheres. Two versions of a SIEE are derived from basic principles. The first version, which uses a product-rule (PR) continuity equation, contains an additional compressible term and does not conserve mass. The second version, which does not use the product-rule (No-PR) in the continuity equation, contains two compressible terms and conserves mass to machine precision regardless of time truncation error. The pressure gradient and gravitational forces in the momentum balance equation are formulated in a manner appropriate for HA applications and the spectral element method (SEM) is used with Implicit-Explicit (IMEX) and Horizontally Explicit Vertically Implicit (HEVI) time-integration. These new equation sets were implemented in two U. S. Navy atmospheric models: the Nonhydrostatic Unified Model of the Atmosphere (NUMA) and the Navy Environmental Prediction sysTem Using a Nonhydrostatic Core (NEPTUNE). Numerical results using a nonhydrostatic and hydrostatic baroclinic instability, a balanced zonal flow, and HA mountain wave experiments are shown. These results are compared to existing deep-atmosphere dynamical cores, indicating that the proposed discretized IEE equation sets are viable next-generation ground-to-thermosphere DyCores.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Cost-Sensitive Learning to Defer to Multiple Experts with Workload Constraints
Authors:
Jean V. Alves,
Diogo Leitão,
Sérgio Jesus,
Marco O. P. Sampaio,
Javier Liébana,
Pedro Saleiro,
Mário A. T. Figueiredo,
Pedro Bizarro
Abstract:
Learning to defer (L2D) aims to improve human-AI collaboration systems by learning how to defer decisions to humans when they are more likely to be correct than an ML classifier. Existing research in L2D overlooks key aspects of real-world systems that impede its practical adoption, namely: i) neglecting cost-sensitive scenarios, where type 1 and type 2 errors have different costs; ii) requiring c…
▽ More
Learning to defer (L2D) aims to improve human-AI collaboration systems by learning how to defer decisions to humans when they are more likely to be correct than an ML classifier. Existing research in L2D overlooks key aspects of real-world systems that impede its practical adoption, namely: i) neglecting cost-sensitive scenarios, where type 1 and type 2 errors have different costs; ii) requiring concurrent human predictions for every instance of the training dataset and iii) not dealing with human work capacity constraints. To address these issues, we propose the deferral under cost and capacity constraints framework (DeCCaF). DeCCaF is a novel L2D approach, employing supervised learning to model the probability of human error under less restrictive data requirements (only one expert prediction per instance) and using constraint programming to globally minimize the error cost subject to workload limitations. We test DeCCaF in a series of cost-sensitive fraud detection scenarios with different teams of 9 synthetic fraud analysts, with individual work capacity constraints. The results demonstrate that our approach performs significantly better than the baselines in a wide array of scenarios, achieving an average 8.4% reduction in the misclassification cost.
△ Less
Submitted 21 March, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
The Pólya-Tchebotarev problem with semiclassical external fields
Authors:
Victor Alves,
Guilherme Silva
Abstract:
The classical Pólya-Tchebotarev problem, commonly stated as a max-min logarithmic energy problem, asks for finding a compact of minimal capacity in the complex plane which connects a prescribed collection of fixed points. Variants of this problem have found ramifications and applications in the theory of non-hermitian orthogonal polynomials, random matrices, approximation theory, among others. Her…
▽ More
The classical Pólya-Tchebotarev problem, commonly stated as a max-min logarithmic energy problem, asks for finding a compact of minimal capacity in the complex plane which connects a prescribed collection of fixed points. Variants of this problem have found ramifications and applications in the theory of non-hermitian orthogonal polynomials, random matrices, approximation theory, among others. Here we consider an extension of this classical problem, including a semiclassical external field, and enforcing finitely many prescribed collections of points to be connected, possibly also to infinity. Our method is based on Rakhmanov's approach to max-min problems in logarithmic potential theory, utilizes the developed machinery by Martínez-Finkelshtein and Rakhmanov on critical measures, and extends the development of Kuijlaars and the second named author from the context of polynomial external fields to the semiclassical case considered here.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
How we won BraTS 2023 Adult Glioma challenge? Just faking it! Enhanced Synthetic Data Augmentation and Model Ensemble for brain tumour segmentation
Authors:
André Ferreira,
Naida Solak,
Jianning Li,
Philipp Dammann,
Jens Kleesiek,
Victor Alves,
Jan Egger
Abstract:
Deep Learning is the state-of-the-art technology for segmenting brain tumours. However, this requires a lot of high-quality data, which is difficult to obtain, especially in the medical field. Therefore, our solutions address this problem by using unconventional mechanisms for data augmentation. Generative adversarial networks and registration are used to massively increase the amount of available…
▽ More
Deep Learning is the state-of-the-art technology for segmenting brain tumours. However, this requires a lot of high-quality data, which is difficult to obtain, especially in the medical field. Therefore, our solutions address this problem by using unconventional mechanisms for data augmentation. Generative adversarial networks and registration are used to massively increase the amount of available samples for training three different deep learning models for brain tumour segmentation, the first task of the BraTS2023 challenge. The first model is the standard nnU-Net, the second is the Swin UNETR and the third is the winning solution of the BraTS 2021 Challenge. The entire pipeline is built on the nnU-Net implementation, except for the generation of the synthetic data. The use of convolutional algorithms and transformers is able to fill each other's knowledge gaps. Using the new metric, our best solution achieves the dice results 0.9005, 0.8673, 0.8509 and HD95 14.940, 14.467, 17.699 (whole tumour, tumour core and enhancing tumour) in the validation set.
△ Less
Submitted 17 July, 2024; v1 submitted 27 February, 2024;
originally announced February 2024.
-
Deep PCCT: Photon Counting Computed Tomography Deep Learning Applications Review
Authors:
Ana Carolina Alves,
André Ferreira,
Gijs Luijten,
Jens Kleesiek,
Behrus Puladi,
Jan Egger,
Victor Alves
Abstract:
Medical imaging faces challenges such as limited spatial resolution, interference from electronic noise and poor contrast-to-noise ratios. Photon Counting Computed Tomography (PCCT) has emerged as a solution, addressing these issues with its innovative technology. This review delves into the recent developments and applications of PCCT in pre-clinical research, emphasizing its potential to overcom…
▽ More
Medical imaging faces challenges such as limited spatial resolution, interference from electronic noise and poor contrast-to-noise ratios. Photon Counting Computed Tomography (PCCT) has emerged as a solution, addressing these issues with its innovative technology. This review delves into the recent developments and applications of PCCT in pre-clinical research, emphasizing its potential to overcome traditional imaging limitations. For example PCCT has demonstrated remarkable efficacy in improving the detection of subtle abnormalities in breast, providing a level of detail previously unattainable. Examining the current literature on PCCT, it presents a comprehensive analysis of the technology, highlighting the main features of scanners and their varied applications. In addition, it explores the integration of deep learning into PCCT, along with the study of radiomic features, presenting successful applications in data processing. While acknowledging these advances, it also discusses the existing challenges in this field, paving the way for future research and improvements in medical imaging technologies. Despite the limited number of articles on this subject, due to the recent integration of PCCT at a clinical level, its potential benefits extend to various diagnostic applications.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Quantitative assessment of dosimetric effect of using alternative OAR delineations in treatment planning as functions of delineations, setup uncertainty and planning techniques using an alternative truth assessment method
Authors:
M. N. H. Rashad,
Abishek Karki,
Jason Czak,
Victor Gabriel Alves,
Hamidreza Nourzadeh,
Wookjin Choi,
Jeffrey V Siebers
Abstract:
Purpose: This study aims to quantify the variation in dose-volume histogram (DVH) and normal tissue complication probability(NTCP) metrics for head-and-neck (HN) cancer patients when alternative organ-at-risk(OAR) delineations are used for treatment planning and for treatment plan evaluation. We particularly focus on the effects of daily patient positioning/setup variations(SV) in relation to trea…
▽ More
Purpose: This study aims to quantify the variation in dose-volume histogram (DVH) and normal tissue complication probability(NTCP) metrics for head-and-neck (HN) cancer patients when alternative organ-at-risk(OAR) delineations are used for treatment planning and for treatment plan evaluation. We particularly focus on the effects of daily patient positioning/setup variations(SV) in relation to treatment technique and delineation variability.
Materials and Methods: We generated two-arc VMAT, 5-beam IMRT, and 9-beam IMRT treatment plans for a cohort of 209 HN patients. These plans incorporated five different OAR delineation sets, including manual and four automated algorithms. Each treatment plan was assessed under various simulated per-fraction patient setup uncertainties, evaluating the potential clinical impacts through DVH and NTCP metrics.
Results: The study demonstrates that increasing SV generally reduces differences in DVH metrics between alternative delineations. However, in contrast, differences in NTCP metrics tend to increase with higher setup variability. This pattern is observed consistently across different treatment plans and delineator combinations, illustrating the intricate relationship between SV and delineation accuracy. Additionally, the need for delineation accuracy in treatment planning is shown to be case-specific and dependent on factors beyond geometric variations.
Conclusions: The findings highlight the necessity for comprehensive Quality Assurance programs in radiotherapy, incorporating both dosimetric impact analysis and geometric variation assessment to ensure optimal delineation quality. The study emphasizes the complex dynamics of treatment planning in radiotherapy, advocating for personalized, case-specific strategies in clinical practice to enhance patient care quality and efficacy in the face of varying SV and delineation accuracies.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
FiFAR: A Fraud Detection Dataset for Learning to Defer
Authors:
Jean V. Alves,
Diogo Leitão,
Sérgio Jesus,
Marco O. P. Sampaio,
Pedro Saleiro,
Mário A. T. Figueiredo,
Pedro Bizarro
Abstract:
Public dataset limitations have significantly hindered the development and benchmarking of learning to defer (L2D) algorithms, which aim to optimally combine human and AI capabilities in hybrid decision-making systems. In such systems, human availability and domain-specific concerns introduce difficulties, while obtaining human predictions for training and evaluation is costly. Financial fraud det…
▽ More
Public dataset limitations have significantly hindered the development and benchmarking of learning to defer (L2D) algorithms, which aim to optimally combine human and AI capabilities in hybrid decision-making systems. In such systems, human availability and domain-specific concerns introduce difficulties, while obtaining human predictions for training and evaluation is costly. Financial fraud detection is a high-stakes setting where algorithms and human experts often work in tandem; however, there are no publicly available datasets for L2D concerning this important application of human-AI teaming. To fill this gap in L2D research, we introduce the Financial Fraud Alert Review Dataset (FiFAR), a synthetic bank account fraud detection dataset, containing the predictions of a team of 50 highly complex and varied synthetic fraud analysts, with varied bias and feature dependence. We also provide a realistic definition of human work capacity constraints, an aspect of L2D systems that is often overlooked, allowing for extensive testing of assignment systems under real-world conditions. We use our dataset to develop a capacity-aware L2D method and rejection learning approach under realistic data availability conditions, and benchmark these baselines under an array of 300 distinct testing scenarios. We believe that this dataset will serve as a pivotal instrument in facilitating a systematic, rigorous, reproducible, and transparent evaluation and comparison of L2D methods, thereby fostering the development of more synergistic human-AI collaboration in decision-making systems. The public dataset and detailed synthetic expert information are available at: https://github.com/feedzai/fifar-dataset
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
A Performance Study of Horizontally Explicit Vertically Implicit (HEVI) Time-Integrators for Non-Hydrostatic Atmospheric Models
Authors:
Francis X. Giraldo,
Felipe Augusto Ventura de Braganca Alves,
James F. Kelly,
Soonpil Kang,
P. Alex Reiencke
Abstract:
We conduct a thorough study of different forms of horizontally explicit and vertically implicit (HEVI) time-integration strategies for the compressible Euler equations on spherical domains typical of nonhydrostatic global atmospheric applications. We compare the computational time and complexity of two nonlinear variants (NHEVI-GMRES and NHEVI-LU) and a linear variant (LHEVI). We report on the per…
▽ More
We conduct a thorough study of different forms of horizontally explicit and vertically implicit (HEVI) time-integration strategies for the compressible Euler equations on spherical domains typical of nonhydrostatic global atmospheric applications. We compare the computational time and complexity of two nonlinear variants (NHEVI-GMRES and NHEVI-LU) and a linear variant (LHEVI). We report on the performance of these three variants for a number of additive Runge-Kutta Methods ranging in order of accuracy from second through fifth, and confirm the expected order of accuracy of the HEVI methods for each time-integrator. To gauge the maximum usable time-step of each HEVI method, we run simulations of a nonhydrostatic baroclinic instability for 100 days and then use this time-step to compare the time-to-solution of each method. The results show that NHEVI-LU is 2x faster than NHEVI-GMRES, and LHEVI is 5x faster than NHEVI-LU, for the idealized cases tested. The baroclinic instability and inertia-gravity wave simulations indicate that the optimal choice of time-integrator is LHEVI with either second or third order schemes, as both schemes yield similar time to solution and relative L2 error at their maximum usable time-steps. In the future, we will report on whether these results hold for more complex problems using, e.g., real atmospheric data and/or a higher model top typical of space weather applications.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
Multilingual Natural Language Processing Model for Radiology Reports -- The Summary is all you need!
Authors:
Mariana Lindo,
Ana Sofia Santos,
André Ferreira,
Jianning Li,
Gijs Luijten,
Gustavo Correia,
Moon Kim,
Benedikt Michael Schaarschmidt,
Cornelius Deuschl,
Johannes Haubold,
Jens Kleesiek,
Jan Egger,
Victor Alves
Abstract:
The impression section of a radiology report summarizes important radiology findings and plays a critical role in communicating these findings to physicians. However, the preparation of these summaries is time-consuming and error-prone for radiologists. Recently, numerous models for radiology report summarization have been developed. Nevertheless, there is currently no model that can summarize the…
▽ More
The impression section of a radiology report summarizes important radiology findings and plays a critical role in communicating these findings to physicians. However, the preparation of these summaries is time-consuming and error-prone for radiologists. Recently, numerous models for radiology report summarization have been developed. Nevertheless, there is currently no model that can summarize these reports in multiple languages. Such a model could greatly improve future research and the development of Deep Learning models that incorporate data from patients with different ethnic backgrounds. In this study, the generation of radiology impressions in different languages was automated by fine-tuning a model, publicly available, based on a multilingual text-to-text Transformer to summarize findings available in English, Portuguese, and German radiology reports. In a blind test, two board-certified radiologists indicated that for at least 70% of the system-generated summaries, the quality matched or exceeded the corresponding human-written summaries, suggesting substantial clinical reliability. Furthermore, this study showed that the multilingual model outperformed other models that specialized in summarizing radiology reports in only one language, as well as models that were not specifically designed for summarizing radiology reports, such as ChatGPT.
△ Less
Submitted 13 January, 2024; v1 submitted 29 September, 2023;
originally announced October 2023.
-
MedShapeNet -- A Large-Scale Dataset of 3D Medical Shapes for Computer Vision
Authors:
Jianning Li,
Zongwei Zhou,
Jiancheng Yang,
Antonio Pepe,
Christina Gsaxner,
Gijs Luijten,
Chongyu Qu,
Tiezheng Zhang,
Xiaoxi Chen,
Wenxuan Li,
Marek Wodzinski,
Paul Friedrich,
Kangxian Xie,
Yuan Jin,
Narmada Ambigapathy,
Enrico Nasca,
Naida Solak,
Gian Marco Melito,
Viet Duc Vu,
Afaque R. Memon,
Christopher Schlachta,
Sandrine De Ribaupierre,
Rajnikant Patel,
Roy Eagleson,
Xiaojun Chen
, et al. (132 additional authors not shown)
Abstract:
Prior to the deep learning era, shape was commonly used to describe the objects. Nowadays, state-of-the-art (SOTA) algorithms in medical imaging are predominantly diverging from computer vision, where voxel grids, meshes, point clouds, and implicit surface models are used. This is seen from numerous shape-related publications in premier vision conferences as well as the growing popularity of Shape…
▽ More
Prior to the deep learning era, shape was commonly used to describe the objects. Nowadays, state-of-the-art (SOTA) algorithms in medical imaging are predominantly diverging from computer vision, where voxel grids, meshes, point clouds, and implicit surface models are used. This is seen from numerous shape-related publications in premier vision conferences as well as the growing popularity of ShapeNet (about 51,300 models) and Princeton ModelNet (127,915 models). For the medical domain, we present a large collection of anatomical shapes (e.g., bones, organs, vessels) and 3D models of surgical instrument, called MedShapeNet, created to facilitate the translation of data-driven vision algorithms to medical applications and to adapt SOTA vision algorithms to medical problems. As a unique feature, we directly model the majority of shapes on the imaging data of real patients. As of today, MedShapeNet includes 23 dataset with more than 100,000 shapes that are paired with annotations (ground truth). Our data is freely accessible via a web interface and a Python application programming interface (API) and can be used for discriminative, reconstructive, and variational benchmarks as well as various applications in virtual, augmented, or mixed reality, and 3D printing. Exemplary, we present use cases in the fields of classification of brain tumors, facial and skull reconstructions, multi-class anatomy completion, education, and 3D printing. In future, we will extend the data and improve the interfaces. The project pages are: https://medshapenet.ikim.nrw/ and https://github.com/Jianningli/medshapenet-feedback
△ Less
Submitted 12 December, 2023; v1 submitted 30 August, 2023;
originally announced August 2023.
-
Treatment And Follow-Up Guidelines For Multiple Brain Metastases: A Systematic Review
Authors:
Ana Sofia Santos,
Matheus Silva,
Crystian Saraiva,
José Soares,
Victor Alves
Abstract:
Brain metastases are a complication of primary cancer, representing the most common type of brain tumor in adults. The management of multiple brain metastases represents a clinical challenge worldwide in finding the optimal treatment for patients considering various individual aspects. Managing multiple metastases with stereotactic radiosurgery (SRS) is being increasingly used because of quality o…
▽ More
Brain metastases are a complication of primary cancer, representing the most common type of brain tumor in adults. The management of multiple brain metastases represents a clinical challenge worldwide in finding the optimal treatment for patients considering various individual aspects. Managing multiple metastases with stereotactic radiosurgery (SRS) is being increasingly used because of quality of life and neurocognitive preservation, which do not present such good outcomes when dealt with whole brain radiation therapy (WBRT). After treatment, analyzing the progression of the disease still represents a clinical issue, since it is difficult to determine a standard schedule for image acquisition. A solution could be the applying artificial intelligence, namely predictive models to forecast the incidence of new metastases in post-treatment images. Although there aren't many works on this subject, this could potentially bennefit medical professionals in early decision of the best treatment approaches.
△ Less
Submitted 27 March, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Renormalization of the band gap in 2D materials near an interface between two dielectrics
Authors:
Alessandra N. Braga,
Wagner P. Pires,
Jeferson Danilo L. Silva,
Danilo T. Alves,
Van Sérgio Alves
Abstract:
We investigate how the renormalization of the band gap in a planar 2D material is affected by the consideration of two nondispersive semi-infinite dielectrics, with dielectric constants $ε_1$ and $ε_2$, separated by a planar interface. Using the pseudo quantum electrodynamics to model the Coulomb interaction between electrons, we show how the renormalization of the band gap depends on $ε_1$ and…
▽ More
We investigate how the renormalization of the band gap in a planar 2D material is affected by the consideration of two nondispersive semi-infinite dielectrics, with dielectric constants $ε_1$ and $ε_2$, separated by a planar interface. Using the pseudo quantum electrodynamics to model the Coulomb interaction between electrons, we show how the renormalization of the band gap depends on $ε_1$ and $ε_2$, and also of the distance between the 2D material and the interface between the two dielectrics. In the appropriate limits, our results reproduce those found in the literature for the band gap renormalization when a single dielectric medium is considered.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Imprints of the nuclear symmetry energy slope in gravitational wave signals emanating from neutron stars
Authors:
Luiz L. Lopes,
Victor B. T. Alves,
César O. V. Flores,
German Lugones
Abstract:
We investigate possible traces of the nuclear symmetry energy slope ($L$) in the gravitational wave emission of neutron stars. For fixed stellar mass values, we examine how the slope influences the stellar radius, compactness, the tidal deformability, the frequency of the quadrupole fundamental fluid mode, and the damping time of the mode due to the gravitational wave emission. We demonstrate that…
▽ More
We investigate possible traces of the nuclear symmetry energy slope ($L$) in the gravitational wave emission of neutron stars. For fixed stellar mass values, we examine how the slope influences the stellar radius, compactness, the tidal deformability, the frequency of the quadrupole fundamental fluid mode, and the damping time of the mode due to the gravitational wave emission. We demonstrate that all these physical quantities are sensitive to the slope and could potentially impose significant constraints on it.
△ Less
Submitted 31 October, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Effects of the two-dimensional Coulomb interaction in both Fermi velocity and energy gap for Dirac-like electrons at finite temperature
Authors:
Nilberto Bezerra,
Van Sérgio Alves,
Leandro O. Nascimento,
Luis Fernandez
Abstract:
We describe both the Fermi velocity and the mass renormalization due to the two-dimensional Coulomb interaction in the presence of a thermal bath. To achieve this, we consider an anisotropic version of pseudo quantum electrodynamics (PQED), within a perturbative approach in the fine-structure constant $α$. Thereafter, we use the so-called imaginary-time formalism for including the thermal bath. In…
▽ More
We describe both the Fermi velocity and the mass renormalization due to the two-dimensional Coulomb interaction in the presence of a thermal bath. To achieve this, we consider an anisotropic version of pseudo quantum electrodynamics (PQED), within a perturbative approach in the fine-structure constant $α$. Thereafter, we use the so-called imaginary-time formalism for including the thermal bath. In the limit $T\rightarrow 0$, we calculate the renormalized mass $m^R(p)$ and compare this result with the experimental findings for the energy band gap in monolayers of transition metal dichalcogenides, namely, WSe$_2$ and MoS$_2$. In these materials, the quasi-particle excitations behave as a massive Dirac-like particles in the low-energy limit, hence, its mass is related to the energy band gap of the material. In the low-temperature limit $T\ll v_F p $, where $v_F p$ is taken as the Fermi energy, we show that $m^R(p)$ decreases linearly on the temperature, i.e, $m^R(p,T)-m^R(p,T\rightarrow 0)\approx -A_αT +O(T^3)$, where $A_α$ is a positive constant. On the other hand, for the renormalized Fermi velocity, we find that $v^R_F(p,T)-v^R_F(p,T\rightarrow 0)\approx -B_αT^3 +O(T^5)$, where $B_α$ is a positive constant. We also perform numerical tests which confirm our analytical results.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
IMplicit-EXplicit Formulations for Discontinuous Galerkin Non-Hydrostatic Atmospheric Models
Authors:
Sohail Reddy,
Maciej Waruszewski,
Felipe A. V. de Braganca Alves,
Francis X. Giraldo
Abstract:
This work presents IMplicit-EXplicit (IMEX) formulations for discontinuous Galerkin (DG) discretizations of the compressible Euler equations governing non-hydrostatic atmospheric flows. In particular, we show two different IMEX formulations that not only treat the stiffness due to the governing dynamics but also the domain discretization. We present these formulations for two different equation se…
▽ More
This work presents IMplicit-EXplicit (IMEX) formulations for discontinuous Galerkin (DG) discretizations of the compressible Euler equations governing non-hydrostatic atmospheric flows. In particular, we show two different IMEX formulations that not only treat the stiffness due to the governing dynamics but also the domain discretization. We present these formulations for two different equation sets typically employed in atmospheric modeling. For both equation sets, efficient Schur complements are derived and the challenges and remedies for deriving them are discussed. The performance of these IMEX formulations of different orders are investigated on both 2D (box) and 3D (sphere) test problems and shown to achieve their theoretical rates of convergence and their efficiency with respect to both mesoscale and global applications are presented.
△ Less
Submitted 12 December, 2022;
originally announced December 2022.
-
Open-Source Skull Reconstruction with MONAI
Authors:
Jianning Li,
André Ferreira,
Behrus Puladi,
Victor Alves,
Michael Kamp,
Moon-Sung Kim,
Felix Nensa,
Jens Kleesiek,
Seyed-Ahmad Ahmadi,
Jan Egger
Abstract:
We present a deep learning-based approach for skull reconstruction for MONAI, which has been pre-trained on the MUG500+ skull dataset. The implementation follows the MONAI contribution guidelines, hence, it can be easily tried out and used, and extended by MONAI users. The primary goal of this paper lies in the investigation of open-sourcing codes and pre-trained deep learning models under the MON…
▽ More
We present a deep learning-based approach for skull reconstruction for MONAI, which has been pre-trained on the MUG500+ skull dataset. The implementation follows the MONAI contribution guidelines, hence, it can be easily tried out and used, and extended by MONAI users. The primary goal of this paper lies in the investigation of open-sourcing codes and pre-trained deep learning models under the MONAI framework. Nowadays, open-sourcing software, especially (pre-trained) deep learning models, has become increasingly important. Over the years, medical image analysis experienced a tremendous transformation. Over a decade ago, algorithms had to be implemented and optimized with low-level programming languages, like C or C++, to run in a reasonable time on a desktop PC, which was not as powerful as today's computers. Nowadays, users have high-level scripting languages like Python, and frameworks like PyTorch and TensorFlow, along with a sea of public code repositories at hand. As a result, implementations that had thousands of lines of C or C++ code in the past, can now be scripted with a few lines and in addition executed in a fraction of the time. To put this even on a higher level, the Medical Open Network for Artificial Intelligence (MONAI) framework tailors medical imaging research to an even more convenient process, which can boost and push the whole field. The MONAI framework is a freely available, community-supported, open-source and PyTorch-based framework, that also enables to provide research contributions with pre-trained models to others. Codes and pre-trained weights for skull reconstruction are publicly available at: https://github.com/Project-MONAI/research-contributions/tree/master/SkullRec
△ Less
Submitted 15 June, 2023; v1 submitted 25 November, 2022;
originally announced November 2022.
-
Towards a Digital Highway Code using Formal Modelling and Verification of Timed Automata
Authors:
Gleifer Vaz Alves,
Maike Schwammberger
Abstract:
One of the challenges in designing safe, reliable and trustworthy Autonomous Vehicles (AVs) is to ensure that the AVs abide by traffic rules. For this, the AVs need to be able to understand and reason about traffic rules. In previous work, we introduce the spatial traffic logic USL-TR to allow for the unambiguous, machine-readable, formalisation of traffic rules. This is only the first step toward…
▽ More
One of the challenges in designing safe, reliable and trustworthy Autonomous Vehicles (AVs) is to ensure that the AVs abide by traffic rules. For this, the AVs need to be able to understand and reason about traffic rules. In previous work, we introduce the spatial traffic logic USL-TR to allow for the unambiguous, machine-readable, formalisation of traffic rules. This is only the first step towards autonomous traffic agents that verifiably follow traffic rules. In this research preview, we focus on two further steps: a) retrieving behaviour diagrams directly from traffic rules and b) converting the behaviour diagrams into timed automata that are using formulae of USL-TR in guards and invariants. With this, we have a formal representation for traffic rules and can move towards the establishment of a Digital Highway Code. We briefly envision further steps which include adding environment and agent models to the timed automata to finally implement and verify these traffic rule models using a selection of formal verification tools.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
On the supersymmetric pseudo-QED
Authors:
Van Sérgio Alves,
M. Gomes,
A. Yu. Petrov,
A. J. da Silva
Abstract:
Within the superfield approach, we discuss the three-dimensional supersymmetric (SUSY) pseudo-QED. We prove that it is all-loop renormalizable. We demonstrate that the SUSY pseudo-QED action can be generated as a quantum correction from the coupling of a spinor gauge superfield to a set of $N$ massless complex scalar superfields. Afterwards, we calculate the two-point function of the scalar superf…
▽ More
Within the superfield approach, we discuss the three-dimensional supersymmetric (SUSY) pseudo-QED. We prove that it is all-loop renormalizable. We demonstrate that the SUSY pseudo-QED action can be generated as a quantum correction from the coupling of a spinor gauge superfield to a set of $N$ massless complex scalar superfields. Afterwards, we calculate the two-point function of the scalar superfields in the pseudo-QED which displays a divergence vanishing in a certain gauge.
△ Less
Submitted 21 March, 2023; v1 submitted 15 September, 2022;
originally announced September 2022.
-
AutoPET Challenge: Combining nn-Unet with Swin UNETR Augmented by Maximum Intensity Projection Classifier
Authors:
Lars Heiliger,
Zdravko Marinov,
Max Hasin,
André Ferreira,
Jana Fragemann,
Kelsey Pomykala,
Jacob Murray,
David Kersting,
Victor Alves,
Rainer Stiefelhagen,
Jan Egger,
Jens Kleesiek
Abstract:
Tumor volume and changes in tumor characteristics over time are important biomarkers for cancer therapy. In this context, FDG-PET/CT scans are routinely used for staging and re-staging of cancer, as the radiolabeled fluorodeoxyglucose is taken up in regions of high metabolism. Unfortunately, these regions with high metabolism are not specific to tumors and can also represent physiological uptake b…
▽ More
Tumor volume and changes in tumor characteristics over time are important biomarkers for cancer therapy. In this context, FDG-PET/CT scans are routinely used for staging and re-staging of cancer, as the radiolabeled fluorodeoxyglucose is taken up in regions of high metabolism. Unfortunately, these regions with high metabolism are not specific to tumors and can also represent physiological uptake by normal functioning organs, inflammation, or infection, making detailed and reliable tumor segmentation in these scans a demanding task. This gap in research is addressed by the AutoPET challenge, which provides a public data set with FDG-PET/CT scans from 900 patients to encourage further improvement in this field. Our contribution to this challenge is an ensemble of two state-of-the-art segmentation models, the nn-Unet and the Swin UNETR, augmented by a maximum intensity projection classifier that acts like a gating mechanism. If it predicts the existence of lesions, both segmentations are combined by a late fusion approach. Our solution achieves a Dice score of 72.12\% on patients diagnosed with lung cancer, melanoma, and lymphoma in our cross-validation. Code: https://github.com/heiligerl/autopet_submission
△ Less
Submitted 14 October, 2022; v1 submitted 2 September, 2022;
originally announced September 2022.
-
Integrating Formal Verification and Simulation-based Assertion Checking in a Corroborative V&V Process
Authors:
Maike Schwammberger,
Christopher Harper,
Gleifer Vaz Alves,
Greg Chance,
Tony Pipe,
Kerstin Eder
Abstract:
Automated Vehicles (AVs) are rapidly maturing in the transportation domain. However, the complexity of the AV design problem is such that no single technique is sufficient to provide adequate validation of key properties such as safety, reliability or trustworthiness. In this vision paper, a combination of a spatial traffic logic and agent-based verification methods with a validation method that u…
▽ More
Automated Vehicles (AVs) are rapidly maturing in the transportation domain. However, the complexity of the AV design problem is such that no single technique is sufficient to provide adequate validation of key properties such as safety, reliability or trustworthiness. In this vision paper, a combination of a spatial traffic logic and agent-based verification methods with a validation method that uses assertion checking of simulations is proposed. We sketch how to integrate the respective approaches within a methodological framework called Corroborative Verification and Validation (V&V).The Corroborative V&V framework identifies three different verification and validation levels for AVs (formal verification, simulation-based testing, real-world experiments) and specifies connections and evidence between these levels. We define specifications for the formal relationships that must be established between processes, system models and requirements models for the evidence from formal design verification and simulation-based testing to corroborate each other and enhance assurance confidence from verification and validation.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
A non column based fully unstructured implementation of Kessler s microphysics with warm rain using continuous and discontinuous spectral elements
Authors:
Yassine Tissaoui,
Simone Marras,
Annalisa Quaini,
Felipe A. V. De Braganca Alves,
Francix X. Giraldo
Abstract:
Numerical weather prediction is pushing the envelope of grid resolution at local and global scales alike. Aiming to model topography with higher precision, a handful of articles introduced unstructured vertical grids and tested them for dry atmospheres. The next step towards effective high-resolution unstructured grids for atmospheric modeling requires that also microphysics is independent of any…
▽ More
Numerical weather prediction is pushing the envelope of grid resolution at local and global scales alike. Aiming to model topography with higher precision, a handful of articles introduced unstructured vertical grids and tested them for dry atmospheres. The next step towards effective high-resolution unstructured grids for atmospheric modeling requires that also microphysics is independent of any vertical columns, in contrast to what is ubiquitous across operational and research models. In this paper, we present a non-column based continuous and discontinuous spectral element implementation of Kessler's microphysics with warm rain as a first step towards fully unstructured atmospheric models. We test the proposed algorithm against standard three-dimensional benchmarks for precipitating clouds and show that the results are comparable with those presented in the literature across all of the tested effective resolutions. While presented for both continuous and discontinuous spectral elements in this paper, the method that we propose can very easily be adapted to any numerical method utilized in other research and legacy codes.
△ Less
Submitted 6 July, 2022; v1 submitted 5 July, 2022;
originally announced July 2022.
-
FakeNews: GAN-based generation of realistic 3D volumetric data -- A systematic review and taxonomy
Authors:
André Ferreira,
Jianning Li,
Kelsey L. Pomykala,
Jens Kleesiek,
Victor Alves,
Jan Egger
Abstract:
With the massive proliferation of data-driven algorithms, such as deep learning-based approaches, the availability of high-quality data is of great interest. Volumetric data is very important in medicine, as it ranges from disease diagnoses to therapy monitoring. When the dataset is sufficient, models can be trained to help doctors with these tasks. Unfortunately, there are scenarios where large a…
▽ More
With the massive proliferation of data-driven algorithms, such as deep learning-based approaches, the availability of high-quality data is of great interest. Volumetric data is very important in medicine, as it ranges from disease diagnoses to therapy monitoring. When the dataset is sufficient, models can be trained to help doctors with these tasks. Unfortunately, there are scenarios where large amounts of data is unavailable. For example, rare diseases and privacy issues can lead to restricted data availability. In non-medical fields, the high cost of obtaining enough high-quality data can also be a concern. A solution to these problems can be the generation of realistic synthetic data using Generative Adversarial Networks (GANs). The existence of these mechanisms is a good asset, especially in healthcare, as the data must be of good quality, realistic, and without privacy issues. Therefore, most of the publications on volumetric GANs are within the medical domain. In this review, we provide a summary of works that generate realistic volumetric synthetic data using GANs. We therefore outline GAN-based methods in these areas with common architectures, loss functions and evaluation metrics, including their advantages and disadvantages. We present a novel taxonomy, evaluations, challenges, and research opportunities to provide a holistic overview of the current state of volumetric GANs.
△ Less
Submitted 14 February, 2024; v1 submitted 4 July, 2022;
originally announced July 2022.
-
Comparison of Sub-Grid Scale Models for Large-Eddy Simulation using a High-Order Spectral Element Approximation of the Compressible Navier-Stokes Equations at Low Mach Number
Authors:
Sohail Reddy,
Yassine Tissaoui,
Felipe A. V. de Braganca Alves,
Simone Marras,
Francis X. Giraldo
Abstract:
This study aims to identify the properties, advantages, and drawbacks of some common (and some less common) sub-grid scale (SGS) models for large eddy simulation of low Mach compressible flows using high order spectral elements. The models investigated are the classical constant coefficient Smagorinsky-Lilly, the model by Vreman and two variants of a dynamic SGS (DSGS) model designed to stabilize…
▽ More
This study aims to identify the properties, advantages, and drawbacks of some common (and some less common) sub-grid scale (SGS) models for large eddy simulation of low Mach compressible flows using high order spectral elements. The models investigated are the classical constant coefficient Smagorinsky-Lilly, the model by Vreman and two variants of a dynamic SGS (DSGS) model designed to stabilize finite and spectral elements for transport dominated problems. In particular, we compare one variant of DSGS that is based on a time-dependent residual version (R-DSGS) in contrast to a time-independent residual based scheme (T-DSGS). The SGS models are compared against the reference model by Smagorinsky and Lilly for their ability to: (i) stabilize the numerical solution, (ii) minimize undershoots and overshoots, (iii) capture/preserve discontinuities, and (iv) transfer energy across different length scales. These abilities are investigated on problems for: (1) passively advected tracers, (2) coupled, nonlinear system of equations exhibiting discontinuities, (3) gravity-driven flows in a stratified atmosphere, and (4) homogenous, isotropic turbulence. All models were able to preserve sharp discontinuities. Vreman and the R-DSGS models also reduce the undershoots and overshoots in the solution of linear and non-linear advection with sharp gradients. Our analysis shows that the R-DSGS and T-DSGS models are more robust than Vreman and Smagorinsky-Lilly for numerical stabilization of high-order spectral methods. The Smagorinsky and Vreman models are better able to resolve the finer flow structures in shear flows, while the nodal R-DSGS model shows better energy conservation. Overall, the nodal implementation of R-DSGS (in contrast to its element-based counterpart) is shown to outperform the other SGS models in most metrics listed above, and on par with respect to the remaining ones.
△ Less
Submitted 6 April, 2022;
originally announced April 2022.
-
OpenKBP-Opt: An international and reproducible evaluation of 76 knowledge-based planning pipelines
Authors:
Aaron Babier,
Rafid Mahmood,
Binghao Zhang,
Victor G. L. Alves,
Ana Maria Barragán-Montero,
Joel Beaudry,
Carlos E. Cardenas,
Yankui Chang,
Zijie Chen,
Jaehee Chun,
Kelly Diaz,
Harold David Eraso,
Erik Faustmann,
Sibaji Gaj,
Skylar Gay,
Mary Gronberg,
Bingqi Guo,
Junjun He,
Gerd Heilemann,
Sanchit Hira,
Yuliang Huang,
Fuxin Ji,
Dashan Jiang,
Jean Carlo Jimenez Giraldo,
Hoyeon Lee
, et al. (34 additional authors not shown)
Abstract:
We establish an open framework for developing plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models that were developed by different research groups during the OpenKBP Grand Challenge. The dose predictions were input to four optimization mode…
▽ More
We establish an open framework for developing plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models that were developed by different research groups during the OpenKBP Grand Challenge. The dose predictions were input to four optimization models to form 76 unique KBP pipelines that generated 7600 plans. The predictions and plans were compared to the reference plans via: dose score, which is the average mean absolute voxel-by-voxel difference in dose a model achieved; the deviation in dose-volume histogram (DVH) criterion; and the frequency of clinical planning criteria satisfaction. We also performed a theoretical investigation to justify our dose mimicking models. The range in rank order correlation of the dose score between predictions and their KBP pipelines was 0.50 to 0.62, which indicates that the quality of the predictions is generally positively correlated with the quality of the plans. Additionally, compared to the input predictions, the KBP-generated plans performed significantly better (P<0.05; one-sided Wilcoxon test) on 18 of 23 DVH criteria. Similarly, each optimization model generated plans that satisfied a higher percentage of criteria than the reference plans. Lastly, our theoretical investigation demonstrated that the dose mimicking models generated plans that are also optimal for a conventional planning model. This was the largest international effort to date for evaluating the combination of KBP prediction and optimization models. In the interest of reproducibility, our data and code is freely available at https://github.com/ababier/open-kbp-opt.
△ Less
Submitted 16 February, 2022;
originally announced February 2022.
-
Generation of Synthetic Rat Brain MRI scans with a 3D Enhanced Alpha-GAN
Authors:
André Ferreira,
Ricardo Magalhães,
Sébastien Mériaux,
Victor Alves
Abstract:
Translational brain research using Magnetic Resonance Imaging (MRI) is becoming increasingly popular as animal models are an essential part of scientific studies and more ultra-high-field scanners are becoming available. Some disadvantages of MRI are the availability of MRI scanners and the time required for a full scanning session (it usually takes over 30 minutes). Privacy laws and the 3Rs ethic…
▽ More
Translational brain research using Magnetic Resonance Imaging (MRI) is becoming increasingly popular as animal models are an essential part of scientific studies and more ultra-high-field scanners are becoming available. Some disadvantages of MRI are the availability of MRI scanners and the time required for a full scanning session (it usually takes over 30 minutes). Privacy laws and the 3Rs ethics rule also make it difficult to create large datasets for training deep learning models. Generative Adversarial Networks (GANs) can perform data augmentation with higher quality than other techniques. In this work, the alpha-GAN architecture is used to test its ability to produce realistic 3D MRI scans of the rat brain. As far as the authors are aware, this is the first time that a GAN-based approach has been used for data augmentation in preclinical data. The generated scans are evaluated using various qualitative and quantitative metrics. A Turing test conducted by 4 experts has shown that the generated scans can trick almost any expert. The generated scans were also used to evaluate their impact on the performance of an existing deep learning model developed for segmenting the rat brain into white matter, grey matter and cerebrospinal fluid. The models were compared using the Dice score. The best results for whole brain and white matter segmentation were obtained when 174 real scans and 348 synthetic scans were used, with improvements of 0.0172 and 0.0129, respectively. Using 174 real scans and 87 synthetic scans resulted in improvements of 0.0038 and 0.0764 for grey matter and CSF segmentation, respectively. Thus, by using the proposed new normalisation layer and loss functions, it was possible to improve the realism of the generated rat MRI scans and it was shown that using the generated data improved the segmentation model more than using the conventional data augmentation.
△ Less
Submitted 4 January, 2022; v1 submitted 27 December, 2021;
originally announced December 2021.
-
In-storage Processing of I/O Intensive Applications on Computational Storage Drives
Authors:
Ali HeydariGorji,
Mahdi Torabzadehkashi,
Siavash Rezaei,
Hossein Bobarshad,
Vladimir Alves,
Pai H. Chou
Abstract:
Computational storage drives (CSD) are solid-state drives (SSD) empowered by general-purpose processors that can perform in-storage processing. They have the potential to improve both performance and energy significantly for big-data analytics by bringing compute to data, thereby eliminating costly data transfer while offering better privacy. In this work, we introduce Solana, the first-ever high-…
▽ More
Computational storage drives (CSD) are solid-state drives (SSD) empowered by general-purpose processors that can perform in-storage processing. They have the potential to improve both performance and energy significantly for big-data analytics by bringing compute to data, thereby eliminating costly data transfer while offering better privacy. In this work, we introduce Solana, the first-ever high-capacity(12-TB) CSD in E1.S form factor, and present an actual prototype for evaluation. To demonstrate the benefits of in-storage processing on CSD, we deploy several natural language processing (NLP) applications on datacenter-grade storage servers comprised of clusters of the Solana. Experimental results show up to 3.1x speedup in processing while reducing the energy consumption and data transfer by 67% and 68%, respectively, compared to regular enterprise SSDs.
△ Less
Submitted 23 December, 2021;
originally announced December 2021.
-
Gross patient error detection via cine transmission dosimetry
Authors:
Nguyen Phuong Dang,
Victor Gabriel Leandro Alves,
Mahmoud Ahmed,
Jeffrey Siebers
Abstract:
$\textbf{Purpose:}$ To quantify the effectiveness of EPID-based cine transmission dosimetry to detect gross patient anatomic errors. $\textbf{Method and Materials:}…
▽ More
$\textbf{Purpose:}$ To quantify the effectiveness of EPID-based cine transmission dosimetry to detect gross patient anatomic errors. $\textbf{Method and Materials:}$ EPID image frames resulting from fluence transmitted through multiple patients anatomies are simulated for 100 msec delivery intervals for hypothetical 6 MV VMAT deliveries. Frames simulated through 10 head-and-neck CTs and 19 prostate CTs with and without 1-3 mm shift and 1-3 degree rotations were used to quantify expected in-tolerance clinical setup variations. Per-frame analysis methods to determine if simulated gross errors of (a) 10-20 mm patient miss alignment offsets and (b) 15-20 degree patient rotations could be reliably distinguished from the above baseline variations. For the prostate image sets, frames simulated through the reference CT are intercompared with (c) frames through 8-13 different CT's for the same patient to quantify expected inter-treatment frame variation. ROC analysis of per-frame error discrimination based upon (i) frame image differences, (ii) frame histogram comparisons, (iii) image feature matching, and (iv) image distance were used to quantify error detectability. $\textbf{Results:}$ Each error detection method was able to distinguish gross patient miss-alignment and gross rotations from in-tolerance levels for both H&N and prostate datasets. The image distance algorithm is the best method based on AUC. $\textbf{Conclusion:}$ In-field gross error detection was possible for gross patient miss-alignments and incorrect patients. For prostate cases, the methods used were able to distinguish different patients from daily patient variations.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
DeepDoseNet: A Deep Learning model for 3D Dose Prediction in Radiation Therapy
Authors:
Mumtaz Hussain Soomro,
Victor Gabriel Leandro Alves,
Hamidreza Nourzadeh,
Jeffrey V. Siebers
Abstract:
The DeepDoseNet 3D dose prediction model based on ResNet and Dilated DenseNet is proposed. The 340 head-and-neck datasets from the 2020 AAPM OpenKBP challenge were utilized, with 200 for training, 40 for validation, and 100 for testing. Structures include 56Gy, 63Gy, 70Gy PTVs, and brainstem, spinal cord, right parotid, left parotid, larynx, esophagus, and mandible OARs. Mean squared error (MSE) l…
▽ More
The DeepDoseNet 3D dose prediction model based on ResNet and Dilated DenseNet is proposed. The 340 head-and-neck datasets from the 2020 AAPM OpenKBP challenge were utilized, with 200 for training, 40 for validation, and 100 for testing. Structures include 56Gy, 63Gy, 70Gy PTVs, and brainstem, spinal cord, right parotid, left parotid, larynx, esophagus, and mandible OARs. Mean squared error (MSE) loss, mean absolute error (MAE) loss, and MAE plus dose-volume histogram (DVH) based loss functions were investigated. Each model's performance was compared using a 3D dose score, $\bar{S_{D}}$, (mean absolute difference between ground truth and predicted 3D dose distributions) and a DVH score, $\bar{S_{DVH}}$ (mean absolute difference between ground truth and predicted dose-volume metrics).Furthermore, DVH metrics Mean[Gy] and D0.1cc [Gy] for OARs and D99%, D95%, D1% for PTVs were computed. DeepDoseNet with the MAE plus DVH-based loss function had the best dose score performance of the OpenKBP entries. MAE+DVH model had the lowest prediction error (P<0.0001, Wilcoxon test) on validation and test datasets (validation: $\bar{S_{D}}$=2.3Gy, $\bar{S_{DVH}}$=1.9Gy; test: $\bar{S_{D}}$=2.0Gy, $\bar{S_{DVH}}$=1.6Gy) followed by the MAE model (validation: $\bar{S_{D}}$=3.6Gy, $\bar{S_{DVH}}$=2.4Gy; test: $\bar{S_{D}}$=3.5Gy, $\bar{S_{DVH}}$=2.3Gy). The MSE model had the highest prediction error (validation: $\bar{S_{D}}$=3.7Gy, $\bar{S_{DVH}}$=3.2Gy; test: $\bar{S_{D}}$=3.6Gy, $\bar{S_{DVH}}$=3.0Gy). No significant difference was found among models in terms of Mean [Gy], but the MAE+DVH model significantly outperformed the MAE and MSE models in terms of D0.1cc[Gy], particularly for mandible and parotids on both validation (P<0.01) and test (P<0.0001) datasets. MAE+DVH outperformed (P<0.0001) in terms of D99%, D95%, D1% for targets. MAE+DVH reduced $\bar{S_{D}}$ by ~60% and $\bar{S_{DVH}}$ by ~70%.
△ Less
Submitted 29 October, 2021;
originally announced November 2021.
-
Extending Urban Multi-Lane Spatial Logic to Formalise Road Junction Rules
Authors:
Maike Schwammberger,
Gleifer Vaz Alves
Abstract:
During the design of autonomous vehicles (AVs), several stages should include a verification process to guarantee that the AV is driving safely on the roads. One of these stages is to assure the AVs abide by the road traffic rules. To include road traffic rules in the design of an AV, a precise and unambiguous formalisation of these rules is needed. However, only recently this has been pointed out…
▽ More
During the design of autonomous vehicles (AVs), several stages should include a verification process to guarantee that the AV is driving safely on the roads. One of these stages is to assure the AVs abide by the road traffic rules. To include road traffic rules in the design of an AV, a precise and unambiguous formalisation of these rules is needed. However, only recently this has been pointed out as an issue for the design of AVs and the few works on this only capture the temporal aspects of the rules, leaving behind the spatial aspects. Here, we extend the spatial traffic logic, Urban Multi-lane Spatial Logic, to formalise a subset of the UK road junction rules, where both temporal and spatial aspects of the rules are captured. Our approach has an abstraction level for urban road junctions that could easily promote the formalisation of the whole set of road junction rules and we exemplarily formalise three of the UK road junction rules. Once we have the whole set formalised, we will model, implement, and formally verify the behaviour of an AV against road traffic rules so that guidelines for the creation of a Digital Highway Code for AVs can be established.
△ Less
Submitted 24 October, 2021;
originally announced October 2021.
-
Effects of the Pseudo-Chern-Simons action for strongly correlated electrons in the plane
Authors:
Rodrigo F. Ozela,
Van Sérgio Alves,
Gabriel C. Magalhães,
Leandro O. Nascimento
Abstract:
Chiral symmetry breaking comes from the mass dynamically generated through interaction of Dirac fermions for both quantum electrodynamics in (2+1)D (QED3) and (3+1)D (QED4). In QED3, the presence of a Chern-Simons (CS) parameter affects the critical structure of the theory, favoring the symmetric phase where the electron remains massless. Here, we calculate the main effects of a Pseudo-Chern-Simon…
▽ More
Chiral symmetry breaking comes from the mass dynamically generated through interaction of Dirac fermions for both quantum electrodynamics in (2+1)D (QED3) and (3+1)D (QED4). In QED3, the presence of a Chern-Simons (CS) parameter affects the critical structure of the theory, favoring the symmetric phase where the electron remains massless. Here, we calculate the main effects of a Pseudo-Chern-Simons (PCS) parameter $θ$ into the dynamical mass generation of Pseudo quantum electrodynamics (PQED). The $θ$-parameter provides a mass scale for PQED at classical level and appears as the pole of the gauge-field propagator. After calculating the full electron propagator with the Schwinger-Dyson equation at quenched-rainbow and large-$N$ approximations, we conclude that $θ$ affects the critical parameters related to the fine-structure constant, $α_c(θ)$, and to the number of copies of the matter field, $N_c(θ)$, by favoring the symmetric phase. In the continuum limit ($Λ\to \infty$), nevertheless, the $θ$-parameter do not affect the critical parameters. We also compare our analytical results with numerical findings of the integral equation for the mass function of the electron.
△ Less
Submitted 3 October, 2021;
originally announced October 2021.
-
OARnet: Automated organs-at-risk delineation in Head and Neck CT images
Authors:
Mumtaz Hussain Soomro,
Hamidreza Nourzadeh,
Victor Gabriel Leandro Alves,
Wookjin Choi,
Jeffrey V. Siebers
Abstract:
A 3D deep learning model (OARnet) is developed and used to delineate 28 H&N OARs on CT images. OARnet utilizes a densely connected network to detect the OAR bounding-box, then delineates the OAR within the box. It reuses information from any layer to subsequent layers and uses skip connections to combine information from different dense block levels to progressively improve delineation accuracy. T…
▽ More
A 3D deep learning model (OARnet) is developed and used to delineate 28 H&N OARs on CT images. OARnet utilizes a densely connected network to detect the OAR bounding-box, then delineates the OAR within the box. It reuses information from any layer to subsequent layers and uses skip connections to combine information from different dense block levels to progressively improve delineation accuracy. Training uses up to 28 expert manual delineated (MD) OARs from 165 CTs. Dice similarity coefficient (DSC) and the 95th percentile Hausdorff distance (HD95) with respect to MD is assessed for 70 other CTs. Mean, maximum, and root-mean-square dose differences with respect to MD are assessed for 56 of the 70 CTs. OARnet is compared with UaNet, AnatomyNet, and Multi-Atlas Segmentation (MAS). Wilcoxon signed-rank tests using 95% confidence intervals are used to assess significance. Wilcoxon signed ranked tests show that, compared with UaNet, OARnet improves (p<0.05) the DSC (23/28 OARs) and HD95 (17/28). OARnet outperforms both AnatomyNet and MAS for DSC (28/28) and HD95 (27/28). Compared with UaNet, OARnet improves median DSC up to 0.05 and HD95 up to 1.5mm. Compared with AnatomyNet and MAS, OARnet improves median (DSC, HD95) by up to (0.08, 2.7mm) and (0.17, 6.3mm). Dosimetrically, OARnet outperforms UaNet (Dmax 7/28; Dmean 10/28), AnatomyNet (Dmax 21/28; Dmean 24/28), and MAS (Dmax 22/28; Dmean 21/28). The DenseNet architecture is optimized using a hybrid approach that performs OAR-specific bounding box detection followed by feature recognition. Compared with other auto-delineation methods, OARnet is better than or equal to UaNet for all but one geometric (Temporal Lobe L, HD95) and one dosimetric (Eye L, mean dose) endpoint for the 28 H&N OARs, and is better than or equal to both AnatomyNet and MAS for all OARs.
△ Less
Submitted 23 November, 2021; v1 submitted 31 August, 2021;
originally announced August 2021.
-
Bosonization, mass generation, and the pseudo Chern-Simons action
Authors:
Gabriel C. Magalhães,
Van S. Alves,
Leandro O. Nascimento,
Eduardo C. Marino
Abstract:
We discuss several aspects of a generalization of the Chern-Simons action containing the pseudo-differential operator$\sqrt{-\Box}$, which we shall call pseudo Chern-Simons (PCS). Firstly, we derive the PCS from the bosonization of free massive Dirac particles in (2+1)D in the limit when $m^2\ll p^2$, where $m$ is the fermion mass and $p$ is its momentum. In this regime, the whole bosonized action…
▽ More
We discuss several aspects of a generalization of the Chern-Simons action containing the pseudo-differential operator$\sqrt{-\Box}$, which we shall call pseudo Chern-Simons (PCS). Firstly, we derive the PCS from the bosonization of free massive Dirac particles in (2+1)D in the limit when $m^2\ll p^2$, where $m$ is the fermion mass and $p$ is its momentum. In this regime, the whole bosonized action also has a modified Maxwell term, involving the same pseudo-differential operator. Furthermore, the large-mass $m^2\gg p^2$ regime is also considered. We also investigate the main effects of the PCS term into the Pseudo quantum electrodynamics (PQED), which describes the electromagnetic interactions between charged particles in (2+1)D. We show that the massless gauge field of PQED becomes massive in the presence of a PCS term, without the need of a Higgs mechanism. In the nonrelativistic limit, we show that the static potential has a repulsive term (given by the Coulomb potential) and an attractive part (given by a sum of special functions), whose competition generates bound states of particles with the same charge. Having in mind two-dimensional materials, we also conclude that the presence of a PCS term does not affect the renormalization either of the Fermi velocity and of the band gap in a Dirac-like material.
△ Less
Submitted 1 June, 2021; v1 submitted 3 April, 2021;
originally announced April 2021.
-
Influence of the four-fermion interactions in (2+1)D massive electrons system
Authors:
Luis Fernández,
Van Sérgio Alves,
M. Gomes,
Leandro O. Nascimento,
Francisco Peña
Abstract:
The description of the electromagnetic interaction in two-dimensional Dirac materials, such as graphene and transition-metal dichalcogenides, in which electrons move in the plane and interact via virtual photons in 3d, leads naturally to the emergence of a projected non-local theory, called pseudo-quantum electrodynamics (PQED), as an effective model suitable for describing electromagnetic interac…
▽ More
The description of the electromagnetic interaction in two-dimensional Dirac materials, such as graphene and transition-metal dichalcogenides, in which electrons move in the plane and interact via virtual photons in 3d, leads naturally to the emergence of a projected non-local theory, called pseudo-quantum electrodynamics (PQED), as an effective model suitable for describing electromagnetic interaction in these systems. In this work, we investigate the role of a complete set of four-fermion interactions in the renormalization group functions when we coupled it with the anisotropic version of massive PQED, where we take into account the fact that the Fermi velocity is not equal to the light velocity. We calculate the electron self-energy in the dominant order in the $1/N$ expansion in the regime where $m ^ 2 \ll p ^ 2$. We show that the Fermi velocity renormalization is insensitive to the presence of quartic fermionic interactions, whereas the renormalized mass may have two different asymptotic behaviors at the high-density limit, which means a high-energy scale.
△ Less
Submitted 14 March, 2021;
originally announced March 2021.
-
Chaotic Logistic Map Forecast using Fuzzy Time Series
Authors:
Lucas Vinícius Ribeiro Alves
Abstract:
This paper deals with the problem of forecast the Logistic Chaotic Map using Fuzzy Times Series (FTS). Chaotic Systems are very sensible to changes in its parameters and in the initial conditions, turning them into hard systems to model and forecast. In this case, we relay in the robustness of Fuzzy Time Series to model and forecast the logistic map. We use the Akaike Information Criterion (AIC) a…
▽ More
This paper deals with the problem of forecast the Logistic Chaotic Map using Fuzzy Times Series (FTS). Chaotic Systems are very sensible to changes in its parameters and in the initial conditions, turning them into hard systems to model and forecast. In this case, we relay in the robustness of Fuzzy Time Series to model and forecast the logistic map. We use the Akaike Information Criterion (AIC) as an index to determine the number of sub intervals for the definition of the fuzzy set.
△ Less
Submitted 10 March, 2021;
originally announced March 2021.
-
New Coalescences for the Painlevé Equations
Authors:
V. C. C. Alves
Abstract:
The Painlevé equations are here connected to other classes of equations with the Painlevé Property (Ince's equations) by the same degeneracy procedure that connects the Painlevé equations (coalescence). These Ince's equations here are also connected among themselves like in the traditional Painlevé's coalescence cascade. Such degeneracy is considered also for the special equations, symmetric equat…
▽ More
The Painlevé equations are here connected to other classes of equations with the Painlevé Property (Ince's equations) by the same degeneracy procedure that connects the Painlevé equations (coalescence). These Ince's equations here are also connected among themselves like in the traditional Painlevé's coalescence cascade. Such degeneracy is considered also for the special equations, symmetric equations and Bäcklund transformations.
△ Less
Submitted 3 March, 2021;
originally announced March 2021.
-
Gauge Symmetry Origin of Bäcklund Transformations for Painlevé Equations
Authors:
V. C. C. Alves,
H. Aratyn,
J. F. Gomes,
A. H. Zimerman
Abstract:
We identify the self-similarity limit of the second flow of $sl(N)$ mKdV hierarchy with the periodic dressing chain thus establishing % a connection to $A^{(1)}_{N-1}$ invariant
Painlevé equations. The $A^{(1)}_{N-1}$ Bäcklund symmetries of dressing equations and Painlevé equations are obtained in the self-similarity limit of gauge transformations of the mKdV hierarchy realized as zero-curvature…
▽ More
We identify the self-similarity limit of the second flow of $sl(N)$ mKdV hierarchy with the periodic dressing chain thus establishing % a connection to $A^{(1)}_{N-1}$ invariant
Painlevé equations. The $A^{(1)}_{N-1}$ Bäcklund symmetries of dressing equations and Painlevé equations are obtained in the self-similarity limit of gauge transformations of the mKdV hierarchy realized as zero-curvature equations on the loop algebra $\widehat{sl}(N)$ endowed with a principal gradation.
△ Less
Submitted 14 January, 2021;
originally announced January 2021.
-
Combining unsupervised and supervised learning for predicting the final stroke lesion
Authors:
Adriano Pinto,
Sérgio Pereira,
Raphael Meier,
Roland Wiest,
Victor Alves,
Mauricio Reyes,
Carlos A. Silva
Abstract:
Predicting the final ischaemic stroke lesion provides crucial information regarding the volume of salvageable hypoperfused tissue, which helps physicians in the difficult decision-making process of treatment planning and intervention. Treatment selection is influenced by clinical diagnosis, which requires delineating the stroke lesion, as well as characterising cerebral blood flow dynamics using n…
▽ More
Predicting the final ischaemic stroke lesion provides crucial information regarding the volume of salvageable hypoperfused tissue, which helps physicians in the difficult decision-making process of treatment planning and intervention. Treatment selection is influenced by clinical diagnosis, which requires delineating the stroke lesion, as well as characterising cerebral blood flow dynamics using neuroimaging acquisitions. Nonetheless, predicting the final stroke lesion is an intricate task, due to the variability in lesion size, shape, location and the underlying cerebral haemodynamic processes that occur after the ischaemic stroke takes place. Moreover, since elapsed time between stroke and treatment is related to the loss of brain tissue, assessing and predicting the final stroke lesion needs to be performed in a short period of time, which makes the task even more complex. Therefore, there is a need for automatic methods that predict the final stroke lesion and support physicians in the treatment decision process. We propose a fully automatic deep learning method based on unsupervised and supervised learning to predict the final stroke lesion after 90 days. Our aim is to predict the final stroke lesion location and extent, taking into account the underlying cerebral blood flow dynamics that can influence the prediction. To achieve this, we propose a two-branch Restricted Boltzmann Machine, which provides specialized data-driven features from different sets of standard parametric Magnetic Resonance Imaging maps. These data-driven feature maps are then combined with the parametric Magnetic Resonance Imaging maps, and fed to a Convolutional and Recurrent Neural Network architecture. We evaluated our proposal on the publicly available ISLES 2017 testing dataset, reaching a Dice score of 0.38, Hausdorff Distance of 29.21 mm, and Average Symmetric Surface Distance of 5.52 mm.
△ Less
Submitted 2 January, 2021;
originally announced January 2021.
-
Dynamical Mass Generation in Pseudo Quantum Electrodynamics with Gross-Neveu Interaction at finite temperature
Authors:
Luis Fernández,
Reginaldo O. Corrêa Jr.,
Van Sérgio Alves,
Leandro O. Nascimento,
Francisco Peña
Abstract:
We study the dynamical mass generation in Pseudo Quantum Electrodynamics (PQED) coupled to the Gross-Neveu (GN) interaction, in (2+1) dimensions, at both zero and finite temperatures. We start with a gapless model and show that, under particular conditions, a dynamically generated mass emerges. In order to do so, we use a truncated Schwinger-Dyson equation, at the large-N approximation, in the ima…
▽ More
We study the dynamical mass generation in Pseudo Quantum Electrodynamics (PQED) coupled to the Gross-Neveu (GN) interaction, in (2+1) dimensions, at both zero and finite temperatures. We start with a gapless model and show that, under particular conditions, a dynamically generated mass emerges. In order to do so, we use a truncated Schwinger-Dyson equation, at the large-N approximation, in the imaginary-time formalism. In the instantaneous-exchange approximation (the static regime), we obtain two critical parameters, namely, the critical number of fermions $N_c(T)$ and the critical coupling constant $α_c(T)$ as a function of temperature and of the cutoff $Λ$, which must be provided by experiments. In the dynamical regime, we find an analytical solution for the mass function $Σ(p,T)$ as well as a zero-external momentum solution for $p=0$. We compare our analytical results with numerical tests and a good agreement is found.
△ Less
Submitted 18 September, 2020;
originally announced September 2020.
-
Coalescence, Deformation and Bäcklund Symmetries of Painlevé IV and II Equations
Authors:
V. C. C. Alves,
H. Aratyn,
J. F. Gomes,
A. H. Zimerman
Abstract:
We extend Painlevé IV model by adding quadratic terms to its Hamiltonian obtaining two classes of models (coalescence and deformation) that interpolate between Painlevé IV and II equations for special limits of the underlying parameters. We derive the underlying Bäcklund transformations, symmetry structure and requirements to satisfy Painlevé property.
We extend Painlevé IV model by adding quadratic terms to its Hamiltonian obtaining two classes of models (coalescence and deformation) that interpolate between Painlevé IV and II equations for special limits of the underlying parameters. We derive the underlying Bäcklund transformations, symmetry structure and requirements to satisfy Painlevé property.
△ Less
Submitted 10 September, 2020;
originally announced September 2020.
-
The influence of a conducting surface on the conductivity of graphene
Authors:
D. C. Pedrelli,
D. T. Alves,
V. S. Alves
Abstract:
In the present paper, using Pseudo-Quantum Electrodynamics to describe the interaction between electrons in graphene, we investigate the longitudinal and optical conductivities of a neutral graphene sheet near a grounded perfectly conducting surface, with calculations up to 2-loop perturbation order. We show that the longitudinal conductivity increases as we bring the conducting surface closer to…
▽ More
In the present paper, using Pseudo-Quantum Electrodynamics to describe the interaction between electrons in graphene, we investigate the longitudinal and optical conductivities of a neutral graphene sheet near a grounded perfectly conducting surface, with calculations up to 2-loop perturbation order. We show that the longitudinal conductivity increases as we bring the conducting surface closer to the graphene sheet. On the other hand, although the optical conductivity initially increases with the proximity of the plate, it reaches a maximum value, tending, afterwards, to the minimal conductivity in the ideal limit of no separation between graphene and the conducting surface. We recover the correspondent results in the literature when the distance to the plate tends to infinity. Our results may be useful as an alternative way to control the longitudinal and optical conductivities of graphene.
△ Less
Submitted 2 September, 2020;
originally announced September 2020.
-
Secure Recovery Procedure for Manufacturing Systems using Synchronizing Automata and Supervisory Control Theory
Authors:
Lucas V. R. Alves,
Patricia N. Pena
Abstract:
Manufacturing systems may be subject to external attacks and failures, so it is important to deal with the recovery of the system after these situations. This paper deals with the problem of recovering a manufacturing system, modeled as a Discrete Event System (DES) using the Supervisory Control Theory (SCT), when the control structure, called supervisor, desynchronizes from the physical plant. Th…
▽ More
Manufacturing systems may be subject to external attacks and failures, so it is important to deal with the recovery of the system after these situations. This paper deals with the problem of recovering a manufacturing system, modeled as a Discrete Event System (DES) using the Supervisory Control Theory (SCT), when the control structure, called supervisor, desynchronizes from the physical plant. The desynchronization may be seen as plant and supervisor being in uncorresponding states. The recovery of the system may be attained if there is a word, the synchronizing word, that regardless the state of each one of them, brings the system and supervisor back to a known state. The concepts of synchronizing automata are used to do so. In this paper we show under what conditions a set of synchronizing plants and specifications leads to a synchronizing supervisor obtained by the Supervisory Control Theory. The problem is extended to cope with multiple supervisors, proposing a local recovery when possible. We also present a simple way to model problems, composed of machines and buffers, as synchronizing automata such that it is always possible do restore synchronization between the control (supervisor) and the plant.
△ Less
Submitted 29 August, 2020;
originally announced August 2020.
-
HyperTune: Dynamic Hyperparameter Tuning For Efficient Distribution of DNN Training Over Heterogeneous Systems
Authors:
Ali HeydariGorji,
Siavash Rezaei,
Mahdi Torabzadehkashi,
Hossein Bobarshad,
Vladimir Alves,
Pai H. Chou
Abstract:
Distributed training is a novel approach to accelerate Deep Neural Networks (DNN) training, but common training libraries fall short of addressing the distributed cases with heterogeneous processors or the cases where the processing nodes get interrupted by other workloads. This paper describes distributed training of DNN on computational storage devices (CSD), which are NAND flash-based, high cap…
▽ More
Distributed training is a novel approach to accelerate Deep Neural Networks (DNN) training, but common training libraries fall short of addressing the distributed cases with heterogeneous processors or the cases where the processing nodes get interrupted by other workloads. This paper describes distributed training of DNN on computational storage devices (CSD), which are NAND flash-based, high capacity data storage with internal processing engines. A CSD-based distributed architecture incorporates the advantages of federated learning in terms of performance scalability, resiliency, and data privacy by eliminating the unnecessary data movement between the storage device and the host processor. The paper also describes Stannis, a DNN training framework that improves on the shortcomings of existing distributed training frameworks by dynamically tuning the training hyperparameters in heterogeneous systems to maintain the maximum overall processing speed in term of processed images per second and energy efficiency. Experimental results on image classification training benchmarks show up to 3.1x improvement in performance and 2.45x reduction in energy consumption when using Stannis plus CSD compare to the generic systems.
△ Less
Submitted 15 July, 2020;
originally announced July 2020.
-
Adaptive feature recombination and recalibration for semantic segmentation with Fully Convolutional Networks
Authors:
Sergio Pereira,
Adriano Pinto,
Joana Amorim,
Alexandrine Ribeiro,
Victor Alves,
Carlos A. Silva
Abstract:
Fully Convolutional Networks have been achieving remarkable results in image semantic segmentation, while being efficient. Such efficiency results from the capability of segmenting several voxels in a single forward pass. So, there is a direct spatial correspondence between a unit in a feature map and the voxel in the same location. In a convolutional layer, the kernel spans over all channels and…
▽ More
Fully Convolutional Networks have been achieving remarkable results in image semantic segmentation, while being efficient. Such efficiency results from the capability of segmenting several voxels in a single forward pass. So, there is a direct spatial correspondence between a unit in a feature map and the voxel in the same location. In a convolutional layer, the kernel spans over all channels and extracts information from them. We observe that linear recombination of feature maps by increasing the number of channels followed by compression may enhance their discriminative power. Moreover, not all feature maps have the same relevance for the classes being predicted. In order to learn the inter-channel relationships and recalibrate the channels to suppress the less relevant ones, Squeeze and Excitation blocks were proposed in the context of image classification with Convolutional Neural Networks. However, this is not well adapted for segmentation with Fully Convolutional Networks since they segment several objects simultaneously, hence a feature map may contain relevant information only in some locations. In this paper, we propose recombination of features and a spatially adaptive recalibration block that is adapted for semantic segmentation with Fully Convolutional Networks - the SegSE block. Feature maps are recalibrated by considering the cross-channel information together with spatial relevance. Experimental results indicate that Recombination and Recalibration improve the results of a competitive baseline, and generalize across three different problems: brain tumor segmentation, stroke penumbra estimation, and ischemic stroke lesion outcome prediction. The obtained results are competitive or outperform the state of the art in the three applications.
△ Less
Submitted 19 June, 2020;
originally announced June 2020.
-
Design-controlled Synthesis of IrO$_2$ sub-monolayers on Au Nanodendrites: Marrying Plasmonic and Electrocatalytic Properties
Authors:
Isabel C. de Freitas,
Luanna S. Parreira,
Eduardo C. M. Barbosa,
Barbara A. Novaes,
Tong Mou,
Tiago. V. Alves,
Jhon Quiroz,
Yi-Chi Wang,
Thomas J Slater,
Andrew Thomas,
Bin Wang,
Sarah J Haigh,
Pedro H. C. Camargoa
Abstract:
We develop herein plasmonic-catalytic Au-IrO$_2$ nanostructures with a morphology optimized for efficient light harvesting and catalytic surface area; the nanoparticles have a dendritic morphology, with closely spaced Au branches all partially covered by an ultrathin (1 nm) IrO$_2$ shell. This nanoparticle architecture optimizes optical features due to the interactions of closely spaced plasmonic…
▽ More
We develop herein plasmonic-catalytic Au-IrO$_2$ nanostructures with a morphology optimized for efficient light harvesting and catalytic surface area; the nanoparticles have a dendritic morphology, with closely spaced Au branches all partially covered by an ultrathin (1 nm) IrO$_2$ shell. This nanoparticle architecture optimizes optical features due to the interactions of closely spaced plasmonic branches forming electromagnetic hot spots, and the ultra-thin IrO$_2$ layer maximizes efficient use of this expensive catalyst. This concept was evaluated towards the enhancement of the electrocatalytic performances towards the oxygen evolution reaction (OER) as a model transformation. The OER can play a central role in meeting future energy demands but the performance of conventional electrocatalysts in this reaction is limited by the sluggish OER kinetics. We demonstrate an improvement of the OER performance for one of the most active OER catalysts, IrO$_2$, by harvesting plasmonic effects from visible light illumination in multimetallic nanoparticles. We find that the OER activity for the Au-IrO$_2$ nanodendrites can be improved under LSPR excitation, matching best properties reported in the literature. Our simulations and electrocatalytic data demonstrate that the enhancement in OER activities can be attributed to an electronic interaction between Au and IrO$_2$ and to the activation of Ir-O bonds by LSPR excited hot holes, leading to a change in the reaction mechanism (rate-determinant step) under visible light illumination.
△ Less
Submitted 20 April, 2020;
originally announced April 2020.
-
Renormalization of the band gap in 2D materials through the competition between electromagnetic and four-fermion interactions
Authors:
Luis Fernández,
Van Sérgio Alves,
Leandro O. Nascimento,
Francisco Peña,
M. Gomes,
E. C. Marino
Abstract:
Recently the renormalization of the band gap $m$, in both WSe$_2$ and MoS$_2$, has been experimentally measured as a function of the carrier concentration $n$. The main result establishes a decreasing of hundreds of meV, in comparison with the bare band gap, as the carrier concentration increases. These materials are known as transition metal dichalcogenides and their low-energy excitations are, a…
▽ More
Recently the renormalization of the band gap $m$, in both WSe$_2$ and MoS$_2$, has been experimentally measured as a function of the carrier concentration $n$. The main result establishes a decreasing of hundreds of meV, in comparison with the bare band gap, as the carrier concentration increases. These materials are known as transition metal dichalcogenides and their low-energy excitations are, approximately, described by the massive Dirac equation. Using Pseudo Quantum Electrodynamics (PQED) to describe the electromagnetic interaction between these quasiparticles and from renormalization group analysis, we obtain that the renormalized mass describes the band gap renormalization with a function given by $m(n)/m_0=(n/n_0)^{C_λ/2}$, where $m_0=m(n_0)$ and $C_λ$ is a function of the coupling constant $λ$. We compare our theoretical results with the experimental findings for WSe$_2$ and MoS$_2$, and we conclude that our approach is in agreement with these experimental results for reasonable values of $λ$. In addition we introduced a Gross-Neveu (GN) interaction which could simulate an disorder/impurity-like microscopic interaction. In this case, we show that there exists a critical coupling constant, namely, $λ_c \approx 0,66$ in which the beta function of the mass vanishes, providing a stable fixed point in the ultraviolet limit. For $λ>λ_c$, the renormalized mass decreases while for $λ<λ_c$ it increases with the carrier concentration.
△ Less
Submitted 2 March, 2020; v1 submitted 23 February, 2020;
originally announced February 2020.
-
STANNIS: Low-Power Acceleration of Deep Neural Network Training Using Computational Storage
Authors:
Ali HeydariGorji,
Mahdi Torabzadehkashi,
Siavash Rezaei,
Hossein Bobarshad,
Vladimir Alves,
Pai H. Chou
Abstract:
This paper proposes a framework for distributed, in-storage training of neural networks on clusters of computational storage devices. Such devices not only contain hardware accelerators but also eliminate data movement between the host and storage, resulting in both improved performance and power savings. More importantly, this in-storage processing style of training ensures that private data neve…
▽ More
This paper proposes a framework for distributed, in-storage training of neural networks on clusters of computational storage devices. Such devices not only contain hardware accelerators but also eliminate data movement between the host and storage, resulting in both improved performance and power savings. More importantly, this in-storage processing style of training ensures that private data never leaves the storage while fully controlling the sharing of public data. Experimental results show up to 2.7x speedup and 69% reduction in energy consumption and no significant loss in accuracy.
△ Less
Submitted 19 February, 2020; v1 submitted 17 February, 2020;
originally announced February 2020.
-
Pseudo Quantum Electrodynamics and Chern-Simons theory Coupled to Two-dimensional Electrons
Authors:
Gabriel C. Magalhães,
Van S. Alves,
Eduardo C. Marino,
Leandro O. Nascimento
Abstract:
We study a nonlocal theory that combines both the Pseudo quantum electrodynamics (PQED) and Chern-Simons actions among two-dimensional electrons. In the static limit, we conclude that the competition of these two interactions yields a Coulomb potential with a screened electric charge given by $e^2/(1+θ^2)$, where $θ$ is the dimensionless Chern-Simons parameter. This could be useful for describing…
▽ More
We study a nonlocal theory that combines both the Pseudo quantum electrodynamics (PQED) and Chern-Simons actions among two-dimensional electrons. In the static limit, we conclude that the competition of these two interactions yields a Coulomb potential with a screened electric charge given by $e^2/(1+θ^2)$, where $θ$ is the dimensionless Chern-Simons parameter. This could be useful for describing the substrate interaction with two-dimensional materials and the doping dependence of the dielectric constant in graphene. In the dynamical limit, we calculate the effective current-current action of the model considering Dirac electrons. We show that this resembles the electromagnetic and statistical interactions, but with two different overall constants, given by $e^2/(1+θ^2)$ and $e^2θ/(1+θ^2)$. Therefore, the $θ$-parameter does not provide a topological mass for the Gauge field in PQED, which is a relevant difference in comparison with quantum electrodynamics. Thereafter, we apply the one-loop perturbation theory in our model. Within this approach, we calculate the electron self-energy, the electron renormalized mass, the corrected gauge-field propagator, and the renormalized Fermi velocity for both high- and low-speed limits, using the renormalization group. In particular, we obtain a maximum value of the renormalized mass for $θ\approx 0.36$. This behavior is an important signature of the model and relations with doping control of band gap size are also discussed in the conclusions.
△ Less
Submitted 13 May, 2020; v1 submitted 15 January, 2020;
originally announced January 2020.