Search | arXiv e-print repository

Utilizing Large Language Models to Identify Reddit Users Considering Vaping Cessation for Digital Interventions

Authors: Sai Krishna Revanth Vuruma, Dezhi Wu, Saborny Sen Gupta, Lucas Aust, Valerie Lookingbill, Caleb Henry, Yang Ren, Erin Kasson, Li-Shiun Chen, Patricia Cavazos-Rehg, Dian Hu, Ming Huang

Abstract: The widespread adoption of social media platforms globally not only enhances users' connectivity and communication but also emerges as a vital channel for the dissemination of health-related information, thereby establishing social media data as an invaluable organic data resource for public health research. The surge in popularity of vaping or e-cigarette use in the United States and other countr… ▽ More The widespread adoption of social media platforms globally not only enhances users' connectivity and communication but also emerges as a vital channel for the dissemination of health-related information, thereby establishing social media data as an invaluable organic data resource for public health research. The surge in popularity of vaping or e-cigarette use in the United States and other countries has caused an outbreak of e-cigarette and vaping use-associated lung injury (EVALI), leading to hospitalizations and fatalities in 2019, highlighting the urgency to comprehend vaping behaviors and develop effective strategies for cession. In this study, we extracted a sample dataset from one vaping sub-community on Reddit to analyze users' quit vaping intentions. Leveraging large language models including both the latest GPT-4 and traditional BERT-based language models for sentence-level quit-vaping intention prediction tasks, this study compares the outcomes of these models against human annotations. Notably, when compared to human evaluators, GPT-4 model demonstrates superior consistency in adhering to annotation guidelines and processes, showcasing advanced capabilities to detect nuanced user quit-vaping intentions that human evaluators might overlook. These preliminary findings emphasize the potential of GPT-4 in enhancing the accuracy and reliability of social media data analysis, especially in identifying subtle users' intentions that may elude human detection. △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.03092 [pdf, other]

Unsupervised, Bottom-up Category Discovery for Symbol Grounding with a Curious Robot

Authors: Catherine Henry, Casey Kennington

Abstract: Towards addressing the Symbol Grounding Problem and motivated by early childhood language development, we leverage a robot which has been equipped with an approximate model of curiosity with particular focus on bottom-up building of unsupervised categories grounded in the physical world. That is, rather than starting with a top-down symbol (e.g., a word referring to an object) and providing meanin… ▽ More Towards addressing the Symbol Grounding Problem and motivated by early childhood language development, we leverage a robot which has been equipped with an approximate model of curiosity with particular focus on bottom-up building of unsupervised categories grounded in the physical world. That is, rather than starting with a top-down symbol (e.g., a word referring to an object) and providing meaning through the application of predetermined samples, the robot autonomously and gradually breaks up its exploration space into a series of increasingly specific unlabeled categories at which point an external expert may optionally provide a symbol association. We extend prior work by using a robot that can observe the visual world, introducing a higher dimensional sensory space, and using a more generalizable method of category building. Our experiments show that the robot learns categories based on actions and what it visually observes, and that those categories can be symbolically grounded into.https://info.arxiv.org/help/prep#comments △ Less

Submitted 3 April, 2024; originally announced April 2024.

Comments: 10 pages

arXiv:2401.16600 [pdf, other]

Depth Anything in Medical Images: A Comparative Study

Authors: John J. Han, Ayberk Acar, Callahan Henry, Jie Ying Wu

Abstract: Monocular depth estimation (MDE) is a critical component of many medical tracking and mapping algorithms, particularly from endoscopic or laparoscopic video. However, because ground truth depth maps cannot be acquired from real patient data, supervised learning is not a viable approach to predict depth maps for medical scenes. Although self-supervised learning for MDE has recently gained attention… ▽ More Monocular depth estimation (MDE) is a critical component of many medical tracking and mapping algorithms, particularly from endoscopic or laparoscopic video. However, because ground truth depth maps cannot be acquired from real patient data, supervised learning is not a viable approach to predict depth maps for medical scenes. Although self-supervised learning for MDE has recently gained attention, the outputs are difficult to evaluate reliably and each MDE's generalizability to other patients and anatomies is limited. This work evaluates the zero-shot performance of the newly released Depth Anything Model on medical endoscopic and laparoscopic scenes. We compare the accuracy and inference speeds of Depth Anything with other MDE models trained on general scenes as well as in-domain models trained on endoscopic data. Our findings show that although the zero-shot capability of Depth Anything is quite impressive, it is not necessarily better than other models in both speed and performance. We hope that this study can spark further research in employing foundation models for MDE in medical scenes. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: 10 pages, 2 figures, 3 tables

arXiv:2311.11437 [pdf]

Decoding the Molecular Universe -- Workshop Report

Authors: Thomas O. Metz, Joshua N. Adkins, Peter B. Armentrout, Patrick Chain, Fanny Chu, Courtney D Corley, John R. Cort, Elizabeth Denis, Daniel Drell, Katherine R. Duncan, Robert G. Ewing, Facundo M. Fernandez, Oliver Fiehn, Neha Garg, Stefan Grimme, Christopher Henry, Robert L. Hettich, Tobias Kind, Roger G. Linington, Gary W. Miller, Trent Northen, Kirsten Overdahl, Ari Patrinos, Daniel Raftery, Paul Rigor , et al. (8 additional authors not shown)

Abstract: On August 9-10, 2023, a workshop was convened at the Pacific Northwest National Laboratory (PNNL) in Richland, WA that brought together a group of internationally recognized experts in metabolomics, natural products discovery, chemical ecology, chemical and biological threat assessment, cheminformatics, computational chemistry, cloud computing, artificial intelligence, and novel technology develop… ▽ More On August 9-10, 2023, a workshop was convened at the Pacific Northwest National Laboratory (PNNL) in Richland, WA that brought together a group of internationally recognized experts in metabolomics, natural products discovery, chemical ecology, chemical and biological threat assessment, cheminformatics, computational chemistry, cloud computing, artificial intelligence, and novel technology development. These experts were invited to assess the value and feasibility of a grand-scale project to create new technologies that would allow the identification and quantification of all small molecules, or to decode the molecular universe. The Decoding the Molecular Universe project would extend and complement the success of the Human Genome Project by developing new capabilities and technologies to measure small molecules (defined as non-protein, non-polymer molecules less than 1500 Daltons) of any origin and generated in biological systems or produced abiotically. Workshop attendees 1) explored what new understanding of biological and environmental systems could be revealed through the lens of small molecules; 2) characterized the similarities in current needs and technical challenges between each science or mission area for unambiguous and comprehensive determination of the composition and quantities of small molecules of any sample; 3) determined the extent to which technologies or methods currently exist for unambiguously and comprehensively determining the small molecule composition of any sample and in a reasonable time; and 4) identified the attributes of the ideal technology or approach for universal small molecule measurement and identification. The workshop concluded with a discussion of how a project of this scale could be undertaken, possible thrusts for the project, early proof-of-principle applications, and similar efforts upon which the project could be modeled. △ Less

Submitted 19 November, 2023; originally announced November 2023.

arXiv:2311.01921 [pdf, other]

The dynamics of discrete particles in turbulent flows: open issues and current challenges in statistical modeling

Authors: Jean-Pierre Minier, Christophe Henry

Abstract: This article is an invitation. It is, first, an invitation to consider as a subject worthy of attention the wide range of situations where small discrete elements, either bubbles, droplets or solid particles, are embedded in turbulent flows. Occurring often at a human scale and in our daily environments, these turbulent dispersed two-phase flows display complex behavior due to the interplay of two… ▽ More This article is an invitation. It is, first, an invitation to consider as a subject worthy of attention the wide range of situations where small discrete elements, either bubbles, droplets or solid particles, are embedded in turbulent flows. Occurring often at a human scale and in our daily environments, these turbulent dispersed two-phase flows display complex behavior due to the interplay of two fundamental interactions, the fluid-particle and particle-particle interactions, compounded by the turbulence of the carrier flow. This is not a domain where the basic laws are unknown but where the huge number of degrees of freedom involved call for reduced, or coarse-grained, statistical descriptions to be developed. Since we are considering transport and collision phenomena or relaxation processes, it would seem that they can be handled by kinetic theory. In the general case of non-fully resolved turbulent flows, we are however dealing with particles influenced by random media with non-zero time and space correlations. The second invitation is therefore to recognize the limitations of kinetic-based descriptions and to address the challenges driving us to extend the classical framework, for fluid-particle as well as particle-particle interactions. Taking the standpoint provided by the modern formulation of stochastic processes and focusing on the description of the particle phase, this review proposes a step-by-step pedagogical presentation of current models while pointing out new directions and remaining uncharted territories. This is done to provide answers to the question `why?' as much as `how?' and to try to kindle interest into these open and fascinating issues. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2310.15644 [pdf, other]

Linear-in-Complexity Computational Strategies for Modeling and Dosimetry at TeraHertz

Authors: Viviana Giunzioni, Giuseppe Ciacco, Clément Henry, Adrien Merlini, Francesco P. Andriulli

Abstract: This work presents a fast direct solver strategy allowing full-wave modeling and dosimetry at terahertz (THz) frequencies. The novel scheme leverages a preconditioned combined field integral equation together with a regularizer for its elliptic spectrum to enable its compression into a non-hierarchical skeleton, invertible in quasi-linear complexity. Numerical results will show the effectiveness o… ▽ More This work presents a fast direct solver strategy allowing full-wave modeling and dosimetry at terahertz (THz) frequencies. The novel scheme leverages a preconditioned combined field integral equation together with a regularizer for its elliptic spectrum to enable its compression into a non-hierarchical skeleton, invertible in quasi-linear complexity. Numerical results will show the effectiveness of the new scheme in a realistic skin modeling scenario. △ Less

Submitted 24 October, 2023; originally announced October 2023.

arXiv:2310.08470 [pdf, other]

Strategies and impact of learning curve estimation for CNN-based image classification

Authors: Laura Didyk, Brayden Yarish, Michael A. Beck, Christopher P. Bidinosti, Christopher J. Henry

Abstract: Learning curves are a measure for how the performance of machine learning models improves given a certain volume of training data. Over a wide variety of applications and models it was observed that learning curves follow -- to a large extent -- a power law behavior. This makes the performance of different models for a given task somewhat predictable and opens the opportunity to reduce the trainin… ▽ More Learning curves are a measure for how the performance of machine learning models improves given a certain volume of training data. Over a wide variety of applications and models it was observed that learning curves follow -- to a large extent -- a power law behavior. This makes the performance of different models for a given task somewhat predictable and opens the opportunity to reduce the training time for practitioners, who are exploring the space of possible models and hyperparameters for the problem at hand. By estimating the learning curve of a model from training on small subsets of data only the best models need to be considered for training on the full dataset. How to choose subset sizes and how often to sample models on these to obtain estimates is however not researched. Given that the goal is to reduce overall training time strategies are needed that sample the performance in a time-efficient way and yet leads to accurate learning curve estimates. In this paper we formulate the framework for these strategies and propose several strategies. Further we evaluate the strategies for simulated learning curves and in experiments with popular datasets and models for image classification tasks. △ Less

Submitted 12 October, 2023; originally announced October 2023.

arXiv:2308.05074 [pdf, other]

Drones4Good: Supporting Disaster Relief Through Remote Sensing and AI

Authors: Nina Merkle, Reza Bahmanyar, Corentin Henry, Seyed Majid Azimi, Xiangtian Yuan, Simon Schopferer, Veronika Gstaiger, Stefan Auer, Anne Schneibel, Marc Wieland, Thomas Kraft

Abstract: In order to respond effectively in the aftermath of a disaster, emergency services and relief organizations rely on timely and accurate information about the affected areas. Remote sensing has the potential to significantly reduce the time and effort required to collect such information by enabling a rapid survey of large areas. To achieve this, the main challenge is the automatic extraction of re… ▽ More In order to respond effectively in the aftermath of a disaster, emergency services and relief organizations rely on timely and accurate information about the affected areas. Remote sensing has the potential to significantly reduce the time and effort required to collect such information by enabling a rapid survey of large areas. To achieve this, the main challenge is the automatic extraction of relevant information from remotely sensed data. In this work, we show how the combination of drone-based data with deep learning methods enables automated and large-scale situation assessment. In addition, we demonstrate the integration of onboard image processing techniques for the deployment of autonomous drone-based aid delivery. The results show the feasibility of a rapid and large-scale image analysis in the field, and that onboard image processing can increase the safety of drone-based aid deliveries. △ Less

Submitted 9 August, 2023; originally announced August 2023.

arXiv:2307.12238 [pdf, other]

doi 10.1007/s12036-023-09973-5

Dust Scattered Radiation in the Galactic Poles

Authors: Jayant Murthy, Richard C. Henry, James Overduin

Abstract: We have modeled the diffuse background at the Galactic Poles in the far-ultraviolet (FUV: 1536 Å) and the near-ultraviolet (NUV: 2316 Å). The background is well-fit using a single-scattering dust model with an offset representing the extragalactic light plus any other contribution to the diffuse background. We have found a dust albedo of 0.35 -- 0.40 (FUV) and 0.11 -- 0.19 in the NGP (… ▽ More We have modeled the diffuse background at the Galactic Poles in the far-ultraviolet (FUV: 1536 Å) and the near-ultraviolet (NUV: 2316 Å). The background is well-fit using a single-scattering dust model with an offset representing the extragalactic light plus any other contribution to the diffuse background. We have found a dust albedo of 0.35 -- 0.40 (FUV) and 0.11 -- 0.19 in the NGP ($b > 70^{\circ}$) and 0.46 -- 0.56 (FUV) and 0.31 -- 0.33 (NUV) in the SGP ($b < 70^{\circ}$. The differences in the albedo may reflect changes in the dust-to-gas ratio over the sky or in the dust distribution. We find offsets at zero-reddening of 273 -- 286 and 553 -- 581 photons cm$^{-2}$ s$^{-1}$ sr$^{-1}$ Å$^{-1}$ in the FUV and NUV, respectively, in the NGP with similar values in the SGP. △ Less

Submitted 3 September, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

Comments: 5 pages, 7 figures, Accepted in Journal of Astronomy and Astrophysics

Journal ref: J. Astrophys. Astr. 2023, 44: 82

arXiv:2306.09418 [pdf, other]

doi 10.1016/j.atech.2023.100316

A comprehensive review of 3D convolutional neural network-based classification techniques of diseased and defective crops using non-UAV-based hyperspectral images

Authors: Nooshin Noshiri, Michael A. Beck, Christopher P. Bidinosti, Christopher J. Henry

Abstract: Hyperspectral imaging (HSI) is a non-destructive and contactless technology that provides valuable information about the structure and composition of an object. It can capture detailed information about the chemical and physical properties of agricultural crops. Due to its wide spectral range, compared with multispectral- or RGB-based imaging methods, HSI can be a more effective tool for monitorin… ▽ More Hyperspectral imaging (HSI) is a non-destructive and contactless technology that provides valuable information about the structure and composition of an object. It can capture detailed information about the chemical and physical properties of agricultural crops. Due to its wide spectral range, compared with multispectral- or RGB-based imaging methods, HSI can be a more effective tool for monitoring crop health and productivity. With the advent of this imaging tool in agrotechnology, researchers can more accurately address issues related to the detection of diseased and defective crops in the agriculture industry. This allows to implement the most suitable and accurate farming solutions, such as irrigation and fertilization before crops enter a damaged and difficult-to-recover phase of growth in the field. While HSI provides valuable insights into the object under investigation, the limited number of HSI datasets for crop evaluation presently poses a bottleneck. Dealing with the curse of dimensionality presents another challenge due to the abundance of spectral and spatial information in each hyperspectral cube. State-of-the-art methods based on 1D- and 2D-CNNs struggle to efficiently extract spectral and spatial information. On the other hand, 3D-CNN-based models have shown significant promise in achieving better classification and detection results by leveraging spectral and spatial features simultaneously. Despite the apparent benefits of 3D-CNN-based models, their usage for classification purposes in this area of research has remained limited. This paper seeks to address this gap by reviewing 3D-CNN-based architectures and the typical deep learning pipeline, including preprocessing and visualization of results, for the classification of hyperspectral images of diseased and defective crops. Furthermore, we discuss open research areas and challenges when utilizing 3D-CNNs with HSI data. △ Less

Submitted 15 June, 2023; originally announced June 2023.

Journal ref: Smart Agricultural Technology 5 (2023) 100316

arXiv:2304.09838 [pdf, other]

Enhanced transport of flexible fibers by pole vaulting in turbulent wall-bounded flow

Authors: Jeremie Bec, Christophe Brouzet, Christophe Henry

Abstract: Long, flexible fibers transported by a turbulent channel flow sample non-linear variations of the fluid velocity along their length. As the fibers tumble and collide with the boundaries, they bounce off with an impulse that propels them toward the center of the flow, similar to pole vaulting. As a result, the fibers migrate away from the walls, leading to depleted regions near the boundaries and m… ▽ More Long, flexible fibers transported by a turbulent channel flow sample non-linear variations of the fluid velocity along their length. As the fibers tumble and collide with the boundaries, they bounce off with an impulse that propels them toward the center of the flow, similar to pole vaulting. As a result, the fibers migrate away from the walls, leading to depleted regions near the boundaries and more concentrated regions in the bulk. These higher concentrations in the center of the channel result in a greater net flux of fibers than what was initially imposed by the fluid. This effect becomes more pronounced as fiber length increases, especially when it approaches the channel height. △ Less

Submitted 19 April, 2023; originally announced April 2023.

Comments: 5 pages, 5 figures

arXiv:2304.09079 [pdf, other]

A time-step-robust algorithm to compute particle trajectories in 3-D unstructured meshes for Lagrangian stochastic methods

Authors: Guilhem Balvet, Jean-Pierre Minier, Christophe Henry, Yelva Roustan, Martin Ferrand

Abstract: The purpose of this paper is to propose a time-step-robust cell-to-cell integration of particle trajectories in 3-D unstructured meshes in particle/mesh Lagrangian stochastic methods. The main idea is to dynamically update the mean fields used in the time integration by splitting, for each particle, the time step into sub-steps such that each of these sub-steps corresponds to particle cell residen… ▽ More The purpose of this paper is to propose a time-step-robust cell-to-cell integration of particle trajectories in 3-D unstructured meshes in particle/mesh Lagrangian stochastic methods. The main idea is to dynamically update the mean fields used in the time integration by splitting, for each particle, the time step into sub-steps such that each of these sub-steps corresponds to particle cell residence times. This reduces the spatial discretization error. Given the stochastic nature of the models, a key aspect is to derive estimations of the residence times that do not anticipate the future of the Wiener process. To that effect, the new algorithm relies on a virtual particle, attached to each stochastic one, whose mean conditional behavior provides free-of-statistical-bias predictions of residence times. After consistency checks, this new algorithm is validated on two representative test cases: particle dispersion in a statistically uniform flow and particle dynamics in a non-uniform flow. △ Less

Submitted 17 April, 2023; originally announced April 2023.

arXiv:2304.05999 [pdf, other]

doi 10.1093/mnras/stad1125

The Loneliest Galaxies in the Universe: A GAMA and GalaxyZoo Study on Void Galaxy Morphology

Authors: Lori E. Porter, Benne W. Holwerda, Sandor Kruk, Maritza Lara-López, Kevin Pimbblet, Christopher Henry, Sarah Casura, Lee Kelvin

Abstract: The large-scale structure (LSS) of the Universe is comprised of galaxy filaments, tendrils, and voids. The majority of the Universe's volume is taken up by these voids, which exist as underdense, but not empty, regions. The galaxies found inside these voids are expected to be some of the most isolated objects in the Universe. This study, using the Galaxy and Mass Assembly (GAMA) and Galaxy Zoo sur… ▽ More The large-scale structure (LSS) of the Universe is comprised of galaxy filaments, tendrils, and voids. The majority of the Universe's volume is taken up by these voids, which exist as underdense, but not empty, regions. The galaxies found inside these voids are expected to be some of the most isolated objects in the Universe. This study, using the Galaxy and Mass Assembly (GAMA) and Galaxy Zoo surveys, aims to investigate basic physical properties and morphology of void galaxies versus field (filament and tendril) galaxies. We use void galaxies with stellar masses of $9.35 < log(M/M_\odot) < 11.25$, and this sample is split by identifying two redshift-limited regions, 0 < z < 0.075, and, $0.075 < z < 0.15$. To find comparable objects in the sample of field galaxies from GAMA and Galaxy Zoo, we identify "twins" of void galaxies as field galaxies within $\pm$0.05 dex and $\pm$0.15 dex of M and specific star formation rate. We determine the statistical significance of our results using the Kolmogorov-Smirnov (KS) test. We see that void galaxies, in contrast with field galaxies, seem to be disk-dominated and have predominantly round bulges (with > 50 percent of the Galaxy Zoo citizen scientists agreeing that bulges are present). △ Less

Submitted 12 April, 2023; originally announced April 2023.

Comments: 13 pages, 16 figures, 3 tables, accepted by MNRAS

arXiv:2303.05634 [pdf, other]

Fusarium head blight detection, spikelet estimation, and severity assessment in wheat using 3D convolutional neural networks

Authors: Oumaima Hamila, Christopher J. Henry, Oscar I. Molina, Christopher P. Bidinosti, Maria Antonia Henriquez

Abstract: Fusarium head blight (FHB) is one of the most significant diseases affecting wheat and other small grain cereals worldwide. The development of resistant varieties requires the laborious task of field and greenhouse phenotyping. The applications considered in this work are the automated detection of FHB disease symptoms expressed on a wheat plant, the automated estimation of the total number of spi… ▽ More Fusarium head blight (FHB) is one of the most significant diseases affecting wheat and other small grain cereals worldwide. The development of resistant varieties requires the laborious task of field and greenhouse phenotyping. The applications considered in this work are the automated detection of FHB disease symptoms expressed on a wheat plant, the automated estimation of the total number of spikelets and the total number of infected spikelets on a wheat head, and the automated assessment of the FHB severity in infected wheat. The data used to generate the results are 3-dimensional (3D) multispectral point clouds (PC), which are 3D collections of points - each associated with a red, green, blue (RGB), and near-infrared (NIR) measurement. Over 300 wheat plant images were collected using a multispectral 3D scanner, and the labelled UW-MRDC 3D wheat dataset was created. The data was used to develop novel and efficient 3D convolutional neural network (CNN) models for FHB detection, which achieved 100% accuracy. The influence of the multispectral information on performance was evaluated, and our results showed the dominance of the RGB channels over both the NIR and the NIR plus RGB channels combined. Furthermore, novel and efficient 3D CNNs were created to estimate the total number of spikelets and the total number of infected spikelets on a wheat head, and our best models achieved mean absolute errors (MAE) of 1.13 and 1.56, respectively. Moreover, 3D CNN models for FHB severity estimation were created, and our best model achieved 8.6 MAE. A linear regression analysis between the visual FHB severity assessment and the FHB severity predicted by our 3D CNN was performed, and the results showed a significant correlation between the two variables with a 0.0001 P-value and 0.94 R-squared. △ Less

Submitted 9 March, 2023; originally announced March 2023.

arXiv:2212.12056 [pdf, other]

Semantically-consistent Landsat 8 image to Sentinel-2 image translation for alpine areas

Authors: M. Sokolov, J. L. Storie, C. J. Henry, C. D. Storie, J. Cameron, R. S. Ødegård, V. Zubinaite, S. Stikbakke

Abstract: The availability of frequent and cost-free satellite images is in growing demand in the research world. Such satellite constellations as Landsat 8 and Sentinel-2 provide a massive amount of valuable data daily. However, the discrepancy in the sensors' characteristics of these satellites makes it senseless to use a segmentation model trained on either dataset and applied to another, which is why do… ▽ More The availability of frequent and cost-free satellite images is in growing demand in the research world. Such satellite constellations as Landsat 8 and Sentinel-2 provide a massive amount of valuable data daily. However, the discrepancy in the sensors' characteristics of these satellites makes it senseless to use a segmentation model trained on either dataset and applied to another, which is why domain adaptation techniques have recently become an active research area in remote sensing. In this paper, an experiment of domain adaptation through style-transferring is conducted using the HRSemI2I model to narrow the sensor discrepancy between Landsat 8 and Sentinel-2. This paper's main contribution is analyzing the expediency of that approach by comparing the results of segmentation using domain-adapted images with those without adaptation. The HRSemI2I model, adjusted to work with 6-band imagery, shows significant intersection-over-union performance improvement for both mean and per class metrics. A second contribution is providing different schemes of generalization between two label schemes - NALCMS 2015 and CORINE. The first scheme is standardization through higher-level land cover classes, and the second is through harmonization validation in the field. △ Less

Submitted 22 December, 2022; originally announced December 2022.

Comments: 13 pages, 6 figures

arXiv:2212.10151 [pdf, other]

doi 10.1016/j.physrep.2022.12.005

Particle resuspension: challenges and perspectives for future models

Authors: Christophe Henry, Jean-Pierre Minier, Sara Brambilla

Abstract: The purpose of this review is to analyze the physics at play in particle resuspension in order to bring insights into the rich complexity of this common but challenging concern. Following the more-is-different vision, this is performed by starting from a range of practical observations and experimental data. We then work our way through the investigation of the key mechanisms which play a role in… ▽ More The purpose of this review is to analyze the physics at play in particle resuspension in order to bring insights into the rich complexity of this common but challenging concern. Following the more-is-different vision, this is performed by starting from a range of practical observations and experimental data. We then work our way through the investigation of the key mechanisms which play a role in the overall process. In turn, these mechanisms reveal an array of fundamental interactions, such as particle-fluid, particle-particle and particle-surface, whose combined effects create the tapestry of current applications. At the core of this analysis are descriptions of these physical phenomena and the different ways through which they are intertwined to build up various models used to provide quantitative assessment of particle resuspension. The physics of particle resuspension implies to hold together processes occurring at extremely different space and time scales and models are key in providing a single vehicle to lead us through such multiscale journeys. This raises questions on what makes up a model and one objective of the present work is to clarify the essence of a modeling approach. In spite of its ubiquitous nature, particle resuspension is still at the early stages of developments. Many extensions need to be worked out and revisiting the art of modeling is not a moot point. The need to consider more complex objects than small and spherical particles and, moreover, to come up with unified descriptions of mono- and multilayer resuspension put the emphasis on solid model foundations if we are to go beyond current limits. This is very much modeling in the making and new ideas are proposed to stimulate interest into this everyday but challenging issue in physics. △ Less

Submitted 20 December, 2022; originally announced December 2022.

Comments: 123 pages, 65 figures, submitted to Physics Reports

arXiv:2211.10211 [pdf, other]

doi 10.1016/j.compfluid.2023.105870

Lagrangian stochastic model for the orientation of inertialess spheroidal particles in turbulent flows: an efficient numerical method for CFD approach

Authors: Lorenzo Campana, Mireille Bossy, Christophe Henry

Abstract: In this work, we propose a model for the orientation of inertialess spheroidal particles suspended in turbulent flows. This model consists in a stochastic version of the Jeffery equation that can be included in a statistical Lagrangian description of particles suspended in a flow. It is compatible and coherent with turbulence models that are widely used in CFD codes for the simulation of the flow… ▽ More In this work, we propose a model for the orientation of inertialess spheroidal particles suspended in turbulent flows. This model consists in a stochastic version of the Jeffery equation that can be included in a statistical Lagrangian description of particles suspended in a flow. It is compatible and coherent with turbulence models that are widely used in CFD codes for the simulation of the flow field in practical large-scale applications. In this context, we propose and analyze a numerical scheme based on a splitting scheme algorithm that decouples the orientation dynamics into its main contributions: stretching and rotation. We detail its implementation in an open-source CFD software. We analyze the weak and strong convergence of both the global scheme and of each sub-part. Subsequently, the splitting technique yields to a highly efficient hybrid algorithm coupling pure probabilistic and deterministic numerical schemes. Various numerical experiments were implemented and the results were compared with analytical predictions of the model to assess the algorithm efficiency and accuracy. △ Less

Submitted 7 March, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

arXiv:2211.08355 [pdf, other]

Galaxy And Mass Assembly: Galaxy Morphology in the Green Valley, Prominent rings and looser Spiral Arms

Authors: Dominic Smith, Lutz Haberzettl, L. E. Porter, Ren Porter-Temple, Christopher P. A. Henry, Benne Holwerda, A. R. Lopez-Sanchez, Steven Phillipps, Alister W. Graham, Sarah Brough, Kevin A. Pimbblet, Jochen Liske, Lee S. Kelvin, Clayton D. Robertson, Wade Roemer, Michael Walmsley, David O'Ryan, Tobias Geron

Abstract: Galaxies broadly fall into two categories: star-forming (blue) galaxies and quiescent (red) galaxies. In between, one finds the less populated ``green valley". Some of these galaxies are suspected to be in the process of ceasing their star-formation through a gradual exhaustion of gas supply or already dead and are experiencing a rejuvenation of star-formation through fuel injection. We use the Ga… ▽ More Galaxies broadly fall into two categories: star-forming (blue) galaxies and quiescent (red) galaxies. In between, one finds the less populated ``green valley". Some of these galaxies are suspected to be in the process of ceasing their star-formation through a gradual exhaustion of gas supply or already dead and are experiencing a rejuvenation of star-formation through fuel injection. We use the Galaxy And Mass Assembly database and the Galaxy Zoo citizen science morphological estimates to compare the morphology of galaxies in the green valley against those in the red sequence and blue cloud. Our goal is to examine the structural differences within galaxies that fall in the green valley, and what brings them there. Previous results found disc features such as rings and lenses are more prominently represented in the green valley population. We revisit this with a similar sized data set of galaxies with morphology labels provided by the Galaxy Zoo for the GAMA fields based on new KiDS images. Our aim is to compare qualitatively the results from expert classification to that of citizen science. We observe that ring structures are indeed found more commonly in green valley galaxies compared to their red and blue counterparts. We suggest that ring structures are a consequence of disc galaxies in the green valley actively exhibiting characteristics of fading discs and evolving disc morphology of galaxies. We note that the progression from blue to red correlates with loosening spiral arm structure. △ Less

Submitted 15 November, 2022; originally announced November 2022.

Comments: 11 pages, 21 figures, accepted to MNRAS

arXiv:2211.07704 [pdf, other]

Laplacian Filtered Loop-Star Decompositions and Quasi-Helmholtz Laplacian Filters: Definitions, Analysis, and Efficient Algorithms

Authors: Adrien Merlini, Clément Henry, Davide Consoli, Lyes Rahmouni, Alexandre Dély, Francesco P. Andriulli

Abstract: Quasi-Helmholtz decompositions are fundamental tools in integral equation modeling of electromagnetic problems because of their ability of rescaling solenoidal and non-solenoidal components of solutions, operator matrices, and radiated fields. These tools are however incapable, per se, of modifying the refinement-dependent spectral behavior of the different operators and often need to be combined… ▽ More Quasi-Helmholtz decompositions are fundamental tools in integral equation modeling of electromagnetic problems because of their ability of rescaling solenoidal and non-solenoidal components of solutions, operator matrices, and radiated fields. These tools are however incapable, per se, of modifying the refinement-dependent spectral behavior of the different operators and often need to be combined with other preconditioning strategies. This paper introduces the new concept of filtered quasi-Helmholtz decompositions proposing them in two incarnations: the filtered Loop-Star functions and the quasi-Helmholtz Laplacian filters. Because they are capable of manipulating large parts of the operators' spectra, new families of preconditioners and fast solvers can be derived from these new tools. A first application to the case of the frequency and h-refinement preconditioning of the electric field integral equation is presented together with numerical results showing the practical effectiveness of the newly proposed decompositions. △ Less

Submitted 14 November, 2022; originally announced November 2022.

arXiv:2211.03854 [pdf, other]

doi 10.1117/1.JRS.17.018505

Exploration of Convolutional Neural Network Architectures for Large Region Map Automation

Authors: R. M. Tsenov, C. J. Henry, J. L. Storie, C. D. Storie, B. Murray, M. Sokolov

Abstract: Deep learning semantic segmentation algorithms have provided improved frameworks for the automated production of Land-Use and Land-Cover (LULC) maps, which significantly increases the frequency of map generation as well as consistency of production quality. In this research, a total of 28 different model variations were examined to improve the accuracy of LULC maps. The experiments were carried ou… ▽ More Deep learning semantic segmentation algorithms have provided improved frameworks for the automated production of Land-Use and Land-Cover (LULC) maps, which significantly increases the frequency of map generation as well as consistency of production quality. In this research, a total of 28 different model variations were examined to improve the accuracy of LULC maps. The experiments were carried out using Landsat 5/7 or Landsat 8 satellite images with the North American Land Change Monitoring System labels. The performance of various CNNs and extension combinations were assessed, where VGGNet with an output stride of 4, and modified U-Net architecture provided the best results. Additional expanded analysis of the generated LULC maps was also provided. Using a deep neural network, this work achieved 92.4% accuracy for 13 LULC classes within southern Manitoba representing a 15.8% improvement over published results for the NALCMS. Based on the large regions of interest, higher radiometric resolution of Landsat 8 data resulted in better overall accuracies (88.04%) compare to Landsat 5/7 (80.66%) for 16 LULC classes. This represents an 11.44% and 4.06% increase in overall accuracy compared to previously published NALCMS results, including larger land area and higher number of LULC classes incorporated into the models compared to other published LULC map automation methods. △ Less

Submitted 7 November, 2022; originally announced November 2022.

arXiv:2211.02972 [pdf, other]

doi 10.3389/frai.2023.1200977

Inside Out: Transforming Images of Lab-Grown Plants for Machine Learning Applications in Agriculture

Authors: A. E. Krosney, P. Sotoodeh, C. J. Henry, M. A. Beck, C. P. Bidinosti

Abstract: Machine learning tasks often require a significant amount of training data for the resultant network to perform suitably for a given problem in any domain. In agriculture, dataset sizes are further limited by phenotypical differences between two plants of the same genotype, often as a result of differing growing conditions. Synthetically-augmented datasets have shown promise in improving existing… ▽ More Machine learning tasks often require a significant amount of training data for the resultant network to perform suitably for a given problem in any domain. In agriculture, dataset sizes are further limited by phenotypical differences between two plants of the same genotype, often as a result of differing growing conditions. Synthetically-augmented datasets have shown promise in improving existing models when real data is not available. In this paper, we employ a contrastive unpaired translation (CUT) generative adversarial network (GAN) and simple image processing techniques to translate indoor plant images to appear as field images. While we train our network to translate an image containing only a single plant, we show that our method is easily extendable to produce multiple-plant field images. Furthermore, we use our synthetic multi-plant images to train several YoloV5 nano object detection models to perform the task of plant detection and measure the accuracy of the model on real field data images. Including training data generated by the CUT-GAN leads to better plant detection performance compared to a network trained solely on real data. △ Less

Submitted 5 November, 2022; originally announced November 2022.

Comments: 35 pages, 23 figures

arXiv:2210.08580 [pdf, other]

Fast Direct Solvers for Integral Equations at Low-Frequency Based on Operator Filtering

Authors: Clément Henry, Davide Consoli, Alexandre Dély, Lyes Rahmouni, Adrien Merlini, Francesco P. Andriulli

Abstract: This paper focuses on fast direct solvers for integral equations in the low-to-moderate-frequency regime obtained by leveraging preconditioned first kind or second kind operators regularized with Laplacian filters. The spectral errors arising from boundary element discretizations are properly handled by filtering that, in addition, allows for the use of low-rank representations for the compact per… ▽ More This paper focuses on fast direct solvers for integral equations in the low-to-moderate-frequency regime obtained by leveraging preconditioned first kind or second kind operators regularized with Laplacian filters. The spectral errors arising from boundary element discretizations are properly handled by filtering that, in addition, allows for the use of low-rank representations for the compact perturbations of all operators involved. Numerical results show the effectiveness of the approaches and their effectiveness in the direct solution of integral equations. △ Less

Submitted 16 October, 2022; originally announced October 2022.

arXiv:2209.06264 [pdf, other]

doi 10.1109/JSTARS.2022.3226705

High-resolution semantically-consistent image-to-image translation

Authors: Mikhail Sokolov, Christopher Henry, Joni Storie, Christopher Storie, Victor Alhassan, Mathieu Turgeon-Pelchat

Abstract: Deep learning has become one of remote sensing scientists' most efficient computer vision tools in recent years. However, the lack of training labels for the remote sensing datasets means that scientists need to solve the domain adaptation problem to narrow the discrepancy between satellite image datasets. As a result, image segmentation models that are then trained, could better generalize and us… ▽ More Deep learning has become one of remote sensing scientists' most efficient computer vision tools in recent years. However, the lack of training labels for the remote sensing datasets means that scientists need to solve the domain adaptation problem to narrow the discrepancy between satellite image datasets. As a result, image segmentation models that are then trained, could better generalize and use an existing set of labels instead of requiring new ones. This work proposes an unsupervised domain adaptation model that preserves semantic consistency and per-pixel quality for the images during the style-transferring phase. This paper's major contribution is proposing the improved architecture of the SemI2I model, which significantly boosts the proposed model's performance and makes it competitive with the state-of-the-art CyCADA model. A second contribution is testing the CyCADA model on the remote sensing multi-band datasets such as WorldView-2 and SPOT-6. The proposed model preserves semantic consistency and per-pixel quality for the images during the style-transferring phase. Thus, the semantic segmentation model, trained on the adapted images, shows substantial performance gain compared to the SemI2I model and reaches similar results as the state-of-the-art CyCADA model. The future development of the proposed method could include ecological domain transfer, {\em a priori} evaluation of dataset quality in terms of data distribution, or exploration of the inner architecture of the domain adaptation model. △ Less

Submitted 13 September, 2022; originally announced September 2022.

Comments: 25 pages, 7 figures

arXiv:2208.05036 [pdf, other]

doi 10.1093/mnras/stac1936

Galaxy And Mass Assembly: Galaxy Zoo spiral arms and star formation rates

Authors: R. Porter-Temple, B. W. Holwerda, A. M. Hopkins, L. E. Porter, C. Henry, T. Geron, B. Simmons, K. Masters, S. Kruk

Abstract: Understanding the effect spiral structure has on star formation properties of galaxies is important to completing our picture of spiral structure evolution. Previous studies have investigated connections between spiral arm properties with star formation, but the effect that the number of spiral arms has on this process is unclear. Here we use the Galaxy and Mass Assembly (GAMA) survey paired with… ▽ More Understanding the effect spiral structure has on star formation properties of galaxies is important to completing our picture of spiral structure evolution. Previous studies have investigated connections between spiral arm properties with star formation, but the effect that the number of spiral arms has on this process is unclear. Here we use the Galaxy and Mass Assembly (GAMA) survey paired with the citizen science visual classifications from the Galaxy Zoo project to explore galaxies' spiral arm number and how it connects to the star formation process. We use the votes from the GAMA-KiDS GalaxyZoo classification to investigate the link between spiral arm number with stellar mass, star formation rate, and specific star formation rate. We find that galaxies with fewer spiral arms have lower stellar masses and higher sSFRs, while those with more spiral arms tend toward higher stellar masses and lower sSFRs, and conclude that galaxies are less efficient at forming stars if they have more spiral arms. We note how previous studies' findings may indicate a cause for this connection in spiral arm strength or opacity. △ Less

Submitted 9 August, 2022; originally announced August 2022.

Comments: 8 figures, 1 table, 8 pages, accepted for publication by MNRAS

arXiv:2205.10955 [pdf, other]

Investigating classification learning curves for automatically generated and labelled plant images

Authors: Michael A. Beck, Christopher P. Bidinosti, Christopher J. Henry, Manisha Ajmani

Abstract: In the context of supervised machine learning a learning curve describes how a model's performance on unseen data relates to the amount of samples used to train the model. In this paper we present a dataset of plant images with representatives of crops and weeds common to the Manitoba prairies at different growth stages. We determine the learning curve for a classification task on this data with t… ▽ More In the context of supervised machine learning a learning curve describes how a model's performance on unseen data relates to the amount of samples used to train the model. In this paper we present a dataset of plant images with representatives of crops and weeds common to the Manitoba prairies at different growth stages. We determine the learning curve for a classification task on this data with the ResNet architecture. Our results are in accordance with previous studies and add to the evidence that learning curves are governed by power-law relationships over large scales, applications, and models. We further investigate how label noise and the reduction of trainable parameters impacts the learning curve on this dataset. Both effects lead to the model requiring disproportionally larger training sets to achieve the same classification performance as observed without these effects. △ Less

Submitted 30 June, 2022; v1 submitted 22 May, 2022; originally announced May 2022.

arXiv:2204.11491 [pdf, other]

doi 10.1109/AP-S/USNC-URSI47032.2022.9886398

On a Fast Solution Strategy for a Surface-Wire Integral Formulation of the Anisotropic Forward Problem in Electroencephalography

Authors: Carlo Baronio, Giulio Cosentino, Paolo Ricci, Clément Henry, Maxime Y. Monin, Adrien Merlini, Francesco P. Andriulli

Abstract: This work focuses on a quasi-linear-in-complexity strategy for a hybrid surface-wire integral equation solver for the electroencephalography forward problem. The scheme exploits a block diagonally dominant structure of the wire self block -- that models the neuronal fibers self interactions -- and of the surface self block -- modeling interface potentials. This structure leads to two Neumann itera… ▽ More This work focuses on a quasi-linear-in-complexity strategy for a hybrid surface-wire integral equation solver for the electroencephalography forward problem. The scheme exploits a block diagonally dominant structure of the wire self block -- that models the neuronal fibers self interactions -- and of the surface self block -- modeling interface potentials. This structure leads to two Neumann iteration schemes further accelerated with adaptive integral methods. The resulting algorithm is linear up to logarithmic factors. Numerical results confirm the performance of the method in biomedically relevant scenarios. △ Less

Submitted 18 May, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

Journal ref: 2022 IEEE International Symposium on Antennas and Propagation and USNC-URSI Radio Science Meeting (AP-S/URSI), Denver, CO, USA, 2022, pp. 1-2

arXiv:2203.15611 [pdf, other]

doi 10.1093/mnras/stac889

Galaxy And Mass Assembly (GAMA): Self-Organizing Map Application on Nearby Galaxies

Authors: B. W. Holwerda, Dominic Smith, Lori Porter, Chris Henry, Ren Porter-Temple, Kyle Cook, Kevin A. Pimbblet, Andrew M. Hopkins, Maciej Bilicki, Sebastian Turner, Viviana Acquaviva, Lingyu Wang, Angus H. Wright, Lee S. Kelvin, Meiert W. Grootes

Abstract: Galaxy populations show bimodality in a variety of properties: stellar mass, colour, specific star-formation rate, size, and Sérsic index. These parameters are our feature space. We use an existing sample of 7556 galaxies from the Galaxy and Mass Assembly (GAMA) survey, represented using five features and the K-means clustering technique, showed that the bimodalities are the manifestation of a mor… ▽ More Galaxy populations show bimodality in a variety of properties: stellar mass, colour, specific star-formation rate, size, and Sérsic index. These parameters are our feature space. We use an existing sample of 7556 galaxies from the Galaxy and Mass Assembly (GAMA) survey, represented using five features and the K-means clustering technique, showed that the bimodalities are the manifestation of a more complex population structure, represented by between 2 and 6 clusters. Here we use Self Organizing Maps (SOM), an unsupervised learning technique which can be used to visualize similarity in a higher dimensional space using a 2D representation, to map these five-dimensional clusters in the feature space onto two-dimensional projections. To further analyze these clusters, using the SOM information, we agree with previous results that the sub-populations found in the feature space can be reasonably mapped onto three or five clusters. We explore where the "green valley" galaxies are mapped onto the SOM, indicating multiple interstitial populations within the green valley population. Finally, we use the projection of the SOM to verify whether morphological information provided by GalaxyZoo users, for example, if features are visible, can be mapped onto the SOM-generated map. Voting on whether galaxies are smooth, likely ellipticals, or "featured" can reasonably be separated but smaller morphological features (bar, spiral arms) can not. SOMs promise to be a useful tool to map and identify instructive sub-populations in multidimensional galaxy survey feature space, provided they are large enough. △ Less

Submitted 29 March, 2022; originally announced March 2022.

Comments: 14 pages, 14 figures, accepted by MNRAS

arXiv:2203.13691 [pdf, other]

The TerraByte Client: providing access to terabytes of plant data

Authors: Michael A. Beck, Christopher P. Bidinosti, Christopher J. Henry, Manisha Ajmani

Abstract: In this paper we demonstrate the TerraByte Client, a software to download user-defined plant datasets from a data portal hosted at Compute Canada. To that end the client offers two key functionalities: (1) It allows the user to get an overview on what data is available and a quick way to visually check samples of that data. For this the client receives the results of queries to a database and disp… ▽ More In this paper we demonstrate the TerraByte Client, a software to download user-defined plant datasets from a data portal hosted at Compute Canada. To that end the client offers two key functionalities: (1) It allows the user to get an overview on what data is available and a quick way to visually check samples of that data. For this the client receives the results of queries to a database and displays the number of images that fulfill the search criteria. Furthermore, a sample can be downloaded within seconds to confirm that the data suits the user's needs. (2) The user can then download the specified data to their own drive. This data is prepared into chunks server-side and sent to the user's end-system, where it is automatically extracted into individual files. The first chunks of data are available for inspection after a brief waiting period of a minute or less depending on available bandwidth and type of data. The TerraByte Client has a full graphical user interface for easy usage and uses end-to-end encryption. The user interface is built on top of a low-level client. This architecture in combination of offering the client program open-source makes it possible for the user to develop their own user interface or use the client's functionality directly. An example for direct usage could be to download specific data on demand within a larger application, such as training machine learning models. △ Less

Submitted 25 March, 2022; originally announced March 2022.

arXiv:2203.13283 [pdf, other]

On the Fast Direct Solution of a Preconditioned Electromagnetic Integral Equation

Authors: Davide Consoli, Clément Henry, Alexandre Dély, Lyes Rahmouni, John Erik Ortiz Guzman, Tiffany L. Chhim, Simon B. Adrian, Adrien Merlini, Francesco P. Andriulli

Abstract: This work presents a fast direct solver strategy for electromagnetic integral equations in the high-frequency regime. The new scheme relies on a suitably preconditioned combined field formulation and results in a single skeleton form plus identity equation. This is obtained after a regularization of the elliptic spectrum through the extraction of a suitably chosen equivalent circulant problem. The… ▽ More This work presents a fast direct solver strategy for electromagnetic integral equations in the high-frequency regime. The new scheme relies on a suitably preconditioned combined field formulation and results in a single skeleton form plus identity equation. This is obtained after a regularization of the elliptic spectrum through the extraction of a suitably chosen equivalent circulant problem. The inverse of the system matrix is then obtained by leveraging the Woodbury matrix identity, the low-rank representation of the extracted part of the operator, and fast circulant algebra yielding a scheme with a favorable complexity and suitable for the solution of multiple right-hand sides. Theoretical considerations are accompanied by numerical results both of which are confirming and showing the practical relevance of the newly developed scheme. △ Less

Submitted 4 April, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

arXiv:2203.08603 [pdf, other]

Laplacian Filters for Integral Equations: Further Developments and Fast Algorithms

Authors: Adrien Merlini, Clément Henry, Davide Consoli, Lyes Rahmouni, Francesco P. Andriulli

Abstract: This paper extends the concept of Laplacian filtered quasi-Helmholtz decompositions we have recently introduced, to the basis-free projector-based setting. This extension allows the discrete analyses of electromagnetic integral operators spectra without passing via an explicit Loop-Star decomposition as previously done. We also present a fast scheme for the evaluation of the filters in quasi linea… ▽ More This paper extends the concept of Laplacian filtered quasi-Helmholtz decompositions we have recently introduced, to the basis-free projector-based setting. This extension allows the discrete analyses of electromagnetic integral operators spectra without passing via an explicit Loop-Star decomposition as previously done. We also present a fast scheme for the evaluation of the filters in quasi linear complexity in the total number of unknowns. Together with the fact that only a logarithmic number of these filters are required for solving the h-refinement breakdown of electric field integral equation, this results in an effective preconditioner that rivals Calderón strategies in performance without relying on barycentric refinements. Numerical results confirm the theoretically predicted behavior and the effectiveness of the approach. △ Less

Submitted 1 March, 2022; originally announced March 2022.

arXiv:2203.02611 [pdf, other]

Plant Species Recognition with Optimized 3D Polynomial Neural Networks and Variably Overlapping Time-Coherent Sliding Window

Authors: Habib Ben Abdallah, Christopher J. Henry, Sheela Ramanna

Abstract: Recently, the EAGL-I system was developed to rapidly create massive labeled datasets of plants intended to be commonly used by farmers and researchers to create AI-driven solutions in agriculture. As a result, a publicly available plant species recognition dataset composed of 40,000 images with different sizes consisting of 8 plant species was created with the system in order to demonstrate its ca… ▽ More Recently, the EAGL-I system was developed to rapidly create massive labeled datasets of plants intended to be commonly used by farmers and researchers to create AI-driven solutions in agriculture. As a result, a publicly available plant species recognition dataset composed of 40,000 images with different sizes consisting of 8 plant species was created with the system in order to demonstrate its capabilities. This paper proposes a novel method, called Variably Overlapping Time-Coherent Sliding Window (VOTCSW), that transforms a dataset composed of images with variable size to a 3D representation with fixed size that is suitable for convolutional neural networks, and demonstrates that this representation is more informative than resizing the images of the dataset to a given size. We theoretically formalized the use cases of the method as well as its inherent properties and we proved that it has an oversampling and a regularization effect on the data. By combining the VOTCSW method with the 3D extension of a recently proposed machine learning model called 1-Dimensional Polynomial Neural Networks, we were able to create a model that achieved a state-of-the-art accuracy of 99.9% on the dataset created by the EAGL-I system, surpassing well-known architectures such as ResNet and Inception. In addition, we created a heuristic algorithm that enables the degree reduction of any pre-trained N-Dimensional Polynomial Neural Network and which compresses it without altering its performance, thus making the model faster and lighter. Furthermore, we established that the currently available dataset could not be used for machine learning in its present form, due to a substantial class imbalance between the training set and the test set. Hence, we created a specific preprocessing and a model development framework that enabled us to improve the accuracy from 49.23% to 99.9%. △ Less

Submitted 29 August, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

arXiv:2112.05208 [pdf, other]

doi 10.3390/atoms10010034

Modeling atom interferometry experiments with Bose-Einstein condensates in power-law potentials

Authors: S. Thomas, C. Sapp, C. Henry, A. Smith, C. A. Sackett, C. W. Clark, M. Edwards

Abstract: Recent atom interferometry (AI) experiments involving Bose--Einstein condensates (BECs) have been conducted under extreme conditions of volume and interrogation time. Numerical solution of the standard mean-field theory applied to these experiments presents a nearly intractable challenge. We present an approximate variational model that provides rapid approximate solutions of the rotating-frame Gr… ▽ More Recent atom interferometry (AI) experiments involving Bose--Einstein condensates (BECs) have been conducted under extreme conditions of volume and interrogation time. Numerical solution of the standard mean-field theory applied to these experiments presents a nearly intractable challenge. We present an approximate variational model that provides rapid approximate solutions of the rotating-frame Gross--Pitaevskii equation for a power-law potential. This model is well-suited to the design and analysis of AI experiments involving BECs that are split and later recombined to form an interference pattern. We derive the equations of motion of the variational parameters for this model and illustrate how the model can be applied to the sequence of steps in a recent AI experiment where BECs were used to implement a dual-Sagnac atom interferometer rotation sensor. We use this model to investigate the impact of finite-size and interaction effects on the single-Sagnac-interferometer phase shift. △ Less

Submitted 9 December, 2021; originally announced December 2021.

Comments: 22 pages, 3 figures

arXiv:2110.13993 [pdf, other]

doi 10.1088/1538-3873/ac32b1

Planetary Nebulae: Sources of Enlightenment

Authors: Karen B. Kwitter, R. B. C. Henry

Abstract: In this review/tutorial we explore planetary nebulae as a stage in the evolution of low-to-intermediate-mass stars, as major contributors to the mass and chemical enrichment of the interstellar medium, and as astrophysical laboratories. We discuss many observed properties of planetary nebulae, placing particular emphasis on element abundance determinations and comparisons with theoretical predicti… ▽ More In this review/tutorial we explore planetary nebulae as a stage in the evolution of low-to-intermediate-mass stars, as major contributors to the mass and chemical enrichment of the interstellar medium, and as astrophysical laboratories. We discuss many observed properties of planetary nebulae, placing particular emphasis on element abundance determinations and comparisons with theoretical predictions. Dust and molecules associated with planetary nebulae are considered as well. We then examine distances, binarity, and planetary nebula morphology and evolution. We end with mention of some of the advances that will be enabled by future observing capabilities. △ Less

Submitted 26 October, 2021; originally announced October 2021.

Comments: Invited review. Accepted for publication in PASP; 67 pages, 10 tables, 30 figures

Journal ref: PASP, 134, Number 022001, 2022

arXiv:2110.05227 [pdf]

doi 10.1021/acs.jpcc.1c02060

Activity of Pd$_{\rm n}$ (n = 1-5) Clusters on Alumina Film on Ni$_3$Al(111) for CO Oxidation: A Molecular Beam Study

Authors: Georges Sitja, Claude Henry

Abstract: Single atom catalyst (SAC) is a vivid new area of research in catalysis. However, the activity in CO oxidation of isolated Pt or Pd atoms, generally supported on an oxide powder, is still controversial. Furthermore, the steady state activity of few atoms clusters is still not yet quantitatively known. In this work we study, by molecular beam reactive scattering (MBRS), the activity of Pd… ▽ More Single atom catalyst (SAC) is a vivid new area of research in catalysis. However, the activity in CO oxidation of isolated Pt or Pd atoms, generally supported on an oxide powder, is still controversial. Furthermore, the steady state activity of few atoms clusters is still not yet quantitatively known. In this work we study, by molecular beam reactive scattering (MBRS), the activity of Pd$_{\rm n}$ ($n$ = 1-5) clusters, grown on a nanostructured alumina films on a Ni$_3$Al(111) surface. It is shown that the single atoms are not active at 473 K but they diffuse and coalesce at 533K forming larger clusters. The activity of a cluster is proportional to the number of atoms it contains (n=2-5). At 533 K, the activity per (surface-) atom, which is the turnover frequency (TOF), is constant. Its value is close to those obtained for large clusters of 181$\pm$13 atoms and Pd (111) extended surfaces, in the same experimental conditions. △ Less

Submitted 11 October, 2021; originally announced October 2021.

Journal ref: Journal of Physical Chemistry C, American Chemical Society, 2021, 125 (24), pp.13247-13253

arXiv:2108.10690 [pdf, other]

On a Low-Frequency and Contrast Stabilized Full-Wave Volume Integral Equation Solver for Lossy Media

Authors: Clément Henry, Adrien Merlini, Lyes Rahmouni, Francesco P. Andriulli

Abstract: In this paper we present a new regularized electric flux volume integral equation (D-VIE) for modeling high-contrast conductive dielectric objects in a broad frequency range. This new formulation is particularly suitable for modeling biological tissues at low frequencies, as it is required by brain epileptogenic area imaging, but also at higher ones, as it is required by several applications inclu… ▽ More In this paper we present a new regularized electric flux volume integral equation (D-VIE) for modeling high-contrast conductive dielectric objects in a broad frequency range. This new formulation is particularly suitable for modeling biological tissues at low frequencies, as it is required by brain epileptogenic area imaging, but also at higher ones, as it is required by several applications including, but not limited to, transcranial magnetic and deep brain stimulation (TMS and DBS, respectively). When modeling inhomogeneous objects with high complex permittivities at low frequencies, the traditional D-VIE is ill-conditioned and suffers from numerical instabilities that result in slower convergence and in less accurate solutions. In this work we address these shortcomings by leveraging a new set of volume quasi-Helmholtz projectors. Their scaling by the material permittivity matrix allows for the re-balancing of the equation when applied to inhomogeneous scatterers and thereby makes the proposed method accurate and stable even for high complex permittivity objects until arbitrarily low frequencies. Numerical results, canonical and realistic, corroborate the theory and confirm the stability and the accuracy of this new method both in the quasi-static regime and at higher frequencies. △ Less

Submitted 15 August, 2021; originally announced August 2021.

arXiv:2108.05789 [pdf, other]

Presenting an extensive lab- and field-image dataset of crops and weeds for computer vision tasks in agriculture

Authors: Michael A. Beck, Chen-Yi Liu, Christopher P. Bidinosti, Christopher J. Henry, Cara M. Godee, Manisha Ajmani

Abstract: We present two large datasets of labelled plant-images that are suited towards the training of machine learning and computer vision models. The first dataset encompasses as the day of writing over 1.2 million images of indoor-grown crops and weeds common to the Canadian Prairies and many US states. The second dataset consists of over 540,000 images of plants imaged in farmland. All indoor plant im… ▽ More We present two large datasets of labelled plant-images that are suited towards the training of machine learning and computer vision models. The first dataset encompasses as the day of writing over 1.2 million images of indoor-grown crops and weeds common to the Canadian Prairies and many US states. The second dataset consists of over 540,000 images of plants imaged in farmland. All indoor plant images are labelled by species and we provide rich etadata on the level of individual images. This comprehensive database allows to filter the datasets under user-defined specifications such as for example the crop-type or the age of the plant. Furthermore, the indoor dataset contains images of plants taken from a wide variety of angles, including profile shots, top-down shots, and angled perspectives. The images taken from plants in fields are all from a top-down perspective and contain usually multiple plants per image. For these images metadata is also available. In this paper we describe both datasets' characteristics with respect to plant variety, plant age, and number of images. We further introduce an open-access sample of the indoor-dataset that contains 1,000 images of each species covered in our dataset. These, in total 14,000 images, had been selected, such that they form a representative sample with respect to plant age and ndividual plants per species. This sample serves as a quick entry point for new users to the dataset, allowing them to explore the data on a small scale and find the parameters of data most useful for their application without having to deal with hundreds of thousands of individual images. △ Less

Submitted 12 August, 2021; originally announced August 2021.

arXiv:2105.07258 [pdf, other]

doi 10.1016/j.rinam.2021.100185

Polynomial degree reduction in the $\mathcal{L}^2$-norm on a symmetric interval for the canonical basis

Authors: Habib Ben Abdallah, Christopher J. Henry, Sheela Ramanna

Abstract: In this paper, we develop a direct formula for determining the coefficients in the canonical basis of the best polynomial of degree $M$ that approximates a polynomial of degree $N>M$ on a symmetric interval for the $\mathcal{L}^2$-norm. We also formally prove that using the formula is more computationally efficient than using a classical matrix multiplication approach and we provide an example to… ▽ More In this paper, we develop a direct formula for determining the coefficients in the canonical basis of the best polynomial of degree $M$ that approximates a polynomial of degree $N>M$ on a symmetric interval for the $\mathcal{L}^2$-norm. We also formally prove that using the formula is more computationally efficient than using a classical matrix multiplication approach and we provide an example to illustrate that it is more numerically stable than the classical approach. △ Less

Submitted 15 May, 2021; originally announced May 2021.

arXiv:2103.14734 [pdf, other]

doi 10.1007/s11042-021-11579-4

Fully Automated 2D and 3D Convolutional Neural Networks Pipeline for Video Segmentation and Myocardial Infarction Detection in Echocardiography

Authors: Oumaima Hamila, Sheela Ramanna, Christopher J. Henry, Serkan Kiranyaz, Ridha Hamila, Rashid Mazhar, Tahir Hamid

Abstract: Cardiac imaging known as echocardiography is a non-invasive tool utilized to produce data including images and videos, which cardiologists use to diagnose cardiac abnormalities in general and myocardial infarction (MI) in particular. Echocardiography machines can deliver abundant amounts of data that need to be quickly analyzed by cardiologists to help them make a diagnosis and treat cardiac condi… ▽ More Cardiac imaging known as echocardiography is a non-invasive tool utilized to produce data including images and videos, which cardiologists use to diagnose cardiac abnormalities in general and myocardial infarction (MI) in particular. Echocardiography machines can deliver abundant amounts of data that need to be quickly analyzed by cardiologists to help them make a diagnosis and treat cardiac conditions. However, the acquired data quality varies depending on the acquisition conditions and the patient's responsiveness to the setup instructions. These constraints are challenging to doctors especially when patients are facing MI and their lives are at stake. In this paper, we propose an innovative real-time end-to-end fully automated model based on convolutional neural networks (CNN) to detect MI depending on regional wall motion abnormalities (RWMA) of the left ventricle (LV) from videos produced by echocardiography. Our model is implemented as a pipeline consisting of a 2D CNN that performs data preprocessing by segmenting the LV chamber from the apical four-chamber (A4C) view, followed by a 3D CNN that performs a binary classification to detect if the segmented echocardiography shows signs of MI. We trained both CNNs on a dataset composed of 165 echocardiography videos each acquired from a distinct patient. The 2D CNN achieved an accuracy of 97.18% on data segmentation while the 3D CNN achieved 90.9% of accuracy, 100% of precision and 95% of recall on MI detection. Our results demonstrate that creating a fully automated system for MI detection is feasible and propitious. △ Less

Submitted 3 August, 2022; v1 submitted 26 March, 2021; originally announced March 2021.

Comments: Multimed Tools Appl (2022)

arXiv:2011.00987 [pdf]

doi 10.1021/acs.jpcc.8b07350

Molecular Beam Study of the CO Adsorption on a Regular Array of PdAu Clusters on Alumina

Authors: Georges Sitja, Claude R Henry

Abstract: The adsorption kinetics of CO on PdAu bimetallic clusters, containing 140 $\pm$ 12 atoms and a composition varying between 0% and 55% of Pd atoms, is investigated by a pulsed molecular beam method (MBRS). The clusters are grown on a nanostructured ultrathin film of alumina on Ni3Al (111) playing the role of a template which gives a hexagonal array of bimetallic clusters having a sharp size distrib… ▽ More The adsorption kinetics of CO on PdAu bimetallic clusters, containing 140 $\pm$ 12 atoms and a composition varying between 0% and 55% of Pd atoms, is investigated by a pulsed molecular beam method (MBRS). The clusters are grown on a nanostructured ultrathin film of alumina on Ni3Al (111) playing the role of a template which gives a hexagonal array of bimetallic clusters having a sharp size distribution and a uniform composition. The surface concentration calculated, assuming segregation of gold to the surface, varies between 0 and 90% of Au atoms on the surface. From the adsorption-desorption kinetics of CO, the lifetime of CO is measured at various temperatures. At low coverage, plotting the CO lifetime in an Arrhenius diagram one obtains the adsorption energy of CO. When the surface concentration of Au increases, the adsorption energy of CO on the PdAu clusters decreases. This evolution of the adsorption energy is discussed, from previous studies, in term of ligand and ensemble effects. We find that the ensemble effect plays a dominant role in the observed decrease of the adsorption energy of CO. △ Less

Submitted 2 November, 2020; originally announced November 2020.

Journal ref: Journal of Physical Chemistry C, American Chemical Society, 2019, 123 (13), pp.7961-7967

arXiv:2010.14147 [pdf]

doi 10.1063/1.5125572

Particle size effect on the Langmuir-Hinshelwood barrier for CO oxidation on regular arrays of Pd clusters supported on ultrathin alumina films

Authors: Georges Sitja, Héloïse Tissot, Claude Henry

Abstract: The Langmuir-Hinshelwood barrier (ELH) and the pre-exponential factor ($ν$LH) for CO oxidation have been measured at high temperature on hexagonal arrays of Pd clusters supported on an ultrathin alumina film on Ni3Al(111). The Pd clusters have a sharp size distribution and the mean sizes are: 174$\pm$13, 360$\pm$19 and 768$\pm$28 atoms. ELH and $ν$LH are determined from the initial reaction rate o… ▽ More The Langmuir-Hinshelwood barrier (ELH) and the pre-exponential factor ($ν$LH) for CO oxidation have been measured at high temperature on hexagonal arrays of Pd clusters supported on an ultrathin alumina film on Ni3Al(111). The Pd clusters have a sharp size distribution and the mean sizes are: 174$\pm$13, 360$\pm$19 and 768$\pm$28 atoms. ELH and $ν$LH are determined from the initial reaction rate of a CO molecular beam with a saturation layer of adsorbed oxygen on the Pd clusters, measured at different temperatures (493$\le$T (K) $\le$613). The largest particles (3.5 nm) give values of ELH and $ν$LH similar to those measured on Pd (111) [2]. However, smaller particles (2.7 and 2.1 nm) show very different behavior. The origin of this size effect is discussed in terms of variation of the electronic structure and of the atomic structure of the Pd clusters. △ Less

Submitted 27 October, 2020; originally announced October 2020.

Journal ref: Journal of Chemical Physics, American Institute of Physics, 2019, 151 (17), pp.174703

arXiv:2010.12357 [pdf]

doi 10.1021/acs.jpcc.9b05109

Regular Arrays of Pt Clusters on Alumina: A New Superstructure on Al$_2$O$_3$/Ni$_3$Al (111)

Authors: Georges Sitja, Aude Bailly, Maurizio de Santis, Vasile Heresanu, Claude Henry

Abstract: Alumina ultrathin films obtained by high temperature oxidation of a Ni$_3$Al (111) surface are a good template to grow regular arrays of metal clusters. Up to now two hexagonal organizations called 'dot' and 'network' structures have been observed with distances between clusters of 4.1 and 2.4 nm, respectively. In the present article we report on an investigation by in situ Grazing Incidence Small… ▽ More Alumina ultrathin films obtained by high temperature oxidation of a Ni$_3$Al (111) surface are a good template to grow regular arrays of metal clusters. Up to now two hexagonal organizations called 'dot' and 'network' structures have been observed with distances between clusters of 4.1 and 2.4 nm, respectively. In the present article we report on an investigation by in situ Grazing Incidence Small Angle X-ray Scattering (GISAXS), showing that Pt deposited at room temperature (RT) and for a low coverage forms a new hexagonal structure with a distance between clusters of 1.38 nm. For the first time, an assembly of tiny Pt clusters (1-6 atoms) with a very high density (5.85x10 13 cm$^{-2}$) and presenting a good organization on an alumina surface, is obtained. This system could be used to investigate by surface science techniques the new emerging field of Single Atom Catalysis (SAC). By deposition at 573 K small Pt clusters are organized on the network structure. By deposition of Pt at 573 K on pre-formed Pd seeds, large Pt (Pd) clusters containing a hundred of atoms are organized on the dot structure and they remain organized up to 733 K. We show that the three structures are interrelated. The different organizations of the Pt clusters on the alumina surface are explained by the presence of 3 types of sites corresponding to different adsorption energy for Pt atoms. △ Less

Submitted 23 October, 2020; originally announced October 2020.

Journal ref: Journal of Physical Chemistry C, American Chemical Society, 2019, 123 (40), pp.24487-24494

arXiv:2009.04077 [pdf, other]

doi 10.1016/j.knosys.2022.108174

1-Dimensional polynomial neural networks for audio signal related problems

Authors: Habib Ben Abdallah, Christopher J. Henry, Sheela Ramanna

Abstract: In addition to being extremely non-linear, modern problems require millions if not billions of parameters to solve or at least to get a good approximation of the solution, and neural networks are known to assimilate that complexity by deepening and widening their topology in order to increase the level of non-linearity needed for a better approximation. However, compact topologies are always prefe… ▽ More In addition to being extremely non-linear, modern problems require millions if not billions of parameters to solve or at least to get a good approximation of the solution, and neural networks are known to assimilate that complexity by deepening and widening their topology in order to increase the level of non-linearity needed for a better approximation. However, compact topologies are always preferred to deeper ones as they offer the advantage of using less computational units and less parameters. This compacity comes at the price of reduced non-linearity and thus, of limited solution search space. We propose the 1-Dimensional Polynomial Neural Network (1DPNN) model that uses automatic polynomial kernel estimation for 1-Dimensional Convolutional Neural Networks (1DCNNs) and that introduces a high degree of non-linearity from the first layer which can compensate the need for deep and/or wide topologies. We show that this non-linearity enables the model to yield better results with less computational and spatial complexity than a regular 1DCNN on various classification and regression problems related to audio signals, even though it introduces more computational and spatial complexity on a neuronal level. The experiments were conducted on three publicly available datasets and demonstrate that, on the problems that were tackled, the proposed model can extract more relevant information from the data than a 1DCNN in less time and with less memory. △ Less

Submitted 12 January, 2022; v1 submitted 8 September, 2020; originally announced September 2020.

arXiv:2007.06124 [pdf, other]

EAGLE: Large-scale Vehicle Detection Dataset in Real-World Scenarios using Aerial Imagery

Authors: Seyed Majid Azimi, Reza Bahmanyar, Corenin Henry, Franz Kurz

Abstract: Multi-class vehicle detection from airborne imagery with orientation estimation is an important task in the near and remote vision domains with applications in traffic monitoring and disaster management. In the last decade, we have witnessed significant progress in object detection in ground imagery, but it is still in its infancy in airborne imagery, mostly due to the scarcity of diverse and larg… ▽ More Multi-class vehicle detection from airborne imagery with orientation estimation is an important task in the near and remote vision domains with applications in traffic monitoring and disaster management. In the last decade, we have witnessed significant progress in object detection in ground imagery, but it is still in its infancy in airborne imagery, mostly due to the scarcity of diverse and large-scale datasets. Despite being a useful tool for different applications, current airborne datasets only partially reflect the challenges of real-world scenarios. To address this issue, we introduce EAGLE (oriEnted vehicle detection using Aerial imaGery in real-worLd scEnarios), a large-scale dataset for multi-class vehicle detection with object orientation information in aerial imagery. It features high-resolution aerial images composed of different real-world situations with a wide variety of camera sensor, resolution, flight altitude, weather, illumination, haze, shadow, time, city, country, occlusion, and camera angle. The annotation was done by airborne imagery experts with small- and large-vehicle classes. EAGLE contains 215,986 instances annotated with oriented bounding boxes defined by four points and orientation, making it by far the largest dataset to date in this task. It also supports researches on the haze and shadow removal as well as super-resolution and in-painting applications. We define three tasks: detection by (1) horizontal bounding boxes, (2) rotated bounding boxes, and (3) oriented bounding boxes. We carried out several experiments to evaluate several state-of-the-art methods in object detection on our dataset to form a baseline. Experiments show that the EAGLE dataset accurately reflects real-world situations and correspondingly challenging applications. △ Less

Submitted 23 November, 2020; v1 submitted 12 July, 2020; originally announced July 2020.

Comments: Accepted in ICPR 2020

arXiv:2007.06102 [pdf, other]

SkyScapes -- Fine-Grained Semantic Understanding of Aerial Scenes

Authors: Seyed Majid Azimi, Corentin Henry, Lars Sommer, Arne Schumann, Eleonora Vig

Abstract: Understanding the complex urban infrastructure with centimeter-level accuracy is essential for many applications from autonomous driving to mapping, infrastructure monitoring, and urban management. Aerial images provide valuable information over a large area instantaneously; nevertheless, no current dataset captures the complexity of aerial scenes at the level of granularity required by real-world… ▽ More Understanding the complex urban infrastructure with centimeter-level accuracy is essential for many applications from autonomous driving to mapping, infrastructure monitoring, and urban management. Aerial images provide valuable information over a large area instantaneously; nevertheless, no current dataset captures the complexity of aerial scenes at the level of granularity required by real-world applications. To address this, we introduce SkyScapes, an aerial image dataset with highly-accurate, fine-grained annotations for pixel-level semantic labeling. SkyScapes provides annotations for 31 semantic categories ranging from large structures, such as buildings, roads and vegetation, to fine details, such as 12 (sub-)categories of lane markings. We have defined two main tasks on this dataset: dense semantic segmentation and multi-class lane-marking prediction. We carry out extensive experiments to evaluate state-of-the-art segmentation methods on SkyScapes. Existing methods struggle to deal with the wide range of classes, object sizes, scales, and fine details present. We therefore propose a novel multi-task model, which incorporates semantic edge detection and is better tuned for feature extraction from a wide range of scales. This model achieves notable improvements over the baselines in region outlines and level of detail on both tasks. △ Less

Submitted 12 July, 2020; originally announced July 2020.

Comments: Accepted in IEEE ICCV19

arXiv:2006.01228 [pdf, other]

doi 10.1371/journal.pone.0243923

An embedded system for the automated generation of labeled plant images to enable machine learning applications in agriculture

Authors: Michael A. Beck, Chen-Yi Liu, Christopher P. Bidinosti, Christopher J. Henry, Cara M. Godee, Manisha Ajmani

Abstract: A lack of sufficient training data, both in terms of variety and quantity, is often the bottleneck in the development of machine learning (ML) applications in any domain. For agricultural applications, ML-based models designed to perform tasks such as autonomous plant classification will typically be coupled to just one or perhaps a few plant species. As a consequence, each crop-specific task is v… ▽ More A lack of sufficient training data, both in terms of variety and quantity, is often the bottleneck in the development of machine learning (ML) applications in any domain. For agricultural applications, ML-based models designed to perform tasks such as autonomous plant classification will typically be coupled to just one or perhaps a few plant species. As a consequence, each crop-specific task is very likely to require its own specialized training data, and the question of how to serve this need for data now often overshadows the more routine exercise of actually training such models. To tackle this problem, we have developed an embedded robotic system to automatically generate and label large datasets of plant images for ML applications in agriculture. The system can image plants from virtually any angle, thereby ensuring a wide variety of data; and with an imaging rate of up to one image per second, it can produce lableled datasets on the scale of thousands to tens of thousands of images per day. As such, this system offers an important alternative to time- and cost-intensive methods of manual generation and labeling. Furthermore, the use of a uniform background made of blue keying fabric enables additional image processing techniques such as background replacement and plant segmentation. It also helps in the training process, essentially forcing the model to focus on the plant features and eliminating random correlations. To demonstrate the capabilities of our system, we generated a dataset of over 34,000 labeled images, with which we trained an ML-model to distinguish grasses from non-grasses in test data from a variety of sources. We now plan to generate much larger datasets of Canadian crop plants and weeds that will be made publicly available in the hope of further enabling ML applications in the agriculture sector. △ Less

Submitted 1 April, 2021; v1 submitted 1 June, 2020; originally announced June 2020.

Comments: 35 pages, 8 figures, Preprint submitted to PLoS One

arXiv:2005.10671 [pdf]

Physics and the Pythagorean Theorem

Authors: James Overduin, Richard Conn Henry

Abstract: Pythagoras' theorem lies at the heart of physics as well as mathematics, yet its historical origins are obscure. We highlight a purely pictorial, gestalt-like proof that may have originated during the Zhou Dynasty. Generalizations of the Pythagorean theorem to three, four and more dimensions undergird fundamental laws including the energy-momentum relation of particle physics and the field equatio… ▽ More Pythagoras' theorem lies at the heart of physics as well as mathematics, yet its historical origins are obscure. We highlight a purely pictorial, gestalt-like proof that may have originated during the Zhou Dynasty. Generalizations of the Pythagorean theorem to three, four and more dimensions undergird fundamental laws including the energy-momentum relation of particle physics and the field equations of general relativity, and may hint at future unified theories. The "pre-mathematical" nature of this theorem lends support to the Eddingtonian view that "the stuff of the world is mind-stuff." △ Less

Submitted 9 June, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

Comments: To appear in the inaugural issue of Minkowski Institute Magazine

arXiv:2004.05347 [pdf, other]

The Diffuse Ultraviolet Background Close to the Galactic Plane

Authors: Jayant Murthy, R. C. Henry, James Overduin

Abstract: We have used Voyager and Galex observations to map the diffuse Galactic light near the Galactic equator. We find that most of the observations are relatively faint with surface brightnesses of less than 5,000 photon units. This is important because many ultraviolet telescopes have not observed at low Galactic latitudes because of the fear of a bright diffuse emission. Our data are consistent with… ▽ More We have used Voyager and Galex observations to map the diffuse Galactic light near the Galactic equator. We find that most of the observations are relatively faint with surface brightnesses of less than 5,000 photon units. This is important because many ultraviolet telescopes have not observed at low Galactic latitudes because of the fear of a bright diffuse emission. Our data are consistent with emission from interstellar dust grains with albedo ($a$) of 0.2 -- 0.3 and phase function ($g$) $ < 0.7$ at 1100 Å; $0.2 < a < 0.5; g < 0.8$ at 1500 Å; and $0.4 < a < 0.6; g < 0.4$ at 2300 Å. △ Less

Submitted 11 April, 2020; originally announced April 2020.

Comments: Submitted to MNRAS

arXiv:1912.06716 [pdf, other]

Dynamics and fragmentation of small inextensible fibers in turbulence

Authors: Sofía Allende, Christophe Henry, Jérémie Bec

Abstract: The fragmentation of small, brittle, flexible, inextensible fibers is investigated in a fully-developed, homogeneous, isotropic turbulent flow. Such small fibers spend most of their time fully stretched and their dynamics follows that of stiff rods. They can then break through tensile failure, i.e. when the tension is higher than a given threshold. Fibers bend when experiencing a strong compressio… ▽ More The fragmentation of small, brittle, flexible, inextensible fibers is investigated in a fully-developed, homogeneous, isotropic turbulent flow. Such small fibers spend most of their time fully stretched and their dynamics follows that of stiff rods. They can then break through tensile failure, i.e. when the tension is higher than a given threshold. Fibers bend when experiencing a strong compression. During these rare and intermittent buckling events, they can break under flexural failure, i.e. when the curvature exceeds a threshold. Fine-scale massive simulations of both the fluid flow and the fiber dynamics are performed to provide statistics on these two fragmentation processes. This gives ingredients for the development of accurate macroscopic models, namely the fragmentation rate and daughter-size distributions, which can be used to predict the time evolution of the fiber size distribution. Evidence is provided for the generic nature of turbulent fragmentation and of the resulting population dynamics. It is indeed shown that the statistics of breakup is fully determined by the probability distribution of Lagrangian fluid velocity gradients. This approach singles out that the only relevant dimensionless parameter is a local flexibility which balances flow stretching to the fiber elastic forces. △ Less

Submitted 2 December, 2019; originally announced December 2019.

Comments: 19 pages, 9 figures

arXiv:1911.05486 [pdf, other]

doi 10.1145/3450703

ELRUNA: Elimination Rule-based Network Alignment

Authors: Zirou Qiu, Ruslan Shaydulin, Xiaoyuan Liu, Yuri Alexeev, Christopher S. Henry, Ilya Safro

Abstract: Networks model a variety of complex phenomena across different domains. In many applications, one of the most essential tasks is to align two or more networks to infer the similarities between cross-network vertices and discover potential node-level correspondence. In this paper, we propose ELRUNA (Elimination rule-based network alignment), a novel network alignment algorithm that relies exclusive… ▽ More Networks model a variety of complex phenomena across different domains. In many applications, one of the most essential tasks is to align two or more networks to infer the similarities between cross-network vertices and discover potential node-level correspondence. In this paper, we propose ELRUNA (Elimination rule-based network alignment), a novel network alignment algorithm that relies exclusively on the underlying graph structure. Under the guidance of the elimination rules that we defined, ELRUNA computes the similarity between a pair of cross-network vertices iteratively by accumulating the similarities between their selected neighbors. The resulting cross-network similarity matrix is then used to infer a permutation matrix that encodes the final alignment of cross-network vertices. In addition to the novel alignment algorithm, we also improve the performance of local search, a commonly used post-processing step for solving the network alignment problem, by introducing a novel selection method RAWSEM (Randomwalk based selection method) based on the propagation of the levels of mismatching (defined in the paper) of vertices across the networks. The key idea is to pass on the initial levels of mismatching of vertices throughout the entire network in a random-walk fashion. Through extensive numerical experiments on real networks, we demonstrate that ELRUNA significantly outperforms the state-of-the-art alignment methods in terms of alignment accuracy under lower or comparable running time. Moreover, ELRUNA is robust to network perturbations such that it can maintain a close to optimal objective value under a high level of noise added to the original networks. Finally, the proposed RAWSEM can further improve the alignment quality with a less number of iterations compared with the naive local search method. △ Less

Submitted 23 February, 2021; v1 submitted 29 October, 2019; originally announced November 2019.

Journal ref: ACM J. Exp. Algorithmics 26, 1, Article 1.7 (2021)

arXiv:1908.02260 [pdf, other]

doi 10.1093/mnras/stz2186

Components of the Diffuse Ultraviolet Radiation at High Latitudes

Authors: M. S. Akshaya, Jayant Murthy, S. Ravichandran, R. C. Henry, James Overduin

Abstract: We have used data from the Galaxy Evolution Explorer to study the different components of the diffuse ultraviolet background in the region between the Galactic latitudes 70-80 degree. We find an offset at zero dust column density (E(B - V) = 0) of $240 \pm 18$ photon units in the FUV (1539A) and $394 \pm 37$ photon units in the NUV (2316A). This is approximately half of the total observed radiatio… ▽ More We have used data from the Galaxy Evolution Explorer to study the different components of the diffuse ultraviolet background in the region between the Galactic latitudes 70-80 degree. We find an offset at zero dust column density (E(B - V) = 0) of $240 \pm 18$ photon units in the FUV (1539A) and $394 \pm 37$ photon units in the NUV (2316A). This is approximately half of the total observed radiation with the remainder divided between an extragalactic component of $114 \pm 18$ photon units in the FUV and $194 \pm 37$ photon units in the NUV and starlight scattered by Galactic dust at high latitudes. The optical constants of the dust grains were found to be a=0.4$\pm$0.1 and g=0.8$\pm$0.1 (FUV) and a=0.4$\pm$0.1 and g=0.5$\pm$0.1 (NUV). We cannot differentiate between a Galactic or extragalactic origin for the zero-offset but can affirm that it is not from any known source. △ Less

Submitted 6 August, 2019; originally announced August 2019.

Comments: Accepted for publication in MNRAS 7 pages, 6 figures

Showing 1–50 of 132 results for author: Henry, C