Search | arXiv e-print repository

Active Learning with Fully Bayesian Neural Networks for Discontinuous and Nonstationary Data

Abstract: Active learning optimizes the exploration of large parameter spaces by strategically selecting which experiments or simulations to conduct, thus reducing resource consumption and potentially accelerating scientific discovery. A key component of this approach is a probabilistic surrogate model, typically a Gaussian Process (GP), which approximates an unknown functional relationship between control… ▽ More Active learning optimizes the exploration of large parameter spaces by strategically selecting which experiments or simulations to conduct, thus reducing resource consumption and potentially accelerating scientific discovery. A key component of this approach is a probabilistic surrogate model, typically a Gaussian Process (GP), which approximates an unknown functional relationship between control parameters and a target property. However, conventional GPs often struggle when applied to systems with discontinuities and non-stationarities, prompting the exploration of alternative models. This limitation becomes particularly relevant in physical science problems, which are often characterized by abrupt transitions between different system states and rapid changes in physical property behavior. Fully Bayesian Neural Networks (FBNNs) serve as a promising substitute, treating all neural network weights probabilistically and leveraging advanced Markov Chain Monte Carlo techniques for direct sampling from the posterior distribution. This approach enables FBNNs to provide reliable predictive distributions, crucial for making informed decisions under uncertainty in the active learning setting. Although traditionally considered too computationally expensive for 'big data' applications, many physical sciences problems involve small amounts of data in relatively low-dimensional parameter spaces. Here, we assess the suitability and performance of FBNNs with the No-U-Turn Sampler for active learning tasks in the 'small data' regime, highlighting their potential to enhance predictive accuracy and reliability on test functions relevant to problems in physical sciences. △ Less

Submitted 17 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

Comments: Fixed PGM in Figure 2 and update caption

arXiv:2405.08773 [pdf]

Evolution of ferroelectric properties in SmxBi1-xFeO3 via automated Piezoresponse Force Microscopy across combinatorial spread libraries

Authors: Aditya Raghavan, Rohit Pant, Ichiro Takeuchi, Eugene A. Eliseev, Marti Checa, Anna N. Morozovska, Maxim Ziatdinov, Sergei V. Kalinin, Yongtao Liu

Abstract: Combinatorial spread libraries offer a unique approach to explore evolution of materials properties over the broad concentration, temperature, and growth parameter spaces. However, the traditional limitation of this approach is the requirement for the read-out of functional properties across the library. Here we demonstrate the application of automated Piezoresponse Force Microscopy (PFM) for the… ▽ More Combinatorial spread libraries offer a unique approach to explore evolution of materials properties over the broad concentration, temperature, and growth parameter spaces. However, the traditional limitation of this approach is the requirement for the read-out of functional properties across the library. Here we demonstrate the application of automated Piezoresponse Force Microscopy (PFM) for the exploration of the physics in the SmxBi1-xFeO3 system with the ferroelectric-antiferroelectric morphotropic phase boundary. This approach relies on the synergy of the quantitative nature of PFM and the implementation of automated experiments that allows PFM-based gird sampling over macroscopic samples. The concentration dependence of pertinent ferroelectric parameters has been determined and used to develop the mathematical framework based on Ginzburg-Landau theory describing the evolution of these properties across the concentration space. We pose that combination of automated scanning probe microscope and combinatorial spread library approach will emerge as an efficient research paradigm to close the characterization gap in the high-throughput materials discovery. We make the data sets open to the community and hope that will stimulate other efforts to interpret and understand the physics of these systems. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 19 pages; 5 figures

arXiv:2404.12899 [pdf]

Bayesian Co-navigation: Dynamic Designing of the Materials Digital Twins via Active Learning

Authors: Boris N. Slautin, Yongtao Liu, Hiroshi Funakubo, Rama K. Vasudevan, Maxim A. Ziatdinov, Sergei V. Kalinin

Abstract: Scientific advancement is universally based on the dynamic interplay between theoretical insights, modelling, and experimental discoveries. However, this feedback loop is often slow, including delayed community interactions and the gradual integration of experimental data into theoretical frameworks. This challenge is particularly exacerbated in domains dealing with high-dimensional object spaces,… ▽ More Scientific advancement is universally based on the dynamic interplay between theoretical insights, modelling, and experimental discoveries. However, this feedback loop is often slow, including delayed community interactions and the gradual integration of experimental data into theoretical frameworks. This challenge is particularly exacerbated in domains dealing with high-dimensional object spaces, such as molecules and complex microstructures. Hence, the integration of theory within automated and autonomous experimental setups, or theory in the loop automated experiment, is emerging as a crucial objective for accelerating scientific research. The critical aspect is not only to use theory but also on-the-fly theory updates during the experiment. Here, we introduce a method for integrating theory into the loop through Bayesian co-navigation of theoretical model space and experimentation. Our approach leverages the concurrent development of surrogate models for both simulation and experimental domains at the rates determined by latencies and costs of experiments and computation, alongside the adjustment of control parameters within theoretical models to minimize epistemic uncertainty over the experimental object spaces. This methodology facilitates the creation of digital twins of material structures, encompassing both the surrogate model of behavior that includes the correlative part and the theoretical model itself. While demonstrated here within the context of functional responses in ferroelectric materials, our approach holds promise for broader applications, the exploration of optical properties in nanoclusters, microstructure-dependent properties in complex materials, and properties of molecular systems. The analysis code that supports the funding is publicly available at https://github.com/Slautin/2024_Co-navigation/tree/main △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: 23 pages, 10 figures

arXiv:2404.07381 [pdf]

Building Workflows for Interactive Human in the Loop Automated Experiment (hAE) in STEM-EELS

Authors: Utkarsh Pratiush, Kevin M. Roccapriore, Yongtao Liu, Gerd Duscher, Maxim Ziatdinov, Sergei V. Kalinin

Abstract: Exploring the structural, chemical, and physical properties of matter on the nano- and atomic scales has become possible with the recent advances in aberration-corrected electron energy-loss spectroscopy (EELS) in scanning transmission electron microscopy (STEM). However, the current paradigm of STEM-EELS relies on the classical rectangular grid sampling, in which all surface regions are assumed t… ▽ More Exploring the structural, chemical, and physical properties of matter on the nano- and atomic scales has become possible with the recent advances in aberration-corrected electron energy-loss spectroscopy (EELS) in scanning transmission electron microscopy (STEM). However, the current paradigm of STEM-EELS relies on the classical rectangular grid sampling, in which all surface regions are assumed to be of equal a priori interest. This is typically not the case for real-world scenarios, where phenomena of interest are concentrated in a small number of spatial locations. One of foundational problems is the discovery of nanometer- or atomic scale structures having specific signatures in EELS spectra. Here we systematically explore the hyperparameters controlling deep kernel learning (DKL) discovery workflows for STEM-EELS and identify the role of the local structural descriptors and acquisition functions on the experiment progression. In agreement with actual experiment, we observe that for certain parameter combinations the experiment path can be trapped in the local minima. We demonstrate the approaches for monitoring automated experiment in the real and feature space of the system and monitor knowledge acquisition of the DKL model. Based on these, we construct intervention strategies, thus defining human-in the loop automated experiment (hAE). This approach can be further extended to other techniques including 4D STEM and other forms of spectroscopic imaging. △ Less

Submitted 10 April, 2024; originally announced April 2024.

arXiv:2404.07074 [pdf]

Multiscale structure-property discovery via active learning in scanning tunneling microscopy

Authors: Ganesh Narasimha, Dejia Kong, Paras Regmi, Rongying Jin, Zheng Gai, Rama Vasudevan, Maxim Ziatdinov

Abstract: Atomic arrangements and local sub-structures fundamentally influence emergent material functionalities. The local structures are conventionally probed using spatially resolved studies and the property correlations are usually deciphered by a researcher based on sequential explorations and auxiliary information, thus limiting the throughput efficiency. Here we demonstrate a Bayesian deep learning b… ▽ More Atomic arrangements and local sub-structures fundamentally influence emergent material functionalities. The local structures are conventionally probed using spatially resolved studies and the property correlations are usually deciphered by a researcher based on sequential explorations and auxiliary information, thus limiting the throughput efficiency. Here we demonstrate a Bayesian deep learning based framework that automatically correlates material structure with its electronic properties using scanning tunneling microscopy (STM) measurements in real-time. Its predictions are used to autonomously direct exploration toward regions of the sample that optimize a given material property. This autonomous method is deployed on the low-temperature ultra-high vacuum STM to understand the structure-property relationship in a europium-based semimetal, EuZn2As2, one of the promising candidates for studying the magnetism-driven topological properties. The framework employs a sparse sampling approach to efficiently construct the scalar-property space using a minimal number of measurements, about 1 - 10 % of the data required in standard hyperspectral imaging methods. We further demonstrate a target-property-guided active learning of structures within a multiscale framework. This is implemented across length scales in a hierarchical fashion for the autonomous discovery of structural origins for an observed material property. This framework offers the choice to select and derive a suitable scalar property from the spectroscopic data to steer exploration across the sample space. Our findings reveal correlations of the electronic properties unique to surface terminations, local defect density, and point defects. △ Less

Submitted 10 April, 2024; originally announced April 2024.

arXiv:2402.13402 [pdf]

Towards accelerating physical discovery via non-interactive and interactive multi-fidelity Bayesian Optimization: Current challenges and future opportunities

Authors: Arpan Biswas, Sai Mani Prudhvi Valleti, Rama Vasudevan, Maxim Ziatdinov, Sergei V. Kalinin

Abstract: Both computational and experimental material discovery bring forth the challenge of exploring multidimensional and often non-differentiable parameter spaces, such as phase diagrams of Hamiltonians with multiple interactions, composition spaces of combinatorial libraries, processing spaces, and molecular embedding spaces. Often these systems are expensive or time-consuming to evaluate a single inst… ▽ More Both computational and experimental material discovery bring forth the challenge of exploring multidimensional and often non-differentiable parameter spaces, such as phase diagrams of Hamiltonians with multiple interactions, composition spaces of combinatorial libraries, processing spaces, and molecular embedding spaces. Often these systems are expensive or time-consuming to evaluate a single instance, and hence classical approaches based on exhaustive grid or random search are too data intensive. This resulted in strong interest towards active learning methods such as Bayesian optimization (BO) where the adaptive exploration occurs based on human learning (discovery) objective. However, classical BO is based on a predefined optimization target, and policies balancing exploration and exploitation are purely data driven. In practical settings, the domain expert can pose prior knowledge on the system in form of partially known physics laws and often varies exploration policies during the experiment. Here, we explore interactive workflows building on multi-fidelity BO (MFBO), starting with classical (data-driven) MFBO, then structured (physics-driven) sMFBO, and extending it to allow human in the loop interactive iMFBO workflows for adaptive and domain expert aligned exploration. These approaches are demonstrated over highly non-smooth multi-fidelity simulation data generated from an Ising model, considering spin-spin interaction as parameter space, lattice sizes as fidelity spaces, and the objective as maximizing heat capacity. Detailed analysis and comparison show the impact of physics knowledge injection and on-the-fly human decisions for improved exploration, current challenges, and potential opportunities for algorithm development with combining data, physics and real time human decisions. △ Less

Submitted 20 February, 2024; originally announced February 2024.

Comments: Main text includes 29 pages and 10 figures, Supplementary mat. includes 4 pages and 4 figures

arXiv:2402.02198 [pdf]

Co-orchestration of Multiple Instruments to Uncover Structure-Property Relationships in Combinatorial Libraries

Authors: Boris N. Slautin, Utkarsh Pratiush, Ilia N. Ivanov, Yongtao Liu, Rohit Pant, Xiaohang Zhang, Ichiro Takeuchi, Maxim A. Ziatdinov, Sergei V. Kalinin

Abstract: The rapid growth of automated and autonomous instrumentations brings forth an opportunity for the co-orchestration of multimodal tools, equipped with multiple sequential detection methods, or several characterization tools to explore identical samples. This can be exemplified by the combinatorial libraries that can be explored in multiple locations by multiple tools simultaneously, or downstream c… ▽ More The rapid growth of automated and autonomous instrumentations brings forth an opportunity for the co-orchestration of multimodal tools, equipped with multiple sequential detection methods, or several characterization tools to explore identical samples. This can be exemplified by the combinatorial libraries that can be explored in multiple locations by multiple tools simultaneously, or downstream characterization in automated synthesis systems. In the co-orchestration approaches, information gained in one modality should accelerate the discovery of other modalities. Correspondingly, the orchestrating agent should select the measurement modality based on the anticipated knowledge gain and measurement cost. Here, we propose and implement a co-orchestration approach for conducting measurements with complex observables such as spectra or images. The method relies on combining dimensionality reduction by variational autoencoders with representation learning for control over the latent space structure, and integrated into iterative workflow via multi-task Gaussian Processes (GP). This approach further allows for the native incorporation of the system's physics via a probabilistic model as a mean function of the GP. We illustrated this method for different modalities of piezoresponse force microscopy and micro-Raman on combinatorial $Sm-BiFeO_3$ library. However, the proposed framework is general and can be extended to multiple measurement modalities and arbitrary dimensionality of measured signals. The analysis code that supports the funding is publicly available at https://github.com/Slautin/2024_Co-orchestration. △ Less

Submitted 17 March, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

Comments: 22 pages, 9 figures

arXiv:2310.17765 [pdf]

doi 10.1063/5.0185362

Autonomous convergence of STM control parameters using Bayesian Optimization

Authors: Ganesh Narasimha, Saban Hus, Arpan Biswas, Rama Vasudevan, Maxim Ziatdinov

Abstract: Scanning Tunneling microscopy (STM) is a widely used tool for atomic imaging of novel materials and its surface energetics. However, the optimization of the imaging conditions is a tedious process due to the extremely sensitive tip-surface interaction, and thus limits the throughput efficiency. Here we deploy a machine learning (ML) based framework to achieve optimal-atomically resolved imaging co… ▽ More Scanning Tunneling microscopy (STM) is a widely used tool for atomic imaging of novel materials and its surface energetics. However, the optimization of the imaging conditions is a tedious process due to the extremely sensitive tip-surface interaction, and thus limits the throughput efficiency. Here we deploy a machine learning (ML) based framework to achieve optimal-atomically resolved imaging conditions in real time. The experimental workflow leverages Bayesian optimization (BO) method to rapidly improve the image quality, defined by the peak intensity in the Fourier space. The outcome of the BO prediction is incorporated into the microscope controls, i.e., the current setpoint and the tip bias, to dynamically improve the STM scan conditions. We present strategies to either selectively explore or exploit across the parameter space. As a result, suitable policies are developed for autonomous convergence of the control-parameters. The ML-based framework serves as a general workflow methodology across a wide range of materials. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: 31 pages, 5 figures and Supplementary Information

arXiv:2310.13187 [pdf]

Dynamic STEM-EELS for single atom and defect measurement during electron beam transformations

Authors: Kevin M. Roccapriore, Riccardo Torsi, Joshua Robinson, Sergei V. Kalinin, Maxim Ziatdinov

Abstract: On- and off-axis electron energy loss spectroscopy (EELS) is a powerful method for probing local electronic structure on single atom level. However, many materials undergo electron-beam induced transformation during the scanning transmission electron microscopy (STEM) and spectroscopy, the problem particularly acute for off-axis EELS signals. Here, we propose and operationalize the rapid object de… ▽ More On- and off-axis electron energy loss spectroscopy (EELS) is a powerful method for probing local electronic structure on single atom level. However, many materials undergo electron-beam induced transformation during the scanning transmission electron microscopy (STEM) and spectroscopy, the problem particularly acute for off-axis EELS signals. Here, we propose and operationalize the rapid object detection and action system (RODAS) for dynamic exploration of the structure-property relationships in STEM-EELS. In this approach, the electron beam is used to induce dynamic transformations creating new defect types at sufficiently small rates and avoiding complete material destruction. The deep convolutional neural networks trained via the ensemble learning iterative training (ELIT) approach are used to identify the defects as they form and perform EELS measurements only at specific defect types. Overall, in this case the EEL spectra are collected only at predefined objects of interest, avoiding measurements on the ideal regions or holes. We note that this approach can be extended to identify new defect classes as they appear, allowing for efficient collection of structure-property relationship data via balanced sampling over defect types. △ Less

Submitted 19 October, 2023; originally announced October 2023.

arXiv:2310.08378 [pdf]

When the atoms dance: exploring mechanisms of electron-beam induced modifications of materials with machine-learning assisted high temporal resolution electron microscopy

Authors: Matthew G. Boebinger, Ayana Ghosh, Kevin M. Roccapriore, Sudhajit Misra, Kai Xiao, Stephen Jesse, Maxim Ziatdinov, Sergei V. Kalinin, Raymond R. Unocic

Abstract: Directed atomic fabrication using an aberration-corrected scanning transmission electron microscope (STEM) opens new pathways for atomic engineering of functional materials. In this approach, the electron beam is used to actively alter the atomic structure through electron beam induced irradiation processes. One of the impediments that has limited widespread use thus far has been the ability to un… ▽ More Directed atomic fabrication using an aberration-corrected scanning transmission electron microscope (STEM) opens new pathways for atomic engineering of functional materials. In this approach, the electron beam is used to actively alter the atomic structure through electron beam induced irradiation processes. One of the impediments that has limited widespread use thus far has been the ability to understand the fundamental mechanisms of atomic transformation pathways at high spatiotemporal resolution. Here, we develop a workflow for obtaining and analyzing high-speed spiral scan STEM data, up to 100 fps, to track the atomic fabrication process during nanopore milling in monolayer MoS2. An automated feedback-controlled electron beam positioning system combined with deep convolution neural network (DCNN) was used to decipher fast but low signal-to-noise datasets and classify time-resolved atom positions and nature of their evolving atomic defect configurations. Through this automated decoding, the initial atomic disordering and reordering processes leading to nanopore formation was able to be studied across various timescales. Using these experimental workflows a greater degree of speed and information can be extracted from small datasets without compromising spatial resolution. This approach can be adapted to other 2D materials systems to gain further insights into the defect formation necessary to inform future automated fabrication techniques utilizing the STEM electron beam. △ Less

Submitted 12 October, 2023; originally announced October 2023.

arXiv:2310.06583 [pdf]

Physics-driven discovery and bandgap engineering of hybrid perovskites

Authors: Sheryl L. Sanchez, Elham Foadian, Maxim Ziatdinov, Jonghee Yang, Sergei V. Kalinin, Yongtao Liu, Mahshid Ahmadi

Abstract: The unique aspect of the hybrid perovskites is their tunability, allowing to engineer the bandgap via substitution. From application viewpoint, this allows creation of the tandem cells between perovskites and silicon, or two or more perovskites, with associated increase of efficiency beyond single-junction Schokley-Queisser limit. However, the concentration dependence of optical bandgap in the hyb… ▽ More The unique aspect of the hybrid perovskites is their tunability, allowing to engineer the bandgap via substitution. From application viewpoint, this allows creation of the tandem cells between perovskites and silicon, or two or more perovskites, with associated increase of efficiency beyond single-junction Schokley-Queisser limit. However, the concentration dependence of optical bandgap in the hybrid perovskite solid solutions can be non-linear and even non-monotonic, as determined by the band alignments between endmembers, presence of the defect states and Urbach tails, and phase separation. Exploring new compositions brings forth the joint problem of the discovery of the composition with the desired band gap, and establishing the physical model of the band gap concentration dependence. Here we report the development of the experimental workflow based on structured Gaussian Process (sGP) models and custom sGP (c-sGP) that allow the joint discovery of the experimental behavior and the underpinning physical model. This approach is verified with simulated data sets with known ground truth, and was found to accelerate the discovery of experimental behavior and the underlying physical model. The d/c-sGP approach utilizes a few calculated thin film bandgap data points to guide targeted explorations, minimizing the number of thin film preparations. Through iterative exploration, we demonstrate that the c-sGP algorithm that combined 5 bandgap models converges rapidly, revealing a relationship in the bandgap diagram of MA1-xGAxPb(I1-xBrx)3. This approach offers a promising method for efficiently understanding the physical model of band gap concentration dependence in the binary systems, this method can also be extended to ternary or higher dimensional systems. △ Less

Submitted 10 October, 2023; originally announced October 2023.

arXiv:2310.05018 [pdf]

Human-in-the-loop: The future of Machine Learning in Automated Electron Microscopy

Authors: Sergei V. Kalinin, Yongtao Liu, Arpan Biswas, Gerd Duscher, Utkarsh Pratiush, Kevin Roccapriore, Maxim Ziatdinov, Rama Vasudevan

Abstract: Machine learning methods are progressively gaining acceptance in the electron microscopy community for de-noising, semantic segmentation, and dimensionality reduction of data post-acquisition. The introduction of the APIs by major instrument manufacturers now allows the deployment of ML workflows in microscopes, not only for data analytics but also for real-time decision-making and feedback for mi… ▽ More Machine learning methods are progressively gaining acceptance in the electron microscopy community for de-noising, semantic segmentation, and dimensionality reduction of data post-acquisition. The introduction of the APIs by major instrument manufacturers now allows the deployment of ML workflows in microscopes, not only for data analytics but also for real-time decision-making and feedback for microscope operation. However, the number of use cases for real-time ML remains remarkably small. Here, we discuss some considerations in designing ML-based active experiments and pose that the likely strategy for the next several years will be human-in-the-loop automated experiments (hAE). In this paradigm, the ML learning agent directly controls beam position and image and spectroscopy acquisition functions, and human operator monitors experiment progression in real- and feature space of the system and tunes the policies of the ML agent to steer the experiment towards specific objectives. △ Less

Submitted 8 October, 2023; originally announced October 2023.

arXiv:2308.09004 [pdf, other]

Towards Lightweight Data Integration using Multi-workflow Provenance and Data Observability

Authors: Renan Souza, Tyler J. Skluzacek, Sean R. Wilkinson, Maxim Ziatdinov, Rafael Ferreira da Silva

Abstract: Modern large-scale scientific discovery requires multidisciplinary collaboration across diverse computing facilities, including High Performance Computing (HPC) machines and the Edge-to-Cloud continuum. Integrated data analysis plays a crucial role in scientific discovery, especially in the current AI era, by enabling Responsible AI development, FAIR, Reproducibility, and User Steering. However, t… ▽ More Modern large-scale scientific discovery requires multidisciplinary collaboration across diverse computing facilities, including High Performance Computing (HPC) machines and the Edge-to-Cloud continuum. Integrated data analysis plays a crucial role in scientific discovery, especially in the current AI era, by enabling Responsible AI development, FAIR, Reproducibility, and User Steering. However, the heterogeneous nature of science poses challenges such as dealing with multiple supporting tools, cross-facility environments, and efficient HPC execution. Building on data observability, adapter system design, and provenance, we propose MIDA: an approach for lightweight runtime Multi-workflow Integrated Data Analysis. MIDA defines data observability strategies and adaptability methods for various parallel systems and machine learning tools. With observability, it intercepts the dataflows in the background without requiring instrumentation while integrating domain, provenance, and telemetry data at runtime into a unified database ready for user steering queries. We conduct experiments showing end-to-end multi-workflow analysis integrating data from Dask and MLFlow in a real distributed deep learning use case for materials science that runs on multiple environments with up to 276 GPUs in parallel. We show near-zero overhead running up to 100,000 tasks on 1,680 CPU cores on the Summit supercomputer. △ Less

Submitted 17 August, 2023; originally announced August 2023.

Comments: 10 pages, 5 figures, 2 Listings, 42 references, Paper accepted at IEEE eScience'23

MSC Class: 65Y05; 68P15 ACM Class: I.2; H.2; C.4; J.2

Journal ref: 19th IEEE International Conference on e-Science (eScience) 2023 - Limassol, Cyprus

arXiv:2307.06883 [pdf, other]

Cyber Framework for Steering and Measurements Collection Over Instrument-Computing Ecosystems

Authors: Anees Al-Najjar, Nageswara S. V. Rao, Ramanan Sankaran, Helia Zandi, Debangshu Mukherjee, Maxim Ziatdinov, Craig Bridges

Abstract: We propose a framework to develop cyber solutions to support the remote steering of science instruments and measurements collection over instrument-computing ecosystems. It is based on provisioning separate data and control connections at the network level, and developing software modules consisting of Python wrappers for instrument commands and Pyro server-client codes that make them available ac… ▽ More We propose a framework to develop cyber solutions to support the remote steering of science instruments and measurements collection over instrument-computing ecosystems. It is based on provisioning separate data and control connections at the network level, and developing software modules consisting of Python wrappers for instrument commands and Pyro server-client codes that make them available across the ecosystem network. We demonstrate automated measurement transfers and remote steering operations in a microscopy use case for materials research over an ecosystem of Nion microscopes and computing platforms connected over site networks. The proposed framework is currently under further refinement and being adopted to science workflows with automated remote experiments steering for autonomous chemistry laboratories and smart energy grid simulations. △ Less

Submitted 12 July, 2023; originally announced July 2023.

Comments: Paper accepted for presentation at IEEE SMARTCOMP 2023

arXiv:2304.02484 [pdf]

A dynamic Bayesian optimized active recommender system for curiosity-driven Human-in-the-loop automated experiments

Authors: Arpan Biswas, Yongtao Liu, Nicole Creange, Yu-Chen Liu, Stephen Jesse, Jan-Chi Yang, Sergei V. Kalinin, Maxim A. Ziatdinov, Rama K. Vasudevan

Abstract: Optimization of experimental materials synthesis and characterization through active learning methods has been growing over the last decade, with examples ranging from measurements of diffraction on combinatorial alloys at synchrotrons, to searches through chemical space with automated synthesis robots for perovskites. In virtually all cases, the target property of interest for optimization is def… ▽ More Optimization of experimental materials synthesis and characterization through active learning methods has been growing over the last decade, with examples ranging from measurements of diffraction on combinatorial alloys at synchrotrons, to searches through chemical space with automated synthesis robots for perovskites. In virtually all cases, the target property of interest for optimization is defined apriori with limited human feedback during operation. In contrast, here we present the development of a new type of human in the loop experimental workflow, via a Bayesian optimized active recommender system (BOARS), to shape targets on the fly, employing human feedback. We showcase examples of this framework applied to pre-acquired piezoresponse force spectroscopy of a ferroelectric thin film, and then implement this in real time on an atomic force microscope, where the optimization proceeds to find symmetric piezoresponse amplitude hysteresis loops. It is found that such features appear more affected by subsurface defects than the local domain structure. This work shows the utility of human-augmented machine learning approaches for curiosity-driven exploration of systems across experimental domains. The analysis reported here is summarized in Colab Notebook for the purpose of tutorial and application to other data: https://github.com/arpanbiswas52/varTBO △ Less

Submitted 5 April, 2023; originally announced April 2023.

Comments: 7 figures in main text, 3 figures in Supp Material

arXiv:2304.02048 [pdf]

doi 10.1038/s41524-023-01142-0

Deep Learning for Automated Experimentation in Scanning Transmission Electron Microscopy

Authors: Sergei V. Kalinin, Debangshu Mukherjee, Kevin M. Roccapriore, Ben Blaiszik, Ayana Ghosh, Maxim A. Ziatdinov, A. Al-Najjar, Christina Doty, Sarah Akers, Nageswara S. Rao, Joshua C. Agar, Steven R. Spurgeon

Abstract: Machine learning (ML) has become critical for post-acquisition data analysis in (scanning) transmission electron microscopy, (S)TEM, imaging and spectroscopy. An emerging trend is the transition to real-time analysis and closed-loop microscope operation. The effective use of ML in electron microscopy now requires the development of strategies for microscopy-centered experiment workflow design and… ▽ More Machine learning (ML) has become critical for post-acquisition data analysis in (scanning) transmission electron microscopy, (S)TEM, imaging and spectroscopy. An emerging trend is the transition to real-time analysis and closed-loop microscope operation. The effective use of ML in electron microscopy now requires the development of strategies for microscopy-centered experiment workflow design and optimization. Here, we discuss the associated challenges with the transition to active ML, including sequential data analysis and out-of-distribution drift effects, the requirements for the edge operation, local and cloud data storage, and theory in the loop operations. Specifically, we discuss the relative contributions of human scientists and ML agents in the ideation, orchestration, and execution of experimental workflows and the need to develop universal hyper languages that can apply across multiple platforms. These considerations will collectively inform the operationalization of ML in next-generation experimentation. △ Less

Submitted 4 April, 2023; originally announced April 2023.

Comments: Review Article

arXiv:2303.14554 [pdf]

Deep Kernel Methods Learn Better: From Cards to Process Optimization

Authors: Mani Valleti, Rama K. Vasudevan, Maxim A. Ziatdinov, Sergei V. Kalinin

Abstract: The ability of deep learning methods to perform classification and regression tasks relies heavily on their capacity to uncover manifolds in high-dimensional data spaces and project them into low-dimensional representation spaces. In this study, we investigate the structure and character of the manifolds generated by classical variational autoencoder (VAE) approaches and deep kernel learning (DKL)… ▽ More The ability of deep learning methods to perform classification and regression tasks relies heavily on their capacity to uncover manifolds in high-dimensional data spaces and project them into low-dimensional representation spaces. In this study, we investigate the structure and character of the manifolds generated by classical variational autoencoder (VAE) approaches and deep kernel learning (DKL). In the former case, the structure of the latent space is determined by the properties of the input data alone, while in the latter, the latent manifold forms as a result of an active learning process that balances the data distribution and target functionalities. We show that DKL with active learning can produce a more compact and smooth latent space which is more conducive to optimization compared to previously reported methods, such as the VAE. We demonstrate this behavior using a simple cards data set and extend it to the optimization of domain-generated trajectories in physical systems. Our findings suggest that latent manifolds constructed through active learning have a more beneficial structure for optimization problems, especially in feature-rich target-poor scenarios that are common in domain sciences, such as materials synthesis, energy storage, and molecular discovery. The jupyter notebooks that encapsulate the complete analysis accompany the article. △ Less

Submitted 19 September, 2023; v1 submitted 25 March, 2023; originally announced March 2023.

Comments: 8 Figures, 26 pages

arXiv:2303.03793 [pdf]

Roadmap on Deep Learning for Microscopy

Authors: Giovanni Volpe, Carolina Wählby, Lei Tian, Michael Hecht, Artur Yakimovich, Kristina Monakhova, Laura Waller, Ivo F. Sbalzarini, Christopher A. Metzler, Mingyang Xie, Kevin Zhang, Isaac C. D. Lenton, Halina Rubinsztein-Dunlop, Daniel Brunner, Bijie Bai, Aydogan Ozcan, Daniel Midtvedt, Hao Wang, Nataša Sladoje, Joakim Lindblad, Jason T. Smith, Marien Ochoa, Margarida Barroso, Xavier Intes, Tong Qiu , et al. (50 additional authors not shown)

Abstract: Through digital imaging, microscopy has evolved from primarily being a means for visual observation of life at the micro- and nano-scale, to a quantitative tool with ever-increasing resolution and throughput. Artificial intelligence, deep neural networks, and machine learning are all niche terms describing computational methods that have gained a pivotal role in microscopy-based research over the… ▽ More Through digital imaging, microscopy has evolved from primarily being a means for visual observation of life at the micro- and nano-scale, to a quantitative tool with ever-increasing resolution and throughput. Artificial intelligence, deep neural networks, and machine learning are all niche terms describing computational methods that have gained a pivotal role in microscopy-based research over the past decade. This Roadmap is written collectively by prominent researchers and encompasses selected aspects of how machine learning is applied to microscopy image data, with the aim of gaining scientific knowledge by improved image quality, automated detection, segmentation, classification and tracking of objects, and efficient merging of information from multiple imaging modalities. We aim to give the reader an overview of the key developments and an understanding of possibilities and limitations of machine learning for microscopy. It will be of interest to a wide cross-disciplinary audience in the physical sciences and life sciences. △ Less

Submitted 7 March, 2023; originally announced March 2023.

arXiv:2302.14629 [pdf]

A processing and analytics system for microscopy data workflows: the Pycroscopy ecosystem of packages

Authors: Rama Vasudevan, Mani Valleti, Maxim Ziatdinov, Gerd Duscher, Suhas Somnath

Abstract: Major advancements in fields as diverse as biology and quantum computing have relied on a multitude of microscopic techniques. All optical, electron and scanning probe microscopy advanced with new detector technologies and integration of spectroscopy, imaging, and diffraction. Despite the considerable proliferation of these instruments, significant bottlenecks remain in terms of processing, analys… ▽ More Major advancements in fields as diverse as biology and quantum computing have relied on a multitude of microscopic techniques. All optical, electron and scanning probe microscopy advanced with new detector technologies and integration of spectroscopy, imaging, and diffraction. Despite the considerable proliferation of these instruments, significant bottlenecks remain in terms of processing, analysis, storage, and retrieval of acquired datasets. Aside from the lack of file standards, individual domain-specific analysis packages are often disjoint from the underlying datasets. Thus, keeping track of analysis and processing steps remains tedious for the end-user, hampering reproducibility. Here, we introduce the pycroscopy ecosystem of packages, an open-source python-based ecosystem underpinned by a common data model. Our data model, termed the N-dimensional spectral imaging data format, is realized in pycroscopy's sidpy package. This package is built on top of dask arrays, thus leveraging dask array attributes but expanding them to accelerate microscopy-relevant analysis and visualization. Several examples of the use of the pycroscopy ecosystem to create workflows for data ingestion and analysis are shown. Adoption of such standardized routines will be critical to usher in the next generation of autonomous instruments where processing, computation, and meta-data storage will be critical to overall experimental operations. △ Less

Submitted 20 February, 2023; originally announced February 2023.

Comments: 14 pages, 6 figures

arXiv:2302.06577 [pdf]

Post-Experiment Forensics and Human-in-the-Loop Interventions in Explainable Autonomous Scanning Probe Microscopy

Authors: Yongtao Liu, Maxim Ziatdinov, Rama Vasudevan, Sergei V. Kalinin

Abstract: The broad adoption of machine learning (ML)-based automated and autonomous experiments (AE) in physical characterization and synthesis requires development of strategies for understanding and intervention in the experimental workflow. Here, we introduce and realize strategies for post-acquisition forensic analysis applied to the deep kernel learning based AE scanning probe microscopy. This approac… ▽ More The broad adoption of machine learning (ML)-based automated and autonomous experiments (AE) in physical characterization and synthesis requires development of strategies for understanding and intervention in the experimental workflow. Here, we introduce and realize strategies for post-acquisition forensic analysis applied to the deep kernel learning based AE scanning probe microscopy. This approach yields real-time and post-acquisition indicators of the progression of an active learning process interacting with an experimental system. We further illustrate that this approach can be extended towards human-in-the-loop autonomous experiments, where human operators make high-level decisions at high latencies setting the policies for AE, and the ML algorithm performs low-level fast decisions. The proposed approach is universal and can be extended to other physical and chemical imaging techniques and applications such as combinatorial library analysis. The full forensic analysis notebook is publicly available on GitHub at https://github.com/yongtaoliu/Forensics-DKL-BEPS. △ Less

Submitted 13 February, 2023; originally announced February 2023.

Comments: 24 pages, 8 figures

arXiv:2302.04397 [pdf]

Designing Workflows for Materials Characterization

Authors: Sergei V. Kalinin, Maxim Ziatdinov, Mahshid Ahmadi, Ayana Ghosh, Kevin Roccapriore, Yongtao Liu, Rama K. Vasudevan

Abstract: Experimental science is enabled by the combination of synthesis, imaging, and functional characterization. Synthesis of a new material is typically followed by a set of characterization methods aiming to provide feedback for optimization or discover fundamental mechanisms. However, the sequence of synthesis and characterization methods and their interpretation, or research workflow, has traditiona… ▽ More Experimental science is enabled by the combination of synthesis, imaging, and functional characterization. Synthesis of a new material is typically followed by a set of characterization methods aiming to provide feedback for optimization or discover fundamental mechanisms. However, the sequence of synthesis and characterization methods and their interpretation, or research workflow, has traditionally been driven by human intuition and is highly domain specific. Here we explore concepts of scientific workflows that emerge at the interface between theory, characterization, and imaging. We discuss the criteria by which these workflows can be constructed for special cases of multi-resolution structural imaging and structural and functional characterization. Some considerations for theory-experiment workflows are provided. We further pose that the emergence of user facilities and cloud labs disrupt the classical progression from ideation, orchestration, and execution stages of workflow development and necessitate development of universal frameworks for workflow design, including universal hyper-languages describing laboratory operation, reward functions and their integration between domains, and policy development for workflow optimization. These tools will enable knowledge-based workflow optimization, enable lateral instrumental networks, sequential and parallel orchestration of characterization between dissimilar facilities, and empower distributed research. △ Less

Submitted 7 February, 2023; originally announced February 2023.

Comments: 33 pages; 8 figures

arXiv:2302.04216 [pdf]

Combining Variational Autoencoders and Physical Bias for Improved Microscopy Data Analysis

Authors: Arpan Biswas, Maxim Ziatdinov, Sergei V. Kalinin

Abstract: Electron and scanning probe microscopy produce vast amounts of data in the form of images or hyperspectral data, such as EELS or 4D STEM, that contain information on a wide range of structural, physical, and chemical properties of materials. To extract valuable insights from these data, it is crucial to identify physically separate regions in the data, such as phases, ferroic variants, and boundar… ▽ More Electron and scanning probe microscopy produce vast amounts of data in the form of images or hyperspectral data, such as EELS or 4D STEM, that contain information on a wide range of structural, physical, and chemical properties of materials. To extract valuable insights from these data, it is crucial to identify physically separate regions in the data, such as phases, ferroic variants, and boundaries between them. In order to derive an easily interpretable feature analysis, combining with well-defined boundaries in a principled and unsupervised manner, here we present a physics augmented machine learning method which combines the capability of Variational Autoencoders to disentangle factors of variability within the data and the physics driven loss function that seeks to minimize the total length of the discontinuities in images corresponding to latent representations. Our method is applied to various materials, including NiO-LSMO, BiFeO3, and graphene. The results demonstrate the effectiveness of our approach in extracting meaningful information from large volumes of imaging data. The fully notebook containing implementation of the code and analysis workflow is available at https://github.com/arpanbiswas52/PaperNotebooks △ Less

Submitted 7 June, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

Comments: 20 pages, 7 figures in main text, 4 figures in Supp Mat

arXiv:2301.02665 [pdf]

doi 10.1063/5.0157644

Discovery of structure-property relations for molecules via hypothesis-driven active learning over the chemical space

Authors: Ayana Ghosh, Sergei V. Kalinin, Maxim A. Ziatdinov

Abstract: Discovery of the molecular candidates for applications in drug targets, biomolecular systems, catalysts, photovoltaics, organic electronics, and batteries, necessitates development of machine learning algorithms capable of rapid exploration of the chemical spaces targeting the desired functionalities. Here we introduce a novel approach for the active learning over the chemical spaces based on hypo… ▽ More Discovery of the molecular candidates for applications in drug targets, biomolecular systems, catalysts, photovoltaics, organic electronics, and batteries, necessitates development of machine learning algorithms capable of rapid exploration of the chemical spaces targeting the desired functionalities. Here we introduce a novel approach for the active learning over the chemical spaces based on hypothesis learning. We construct the hypotheses on the possible relationships between structures and functionalities of interest based on a small subset of data and introduce them as (probabilistic) mean functions for the Gaussian process. This approach combines the elements from the symbolic regression methods such as SISSO and active learning into a single framework. The primary focus of constructing this framework is to approximate physical laws in an active learning regime toward a more robust predictive performance, as traditional evaluation on hold-out sets in machine learning doesn't account for out-of-distribution effects and may lead to a complete failure on unseen chemical space. Here, we demonstrate it for the QM9 dataset, but it can be applied more broadly to datasets from both domains of molecular and solid-state materials sciences. △ Less

Submitted 8 May, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

Report number: APL Mach. Learn. 1, 046102 (2023)

Journal ref: APL Mach. Learn. 1, 046102 (2023)

arXiv:2212.14442 [pdf, ps, other]

Deterministic Construction of QFAs based on the Quantum Fingerprinting Technique

Authors: Aliya Khadieva, Mansur Ziatdinov

Abstract: It is known that for some languages quantum finite automata are more efficient than classical counterparts. Particularly, a QFA recognizing the language $MOD_p$ has an exponential advantage over the classical finite automata. However, the construction of such QFA is probabilistic. In the current work, we propose a deterministic construction of the QFA for the language $MOD_p$. We construct a QFA f… ▽ More It is known that for some languages quantum finite automata are more efficient than classical counterparts. Particularly, a QFA recognizing the language $MOD_p$ has an exponential advantage over the classical finite automata. However, the construction of such QFA is probabilistic. In the current work, we propose a deterministic construction of the QFA for the language $MOD_p$. We construct a QFA for a promise problem $Palindrome_s$ and implement this QFA on the IBMQ simulator using qiskit library tools. △ Less

Submitted 29 December, 2022; originally announced December 2022.

arXiv:2212.07310 [pdf]

Exploring the microstructural origins of conductivity and hysteresis in metal halide perovskites via active learning driven automated scanning probe microscopy

Authors: Yongtao Liu, Jonghee Yang, Rama K. Vasudevan, Kyle P. Kelley, Maxim Ziatdinov, Sergei V. Kalinin, Mahshid Ahmadi

Abstract: Electronic transport and hysteresis in metal halide perovskites (MHPs) are key to the applications in photovoltaics, light emitting devices, and light and chemical sensors. These phenomena are strongly affected by the materials microstructure including grain boundaries, ferroic domain walls, and secondary phase inclusions. Here, we demonstrate an active machine learning framework for 'driving' an… ▽ More Electronic transport and hysteresis in metal halide perovskites (MHPs) are key to the applications in photovoltaics, light emitting devices, and light and chemical sensors. These phenomena are strongly affected by the materials microstructure including grain boundaries, ferroic domain walls, and secondary phase inclusions. Here, we demonstrate an active machine learning framework for 'driving' an automated scanning probe microscope (SPM) to discover the microstructures responsible for specific aspects of transport behavior in MHPs. In our setup, the microscope can discover the microstructural elements that maximize the onset of conduction, hysteresis, or any other characteristic that can be derived from a set of current-voltage spectra. This approach opens new opportunities for exploring the origins of materials functionality in complex materials by SPM and can be integrated with other characterization techniques either before (prior knowledge) or after (identification of locations of interest for detail studies) functional probing. △ Less

Submitted 14 December, 2022; originally announced December 2022.

Comments: 19 pages; 7 figures

arXiv:2210.14138 [pdf]

Disentangling electronic transport and hysteresis at individual grain boundaries in hybrid perovskites via automated scanning probe microscopy

Authors: Yongtao Liu, Jonghee Yang, Benjamin J. Lawrie, Kyle P. Kelley, Maxim Ziatdinov, Sergei V. Kalinin, Mahshid Ahmadi

Abstract: Underlying the rapidly increasing photovoltaic efficiency and stability of metal halide perovskites (MHPs) is the advance in the understanding of the microstructure of polycrystalline MHP thin film. Over the past decade, intense efforts have aimed to understand the effect of microstructure on MHP properties, including chemical heterogeneity, strain disorder, phase impurity, etc. It has been found… ▽ More Underlying the rapidly increasing photovoltaic efficiency and stability of metal halide perovskites (MHPs) is the advance in the understanding of the microstructure of polycrystalline MHP thin film. Over the past decade, intense efforts have aimed to understand the effect of microstructure on MHP properties, including chemical heterogeneity, strain disorder, phase impurity, etc. It has been found that grain and grain boundary (GB) are tightly related to lots of microscale and nanoscale behavior in MHP thin film. Atomic force microscopy (AFM) is widely used to observe grain and boundary structures in topography and subsequently to study the correlative surface potential and conductivity of these structures. For now, most AFM measurements have been performed in imaging mode to study the static behavior, in contrast, AFM spectroscopy mode allows us to investigate the dynamic behavior of materials, e.g. conductivity under sweeping voltage. However, a major limitation of AFM spectroscopy measurements is that it requests manual operation by human operators, as such only limited data can be obtained, hindering systematic investigations of these microstructures. In this work, we designed a workflow combining the conductive AFM measurement with a machine learning (ML) algorithm to systematically investigate grain boundaries in MHPs. The trained ML model can extract GBs locations from the topography image, and the workflow drives the AFM probe to each GB location to perform a current-voltage (IV) curve automatically. Then, we are able to IV curves at all GB locations, allowing us to systematically understand the property of GBs. Using this method, we discover that the GB junction points are more photoactive, while most previous works only focused on the difference between GB and grains. △ Less

Submitted 25 October, 2022; originally announced October 2022.

Comments: 19 pages, 8 figures

arXiv:2210.11197 [pdf, ps, other]

Noisy Tree Data Structures and Quantum Applications

Authors: Kamil Khadiev, Nikita Savelyev, Mansur Ziatdinov, Denis Melnikov

Abstract: The paper presents a technique for constructing noisy data structures called a walking tree. We apply it for a Red-Black tree (an implementation of a Self-Balanced Binary Search Tree) and a segment tree. We obtain the same complexity of the main operations for these data structures as in the case without noise (asymptotically). We present several applications of the data structures for quantum alg… ▽ More The paper presents a technique for constructing noisy data structures called a walking tree. We apply it for a Red-Black tree (an implementation of a Self-Balanced Binary Search Tree) and a segment tree. We obtain the same complexity of the main operations for these data structures as in the case without noise (asymptotically). We present several applications of the data structures for quantum algorithms. Finally, we suggest new quantum solution for strings sorting problem and show the lower bound. The upper and lower bounds are the same up to a log factor. At the same time, it is more effective than classical counterparts. △ Less

Submitted 15 May, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

arXiv:2210.09791 [pdf, other]

Enabling Autonomous Electron Microscopy for Networked Computation and Steering

Authors: Anees Al-Najjar, Nageswara S. V. Rao, Ramanan Sankaran, Maxim Ziatdinov, Debangshu Mukherjee, Olga Ovchinnikova, Kevin Roccapriore, Andrew R. Lupini, Sergei V. Kalinin

Abstract: Advanced electron microscopy workflows require an ecosystem of microscope instruments and computing systems possibly located at different sites to conduct remotely steered and automated experiments. Current workflow executions involve manual operations for steering and measurement tasks, which are typically performed from control workstations co-located with microscopes; consequently, their operat… ▽ More Advanced electron microscopy workflows require an ecosystem of microscope instruments and computing systems possibly located at different sites to conduct remotely steered and automated experiments. Current workflow executions involve manual operations for steering and measurement tasks, which are typically performed from control workstations co-located with microscopes; consequently, their operational tempo and effectiveness are limited. We propose an approach based on separate data and control channels for such an ecosystem of Scanning Transmission Electron Microscopes (STEM) and computing systems, for which no general solutions presently exist, unlike the neutron and light source instruments. We demonstrate automated measurement transfers and remote steering of Nion STEM physical instruments over site networks. We propose a Virtual Infrastructure Twin (VIT) of this ecosystem, which is used to develop and test our steering software modules without requiring access to the physical instrument infrastructure. Additionally, we develop a VIT for a multiple laboratory scenario, which illustrates the applicability of this approach to ecosystems connected over wide-area networks, for the development and testing of software modules and their later field deployment. △ Less

Submitted 18 October, 2022; originally announced October 2022.

Comments: 11 pages, 16 figures, accepted at IEEE eScience 2022 conference

arXiv:2210.06526 [pdf]

Microscopy is All You Need

Authors: Sergei V. Kalinin, Rama Vasudevan, Yongtao Liu, Ayana Ghosh, Kevin Roccapriore, Maxim Ziatdinov

Abstract: We pose that microscopy offers an ideal real-world experimental environment for the development and deployment of active Bayesian and reinforcement learning methods. Indeed, the tremendous progress achieved by machine learning (ML) and artificial intelligence over the last decade has been largely achieved via the utilization of static data sets, from the paradigmatic MNIST to the bespoke corpora o… ▽ More We pose that microscopy offers an ideal real-world experimental environment for the development and deployment of active Bayesian and reinforcement learning methods. Indeed, the tremendous progress achieved by machine learning (ML) and artificial intelligence over the last decade has been largely achieved via the utilization of static data sets, from the paradigmatic MNIST to the bespoke corpora of text and image data used to train large models such as GPT3, DALLE and others. However, it is now recognized that continuous, minute improvements to state-of-the-art do not necessarily translate to advances in real-world applications. We argue that a promising pathway for the development of ML methods is via the route of domain-specific deployable algorithms in areas such as electron and scanning probe microscopy and chemical imaging. This will benefit both fundamental physical studies and serve as a test bed for more complex autonomous systems such as robotics and manufacturing. Favorable environment characteristics of scanning and electron microscopy include low risk, extensive availability of domain-specific priors and rewards, relatively small effects of exogeneous variables, and often the presence of both upstream first principles as well as downstream learnable physical models for both statics and dynamics. Recent developments in programmable interfaces, edge computing, and access to APIs facilitating microscope control, all render the deployment of ML codes on operational microscopes straightforward. We discuss these considerations and hope that these arguments will lead to creating a novel set of development targets for the ML community by accelerating both real-world ML applications and scientific progress. △ Less

Submitted 12 October, 2022; originally announced October 2022.

arXiv:2210.02538 [pdf, other]

doi 10.1017/S1551929522001286

A roadmap for edge computing enabled automated multidimensional transmission electron microscopy

Authors: Debangshu Mukherjee, Kevin M. Roccapriore, Anees Al-Najjar, Ayana Ghosh, Jacob D. Hinkle, Andrew R. Lupini, Rama K. Vasudevan, Sergei V. Kalinin, Olga S. Ovchinnikova, Maxim A. Ziatdinov, Nageswara S. Rao

Abstract: The advent of modern, high-speed electron detectors has made the collection of multidimensional hyperspectral transmission electron microscopy datasets, such as 4D-STEM, a routine. However, many microscopists find such experiments daunting since such datasets' analysis, collection, long-term storage, and networking remain challenging. Some common issues are the large and unwieldy size of the said… ▽ More The advent of modern, high-speed electron detectors has made the collection of multidimensional hyperspectral transmission electron microscopy datasets, such as 4D-STEM, a routine. However, many microscopists find such experiments daunting since such datasets' analysis, collection, long-term storage, and networking remain challenging. Some common issues are the large and unwieldy size of the said datasets, often running into several gigabytes, non-standardized data analysis routines, and a lack of clarity about the computing and network resources needed to utilize the electron microscope fully. However, the existing computing and networking bottlenecks introduce significant penalties in each step of these experiments, and thus, real-time analysis-driven automated experimentation for multidimensional TEM is exceptionally challenging. One solution is integrating microscopy with edge computing, where moderately powerful computational hardware performs the preliminary analysis before handing off the heavier computation to HPC systems. In this perspective, we trace the roots of computation in modern electron microscopy, demonstrate deep learning experiments running on an edge system, and discuss the networking requirements for tying together microscopes, edge computers, and HPC systems. △ Less

Submitted 5 October, 2022; originally announced October 2022.

Comments: Perspective on automated microscopy. 3 figures

Journal ref: Microscopy Today, 30(6), 10-19. (2022)

arXiv:2208.03861 [pdf]

Learning and predicting photonic responses of plasmonic nanoparticle assemblies via dual variational autoencoders

Authors: Muammer Y. Yaman, Sergei V. Kalinin, Kathryn N. Guye, David Ginger, Maxim Ziatdinov

Abstract: We demonstrate the application of machine learning for rapid and accurate extraction of plasmonic particles cluster geometries from hyperspectral image data via a dual variational autoencoder (dual-VAE). In this approach, the information is shared between the latent spaces of two VAEs acting on the particle shape data and spectral data, respectively, but enforcing a common encoding on the shape-sp… ▽ More We demonstrate the application of machine learning for rapid and accurate extraction of plasmonic particles cluster geometries from hyperspectral image data via a dual variational autoencoder (dual-VAE). In this approach, the information is shared between the latent spaces of two VAEs acting on the particle shape data and spectral data, respectively, but enforcing a common encoding on the shape-spectra pairs. We show that this approach can establish the relationship between the geometric characteristics of nanoparticles and their far-field photonic responses, demonstrating that we can use hyperspectral darkfield microscopy to accurately predict the geometry (number of particles, arrangement) of a multiparticle assemblies below the diffraction limit in an automated fashion with high fidelity (for monomers (0.96), dimers (0.86), and trimers (0.58). This approach of building structure-property relationships via shared encoding is universal and should have applications to a broader range of materials science and physics problems in imaging of both molecular and nanomaterial systems. △ Less

Submitted 7 August, 2022; originally announced August 2022.

Comments: 12 pages, 5 figures

arXiv:2207.12882 [pdf]

Probing electron beam induced transformations on a single defect level via automated scanning transmission electron microscopy

Authors: Kevin M. Roccapriore, Matthew G. Boebinger, Ondrej Dyck, Ayana Ghosh, Raymond R. Unocic, Sergei V. Kalinin, Maxim Ziatdinov

Abstract: The robust approach for real-time analysis of the scanning transmission electron microscopy (STEM) data streams, based on the ensemble learning and iterative training (ELIT) of deep convolutional neural networks, is implemented on an operational microscope, enabling the exploration of the dynamics of specific atomic configurations under electron beam irradiation via an automated experiment in STEM… ▽ More The robust approach for real-time analysis of the scanning transmission electron microscopy (STEM) data streams, based on the ensemble learning and iterative training (ELIT) of deep convolutional neural networks, is implemented on an operational microscope, enabling the exploration of the dynamics of specific atomic configurations under electron beam irradiation via an automated experiment in STEM. Combined with beam control, this approach allows studying beam effects on selected atomic groups and chemical bonds in a fully automated mode. Here, we demonstrate atomically precise engineering of single vacancy lines in transition metal dichalcogenides and the creation and identification of topological defects graphene. The ELIT-based approach opens the pathway toward the direct on-the-fly analysis of the STEM data and engendering real-time feedback schemes for probing electron beam chemistry, atomic manipulation, and atom by atom assembly. △ Less

Submitted 26 July, 2022; originally announced July 2022.

arXiv:2207.03039 [pdf]

Learning the right channel in multimodal imaging: automated experiment in Piezoresponse Force Microscopy

Authors: Yongtao Liu, Rama K. Vasudevan, Kyle P. Kelley, Hiroshi Funakubo, Maxim Ziatdinov, Sergei V. Kalinin

Abstract: We report the development and experimental implementation of the automated experiment workflows for the identification of the best predictive channel for a phenomenon of interest in spectroscopic measurements. The approach is based on the combination of ensembled deep kernel learning for probabilistic predictions and a basic reinforcement learning policy for channel selection. It allows the identi… ▽ More We report the development and experimental implementation of the automated experiment workflows for the identification of the best predictive channel for a phenomenon of interest in spectroscopic measurements. The approach is based on the combination of ensembled deep kernel learning for probabilistic predictions and a basic reinforcement learning policy for channel selection. It allows the identification of which of the available observational channels, sampled sequentially, are most predictive of selected behaviors, and hence have the strongest correlations. We implement this approach for multimodal imaging in Piezoresponse Force Microscopy (PFM), with the behaviors of interest manifesting in piezoresponse spectroscopy. We illustrate the best predictive channel for polarization-voltage hysteresis loop and frequency-voltage hysteresis loop areas is amplitude in the model samples. The same workflow and code are universal and applicable for any multimodal imaging and local characterization methods. △ Less

Submitted 13 February, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

Comments: 17 pages, 5 figures

arXiv:2207.00128 [pdf]

Optimizing Training Trajectories in Variational Autoencoders via Latent Bayesian Optimization Approach

Authors: Arpan Biswas, Rama Vasudevan, Maxim Ziatdinov, Sergei V. Kalinin

Abstract: Unsupervised and semi-supervised ML methods such as variational autoencoders (VAE) have become widely adopted across multiple areas of physics, chemistry, and materials sciences due to their capability in disentangling representations and ability to find latent manifolds for classification and regression of complex experimental data. Like other ML problems, VAEs require hyperparameter tuning, e.g.… ▽ More Unsupervised and semi-supervised ML methods such as variational autoencoders (VAE) have become widely adopted across multiple areas of physics, chemistry, and materials sciences due to their capability in disentangling representations and ability to find latent manifolds for classification and regression of complex experimental data. Like other ML problems, VAEs require hyperparameter tuning, e.g., balancing the Kullback Leibler (KL) and reconstruction terms. However, the training process and resulting manifold topology and connectivity depend not only on hyperparameters, but also their evolution during training. Because of the inefficiency of exhaustive search in a high-dimensional hyperparameter space for the expensive to train models, here we explored a latent Bayesian optimization (zBO) approach for the hyperparameter trajectory optimization for the unsupervised and semi-supervised ML and demonstrate for joint-VAE with rotational invariances. We demonstrate an application of this method for finding joint discrete and continuous rotationally invariant representations for MNIST and experimental data of a plasmonic nanoparticles material system. The performance of the proposed approach has been discussed extensively, where it allows for any high dimensional hyperparameter tuning or trajectory optimization of other ML models. △ Less

Submitted 30 June, 2022; originally announced July 2022.

Comments: 32 pages, including 11 figures in the main text and Appendixes with 2 figures. arXiv admin note: text overlap with arXiv:2108.12889

arXiv:2206.15110 [pdf]

Automated Experiments of Local Non-linear Behavior in Ferroelectric Materials

Authors: Yongtao Liu, Kyle P. Kelley, Rama K. Vasudevan, Wanlin Zhu, John Hayden, Jon-Paul Maria, Hiroshi Funakubo, Maxim A. Ziatdinov, Susan Trolier-McKinstry, Sergei V. Kalinin

Abstract: We develop and implement an automated experiment in multimodal imaging to probe structural, chemical, and functional behaviors in complex materials and elucidate the dominant physical mechanisms that control device function. Here the emergence of non-linear electromechanical responses in piezoresponse force microscopy (PFM) is explored. Non-linear responses in PFM can originate from multiple mecha… ▽ More We develop and implement an automated experiment in multimodal imaging to probe structural, chemical, and functional behaviors in complex materials and elucidate the dominant physical mechanisms that control device function. Here the emergence of non-linear electromechanical responses in piezoresponse force microscopy (PFM) is explored. Non-linear responses in PFM can originate from multiple mechanisms, including intrinsic material responses often controlled by domain structure, surface topography that affects the mechanical phenomena at the tip-surface junction, and, potentially, the presence of surface contaminants. Using an automated experiment to probe the origins of non-linear behavior in model ferroelectric lead titanate (PTO) and ferroelectric Al0.93B0.07N films, it was found that PTO showed asymmetric nonlinear behavior across a/c domain walls and a broadened high nonlinear response region around c/c domain walls. In contrast, for Al0.93B0.07N, well-poled regions showed high linear piezoelectric responses paired with low non-linear responses and regions that were multidomain indicated low linear responses and high nonlinear responses. We show that formulating dissimilar exploration strategies in deep kernel learning as alternative hypotheses allows for establishing the preponderant physical mechanisms behind the non-linear behaviors, suggesting that this approach automated experiments can potentially discern between competing physical mechanisms. This technique can also be extended to electron, probe, and chemical imaging. △ Less

Submitted 30 June, 2022; originally announced June 2022.

Comments: 18 pages, 5 figures

arXiv:2206.12435 [pdf]

Bayesian Optimization in Continuous Spaces via Virtual Process Embeddings

Authors: Mani Valleti, Rama K. Vasudevan, Maxim A. Ziatdinov, Sergei V. Kalinin

Abstract: Automated chemical synthesis, materials fabrication, and spectroscopic physical measurements often bring forth the challenge of process trajectory optimization, i.e., discovering the time dependence of temperature, electric field, or pressure that gives rise to optimal properties. Due to the high dimensionality of the corresponding vectors, these problems are not directly amenable to Bayesian Opti… ▽ More Automated chemical synthesis, materials fabrication, and spectroscopic physical measurements often bring forth the challenge of process trajectory optimization, i.e., discovering the time dependence of temperature, electric field, or pressure that gives rise to optimal properties. Due to the high dimensionality of the corresponding vectors, these problems are not directly amenable to Bayesian Optimization (BO). Here we propose an approach based on the combination of the generative statistical models, specifically variational autoencoders, and Bayesian optimization. Here, the set of potential trajectories is formed based on best practices in the field, domain intuition, or human expertise. The variational autoencoder is used to encode the thus generated trajectories as a latent vector, and also allows for the generation of trajectories via sampling from latent space. In this manner, Bayesian Optimization of the process is realized in the latent space of the system, reducing the problem to a low-dimensional one. Here we apply this approach to a ferroelectric lattice model and demonstrate that this approach allows discovering the field trajectories that maximize curl in the system. The analysis of the corresponding polarization and curl distributions allows the relevant physical mechanisms to be decoded. △ Less

Submitted 24 June, 2022; originally announced June 2022.

Comments: 22 pages and 9 figures

arXiv:2206.11457 [pdf]

Exploring Physics of Ferroelectric Domain Walls in Real Time: Deep Learning Enabled Scanning Probe Microscopy

Authors: Yongtao Liu, Kyle P. Kelley, Hiroshi Funakubo, Sergei V. Kalinin, Maxim Ziatdinov

Abstract: The functionality of ferroelastic domain walls in ferroelectric materials is explored in real-time via the in-situ implementation of computer vision algorithms in scanning probe microscopy (SPM) experiment. The robust deep convolutional neural network (DCNN) is implemented based on a deep residual learning framework (Res) and holistically-nested edge detection (Hed), and ensembled to minimize the… ▽ More The functionality of ferroelastic domain walls in ferroelectric materials is explored in real-time via the in-situ implementation of computer vision algorithms in scanning probe microscopy (SPM) experiment. The robust deep convolutional neural network (DCNN) is implemented based on a deep residual learning framework (Res) and holistically-nested edge detection (Hed), and ensembled to minimize the out-of-distribution drift effects. The DCNN is implemented for real-time operations on SPM, converting the data stream into the semantically segmented image of domain walls and the corresponding uncertainty. We further demonstrate the pre-selected experimental workflows on thus discovered domain walls, and report alternating high- and low- polarization dynamic (out-of-plane) ferroelastic domain walls in a (PbTiO3) PTO thin film and high polarization dynamic (out-of-plane) at short ferroelastic walls (compared with long ferroelastic walls) in a lead zirconate titanate (PZT) thin film. This work establishes the framework for real-time DCNN analysis of data streams in scanning probe and other microscopies and highlights the role of out-of-distribution effects and strategies to ameliorate them in real time analytics. △ Less

Submitted 22 June, 2022; originally announced June 2022.

Comments: 21 pages, 7 figures

arXiv:2205.15458 [pdf]

Bayesian Active Learning for Scanning Probe Microscopy: from Gaussian Processes to Hypothesis Learning

Authors: Maxim Ziatdinov, Yongtao Liu, Kyle Kelley, Rama Vasudevan, Sergei V. Kalinin

Abstract: Recent progress in machine learning methods, and the emerging availability of programmable interfaces for scanning probe microscopes (SPMs), have propelled automated and autonomous microscopies to the forefront of attention of the scientific community. However, enabling automated microscopy requires the development of task-specific machine learning methods, understanding the interplay between phys… ▽ More Recent progress in machine learning methods, and the emerging availability of programmable interfaces for scanning probe microscopes (SPMs), have propelled automated and autonomous microscopies to the forefront of attention of the scientific community. However, enabling automated microscopy requires the development of task-specific machine learning methods, understanding the interplay between physics discovery and machine learning, and fully defined discovery workflows. This, in turn, requires balancing the physical intuition and prior knowledge of the domain scientist with rewards that define experimental goals and machine learning algorithms that can translate these to specific experimental protocols. Here, we discuss the basic principles of Bayesian active learning and illustrate its applications for SPM. We progress from the Gaussian Process as a simple data-driven method and Bayesian inference for physical models as an extension of physics-based functional fits to more complex deep kernel learning methods, structured Gaussian Processes, and hypothesis learning. These frameworks allow for the use of prior data, the discovery of specific functionalities as encoded in spectral data, and exploration of physical laws manifesting during the experiment. The discussed framework can be universally applied to all techniques combining imaging and spectroscopy, SPM methods, nanoindentation, electron microscopy and spectroscopy, and chemical imaging methods, and can be particularly impactful for destructive or irreversible measurements. △ Less

Submitted 18 August, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

Comments: 39 pages, 10 figures

arXiv:2204.05095 [pdf]

Physics is the New Data

Authors: Sergei V. Kalinin, Maxim Ziatdinov, Bobby G. Sumpter, Andrew D. White

Abstract: The rapid development of machine learning (ML) methods has fundamentally affected numerous applications ranging from computer vision, biology, and medicine to accounting and text analytics. Until now, it was the availability of large and often labeled data sets that enabled significant breakthroughs. However, the adoption of these methods in classical physical disciplines has been relatively slow,… ▽ More The rapid development of machine learning (ML) methods has fundamentally affected numerous applications ranging from computer vision, biology, and medicine to accounting and text analytics. Until now, it was the availability of large and often labeled data sets that enabled significant breakthroughs. However, the adoption of these methods in classical physical disciplines has been relatively slow, a tendency that can be traced to the intrinsic differences between correlative approaches of purely data-based ML and the causal hypothesis-driven nature of physical sciences. Furthermore, anomalous behaviors of classical ML necessitate addressing issues such as explainability and fairness of ML. We also note the sequence in which deep learning became mainstream in different scientific disciplines - starting from medicine and biology and then towards theoretical chemistry, and only after that, physics - is rooted in the progressively more complex level of descriptors, constraints, and causal structures available for incorporation in ML architectures. Here we put forth that over the next decade, physics will become a new data, and this will continue the transition from dot-coms and scientific computing concepts of the 90ies to big data of 2000-2010 to deep learning of 2010-2020 to physics-enabled scientific ML. △ Less

Submitted 11 April, 2022; originally announced April 2022.

arXiv:2203.10181 [pdf]

Active learning in open experimental environments: selecting the right information channel(s) based on predictability in deep kernel learning

Authors: Maxim Ziatdinov, Yongtao Liu, Sergei V. Kalinin

Abstract: Active learning methods are rapidly becoming the integral component of automated experiment workflows in imaging, materials synthesis, and computation. The distinctive aspect of many experimental scenarios is the presence of multiple information channels, including both the intrinsic modalities of the measurement system and the exogenous environment and noise signals. One of the key tasks in exper… ▽ More Active learning methods are rapidly becoming the integral component of automated experiment workflows in imaging, materials synthesis, and computation. The distinctive aspect of many experimental scenarios is the presence of multiple information channels, including both the intrinsic modalities of the measurement system and the exogenous environment and noise signals. One of the key tasks in experimental studies is hence establishing which of these channels is predictive of the behaviors of interest. Here we explore the problem of discovery of the optimal predictive channel for structure-property relationships (in microscopy) using deep kernel learning for modality selection in an active experiment setting. We further pose that this approach can be directly applicable to similar active learning tasks in automated synthesis and the discovery of quantitative structure-activity relations in molecular systems. △ Less

Submitted 18 March, 2022; originally announced March 2022.

arXiv:2203.03122 [pdf]

Physical discovery in representation learning via conditioning on prior knowledge: applications for ferroelectric domain dynamics

Authors: Yongtao Liu, Bryan D Huey, Maxim A. Ziatdinov, Sergei V. Kalinin

Abstract: Recent advances in electron, scanning probe, optical, and chemical imaging and spectroscopy yield bespoke data sets containing the information of structure and functionality of complex systems. In many cases, the resulting data sets are underpinned by low-dimensional simple representations encoding the factors of variability within the data. The representation learning methods seek to discover the… ▽ More Recent advances in electron, scanning probe, optical, and chemical imaging and spectroscopy yield bespoke data sets containing the information of structure and functionality of complex systems. In many cases, the resulting data sets are underpinned by low-dimensional simple representations encoding the factors of variability within the data. The representation learning methods seek to discover these factors of variability, ideally further connecting them with relevant physical mechanisms. However, generally the task of identifying the latent variables corresponding to actual physical mechanisms is extremely complex. Here, we explore an approach based on conditioning the data on the known (continuous) physical parameters, and systematically compare it with the previously introduced approach based on the invariant variational autoencoders. The conditional variational autoencoders (cVAE) approach does not rely on the existence of the invariant transforms, and hence allows for much greater flexibility and applicability. Interestingly, cVAE allows for limited extrapolation outside of the original domain of the conditional variable. However, this extrapolation is limited compared to the cases when true physical mechanisms are known, and the physical factor of variability can be disentangled in full. We further show that introducing the known conditioning results in the simplification of the latent distribution if the conditioning vector is correlated with the factor of variability in the data, thus allowing to separate relevant physical factors. We initially demonstrate this approach using 1D and 2D examples on a synthetic dataset and then extend it to the analysis of experimental data on ferroelectric domain dynamics visualized via Piezoresponse Force Microscopy. △ Less

Submitted 6 March, 2022; originally announced March 2022.

Comments: 20 pages, 8 figures

arXiv:2202.01089 [pdf]

Hypothesis-Driven Automated Experiment in Scanning Probe Microscopy: Exploring the Domain Growth Laws in Ferroelectric Materials

Authors: Yongtao Liu, Anna Morozovska, Eugene Eliseev, Kyle P. Kelley, Rama Vasudevan, Maxim Ziatdinov, Sergei V. Kalinin

Abstract: We report the development and implementation of a hypothesis learning based automated experiment, in which the microscope operating in the autonomous mode identifies the physical laws behind the material's response. Specifically, we explore the bias induced transformations that underpin the functionality of broad classes of devices and functional materials from batteries and memristors to ferroele… ▽ More We report the development and implementation of a hypothesis learning based automated experiment, in which the microscope operating in the autonomous mode identifies the physical laws behind the material's response. Specifically, we explore the bias induced transformations that underpin the functionality of broad classes of devices and functional materials from batteries and memristors to ferroelectrics and antiferroelectrics. Optimization and design of these materials require probing the mechanisms of these transformations on the nanometer scale as a function of the broad range of control parameters such as applied potential and time, often leading to experimentally intractable scenarios. At the same time, often the behaviors of these systems are understood within potentially competing theoretical models, or hypotheses. Here, we develop a hypothesis list that covers the possible limiting scenarios for the domain growth, including thermodynamic, domain wall pinning, and screening limited. We further develop and experimentally implement the hypothesis driven automated experiment in Piezoresponse Force Microscopy, autonomously identifying the mechanisms of the bias induced domain switching. This approach can be applied for a broad range of physical and chemical experiments with relatively low dimensional control parameter space and for which the possible competing models of the system behavior that ideally cover the full range of physical eventualities are known or can be created. These include other scanning probe microscopy modalities such as force distance curve measurements and nanoindentation, as well as materials synthesis and optimization. △ Less

Submitted 2 February, 2022; originally announced February 2022.

Comments: 25 pages, 6 figures

arXiv:2202.00657 [pdf]

Discovering Invariant Spatial Features in Electron Energy Loss Spectroscopy Images on the Mesoscopic and Atomic Levels

Authors: Kevin M. Roccapriore, Maxim Ziatdinov, Andrew R. Lupini, Abhay P. Singh, Usha Philipose, Sergei V. Kalinin

Abstract: Over the last two decades, Electron Energy Loss Spectroscopy (EELS) imaging with a scanning transmission electron microscope (STEM) has emerged as a technique of choice for visualizing complex chemical, electronic, plasmonic, and phononic phenomena in complex materials and structures. The availability of the EELS data necessitates the development of methods to analyze multidimensional datasets wit… ▽ More Over the last two decades, Electron Energy Loss Spectroscopy (EELS) imaging with a scanning transmission electron microscope (STEM) has emerged as a technique of choice for visualizing complex chemical, electronic, plasmonic, and phononic phenomena in complex materials and structures. The availability of the EELS data necessitates the development of methods to analyze multidimensional datasets with complex spatial and energy structures. Traditionally, the analysis of these data sets has been based on analysis of individual spectra, one at a time, whereas the spatial structure and correlations between individual spatial pixels containing the relevant information of the physics of underpinning processes have generally been ignored and analyzed only via the visualization as 2D maps. Here we develop a machine learning-based approach and workflows for the analysis of spatial structures in 3D EELS data sets using a combination of dimensionality reduction and multichannel rotationally-invariant variational autoencoders. This approach is illustrated for the analysis of both the plasmonic phenomena in a system of nanowires and in the core excitations in functional oxides using low loss and core loss EELS, respectively. The code developed in this manuscript is open sourced and freely available and provided as a Jupyter notebook for the interested reader here. △ Less

Submitted 1 February, 2022; originally announced February 2022.

Comments: Associated Jupyter Notebook: https://github.com/kevinroccapriore/Multichannel-rVAE

arXiv:2112.06649 [pdf]

doi 10.1002/adma.202201345

Hypothesis Learning in Automated Experiment: Application to Combinatorial Materials Libraries

Authors: Maxim Ziatdinov, Yongtao Liu, Anna N. Morozovska, Eugene A. Eliseev, Xiaohang Zhang, Ichiro Takeuchi, Sergei V. Kalinin

Abstract: Machine learning is rapidly becoming an integral part of experimental physical discovery via automated and high-throughput synthesis, and active experiments in scattering and electron/probe microscopy. This, in turn, necessitates the development of active learning methods capable of exploring relevant parameter spaces with the smallest number of steps. Here we introduce an active learning approach… ▽ More Machine learning is rapidly becoming an integral part of experimental physical discovery via automated and high-throughput synthesis, and active experiments in scattering and electron/probe microscopy. This, in turn, necessitates the development of active learning methods capable of exploring relevant parameter spaces with the smallest number of steps. Here we introduce an active learning approach based on co-navigation of the hypothesis and experimental spaces. This is realized by combining the structured Gaussian Processes containing probabilistic models of the possible system's behaviors (hypotheses) with reinforcement learning policy refinement (discovery). This approach closely resembles classical human-driven physical discovery, when several alternative hypotheses realized via models with adjustable parameters are tested during an experiment. We demonstrate this approach for exploring concentration-induced phase transitions in combinatorial libraries of Sm-doped BiFeO3 using Piezoresponse Force Microscopy, but it is straightforward to extend it to higher-dimensional parameter spaces and more complex physical problems once the experimental workflow and hypothesis-generation are available. △ Less

Submitted 20 April, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

Comments: Fixed typo in Eq. 1. Expanded the introduction part. The code reproducing Algorithm 1 is available at https://github.com/ziatdinovmax/hypoAL

Journal ref: Adv. Mater. 2022, 2201345

arXiv:2112.04479 [pdf]

Automated experiment in 4D-STEM: exploring emergent physics and structural behaviors

Authors: Kevin M. Roccapriore, Ondrej Dyck, Mark P. Oxley, Maxim Ziatdinov, Sergei V. Kalinin

Abstract: Automated experiments in 4D Scanning Transmission Electron Microscopy are implemented for rapid discovery of local structures, symmetry-breaking distortions, and internal electric and magnetic fields in complex materials. Deep kernel learning enables active learning of the relationship between local structure and a 4D-STEM based descriptors. With this, efficient and "intelligent" probing of dissim… ▽ More Automated experiments in 4D Scanning Transmission Electron Microscopy are implemented for rapid discovery of local structures, symmetry-breaking distortions, and internal electric and magnetic fields in complex materials. Deep kernel learning enables active learning of the relationship between local structure and a 4D-STEM based descriptors. With this, efficient and "intelligent" probing of dissimilar structural elements to discover desired physical functionality is made possible. This approach allows effective navigation of the sample in an automated fashion guided by either a pre-determined physical phenomenon, such as strongest electric field magnitude, or in an exploratory fashion. We verify the approach first on pre-acquired 4D-STEM data, and further implement it experimentally on an operational STEM. The experimental discovery workflow is demonstrated using graphene, and subsequently extended towards a lesser-known layered 2D van der Waal material, MnPS3. This approach establishes a paradigm for physics-driven automated 4D-STEM experiments that enable probing the physics of strongly correlated systems and quantum materials and devices, as well as exploration of beam sensitive materials. △ Less

Submitted 20 April, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

Comments: The data used for analysis as well as additional materials are available through the Jupyter notebook located at: https://github.com/kevinroccapriore/AE-DKL-4DSTEM

arXiv:2110.06888 [pdf]

Exploring causal physical mechanisms via non-gaussian linear models and deep kernel learning: applications for ferroelectric domain structures

Authors: Yongtao Liu, Maxim Ziatdinov, Sergei V. Kalinin

Abstract: Rapid emergence of the multimodal imaging in scanning probe, electron, and optical microscopies have brought forth the challenge of understanding the information contained in these complex data sets, targeting both the intrinsic correlations between different channels and further exploring the underpinning causal physical mechanisms. Here, we develop such analysis framework for the Piezoresponse F… ▽ More Rapid emergence of the multimodal imaging in scanning probe, electron, and optical microscopies have brought forth the challenge of understanding the information contained in these complex data sets, targeting both the intrinsic correlations between different channels and further exploring the underpinning causal physical mechanisms. Here, we develop such analysis framework for the Piezoresponse Force Microscopy. We argue that under certain conditions, we can bootstrap experimental observations with the prior knowledge of materials structure to get information on certain non-observed properties, and demonstrate linear causal analysis for PFM observables. We further demonstrate that this approach can be extended to complex descriptors using the deep kernel learning (DKL) model. In this DKL analysis, we use the prior information on domain structure within the image to predict the physical properties. This analysis demonstrates the correlative relationships between morphology, piezoresponse, elastic property, etc. at nanoscale. The prediction of morphology and other physical parameters illustrates a mutual interaction between surface condition and physical properties in ferroelectric materials. This analysis is universal and can be extended to explore the correlative relationships of other multi-channel datasets. △ Less

Submitted 13 October, 2021; originally announced October 2021.

Comments: 19 pages, 7 figures

arXiv:2109.07350 [pdf]

doi 10.1038/s41567-022-01666-0

Describing condensed matter from atomically resolved imaging data: from structure to generative and causal models

Authors: Sergei V. Kalinin, Ayana Ghosh, Rama Vasudevan, Maxim Ziatdinov

Abstract: The development of high-resolution imaging methods such as electron and scanning probe microscopy and atomic probe tomography have provided a wealth of information on structure and functionalities of solids. The availability of this data in turn necessitates development of approaches to derive quantitative physical information, much like the development of scattering methods in the early XX centur… ▽ More The development of high-resolution imaging methods such as electron and scanning probe microscopy and atomic probe tomography have provided a wealth of information on structure and functionalities of solids. The availability of this data in turn necessitates development of approaches to derive quantitative physical information, much like the development of scattering methods in the early XX century which have given one of the most powerful tools in condensed matter physics arsenal. Here, we argue that this transition requires adapting classical macroscopic definitions, that can in turn enable fundamentally new opportunities in understanding physics and chemistry. For example, many macroscopic definitions such as symmetry can be introduced locally only in a Bayesian sense, balancing the prior knowledge of materials' physics and experimental data to yield posterior probability distributions. At the same time, a wealth of local data allows fundamentally new approaches for the description of solids based on construction of statistical and physical generative models, akin to Ginzburg-Landau thermodynamic models. Finally, we note that availability of observational data opens pathways towards exploring causal mechanisms underpinning solid structure and functionality. △ Less

Submitted 15 September, 2021; originally announced September 2021.

arXiv:2109.04541 [pdf]

doi 10.1038/s41524-022-00733-7

Bridging microscopy with molecular dynamics and quantum simulations: An AtomAI based pipeline

Authors: Ayana Ghosh, Maxim Ziatdinov, Ondrej Dyck, Bobby Sumpter, Sergei V. Kalinin

Abstract: Recent advances in (scanning) transmission electron microscopy have enabled routine generation of large volumes of high-veracity structural data on 2D and 3D materials, naturally offering the challenge of using these as starting inputs for atomistic simulations. In this fashion, theory will address experimentally emerging structures, as opposed to the full range of theoretically possible atomic co… ▽ More Recent advances in (scanning) transmission electron microscopy have enabled routine generation of large volumes of high-veracity structural data on 2D and 3D materials, naturally offering the challenge of using these as starting inputs for atomistic simulations. In this fashion, theory will address experimentally emerging structures, as opposed to the full range of theoretically possible atomic configurations. However, this challenge is highly non-trivial due to the extreme disparity between intrinsic time scales accessible to modern simulations and microscopy, as well as latencies of microscopy and simulations per se. Addressing this issue requires as a first step bridging the instrumental data flow and physics-based simulation environment, to enable the selection of regions of interest and exploring them using physical simulations. Here we report the development of the machine learning workflow that directly bridges the instrument data stream into Python-based molecular dynamics and density functional theory environments using pre-trained neural networks to convert imaging data to physical descriptors. The pathways to ensure the structural stability and compensate for the observational biases universally present in the data are identified in the workflow. This approach is used for a graphene system to reconstruct optimized geometry and simulate temperature-dependent dynamics including adsorption of Cr as an ad-atom and graphene healing effects. However, it is universal and can be used for other material systems. △ Less

Submitted 21 December, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

arXiv:2108.12889 [pdf]

doi 10.1063/5.0068903

Multi-objective Bayesian optimization of ferroelectric materials with interfacial control for memory and energy storage applications

Authors: Arpan Biswas, Anna N. Morozovska, Maxim Ziatdinov, Eugene A. Eliseev, Sergei V. Kalinin

Abstract: Optimization of materials performance for specific applications often requires balancing multiple aspects of materials functionality. Even for the cases where generative physical model of material behavior is known and reliable, this often requires search over multidimensional parameter space to identify low-dimensional manifold corresponding to required Pareto front. Here we introduce the multi-o… ▽ More Optimization of materials performance for specific applications often requires balancing multiple aspects of materials functionality. Even for the cases where generative physical model of material behavior is known and reliable, this often requires search over multidimensional parameter space to identify low-dimensional manifold corresponding to required Pareto front. Here we introduce the multi-objective Bayesian Optimization (MOBO) workflow for the ferroelectric/anti-ferroelectric performance optimization for memory and energy storage applications based on the numerical solution of the Ginzburg-Landau equation with electrochemical or semiconducting boundary conditions. MOBO is a low computational cost optimization tool for expensive multi-objective functions, where we update posterior surrogate Gaussian process models from prior evaluations, and then select future evaluations from maximizing an acquisition function. Using the parameters for a prototype bulk antiferroelectric (PbZrO3), we first develop a physics-driven decision tree of target functions from the loop structures. We further develop a physics-driven MOBO architecture to explore multidimensional parameter space and build Pareto-frontiers by maximizing two target functions jointly: energy storage and loss. This approach allows for rapid initial materials and device parameter selection for a given application and can be further expanded towards the active experiment setting. The associated notebooks provide both the tutorial on MOBO and allow to reproduce the reported analyses and apply them to other systems (https://github.com/arpanbiswas52/MOBO_AFI_Supplements). △ Less

Submitted 29 August, 2021; originally announced August 2021.

Comments: 40 pages, including 12 figures in the main text and Appendixes with 4 figures

arXiv:2108.10280 [pdf]

Physics makes the difference: Bayesian optimization and active learning via augmented Gaussian process

Authors: Maxim Ziatdinov, Ayana Ghosh, Sergei V. Kalinin

Abstract: Both experimental and computational methods for the exploration of structure, functionality, and properties of materials often necessitate the search across broad parameter spaces to discover optimal experimental conditions and regions of interest in the image space or parameter space of computational models. The direct grid search of the parameter space tends to be extremely time-consuming, leadi… ▽ More Both experimental and computational methods for the exploration of structure, functionality, and properties of materials often necessitate the search across broad parameter spaces to discover optimal experimental conditions and regions of interest in the image space or parameter space of computational models. The direct grid search of the parameter space tends to be extremely time-consuming, leading to the development of strategies balancing exploration of unknown parameter spaces and exploitation towards required performance metrics. However, classical Bayesian optimization strategies based on the Gaussian process (GP) do not readily allow for the incorporation of the known physical behaviors or past knowledge. Here we explore a hybrid optimization/exploration algorithm created by augmenting the standard GP with a structured probabilistic model of the expected system's behavior. This approach balances the flexibility of the non-parametric GP approach with a rigid structure of physical knowledge encoded into the parametric model. The fully Bayesian treatment of the latter allows additional control over the optimization via the selection of priors for the model parameters. The method is demonstrated for a noisy version of the classical objective function used to evaluate optimization algorithms and further extended to physical lattice models. This methodology is expected to be universally suitable for injecting prior knowledge in the form of physical models and past data in the Bayesian optimization framework △ Less

Submitted 29 August, 2021; v1 submitted 23 August, 2021; originally announced August 2021.

Comments: Expanded the discussion and added additional info to the supplemental materials

Showing 1–50 of 118 results for author: Ziatdinov, M