Search | arXiv e-print repository

Towards Unlocking Insights from Logbooks Using AI

Authors: Antonin Sulc, Alex Bien, Annika Eichler, Daniel Ratner, Florian Rehm, Frank Mayet, Gregor Hartmann, Hayden Hoschouer, Henrik Tuennermann, Jan Kaiser, Jason St. John, Jennefer Maldonado, Kyle Hazelwood, Raimund Kammering, Thorsten Hellert, Tim Wilksen, Verena Kain, Wan-Lin Hu

Abstract: Electronic logbooks contain valuable information about activities and events concerning their associated particle accelerator facilities. However, the highly technical nature of logbook entries can hinder their usability and automation. As natural language processing (NLP) continues advancing, it offers opportunities to address various challenges that logbooks present. This work explores jointly t… ▽ More Electronic logbooks contain valuable information about activities and events concerning their associated particle accelerator facilities. However, the highly technical nature of logbook entries can hinder their usability and automation. As natural language processing (NLP) continues advancing, it offers opportunities to address various challenges that logbooks present. This work explores jointly testing a tailored Retrieval Augmented Generation (RAG) model for enhancing the usability of particle accelerator logbooks at institutes like DESY, BESSY, Fermilab, BNL, SLAC, LBNL, and CERN. The RAG model uses a corpus built on logbook contributions and aims to unlock insights from these logbooks by leveraging retrieval over facility datasets, including discussion about potential multimodal sources. Our goals are to increase the FAIR-ness (findability, accessibility, interoperability, and reusability) of logbooks by exploiting their information content to streamline everyday use, enable macro-analysis for root cause analysis, and facilitate problem-solving automation. △ Less

Submitted 25 May, 2024; originally announced June 2024.

Comments: 5 pages, 1 figure, 15th International Particle Accelerator Conference

arXiv:2307.05253 [pdf, other]

doi 10.1088/2058-9565/ad0389

Precise Image Generation on Current Noisy Quantum Computing Devices

Authors: Florian Rehm, Sofia Vallecorsa, Kerstin Borras, Dirk Krücker, Michele Grossi, Valle Varo

Abstract: The Quantum Angle Generator (QAG) is a new full Quantum Machine Learning model designed to generate accurate images on current Noise Intermediate Scale (NISQ) Quantum devices. Variational quantum circuits form the core of the QAG model, and various circuit architectures are evaluated. In combination with the so-called MERA-upsampling architecture, the QAG model achieves excellent results, which ar… ▽ More The Quantum Angle Generator (QAG) is a new full Quantum Machine Learning model designed to generate accurate images on current Noise Intermediate Scale (NISQ) Quantum devices. Variational quantum circuits form the core of the QAG model, and various circuit architectures are evaluated. In combination with the so-called MERA-upsampling architecture, the QAG model achieves excellent results, which are analyzed and evaluated in detail. To our knowledge, this is the first time that a quantum model has achieved such accurate results. To explore the robustness of the model to noise, an extensive quantum noise study is performed. In this paper, it is demonstrated that the model trained on a physical quantum device learns the noise characteristics of the hardware and generates outstanding results. It is verified that even a quantum hardware machine calibration change during training of up to 8% can be well tolerated. For demonstration, the model is employed in indispensable simulations in high energy physics required to measure particle energies and, ultimately, to discover unknown particles at the Large Hadron Collider at CERN. △ Less

Submitted 23 October, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

Journal ref: IOP Quantum Science and Technology (October 2023)

arXiv:2305.07284 [pdf, other]

A Full Quantum Generative Adversarial Network Model for High Energy Physics Simulations

Authors: Florian Rehm, Sofia Vallecorsa, Michele Grossi, Kerstin Borras, Dirk Krücker

Abstract: The prospect of quantum computing with a potential exponential speed-up compared to classical computing identifies it as a promising method in the search for alternative future High Energy Physics (HEP) simulation approaches. HEP simulations, such as employed at the Large Hadron Collider at CERN, are extraordinarily complex and require an immense amount of computing resources in hardware and time.… ▽ More The prospect of quantum computing with a potential exponential speed-up compared to classical computing identifies it as a promising method in the search for alternative future High Energy Physics (HEP) simulation approaches. HEP simulations, such as employed at the Large Hadron Collider at CERN, are extraordinarily complex and require an immense amount of computing resources in hardware and time. For some HEP simulations, classical machine learning models have already been successfully developed and tested, resulting in several orders of magnitude speed-up. In this research, we proceed to the next step and explore whether quantum computing can provide sufficient accuracy, and further improvements, suggesting it as an exciting direction of future investigations. With a small prototype model, we demonstrate a full quantum Generative Adversarial Network (GAN) model for generating downsized eight-pixel calorimeter shower images. The advantage over previous quantum models is that the model generates real individual images containing pixel energy values instead of simple probability distributions averaged over a test sample. To complete the picture, the results of the full quantum GAN model are compared to hybrid quantum-classical models using a classical discriminator neural network. △ Less

Submitted 29 April, 2024; v1 submitted 12 May, 2023; originally announced May 2023.

Comments: Submitted for proceedings to the ACAT 2022 Conference. Paper accepted and revised version uploaded. Proceeding publication process still ongoing

arXiv:2203.01007 [pdf, other]

doi 10.1088/1742-6596/2438/1/012093

Impact of quantum noise on the training of quantum Generative Adversarial Networks

Authors: Kerstin Borras, Su Yeon Chang, Lena Funcke, Michele Grossi, Tobias Hartung, Karl Jansen, Dirk Kruecker, Stefan Kühn, Florian Rehm, Cenk Tüysüz, Sofia Vallecorsa

Abstract: Current noisy intermediate-scale quantum devices suffer from various sources of intrinsic quantum noise. Overcoming the effects of noise is a major challenge, for which different error mitigation and error correction techniques have been proposed. In this paper, we conduct a first study of the performance of quantum Generative Adversarial Networks (qGANs) in the presence of different types of quan… ▽ More Current noisy intermediate-scale quantum devices suffer from various sources of intrinsic quantum noise. Overcoming the effects of noise is a major challenge, for which different error mitigation and error correction techniques have been proposed. In this paper, we conduct a first study of the performance of quantum Generative Adversarial Networks (qGANs) in the presence of different types of quantum noise, focusing on a simplified use case in high-energy physics. In particular, we explore the effects of readout and two-qubit gate errors on the qGAN training process. Simulating a noisy quantum device classically with IBM's Qiskit framework, we examine the threshold of error rates up to which a reliable training is possible. In addition, we investigate the importance of various hyperparameters for the training process in the presence of different error rates, and we explore the impact of readout error mitigation on the results. △ Less

Submitted 2 March, 2022; originally announced March 2022.

Comments: 6 pages, 5 figures, Proceedings of the 20th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2021)

Report number: MIT-CTP/5400

arXiv:2105.08960 [pdf, other]

doi 10.1051/epjconf/202125103042

Physics Validation of Novel Convolutional 2D Architectures for Speeding Up High Energy Physics Simulations

Authors: Florian Rehm, Sofia Vallecorsa, Kerstin Borras, Dirk Krücker

Abstract: The precise simulation of particle transport through detectors remains a key element for the successful interpretation of high energy physics results. However, Monte Carlo based simulation is extremely demanding in terms of computing resources. This challenge motivates investigations of faster, alternative approaches for replacing the standard Monte Carlo approach. We apply Generative Adversaria… ▽ More The precise simulation of particle transport through detectors remains a key element for the successful interpretation of high energy physics results. However, Monte Carlo based simulation is extremely demanding in terms of computing resources. This challenge motivates investigations of faster, alternative approaches for replacing the standard Monte Carlo approach. We apply Generative Adversarial Networks (GANs), a deep learning technique, to replace the calorimeter detector simulations and speeding up the simulation time by orders of magnitude. We follow a previous approach which used three-dimensional convolutional neural networks and develop new two-dimensional convolutional networks to solve the same 3D image generation problem faster. Additionally, we increased the number of parameters and the neural networks representational power, obtaining a higher accuracy. We compare our best convolutional 2D neural network architecture and evaluate it versus the previous 3D architecture and Geant4 data. Our results demonstrate a high physics accuracy and further consolidate the use of GANs for fast detector simulations. △ Less

Submitted 19 May, 2021; originally announced May 2021.

Comments: Paper published at vCHEP2021 conference

arXiv:2104.03780 [pdf, other]

Enabling Cross-Domain Communication: How to Bridge the Gap between AI and HW Engineers

Authors: Michael J. Klaiber, Axel J. Acosta, Ingo Feldner, Falk Rehm

Abstract: A key issue in system design is the lack of communication between hardware, software and domain expert. Recent research work shows progress in automatic HW/SW co-design flows of neural accelerators that seems to make this kind of communication obsolete. Most real-world systems, however, are a composition of multiple processing units, communication networks and memories. A HW/SW co-design process o… ▽ More A key issue in system design is the lack of communication between hardware, software and domain expert. Recent research work shows progress in automatic HW/SW co-design flows of neural accelerators that seems to make this kind of communication obsolete. Most real-world systems, however, are a composition of multiple processing units, communication networks and memories. A HW/SW co-design process of (reconfigurable) neural accelerators, therefore, is an important sub-problem towards a common co-design methodology. The ultimate challenge is to define the constraints for the design space exploration on system level - a task which requires deep knowledge and understanding of hardware architectures, mapping of workloads onto hardware and the application domain, e.g. artificial intelligence. For most projects, these skills are distributed among several people or even different teams which is one of the major reasons why there is no established end-to-end development methodology for digital systems. This position paper discusses possibilities how to establish such a methodology for systems that include (reconfigurable) dedicated accelerators and outlines the central role that languages and tools play in the process. △ Less

Submitted 8 April, 2021; originally announced April 2021.

Comments: LATTE 2021 Workshop on Languages, Tools, and Techniques for Accelerator Design

arXiv:2103.13698 [pdf, other]

Validation of Deep Convolutional Generative Adversarial Networks for High Energy Physics Calorimeter Simulations

Authors: Florian Rehm, Sofia Vallecorsa, Kerstin Borras, Dirk Krücker

Abstract: In particle physics the simulation of particle transport through detectors requires an enormous amount of computational resources, utilizing more than 50% of the resources of the CERN Worldwide Large Hadron Collider Grid. This challenge has motivated the investigation of different, faster approaches for replacing the standard Monte Carlo simulations. Deep Learning Generative Adversarial Networks a… ▽ More In particle physics the simulation of particle transport through detectors requires an enormous amount of computational resources, utilizing more than 50% of the resources of the CERN Worldwide Large Hadron Collider Grid. This challenge has motivated the investigation of different, faster approaches for replacing the standard Monte Carlo simulations. Deep Learning Generative Adversarial Networks are among the most promising alternatives. Previous studies showed that they achieve the necessary level of accuracy while decreasing the simulation time by orders of magnitudes. In this paper we present a newly developed neural network architecture which reproduces a three-dimensional problem employing 2D convolutional layers and we compare its performance with an earlier architecture consisting of 3D convolutional layers. The performance evaluation relies on direct comparison to Monte Carlo simulations, in terms of different physics quantities usually employed to quantify the detector response. We prove that our new neural network architecture reaches a higher level of accuracy with respect to the 3D convolutional GAN while reducing the necessary computational resources. Calorimeters are among the most expensive detectors in terms of simulation time. Therefore we focus our study on an electromagnetic calorimeter prototype with a regular highly granular geometry, as an example of future calorimeters. △ Less

Submitted 25 March, 2021; originally announced March 2021.

Comments: AAAI-MLPS 2021 Spring Symposium at Stanford University

arXiv:2103.10142 [pdf, other]

doi 10.5220/0010245002510258

Reduced Precision Strategies for Deep Learning: A High Energy Physics Generative Adversarial Network Use Case

Authors: Florian Rehm, Sofia Vallecorsa, Vikram Saletore, Hans Pabst, Adel Chaibi, Valeriu Codreanu, Kerstin Borras, Dirk Krücker

Abstract: Deep learning is finding its way into high energy physics by replacing traditional Monte Carlo simulations. However, deep learning still requires an excessive amount of computational resources. A promising approach to make deep learning more efficient is to quantize the parameters of the neural networks to reduced precision. Reduced precision computing is extensively used in modern deep learning a… ▽ More Deep learning is finding its way into high energy physics by replacing traditional Monte Carlo simulations. However, deep learning still requires an excessive amount of computational resources. A promising approach to make deep learning more efficient is to quantize the parameters of the neural networks to reduced precision. Reduced precision computing is extensively used in modern deep learning and results to lower execution inference time, smaller memory footprint and less memory bandwidth. In this paper we analyse the effects of low precision inference on a complex deep generative adversarial network model. The use case which we are addressing is calorimeter detector simulations of subatomic particle interactions in accelerator based high energy physics. We employ the novel Intel low precision optimization tool (iLoT) for quantization and compare the results to the quantized model from TensorFlow Lite. In the performance benchmark we gain a speed-up of 1.73x on Intel hardware for the quantized iLoT model compared to the initial, not quantized, model. With different physics-inspired self-developed metrics, we validate that the quantized iLoT model shows a lower loss of physical accuracy in comparison to the TensorFlow Lite model. △ Less

Submitted 18 March, 2021; originally announced March 2021.

Comments: Submitted at ICPRAM 2021; from CERN openlab - Intel collaboration

Journal ref: ICPRAM 2021

arXiv:1809.04071 [pdf, other]

doi 10.1103/PhysRevC.99.025503

First ultracold neutrons produced at TRIUMF

Authors: S. Ahmed, E. Altiere, T. Andalib, B. Bell, C. P. Bidinosti, E. Cudmore, M. Das, C. A. Davis, B. Franke, M. Gericke, P. Giampa, P. Gnyp, S. Hansen-Romu, K. Hatanaka, T. Hayamizu, B. Jamieson, D. Jones, S. Kawasaki, T. Kikawa, M. Kitaguchi, W. Klassen, A. Konaka, E. Korkmaz, F. Kuchler, M. Lang , et al. (28 additional authors not shown)

Abstract: We installed a source for ultracold neutrons at a new, dedicated spallation target at TRIUMF. The source was originally developed in Japan and uses a superfluid-helium converter cooled to 0.9$\,$K. During an extensive test campaign in November 2017, we extracted up to 325000 ultracold neutrons after a one-minute irradiation of the target, over three times more than previously achieved with this so… ▽ More We installed a source for ultracold neutrons at a new, dedicated spallation target at TRIUMF. The source was originally developed in Japan and uses a superfluid-helium converter cooled to 0.9$\,$K. During an extensive test campaign in November 2017, we extracted up to 325000 ultracold neutrons after a one-minute irradiation of the target, over three times more than previously achieved with this source. The corresponding ultracold-neutron density in the whole production and guide volume is 5.3$\,$cm$^{-3}$. The storage lifetime of ultracold neutrons in the source was initially 37$\,$s and dropped to 24$\,$s during the eighteen days of operation. During continuous irradiation of the spallation target, we were able to detect a sustained ultracold-neutron rate of up to 1500$\,$s$^{-1}$. Simulations of UCN production, UCN transport, temperature-dependent UCN yield, and temperature-dependent storage lifetime show excellent agreement with the experimental data and confirm that the ultracold-neutron-upscattering rate in superfluid helium is proportional to $T^7$. △ Less

Submitted 16 December, 2018; v1 submitted 10 September, 2018; originally announced September 2018.

Comments: 8 pages, 10 figures

Journal ref: Phys. Rev. C 99, 025503 (2019)

Showing 1–9 of 9 results for author: Rehm, F