Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–18 of 18 results for author: Herman, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04733  [pdf

    cs.NE q-bio.NC

    Unsupervised representation learning with Hebbian synaptic and structural plasticity in brain-like feedforward neural networks

    Authors: Naresh Ravichandran, Anders Lansner, Pawel Herman

    Abstract: Neural networks that can capture key principles underlying brain computation offer exciting new opportunities for developing artificial intelligence and brain-like computing algorithms. Such networks remain biologically plausible while leveraging localized forms of synaptic learning rules and modular network architecture found in the neocortex. Compared to backprop-driven deep learning approches,… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2406.03054  [pdf

    cs.NE q-bio.NC

    Spiking representation learning for associative memories

    Authors: Naresh Ravichandran, Anders Lansner, Pawel Herman

    Abstract: Networks of interconnected neurons communicating through spiking signals offer the bedrock of neural computations. Our brains spiking neural networks have the computational capacity to achieve complex pattern recognition and cognitive functions effortlessly. However, solving real-world problems with artificial spiking neural networks (SNNs) has proved to be difficult for a variety of reasons. Cruc… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. Self-Supervised Learning of Time Series Representation via Diffusion Process and Imputation-Interpolation-Forecasting Mask

    Authors: Zineb Senane, Lele Cao, Valentin Leonhard Buchner, Yusuke Tashiro, Lei You, Pawel Herman, Mats Nordahl, Ruibo Tu, Vilhelm von Ehrenheim

    Abstract: Time Series Representation Learning (TSRL) focuses on generating informative representations for various Time Series (TS) modeling tasks. Traditional Self-Supervised Learning (SSL) methods in TSRL fall into four main categories: reconstructive, adversarial, contrastive, and predictive, each with a common challenge of sensitivity to noise and intricate data nuances. Recently, diffusion-based method… ▽ More

    Submitted 17 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Published as a full paper by KDD 2024 Research Track (12 pages as main paper and 11 pages as appendix). Source code available at https://github.com/llcresearch/TSDE

    ACM Class: G.3; I.6.5; I.2.4

  4. arXiv:2401.00335  [pdf

    cs.NE

    Benchmarking Hebbian learning rules for associative memory

    Authors: Anders Lansner, Naresh B Ravichandran, Pawel Herman

    Abstract: Associative memory or content addressable memory is an important component function in computer science and information processing and is a key concept in cognitive and computational brain science. Many different neural network architectures and learning rules have been proposed to model associative memory of the brain while investigating key functions like pattern completion and rivalry, noise re… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: 24 pages, 9 figures

  5. arXiv:2309.16888  [pdf, other

    cs.LG cs.AI cs.CE q-fin.PM

    Beyond Gut Feel: Using Time Series Transformers to Find Investment Gems

    Authors: Lele Cao, Gustaf Halvardsson, Andrew McCornack, Vilhelm von Ehrenheim, Pawel Herman

    Abstract: This paper addresses the growing application of data-driven approaches within the Private Equity (PE) industry, particularly in sourcing investment targets (i.e., companies) for Venture Capital (VC) and Growth Capital (GC). We present a comprehensive review of the relevant approaches and propose a novel approach leveraging a Transformer-based Multivariate Time Series Classifier (TMTSC) for predict… ▽ More

    Submitted 14 June, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Published by ICANN (33rd International Conference on Artificial Neural Networks) 2024 as full paper (15 pages and 7 figures)

    Report number: EQT-Motherbrain-Research-2023SIT MSC Class: 91B84 (Primary) 68T07 (Secondary) ACM Class: I.2.6; I.2.1; H.4.0

  6. arXiv:2305.03866  [pdf

    cs.NE cs.LG

    Spiking neural networks with Hebbian plasticity for unsupervised representation learning

    Authors: Naresh Ravichandran, Anders Lansner, Pawel Herman

    Abstract: We introduce a novel spiking neural network model for learning distributed internal representations from data in an unsupervised procedure. We achieved this by transforming the non-spiking feedforward Bayesian Confidence Propagation Neural Network (BCPNN) model, employing an online correlation-based Hebbian-Bayesian learning and rewiring mechanism, shown previously to perform representation learni… ▽ More

    Submitted 10 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

  7. arXiv:2304.06626  [pdf

    q-bio.NC cs.NE

    Hebbian fast plasticity and working memory

    Authors: Anders Lansner, Florian Fiebig, Pawel Herman

    Abstract: Theories and models of working memory (WM) were at least since the mid-1990s dominated by the persistent activity hypothesis. The past decade has seen rising concerns about the shortcomings of sustained activity as the mechanism for short-term maintenance of WM information in the light of accumulating experimental evidence for so-called activity-silent WM and the fundamental difficulty in explaini… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 12 pages, 2 figures, 1 box, submitted

  8. arXiv:2303.02876  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci cs.AI cs.LG

    Metaheuristic conditional neural network for harvesting skyrmionic metastable states

    Authors: Qichen Xu, I. P. Miranda, Manuel Pereiro, Filipp N. Rybakov, Danny Thonig, Erik Sjöqvist, Pavel Bessarab, Anders Bergman, Olle Eriksson, Pawel Herman, Anna Delin

    Abstract: We present a metaheuristic conditional neural-network-based method aimed at identifying physically interesting metastable states in a potential energy surface of high rugosity. To demonstrate how this method works, we identify and analyze spin textures with topological charge $Q$ ranging from 1 to $-13$ (where antiskyrmions have $Q<0$) in the Pd/Fe/Ir(111) system, which we model using a classical… ▽ More

    Submitted 29 May, 2023; v1 submitted 5 March, 2023; originally announced March 2023.

  9. arXiv:2301.00207  [pdf, other

    physics.comp-ph cs.NE

    Genetic-tunneling driven energy optimizer for spin systems

    Authors: Qichen Xu, Zhuanglin Shen, Manuel Pereiro, Pawel Herman, Olle Eriksson, Anna Delin

    Abstract: A long-standing and difficult problem in, e.g., condensed matter physics is how to find the ground state of a complex many-body system where the potential energy surface has a large number of local minima. Spin systems containing complex and/or topological textures, for example spin spirals or magnetic skyrmions, are prime examples of such systems. We propose here a genetic-tunneling-driven varian… ▽ More

    Submitted 27 February, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

    Journal ref: Commun Phys 6, 239 (2023)

  10. arXiv:2206.15036  [pdf

    cs.NE

    Brain-like combination of feedforward and recurrent network components achieves prototype extraction and robust pattern recognition

    Authors: Naresh Balaji Ravichandran, Anders Lansner, Pawel Herman

    Abstract: Associative memory has been a prominent candidate for the computation performed by the massively recurrent neocortical networks. Attractor networks implementing associative memory have offered mechanistic explanation for many cognitive phenomena. However, attractor memory models are typically trained using orthogonal or random patterns to avoid interference between memories, which makes them unfea… ▽ More

    Submitted 3 September, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

  11. arXiv:2203.14769  [pdf

    cs.CV physics.med-ph

    A Long Short-term Memory Based Recurrent Neural Network for Interventional MRI Reconstruction

    Authors: Ruiyang Zhao, Zhao He, Tao Wang, Suhao Qiu, Pawel Herman, Yanle Hu, Chencheng Zhang, Dinggang Shen, Bomin Sun, Guang-Zhong Yang, Yuan Feng

    Abstract: Interventional magnetic resonance imaging (i-MRI) for surgical guidance could help visualize the interventional process such as deep brain stimulation (DBS), improving the surgery performance and patient outcome. Different from retrospective reconstruction in conventional dynamic imaging, i-MRI for DBS has to acquire and reconstruct the interventional images sequentially online. Here we proposed a… ▽ More

    Submitted 12 April, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

  12. arXiv:2106.15546  [pdf

    cs.LG cs.NE

    Semi-supervised learning with Bayesian Confidence Propagation Neural Network

    Authors: Naresh Balaji Ravichandran, Anders Lansner, Pawel Herman

    Abstract: Learning internal representations from data using no or few labels is useful for machine learning research, as it allows using massive amounts of unlabeled data. In this work, we use the Bayesian Confidence Propagation Neural Network (BCPNN) model developed as a biologically plausible model of the cortex. Recent work has demonstrated that these networks can learn useful internal representations fr… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  13. arXiv:2106.05373  [pdf, other

    cs.DC cs.LG cs.NE

    StreamBrain: An HPC Framework for Brain-like Neural Networks on CPUs, GPUs and FPGAs

    Authors: Artur Podobas, Martin Svedin, Steven W. D. Chien, Ivy B. Peng, Naresh Balaji Ravichandran, Pawel Herman, Anders Lansner, Stefano Markidis

    Abstract: The modern deep learning method based on backpropagation has surged in popularity and has been used in multiple domains and application areas. At the same time, there are other -- less-known -- machine learning algorithms with a mature and solid theoretical foundation whose performance remains unexplored. One such example is the brain-like Bayesian Confidence Propagation Neural Network (BCPNN). In… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at the International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies (HEART 2021)

  14. arXiv:2010.05348  [pdf, other

    physics.comp-ph cs.LG

    Automatic Particle Trajectory Classification in Plasma Simulations

    Authors: Stefano Markidis, Ivy Peng, Artur Podobas, Itthinat Jongsuebchoke, Gabriel Bengtsson, Pawel Herman

    Abstract: Numerical simulations of plasma flows are crucial for advancing our understanding of microscopic processes that drive the global plasma dynamics in fusion devices, space, and astrophysical systems. Identifying and classifying particle trajectories allows us to determine specific on-going acceleration mechanisms, shedding light on essential plasma processes. Our overall goal is to provide a gener… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: Accepted for publication at AI4S: Workshop on Artificial Intelligence and Machine Learning for Scientific Applications

  15. arXiv:2005.03476  [pdf, other

    cs.NE cs.LG

    Brain-like approaches to unsupervised learning of hidden representations -- a comparative study

    Authors: Naresh Balaji Ravichandran, Anders Lansner, Pawel Herman

    Abstract: Unsupervised learning of hidden representations has been one of the most vibrant research directions in machine learning in recent years. In this work we study the brain-like Bayesian Confidence Propagating Neural Network (BCPNN) model, recently extended to extract sparse distributed high-dimensional representations. The usefulness and class-dependent separability of the hidden representations whe… ▽ More

    Submitted 16 April, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: arXiv admin note: text overlap with arXiv:2003.12415

  16. Learning representations in Bayesian Confidence Propagation neural networks

    Authors: Naresh Balaji Ravichandran, Anders Lansner, Pawel Herman

    Abstract: Unsupervised learning of hierarchical representations has been one of the most vibrant research directions in deep learning during recent years. In this work we study biologically inspired unsupervised strategies in neural networks based on local Hebbian learning. We propose new mechanisms to extend the Bayesian Confidence Propagating Neural Network (BCPNN) architecture, and demonstrate their capa… ▽ More

    Submitted 27 March, 2020; originally announced March 2020.

    Journal ref: 2020 International Joint Conference on Neural Networks (IJCNN)

  17. arXiv:1908.05715  [pdf, other

    physics.space-ph cs.LG eess.IV

    Automated classification of plasma regions using 3D particle energy distributions

    Authors: Vyacheslav Olshevsky, Yuri V. Khotyaintsev, Ahmad Lalti, Andrey Divin, Gian Luca Delzanno, Sven Anderzen, Pawel Herman, Steven W. D. Chien, Levon Avanov, Andrew P. Dimmock, Stefano Markidis

    Abstract: We investigate the properties of the ion sky maps produced by the Dual Ion Spectrometers (DIS) from the Fast Plasma Investigation (FPI). We have trained a convolutional neural network classifier to predict four regions crossed by the MMS on the dayside magnetosphere: solar wind, ion foreshock, magnetosheath, and magnetopause using solely DIS spectrograms. The accuracy of the classifier is >98%. We… ▽ More

    Submitted 21 September, 2021; v1 submitted 15 August, 2019; originally announced August 2019.

    Comments: Accepted to JGR: Space Physics

  18. Characterizing Deep-Learning I/O Workloads in TensorFlow

    Authors: Steven W. D. Chien, Stefano Markidis, Chaitanya Prasad Sishtla, Luis Santos, Pawel Herman, Sai Narasimhamurthy, Erwin Laure

    Abstract: The performance of Deep-Learning (DL) computing frameworks rely on the performance of data ingestion and checkpointing. In fact, during the training, a considerable high number of relatively small files are first loaded and pre-processed on CPUs and then moved to accelerator for computation. In addition, checkpointing and restart operations are carried out to allow DL computing frameworks to resta… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.

    Comments: Accepted for publication at pdsw-DISCS 2018