-
Hebbian Learning from First Principles
Authors:
Linda Albanese,
Adriano Barra,
Pierluigi Bianco,
Fabrizio Durante,
Diego Pallara
Abstract:
Recently, the original storage prescription for the Hopfield model of neural networks -- as well as for its dense generalizations -- has been turned into a genuine Hebbian learning rule by postulating the expression of its Hamiltonian for both the supervised and unsupervised protocols. In these notes, first, we obtain these explicit expressions by relying upon maximum entropy extremization à la Ja…
▽ More
Recently, the original storage prescription for the Hopfield model of neural networks -- as well as for its dense generalizations -- has been turned into a genuine Hebbian learning rule by postulating the expression of its Hamiltonian for both the supervised and unsupervised protocols. In these notes, first, we obtain these explicit expressions by relying upon maximum entropy extremization à la Jaynes. Beyond providing a formal derivation of these recipes for Hebbian learning, this construction also highlights how Lagrangian constraints within entropy extremization force network's outcomes on neural correlations: these try to mimic the empirical counterparts hidden in the datasets provided to the network for its training and, the denser the network, the longer the correlations that it is able to capture. Next, we prove that, in the big data limit, whatever the presence of a teacher (or its lacking), not only these Hebbian learning rules converge to the original storage prescription of the Hopfield model but also their related free energies (and, thus, the statistical mechanical picture provided by Amit, Gutfreund and Sompolinsky is fully recovered). As a sideline, we show mathematical equivalence among standard Cost functions (Hamiltonian), preferred in Statistical Mechanical jargon, and quadratic Loss Functions, preferred in Machine Learning terminology. Remarks on the exponential Hopfield model (as the limit of dense networks with diverging density) and semi-supervised protocols are also provided.
△ Less
Submitted 13 January, 2024;
originally announced January 2024.
-
Unsupervised and Supervised learning by Dense Associative Memory under replica symmetry breaking
Authors:
Linda Albanese,
Andrea Alessandrelli,
Alessia Annibale,
Adriano Barra
Abstract:
Statistical mechanics of spin glasses is one of the main strands toward a comprehension of information processing by neural networks and learning machines. Tackling this approach, at the fairly standard replica symmetric level of description, recently Hebbian attractor networks with multi-node interactions (often called Dense Associative Memories) have been shown to outperform their classical pair…
▽ More
Statistical mechanics of spin glasses is one of the main strands toward a comprehension of information processing by neural networks and learning machines. Tackling this approach, at the fairly standard replica symmetric level of description, recently Hebbian attractor networks with multi-node interactions (often called Dense Associative Memories) have been shown to outperform their classical pairwise counterparts in a number of tasks, from their robustness against adversarial attacks and their capability to work with prohibitively weak signals to their supra-linear storage capacities. Focusing on mathematical techniques more than computational aspects, in this paper we relax the replica symmetric assumption and we derive the one-step broken-replica-symmetry picture of supervised and unsupervised learning protocols for these Dense Associative Memories: a phase diagram in the space of the control parameters is achieved, independently, both via the Parisi's hierarchy within then replica trick as well as via the Guerra's telescope within the broken-replica interpolation. Further, an explicit analytical investigation is provided to deepen both the big-data and ground state limits of these networks as well as a proof that replica symmetry breaking does not alter the thresholds for learning and slightly increases the maximal storage capacity. Finally the De Almeida and Thouless line, depicting the onset of instability of a replica symmetric description, is also analytically derived highlighting how, crossed this boundary, the broken replica description should be preferred.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Inverse modeling of time-delayed interactions via the dynamic-entropy formalism
Authors:
Elena Agliari,
Francesco Alemanno,
Adriano Barra,
Michele Castellana,
Daniele Lotito,
Matthieu Piel
Abstract:
Although instantaneous interactions are unphysical, a large variety of maximum entropy statistical inference methods match the model-inferred and the empirically-measured equal-time correlation functions. Focusing on collective motion of active units, this constraint is reasonable when the interaction timescale is much faster than that of the interacting units, as in starling flocks, yet it fails…
▽ More
Although instantaneous interactions are unphysical, a large variety of maximum entropy statistical inference methods match the model-inferred and the empirically-measured equal-time correlation functions. Focusing on collective motion of active units, this constraint is reasonable when the interaction timescale is much faster than that of the interacting units, as in starling flocks, yet it fails in a number of counter examples, as in leukocyte coordination (where signalling proteins diffuse among two cells). Here, we relax this assumption and develop a path integral approach to maximum-entropy framework, which includes delay in signalling. Our method is able to infer the strength of couplings and fields, but also the time required by the couplings to completely transfer information among the units. We demonstrate the validity of our approach providing excellent results on synthetic datasets of non-Markovian trajectories generated by the Heisenberg-Kuramoto and Vicsek models equipped with delayed interactions. As a proof of concept, we also apply the method to experiments on dendritic migration, where matching equal-time correlations results in a significant information loss.
△ Less
Submitted 10 July, 2024; v1 submitted 3 September, 2023;
originally announced September 2023.
-
Parallel Learning by Multitasking Neural Networks
Authors:
Elena Agliari,
Andrea Alessandrelli,
Adriano Barra,
Federico Ricci-Tersenghi
Abstract:
A modern challenge of Artificial Intelligence is learning multiple patterns at once (i.e.parallel learning). While this can not be accomplished by standard Hebbian associative neural networks, in this paper we show how the Multitasking Hebbian Network (a variation on theme of the Hopfield model working on sparse data-sets) is naturally able to perform this complex task. We focus on systems process…
▽ More
A modern challenge of Artificial Intelligence is learning multiple patterns at once (i.e.parallel learning). While this can not be accomplished by standard Hebbian associative neural networks, in this paper we show how the Multitasking Hebbian Network (a variation on theme of the Hopfield model working on sparse data-sets) is naturally able to perform this complex task. We focus on systems processing in parallel a finite (up to logarithmic growth in the size of the network) amount of patterns, mirroring the low-storage level of standard associative neural networks at work with pattern recognition. For mild dilution in the patterns, the network handles them hierarchically, distributing the amplitudes of their signals as power-laws w.r.t. their information content (hierarchical regime), while, for strong dilution, all the signals pertaining to all the patterns are raised with the same strength (parallel regime). Further, confined to the low-storage setting (i.e., far from the spin glass limit), the presence of a teacher neither alters the multitasking performances nor changes the thresholds for learning: the latter are the same whatever the training protocol is supervised or unsupervised. Results obtained through statistical mechanics, signal-to-noise technique and Monte Carlo simulations are overall in perfect agreement and carry interesting insights on multiple learning at once: for instance, whenever the cost-function of the model is minimized in parallel on several patterns (in its description via Statistical Mechanics), the same happens to the standard sum-squared error Loss function (typically used in Machine Learning).
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Statistical Mechanics of Learning via Reverberation in Bidirectional Associative Memories
Authors:
Martino Salomone Centonze,
Ido Kanter,
Adriano Barra
Abstract:
We study bi-directional associative neural networks that, exposed to noisy examples of an extensive number of random archetypes, learn the latter (with or without the presence of a teacher) when the supplied information is enough: in this setting, learning is heteroassociative -- involving couples of patterns -- and it is achieved by reverberating the information depicted from the examples through…
▽ More
We study bi-directional associative neural networks that, exposed to noisy examples of an extensive number of random archetypes, learn the latter (with or without the presence of a teacher) when the supplied information is enough: in this setting, learning is heteroassociative -- involving couples of patterns -- and it is achieved by reverberating the information depicted from the examples through the layers of the network. By adapting Guerra's interpolation technique, we provide a full statistical mechanical picture of supervised and unsupervised learning processes (at the replica symmetric level of description) obtaining analytically phase diagrams, thresholds for learning, a picture of the ground-state in plain agreement with Monte Carlo simulations and signal-to-noise outcomes. In the large dataset limit, the Kosko storage prescription as well as its statistical mechanical picture provided by Kurchan, Peliti, and Saber in the eighties is fully recovered. Computational advantages in dealing with information reverberation, rather than storage, are discussed for natural test cases. In particular, we show how this network admits an integral representation in terms of two coupled restricted Boltzmann machines, whose hidden layers are entirely built of by grand-mother neurons, to prove that by coupling solely these grand-mother neurons we can correlate the patterns they are related to: it is thus possible to recover Pavlov's Classical Conditioning by adding just one synapse among the correct grand-mother neurons (hence saving an extensive number of these links for further information storage w.r.t. the classical autoassociative setting).
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
Ultrametric identities in glassy models of Natural Evolution
Authors:
Elena Agliari,
Francesco Alemanno,
Miriam Aquaro,
Adriano Barra
Abstract:
Spin-glasses constitute a well-grounded framework for evolutionary models. Of particular interest for (some of) these models is the lack of self-averaging of their order parameters (e.g. the Hamming distance between the genomes of two individuals), even in asymptotic limits, much as like the behavior of the overlap between the configurations of two replica in mean-field spin-glasses. In the latter…
▽ More
Spin-glasses constitute a well-grounded framework for evolutionary models. Of particular interest for (some of) these models is the lack of self-averaging of their order parameters (e.g. the Hamming distance between the genomes of two individuals), even in asymptotic limits, much as like the behavior of the overlap between the configurations of two replica in mean-field spin-glasses. In the latter, this lack of self-averaging is related to peculiar fluctuations of the overlap, known as Ghirlanda-Guerra identities and Aizenman-Contucci polynomials, that cover a pivotal role in describing the ultrametric structure of the spin-glass landscape. As for evolutionary models, such identities may therefore be related to a taxonomic classification of individuals, yet a full investigation on their validity is missing. In this paper, we study ultrametric identities in simple cases where solely random mutations take place, while selective pressure is absent, namely in {\em flat landscape} models. In particular, we study three paradigmatic models in this setting: the {\em one parent model} (which, by construction, is ultrametric at the level of single individuals), the {\em homogeneous population model} (which is replica symmetric), and the {\em species formation model} (where a broken-replica scenario emerges at the level of species). We find analytical and numerical evidence that in the first and in the third model nor the Ghirlanda-Guerra neither the Aizenman-Contucci constraints hold, rather a new class of ultrametric identities is satisfied; in the second model all these constraints hold trivially. Very preliminary results on a real biological human genome derived by {\em The 1000 Genome Project Consortium} and on two artificial human genomes (generated by two different types neural networks) seem in better agreement with these new identities rather than the classic ones.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
About the de Almeida-Thouless line in neural networks
Authors:
Linda Albanese,
Andrea Alessandrelli,
Adriano Barra,
Alessia Annibale
Abstract:
In this work we present a rigorous and straightforward method to detect the onset of the instability of replica-symmetric theories in information processing systems, which does not require a full replica analysis as in the method originally proposed by de Almeida and Thouless for spin glasses. The method is based on an expansion of the free-energy obtained within one-step of replica symmetry break…
▽ More
In this work we present a rigorous and straightforward method to detect the onset of the instability of replica-symmetric theories in information processing systems, which does not require a full replica analysis as in the method originally proposed by de Almeida and Thouless for spin glasses. The method is based on an expansion of the free-energy obtained within one-step of replica symmetry breaking (RSB) around the RS value. As such, it requires solely continuity and differentiability of the free-energy and it is robust to be applied broadly to systems with quenched disorder. We apply the method to the Hopfield model and to neural networks with multi-node Hebbian interactions, as case studies. In the appendices we test the method on the Sherrington-Kirkpatrick and the Ising P-spin models, recovering the AT lines known in the literature for these models, as a special limit, which corresponds to assuming that the transition from the RS to the RSB phase can be obtained by varying continuously the order parameters. Our method provides a generalization of the AT approach, which does not rely on this limit and can be applied to systems with discontinuous phase transitions, as we show explicitly for the spherical P-spin model, recovering the known RS instability line.
△ Less
Submitted 12 November, 2023; v1 submitted 11 March, 2023;
originally announced March 2023.
-
Dense Hebbian neural networks: a replica symmetric picture of supervised learning
Authors:
Elena Agliari,
Linda Albanese,
Francesco Alemanno,
Andrea Alessandrelli,
Adriano Barra,
Fosca Giannotti,
Daniele Lotito,
Dino Pedreschi
Abstract:
We consider dense, associative neural-networks trained by a teacher (i.e., with supervision) and we investigate their computational capabilities analytically, via statistical-mechanics of spin glasses, and numerically, via Monte Carlo simulations. In particular, we obtain a phase diagram summarizing their performance as a function of the control parameters such as quality and quantity of the train…
▽ More
We consider dense, associative neural-networks trained by a teacher (i.e., with supervision) and we investigate their computational capabilities analytically, via statistical-mechanics of spin glasses, and numerically, via Monte Carlo simulations. In particular, we obtain a phase diagram summarizing their performance as a function of the control parameters such as quality and quantity of the training dataset, network storage and noise, that is valid in the limit of large network size and structureless datasets: these networks may work in a ultra-storage regime (where they can handle a huge amount of patterns, if compared with shallow neural networks) or in a ultra-detection regime (where they can perform pattern recognition at prohibitive signal-to-noise ratios, if compared with shallow neural networks). Guided by the random theory as a reference framework, we also test numerically learning, storing and retrieval capabilities shown by these networks on structured datasets as MNist and Fashion MNist. As technical remarks, from the analytic side, we implement large deviations and stability analysis within Guerra's interpolation to tackle the not-Gaussian distributions involved in the post-synaptic potentials while, from the computational counterpart, we insert Plefka approximation in the Monte Carlo scheme, to speed up the evaluation of the synaptic tensors, overall obtaining a novel and broad approach to investigate supervised learning in neural networks, beyond the shallow limit, in general.
△ Less
Submitted 2 July, 2023; v1 submitted 25 November, 2022;
originally announced December 2022.
-
Microscopic parameters of the van der Waals CrSBr antiferromagnet from microwave absorption experiments
Authors:
C. W. Cho,
A. Pawbake,
N. Aubergier,
A. L. Barra,
K. Mosina,
Z. Sofer,
M. E. Zhitomirsky,
C. Faugeras,
B. A. Piot
Abstract:
Microwave absorption experiments employing a phase-sensitive external resistive detection are performed for a topical van der Waals antiferromagnet CrSBr. The field dependence of two resonance modes is measured in an applied field parallel to the three principal crystallographic directions, revealing anisotropies and magnetic transitions in this material. To account for the observed results, we fo…
▽ More
Microwave absorption experiments employing a phase-sensitive external resistive detection are performed for a topical van der Waals antiferromagnet CrSBr. The field dependence of two resonance modes is measured in an applied field parallel to the three principal crystallographic directions, revealing anisotropies and magnetic transitions in this material. To account for the observed results, we formulate a microscopic spin model with a bi-axial single-ion anisotropy and inter-plane exchange. Theoretical calculations give an excellent description of full magnon spectra enabling us to precisely determine microscopic interaction parameters for CrSBr.
△ Less
Submitted 25 November, 2022;
originally announced November 2022.
-
Dense Hebbian neural networks: a replica symmetric picture of unsupervised learning
Authors:
Elena Agliari,
Linda Albanese,
Francesco Alemanno,
Andrea Alessandrelli,
Adriano Barra,
Fosca Giannotti,
Daniele Lotito,
Dino Pedreschi
Abstract:
We consider dense, associative neural-networks trained with no supervision and we investigate their computational capabilities analytically, via a statistical-mechanics approach, and numerically, via Monte Carlo simulations. In particular, we obtain a phase diagram summarizing their performance as a function of the control parameters such as the quality and quantity of the training dataset and the…
▽ More
We consider dense, associative neural-networks trained with no supervision and we investigate their computational capabilities analytically, via a statistical-mechanics approach, and numerically, via Monte Carlo simulations. In particular, we obtain a phase diagram summarizing their performance as a function of the control parameters such as the quality and quantity of the training dataset and the network storage, valid in the limit of large network size and structureless datasets. Moreover, we establish a bridge between macroscopic observables standardly used in statistical mechanics and loss functions typically used in the machine learning. As technical remarks, from the analytic side, we implement large deviations and stability analysis within Guerra's interpolation to tackle the not-Gaussian distributions involved in the post-synaptic potentials while, from the computational counterpart, we insert Plefka approximation in the Monte Carlo scheme, to speed up the evaluation of the synaptic tensors, overall obtaining a novel and broad approach to investigate neural networks in general.
△ Less
Submitted 2 July, 2023; v1 submitted 25 November, 2022;
originally announced November 2022.
-
Thermodynamics of bidirectional associative memories
Authors:
Adriano Barra,
Giovanni Catania,
Aurélien Decelle,
Beatriz Seoane
Abstract:
In this paper we investigate the equilibrium properties of bidirectional associative memories (BAMs). Introduced by Kosko in 1988 as a generalization of the Hopfield model to a bipartite structure, the simplest architecture is defined by two layers of neurons, with synaptic connections only between units of different layers: even without internal connections within each layer, information storage…
▽ More
In this paper we investigate the equilibrium properties of bidirectional associative memories (BAMs). Introduced by Kosko in 1988 as a generalization of the Hopfield model to a bipartite structure, the simplest architecture is defined by two layers of neurons, with synaptic connections only between units of different layers: even without internal connections within each layer, information storage and retrieval are still possible through the reverberation of neural activities passing from one layer to another. We characterize the computational capabilities of a stochastic extension of this model in the thermodynamic limit, by applying rigorous techniques from statistical physics. A detailed picture of the phase diagram at the replica symmetric level is provided, both at finite temperature and in the noiseless regimes. Also for the latter, the critical load is further investigated up to one step of replica symmetry breaking. An analytical and numerical inspection of the transition curves (namely critical lines splitting the various modes of operation of the machine) is carried out as the control parameters - noise, load and asymmetry between the two layer sizes - are tuned. In particular, with a finite asymmetry between the two layers, it is shown how the BAM can store information more efficiently than the Hopfield model by requiring less parameters to encode a fixed number of patterns. Comparisons are made with numerical simulations of neural dynamics. Finally, a low-load analysis is carried out to explain the retrieval mechanism in the BAM by analogy with two interacting Hopfield models. A potential equivalence with two coupled Restricted Boltmzann Machines is also discussed.
△ Less
Submitted 27 March, 2023; v1 submitted 17 November, 2022;
originally announced November 2022.
-
Pavlov Learning Machines
Authors:
Elena Agliari,
Miriam Aquaro,
Adriano Barra,
Alberto Fachechi,
Chiara Marullo
Abstract:
As well known, Hebb's learning traces its origin in Pavlov's Classical Conditioning, however, while the former has been extensively modelled in the past decades (e.g., by Hopfield model and countless variations on theme), as for the latter modelling has remained largely unaddressed so far; further, a bridge between these two pillars is totally lacking. The main difficulty towards this goal lays in…
▽ More
As well known, Hebb's learning traces its origin in Pavlov's Classical Conditioning, however, while the former has been extensively modelled in the past decades (e.g., by Hopfield model and countless variations on theme), as for the latter modelling has remained largely unaddressed so far; further, a bridge between these two pillars is totally lacking. The main difficulty towards this goal lays in the intrinsically different scales of the information involved: Pavlov's theory is about correlations among \emph{concepts} that are (dynamically) stored in the synaptic matrix as exemplified by the celebrated experiment starring a dog and a ring bell; conversely, Hebb's theory is about correlations among pairs of adjacent neurons as summarized by the famous statement {\em neurons that fire together wire together}. In this paper we rely on stochastic-process theory and model neural and synaptic dynamics via Langevin equations, to prove that -- as long as we keep neurons' and synapses' timescales largely split -- Pavlov mechanism spontaneously takes place and ultimately gives rise to synaptic weights that recover the Hebbian kernel.
△ Less
Submitted 2 July, 2022;
originally announced July 2022.
-
Recurrent neural networks that generalize from examples and optimize by dreaming
Authors:
Miriam Aquaro,
Francesco Alemanno,
Ido Kanter,
Fabrizio Durante,
Elena Agliari,
Adriano Barra
Abstract:
The gap between the huge volumes of data needed to train artificial neural networks and the relatively small amount of data needed by their biological counterparts is a central puzzle in machine learning. Here, inspired by biological information-processing, we introduce a generalized Hopfield network where pairwise couplings between neurons are built according to Hebb's prescription for on-line le…
▽ More
The gap between the huge volumes of data needed to train artificial neural networks and the relatively small amount of data needed by their biological counterparts is a central puzzle in machine learning. Here, inspired by biological information-processing, we introduce a generalized Hopfield network where pairwise couplings between neurons are built according to Hebb's prescription for on-line learning and allow also for (suitably stylized) off-line sleeping mechanisms. Moreover, in order to retain a learning framework, here the patterns are not assumed to be available, instead, we let the network experience solely a dataset made of a sample of noisy examples for each pattern. We analyze the model by statistical-mechanics tools and we obtain a quantitative picture of its capabilities as functions of its control parameters: the resulting network is an associative memory for pattern recognition that learns from examples on-line, generalizes and optimizes its storage capacity by off-line sleeping. Remarkably, the sleeping mechanisms always significantly reduce (up to $\approx 90\%$) the dataset size required to correctly generalize, further, there are memory loads that are prohibitive to Hebbian networks without sleeping (no matter the size and quality of the provided examples), but that are easily handled by the present "rested" neural networks.
△ Less
Submitted 17 April, 2022;
originally announced April 2022.
-
Supervised Hebbian Learning
Authors:
Francesco Alemanno,
Miriam Aquaro,
Ido Kanter,
Adriano Barra,
Elena Agliari
Abstract:
In neural network's Literature, Hebbian learning traditionally refers to the procedure by which the Hopfield model and its generalizations store archetypes (i.e., definite patterns that are experienced just once to form the synaptic matrix). However, the term "Learning" in Machine Learning refers to the ability of the machine to extract features from the supplied dataset (e.g., made of blurred exa…
▽ More
In neural network's Literature, Hebbian learning traditionally refers to the procedure by which the Hopfield model and its generalizations store archetypes (i.e., definite patterns that are experienced just once to form the synaptic matrix). However, the term "Learning" in Machine Learning refers to the ability of the machine to extract features from the supplied dataset (e.g., made of blurred examples of these archetypes), in order to make its own representation of the unavailable archetypes. Here, given a sample of examples, we define a supervised learning protocol by which the Hopfield network can infer the archetypes, and we detect the correct control parameters (including size and quality of the dataset) to depict a phase diagram for the system performance. We also prove that, for structureless datasets, the Hopfield model equipped with this supervised learning rule is equivalent to a restricted Boltzmann machine and this suggests an optimal and interpretable training routine. Finally, this approach is generalized to structured datasets: we highlight a quasi-ultrametric organization (reminiscent of replica-symmetry-breaking) in the analyzed datasets and, consequently, we introduce an additional "replica hidden layer" for its (partial) disentanglement, which is shown to improve MNIST classification from 75% to 95%, and to offer a new perspective on deep architectures.
△ Less
Submitted 7 September, 2022; v1 submitted 2 March, 2022;
originally announced March 2022.
-
Anisotropic long-range spin transport in canted antiferromagnetic orthoferrite YFeO$_3$
Authors:
Shubhankar Das,
A. Ross,
X. X. Ma,
S. Becker,
C. Schmitt,
F. van Duijn,
F. Fuhrmann,
M. -A. Syskaki,
U. Ebels,
V. Baltz,
A. -L. Barra,
H. Y. Chen,
G. Jakob,
S. X. Cao,
J. Sinova,
O. Gomonay,
R. Lebrun,
M. Kläui
Abstract:
In antiferromagnets, the efficient propagation of spin-waves has until now only been observed in the insulating antiferromagnet hematite, where circularly (or a superposition of pairs of linearly) polarized spin-waves propagate over long distances. Here, we report long-distance spin-transport in the antiferromagnetic orthoferrite YFeO$_3$, where a different transport mechanism is enabled by the co…
▽ More
In antiferromagnets, the efficient propagation of spin-waves has until now only been observed in the insulating antiferromagnet hematite, where circularly (or a superposition of pairs of linearly) polarized spin-waves propagate over long distances. Here, we report long-distance spin-transport in the antiferromagnetic orthoferrite YFeO$_3$, where a different transport mechanism is enabled by the combined presence of the Dzyaloshinskii-Moriya interaction and externally applied fields. The magnon decay length is shown to exceed hundreds of nano-meters, in line with resonance measurements that highlight the low magnetic damping. We observe a strong anisotropy in the magnon decay lengths that we can attribute to the role of the magnon group velocity in the propagation of spin-waves in antiferromagnets. This unique mode of transport identified in YFeO$_3$ opens up the possibility of a large and technologically relevant class of materials, i.e., canted antiferromagnets, for long-distance spin transport.
△ Less
Submitted 11 December, 2021;
originally announced December 2021.
-
Replica symmetry breaking in dense neural networks
Authors:
Linda Albanese,
Francesco Alemanno,
Andrea Alessandrelli,
Adriano Barra
Abstract:
Understanding the glassy nature of neural networks is pivotal both for theoretical and computational advances in Machine Learning and Theoretical Artificial Intelligence. Keeping the focus on dense associative Hebbian neural networks, the purpose of this paper is two-fold: at first we develop rigorous mathematical approaches to address properly a statistical mechanical picture of the phenomenon of…
▽ More
Understanding the glassy nature of neural networks is pivotal both for theoretical and computational advances in Machine Learning and Theoretical Artificial Intelligence. Keeping the focus on dense associative Hebbian neural networks, the purpose of this paper is two-fold: at first we develop rigorous mathematical approaches to address properly a statistical mechanical picture of the phenomenon of {\em replica symmetry breaking} (RSB) in these networks, then -- deepening results stemmed via these routes -- we aim to inspect the {\em glassiness} that they hide. In particular, regarding the methodology, we provide two techniques: the former is an adaptation of the transport PDE to the case, while the latter is an extension of Guerra's interpolation breakthrough. Beyond coherence among the results, either in replica symmetric and in the one-step replica symmetry breaking level of description, we prove the Gardner's picture and we identify the maximal storage capacity by a ground-state analysis in the Baldi-Venkatesh high-storage regime.
In the second part of the paper we investigate the glassy structure of these networks: in contrast with the replica symmetric scenario (RS), RSB actually stabilizes the spin-glass phase. We report huge differences w.r.t. the standard pairwise Hopfield limit: in particular, it is known that it is possible to express the free energy of the Hopfield neural network as a linear combination of the free energies of an hard spin glass (i.e. the Sherrington-Kirkpatrick model) and a soft spin glass (the Gaussian or "spherical" model). This is no longer true when interactions are more than pairwise (whatever the level of description, RS or RSB): for dense networks solely the free energy of the hard spin glass survives, proving a huge diversity in the underlying glassiness of associative neural networks.
△ Less
Submitted 25 November, 2021;
originally announced November 2021.
-
Determining Sidon Polynomials on Sidon Sets over $\mathbb{F}_q\times \mathbb{F}_q$
Authors:
Muhammad Afifurrahman,
Aleams Barra
Abstract:
Let $p$ be a prime, and $q=p^n$ be a prime power. In his works on Sidon sets over $\mathbb{F}_q\times \mathbb{F}_q$, Cilleruelo conjectured about polynomials that could generate $q$-element Sidon sets over $\mathbb{F}_q\times \mathbb{F}_q$.
Here, we derive some criteria for determining polynomials that could generate $q$-element Sidon set over $\mathbb{F}_q\times \mathbb{F}_q$. Using these crite…
▽ More
Let $p$ be a prime, and $q=p^n$ be a prime power. In his works on Sidon sets over $\mathbb{F}_q\times \mathbb{F}_q$, Cilleruelo conjectured about polynomials that could generate $q$-element Sidon sets over $\mathbb{F}_q\times \mathbb{F}_q$.
Here, we derive some criteria for determining polynomials that could generate $q$-element Sidon set over $\mathbb{F}_q\times \mathbb{F}_q$. Using these criteria, we prove that certain classes of monomials and cubic polynomials over $\mathbb{F}_p$ cannot be used to generate $p$-element Sidon set over $\mathbb{F}_p\times \mathbb{F}_p$. We also discover a connection between the needed polynomials and planar polynomials.
△ Less
Submitted 17 June, 2023; v1 submitted 16 November, 2021;
originally announced November 2021.
-
The emergence of a concept in shallow neural networks
Authors:
Elena Agliari,
Francesco Alemanno,
Adriano Barra,
Giordano De Marzo
Abstract:
We consider restricted Boltzmann machine (RBMs) trained over an unstructured dataset made of blurred copies of definite but unavailable ``archetypes'' and we show that there exists a critical sample size beyond which the RBM can learn archetypes, namely the machine can successfully play as a generative model or as a classifier, according to the operational routine. In general, assessing a critical…
▽ More
We consider restricted Boltzmann machine (RBMs) trained over an unstructured dataset made of blurred copies of definite but unavailable ``archetypes'' and we show that there exists a critical sample size beyond which the RBM can learn archetypes, namely the machine can successfully play as a generative model or as a classifier, according to the operational routine. In general, assessing a critical sample size (possibly in relation to the quality of the dataset) is still an open problem in machine learning. Here, restricting to the random theory, where shallow networks suffice and the grand-mother cell scenario is correct, we leverage the formal equivalence between RBMs and Hopfield networks, to obtain a phase diagram for both the neural architectures which highlights regions, in the space of the control parameters (i.e., number of archetypes, number of neurons, size and quality of the training set), where learning can be accomplished. Our investigations are led by analytical methods based on the statistical-mechanics of disordered systems and results are further corroborated by extensive Monte Carlo simulations.
△ Less
Submitted 1 September, 2021;
originally announced September 2021.
-
Robust magnetic anisotropy of a monolayer of hexacoordinate Fe( ii ) complexes assembled on Cu(111)
Authors:
Massine Kelai,
Benjamin Cahier,
Mihail Atanasov,
Frank Neese,
Yongfeng Tong,
Luqiong Zhang,
Amandine Bellec,
Olga Iasco,
Eric Rivière,
Régis Guillot,
Cyril Chacon,
Yann Girard,
Jérôme Lagoute,
Sylvie Rousset,
Vincent Repain,
Edwige Otero,
Marie-Anne Arrio,
Philippe Sainctavit,
Anne-Laure Barra,
Marie-Laure Boillot,
Talal Mallah
Abstract:
The tris pyrazolyl borate ligand imposes a rigid scaffold around Fe( ii ) ensuring a robust magnetic anisotropy when the molecules assembled as monolayers suffer from the dissymmetric environment of the substrate/vacuum interface.
The tris pyrazolyl borate ligand imposes a rigid scaffold around Fe( ii ) ensuring a robust magnetic anisotropy when the molecules assembled as monolayers suffer from the dissymmetric environment of the substrate/vacuum interface.
△ Less
Submitted 10 May, 2021;
originally announced May 2021.
-
Effective strain manipulation of the antiferromagnetic state of polycrystalline NiO
Authors:
A. Barra,
A. Ross,
O. Gomonay,
L. Baldrati,
A. Chavez,
R. Lebrun,
J. D. Schneider,
P. Shirazi,
Q. Wang,
J. Sinova,
G. P. Carman,
M. Kläui
Abstract:
As a candidate material for applications such as magnetic memory, polycrystalline antiferromagnets offer the same robustness to external magnetic fields, THz spin dynamics, and lack of stray field as their single crystalline counterparts, but without the limitation of epitaxial growth and lattice matched substrates. Here, we first report the detection of the average Neel vector orientiation in pol…
▽ More
As a candidate material for applications such as magnetic memory, polycrystalline antiferromagnets offer the same robustness to external magnetic fields, THz spin dynamics, and lack of stray field as their single crystalline counterparts, but without the limitation of epitaxial growth and lattice matched substrates. Here, we first report the detection of the average Neel vector orientiation in polycrystalline NiO via spin Hall magnetoresistance (SMR). Secondly, by applying strain through a piezo-electric substrate, we reduce the critical magnetic field required to reach a saturation of the SMR signal, indicating a change of the anisotropy. Our results are consistent with polycrystalline NiO exhibiting a positive sign of the in-plane magnetostriction. This method of anisotropy-tuning offers an energy efficient, on-chip alternative to manipulate a polycrystalline antiferromagnets magnetic state.
△ Less
Submitted 24 March, 2021;
originally announced March 2021.
-
Chemical tuning of spin clock transitions in molecular monomers based on nuclear spin-free Ni(II)
Authors:
Marcos Rubín-Osanz,
François Lambert,
Feng Shao,
Eric Rivière,
Régis Guillot,
Nicolas Suaud,
Nathalie Guihéry,
David Zueco,
Anne-Laure Barra,
Talal Mallah,
Fernando Luis
Abstract:
We report the existence of a sizeable quantum tunnelling splitting between the two lowest electronic spin levels of mononuclear Ni complexes. The level anti-crossing, or magnetic clock transition, associated with this gap has been directly monitored by heat capacity experiments. The comparison of these results with those obtained for a Co derivative, for which tunnelling is forbidden by symmetry,…
▽ More
We report the existence of a sizeable quantum tunnelling splitting between the two lowest electronic spin levels of mononuclear Ni complexes. The level anti-crossing, or magnetic clock transition, associated with this gap has been directly monitored by heat capacity experiments. The comparison of these results with those obtained for a Co derivative, for which tunnelling is forbidden by symmetry, shows that the clock transition leads to an effective suppression of intermolecular spin-spin interactions. In addition, we show that the quantum tunnelling splitting admits a chemical tuning via the modification of the ligand shell that determines the crystal field and the magnetic anisotropy. These properties are crucial to realize model spin qubits that combine the necessary resilience against decoherence, a proper interfacing with other qubits and with the control circuitry and the ability to initialize them by cooling.
△ Less
Submitted 4 March, 2021;
originally announced March 2021.
-
Towards the development of human immune-system-on-a-chip platforms
Authors:
Alessandro Polini,
Loretta L. del Mercato,
Adriano Barra,
Yu Shrike Zhang,
Franco Calabi,
Giuseppe Gigli
Abstract:
Organ-on-a-chip (OoCs) platforms could revolutionize drug discovery and might ultimately become essential tools for precision therapy. Although many single-organ and interconnected systems have been described, the immune system has been comparatively neglected, despite its pervasive role in the body and the trend towards newer therapeutic products (i.e., complex biologics, nanoparticles, immune ch…
▽ More
Organ-on-a-chip (OoCs) platforms could revolutionize drug discovery and might ultimately become essential tools for precision therapy. Although many single-organ and interconnected systems have been described, the immune system has been comparatively neglected, despite its pervasive role in the body and the trend towards newer therapeutic products (i.e., complex biologics, nanoparticles, immune checkpoint inhibitors, and engineered T cells) that often cause, or are based on, immune reactions. In this review, we recapitulate some distinctive features of the immune system before reviewing microfluidic devices that mimic lymphoid organs or other organs and/or tissues with an integrated immune system component.
△ Less
Submitted 25 February, 2021;
originally announced February 2021.
-
Replica symmetry breaking in neural networks: a few steps toward rigorous results
Authors:
Elena Agliari,
Linda Albanese,
Adriano Barra,
Gabriele Ottaviani
Abstract:
In this paper we adapt the broken replica interpolation technique (developed by Francesco Guerra to deal with the Sherrington-Kirkpatrick model, namely a pairwise mean-field spin-glass whose couplings are i.i.d. standard Gaussian variables) in order to work also with the Hopfield model (i.e., a pairwise mean-field neural-network whose couplings are drawn according to Hebb's learning rule): this is…
▽ More
In this paper we adapt the broken replica interpolation technique (developed by Francesco Guerra to deal with the Sherrington-Kirkpatrick model, namely a pairwise mean-field spin-glass whose couplings are i.i.d. standard Gaussian variables) in order to work also with the Hopfield model (i.e., a pairwise mean-field neural-network whose couplings are drawn according to Hebb's learning rule): this is accomplished by grafting Guerra's telescopic averages on the transport equation technique, recently developed by some of the Authors. As an overture, we apply the technique to solve the Sherrington-Kirkpatrick model with i.i.d. Gaussian couplings centered at $J_0$ and with finite variance $J$; the mean $J_0$ plays the role of a signal to be detected in a noisy environment tuned by $J$, hence making this model a natural test-case to be investigated before addressing the Hopfield model. For both the models, an explicit expression of their quenched free energy in terms of their natural order parameters is obtained at the K-th step (K arbitrary, but finite) of replica-symmetry-breaking. In particular, for the Hopfield model, by assuming that the overlaps respect Parisi's decomposition (following the ziqqurat ansatz) and that the Mattis magnetization is self-averaging, we recover previous results obtained via replica-trick by Amit, Crisanti and Gutfreund (1RSB) and by Steffan and Kühn (2RSB).
△ Less
Submitted 30 May, 2020;
originally announced June 2020.
-
Long-distance spin-transport across the Morin phase transition up to room temperature in ultra-low damping single crystals of the antiferromagnet α-Fe2O3
Authors:
Romain Lebrun,
Andrew Ross,
Olena Gomonay,
Vincent Baltz,
Ursula Ebels,
Anne Laure Barra,
Alireza Qaiumzadeh,
Arne Brataas,
Jairo Sinova,
Mathias Kläui
Abstract:
Antiferromagnetic materials can host spin-waves with polarizations ranging from circular to linear depending on their magnetic anisotropies. Until now, only easy-axis anisotropy antiferromagnets with circularly polarized spin-waves were reported to carry spin-information over long distances of micrometers. In this article, we report long-distance spin-transport in the easy-plane canted antiferroma…
▽ More
Antiferromagnetic materials can host spin-waves with polarizations ranging from circular to linear depending on their magnetic anisotropies. Until now, only easy-axis anisotropy antiferromagnets with circularly polarized spin-waves were reported to carry spin-information over long distances of micrometers. In this article, we report long-distance spin-transport in the easy-plane canted antiferromagnetic phase of hematite and at room temperature, where the linearly polarized magnons are not intuitively expected to carry spin. We demonstrate that the spin-transport signal decreases continuously through the easy-axis to easy-plane Morin transition, and persists in the easy-plane phase through current induced pairs of linearly polarized magnons with dephasing lengths in the micrometer range. We explain the long transport distance as a result of the low magnetic damping, which we measure to be below 0.0001 as in the best ferromagnets. All of this together demonstrates that long-distance transport can be achieved across a range of anisotropies and temperatures, up to room temperature, highlighting the promising potential of this insulating antiferromagnet for magnon-based devices.
△ Less
Submitted 28 April, 2021; v1 submitted 29 May, 2020;
originally announced May 2020.
-
Fractional Brownian Motions and their multifractal analysis applied to Parana river flow
Authors:
M. N. Piacquadio Losada,
R. Seoane,
A. de la Barra,
L. F. Caram
Abstract:
A number of different analysis techniques have been used to analyze long-term time series data from different rivers, starting with the determination of the Hurst coefficient. We summarize the concept of fractals, multifractals and Fractional Brownian Motion (FBM), and apply some such techniques to daily stream flow data from the Parana River recorded at Corrientes, Argentina, for 106 years. After…
▽ More
A number of different analysis techniques have been used to analyze long-term time series data from different rivers, starting with the determination of the Hurst coefficient. We summarize the concept of fractals, multifractals and Fractional Brownian Motion (FBM), and apply some such techniques to daily stream flow data from the Parana River recorded at Corrientes, Argentina, for 106 years. After determining the Hurst coefficient for the entire data set (H = 0.76), we analyze the data for each of four seasons and draw the corresponding FBM graphs and their multifractal spectra (MFS). Three of the seasons are similar, but autumn is very different for both FBM and MFS. Based on the MFS results, we propose a number of indices for measuring variations in stream flow, and determine the values of the indices for the three similar seasons. The indices are based on important parameters of the multifractal spectra. The geometry of the spectra as well as the indices all indicate that Winter is the most stable season. This is in contrast to the Boxplot of seasonal stream flow data where Winter shows the largest variation. Thus, these indices provide insight into river flow stability, not detected in, and indeed contradictory to, that from basic statistical analysis.
△ Less
Submitted 4 February, 2020;
originally announced February 2020.
-
Annealing and replica-symmetry in Deep Boltzmann Machines
Authors:
Diego Alberici,
Adriano Barra,
Pierluigi Contucci,
Emanuele Mingione
Abstract:
In this paper we study the properties of the quenched pressure of a multi-layer spin-glass model (a deep Boltzmann Machine in artificial intelligence jargon) whose pairwise interactions are allowed between spins lying in adjacent layers and not inside the same layer nor among layers at distance larger than one. We prove a theorem that bounds the quenched pressure of such a K-layer machine in terms…
▽ More
In this paper we study the properties of the quenched pressure of a multi-layer spin-glass model (a deep Boltzmann Machine in artificial intelligence jargon) whose pairwise interactions are allowed between spins lying in adjacent layers and not inside the same layer nor among layers at distance larger than one. We prove a theorem that bounds the quenched pressure of such a K-layer machine in terms of K Sherrington-Kirkpatrick spin glasses and use it to investigate its annealed region. The replica-symmetric approximation of the quenched pressure is identified and its relation to the annealed one is considered. The paper also presents some observation on the model's architectural structure related to machine learning. Since escaping the annealed region is mandatory for a meaningful training, by squeezing such region we obtain thermodynamical constraints on the form factors. Remarkably, its optimal escape is achieved by requiring the last layer to scale sub-linearly in the network size.
△ Less
Submitted 21 January, 2020;
originally announced January 2020.
-
A statistical-inference approach to reconstruct inter-cellular interactions in cell-migration experiments
Authors:
Elena Agliari,
Pablo J. Sáez,
Adriano Barra,
Matthieu Piel,
Pablo Vargas,
Michele Castellana
Abstract:
Migration of cells can be characterized by two, prototypical types of motion: individual and collective migration. We propose a statistical-inference approach designed to detect the presence of cell-cell interactions that give rise to collective behaviors in cell-motility experiments. Such inference method has been first successfully tested on synthetic motional data, and then applied to two exper…
▽ More
Migration of cells can be characterized by two, prototypical types of motion: individual and collective migration. We propose a statistical-inference approach designed to detect the presence of cell-cell interactions that give rise to collective behaviors in cell-motility experiments. Such inference method has been first successfully tested on synthetic motional data, and then applied to two experiments. In the first experiment, cell migrate in a wound-healing model: when applied to this experiment, the inference method predicts the existence of cell-cell interactions, correctly mirroring the strong intercellular contacts which are present in the experiment. In the second experiment, dendritic cells migrate in a chemokine gradient. Our inference analysis does not provide evidence for interactions, indicating that cells migrate by sensing independently the chemokine source. According to this prediction, we speculate that mature dendritic cells disregard inter-cellular signals that could otherwise delay their arrival to lymph vessels.
△ Less
Submitted 4 December, 2019;
originally announced December 2019.
-
Generalized Guerra's interpolation schemes for dense associative neural networks
Authors:
Elena Agliari,
Francesco Alemanno,
Adriano Barra,
Alberto Fachechi
Abstract:
In this work we develop analytical techniques to investigate a broad class of associative neural networks set in the high-storage regime. These techniques translate the original statistical-mechanical problem into an analytical-mechanical one which implies solving a set of partial differential equations, rather than tackling the canonical probabilistic route. We test the method on the classical Ho…
▽ More
In this work we develop analytical techniques to investigate a broad class of associative neural networks set in the high-storage regime. These techniques translate the original statistical-mechanical problem into an analytical-mechanical one which implies solving a set of partial differential equations, rather than tackling the canonical probabilistic route. We test the method on the classical Hopfield model - where the cost function includes only two-body interactions (i.e., quadratic terms) - and on the "relativistic" Hopfield model - where the (expansion of the) cost function includes p-body (i.e., of degree p) contributions. Under the replica symmetric assumption, we paint the phase diagrams of these models by obtaining the explicit expression of their free energy as a function of the model parameters (i.e., noise level and memory storage). Further, since for non-pairwise models ergodicity breaking is non necessarily a critical phenomenon, we develop a fluctuation analysis and find that criticality is preserved in the relativistic model.
△ Less
Submitted 16 April, 2020; v1 submitted 28 November, 2019;
originally announced November 2019.
-
Neural networks with redundant representation: detecting the undetectable
Authors:
Elena Agliari,
Francesco Alemanno,
Adriano Barra,
Martino Centonze,
Alberto Fachechi
Abstract:
We consider a three-layer Sejnowski machine and show that features learnt via contrastive divergence have a dual representation as patterns in a dense associative memory of order P=4. The latter is known to be able to Hebbian-store an amount of patterns scaling as N^{P-1}, where N denotes the number of constituting binary neurons interacting P-wisely. We also prove that, by keeping the dense assoc…
▽ More
We consider a three-layer Sejnowski machine and show that features learnt via contrastive divergence have a dual representation as patterns in a dense associative memory of order P=4. The latter is known to be able to Hebbian-store an amount of patterns scaling as N^{P-1}, where N denotes the number of constituting binary neurons interacting P-wisely. We also prove that, by keeping the dense associative network far from the saturation regime (namely, allowing for a number of patterns scaling only linearly with N, while P>2) such a system is able to perform pattern recognition far below the standard signal-to-noise threshold. In particular, a network with P=4 is able to retrieve information whose intensity is O(1) even in the presence of a noise O(\sqrt{N}) in the large N limit. This striking skill stems from a redundancy representation of patterns -- which is afforded given the (relatively) low-load information storage -- and it contributes to explain the impressive abilities in pattern recognition exhibited by new-generation neural networks. The whole theory is developed rigorously, at the replica symmetric level of approximation, and corroborated by signal-to-noise analysis and Monte Carlo simulations.
△ Less
Submitted 28 November, 2019;
originally announced November 2019.
-
Permutation codes over finite fields
Authors:
Irwansyah,
Intan Muchtadi-Alamsyah,
Aleams Barra
Abstract:
In this paper we describe a class of codes called {\it permutation codes}. This class of codes is a generalization of cyclic codes and quasi-cyclic codes. We also give some examples of optimal permutation codes over binary, ternary, and $5$-ary. Then, we describe its structure as submodules over a polynomial ring.
In this paper we describe a class of codes called {\it permutation codes}. This class of codes is a generalization of cyclic codes and quasi-cyclic codes. We also give some examples of optimal permutation codes over binary, ternary, and $5$-ary. Then, we describe its structure as submodules over a polynomial ring.
△ Less
Submitted 14 April, 2019;
originally announced April 2019.
-
Dreaming neural networks: rigorous results
Authors:
Elena Agliari,
Francesco Alemanno,
Adriano Barra,
Alberto Fachechi
Abstract:
Recently a daily routine for associative neural networks has been proposed: the network Hebbian-learns during the awake state (thus behaving as a standard Hopfield model), then, during its sleep state, optimizing information storage, it consolidates pure patterns and removes spurious ones: this forces the synaptic matrix to collapse to the projector one (ultimately approaching the Kanter-Sompolink…
▽ More
Recently a daily routine for associative neural networks has been proposed: the network Hebbian-learns during the awake state (thus behaving as a standard Hopfield model), then, during its sleep state, optimizing information storage, it consolidates pure patterns and removes spurious ones: this forces the synaptic matrix to collapse to the projector one (ultimately approaching the Kanter-Sompolinksy model). This procedure keeps the learning Hebbian-based (a biological must) but, by taking advantage of a (properly stylized) sleep phase, still reaches the maximal critical capacity (for symmetric interactions). So far this emerging picture (as well as the bulk of papers on unlearning techniques) was supported solely by mathematically-challenging routes, e.g. mainly replica-trick analysis and numerical simulations: here we rely extensively on Guerra's interpolation techniques developed for neural networks and, in particular, we extend the generalized stochastic stability approach to the case. Confining our description within the replica symmetric approximation (where the previous ones lie), the picture painted regarding this generalization (and the previously existing variations on theme) is here entirely confirmed. Further, still relying on Guerra's schemes, we develop a systematic fluctuation analysis to check where ergodicity is broken (an analysis entirely absent in previous investigations). We find that, as long as the network is awake, ergodicity is bounded by the Amit-Gutfreund-Sompolinsky critical line (as it should), but, as the network sleeps, sleeping destroys spin glass states by extending both the retrieval as well as the ergodic region: after an entire sleeping session the solely surviving regions are retrieval and ergodic ones and this allows the network to achieve the perfect retrieval regime (the number of storable patterns equals the number of neurons in the network).
△ Less
Submitted 21 December, 2018;
originally announced December 2018.
-
A novel derivation of the Marchenko-Pastur law through analog bipartite spin-glasses
Authors:
Elena Agliari,
Francesco Alemanno,
Adriano Barra,
Alberto Fachechi
Abstract:
In this work we consider the {\em analog bipartite spin-glass} (or {\em real-valued restricted Boltzmann machine} in a neural network jargon), whose variables (those quenched as well as those dynamical) share standard Gaussian distributions. First, via Guerra's interpolation technique, we express its quenched free energy in terms of the natural order parameters of the theory (namely the self- and…
▽ More
In this work we consider the {\em analog bipartite spin-glass} (or {\em real-valued restricted Boltzmann machine} in a neural network jargon), whose variables (those quenched as well as those dynamical) share standard Gaussian distributions. First, via Guerra's interpolation technique, we express its quenched free energy in terms of the natural order parameters of the theory (namely the self- and two-replica overlaps), then, we re-obtain the same result by using the replica-trick: a mandatory tribute, given the special occasion. Next, we show that the quenched free energy of this model is the functional generator of the moments of the correlation matrix among the weights connecting the two layers of the spin-glass (i.e., the Wishart matrix in random matrix theory or the Hebbian coupling in neural networks): as weights are quenched stochastic variables, this plays as a novel tool to inspect random matrices. In particular, we find that the Stieltjes transform of the spectral density of the correlation matrix is determined by the (replica-symmetric) quenched free energy of the bipartite spin-glass model. In this setup, we re-obtain the Marchenko-Pastur law in a very simple way.
△ Less
Submitted 20 November, 2018;
originally announced November 2018.
-
Dreaming neural networks: forgetting spurious memories and reinforcing pure ones
Authors:
Alberto Fachechi,
Elena Agliari,
Adriano Barra
Abstract:
The standard Hopfield model for associative neural networks accounts for biological Hebbian learning and acts as the harmonic oscillator for pattern recognition, however its maximal storage capacity is $α\sim 0.14$, far from the theoretical bound for symmetric networks, i.e. $α=1$. Inspired by sleeping and dreaming mechanisms in mammal brains, we propose an extension of this model displaying the s…
▽ More
The standard Hopfield model for associative neural networks accounts for biological Hebbian learning and acts as the harmonic oscillator for pattern recognition, however its maximal storage capacity is $α\sim 0.14$, far from the theoretical bound for symmetric networks, i.e. $α=1$. Inspired by sleeping and dreaming mechanisms in mammal brains, we propose an extension of this model displaying the standard on-line (awake) learning mechanism (that allows the storage of external information in terms of patterns) and an off-line (sleep) unlearning$\&$consolidating mechanism (that allows spurious-pattern removal and pure-pattern reinforcement): this obtained daily prescription is able to saturate the theoretical bound $α=1$, remaining also extremely robust against thermal noise. Both neural and synaptic features are analyzed both analytically and numerically. In particular, beyond obtaining a phase diagram for neural dynamics, we focus on synaptic plasticity and we give explicit prescriptions on the temporal evolution of the synaptic matrix. We analytically prove that our algorithm makes the Hebbian kernel converge with high probability to the projection matrix built over the pure stored patterns. Furthermore, we obtain a sharp and explicit estimate for the "sleep rate" in order to ensure such a convergence. Finally, we run extensive numerical simulations (mainly Monte Carlo sampling) to check the approximations underlying the analytical investigations (e.g., we developed the whole theory at the so called replica-symmetric level, as standard in the Amit-Gutfreund-Sompolinsky reference framework) and possible finite-size effects, finding overall full agreement with the theory.
△ Less
Submitted 29 October, 2018;
originally announced October 2018.
-
The Relativistic Hopfield network: rigorous results
Authors:
Elena Agliari,
Adriano Barra,
Matteo Notarnicola
Abstract:
The relativistic Hopfield model constitutes a generalization of the standard Hopfield model that is derived by the formal analogy between the statistical-mechanic framework embedding neural networks and the Lagrangian mechanics describing a fictitious single-particle motion in the space of the tuneable parameters of the network itself. In this analogy the cost-function of the Hopfield model plays…
▽ More
The relativistic Hopfield model constitutes a generalization of the standard Hopfield model that is derived by the formal analogy between the statistical-mechanic framework embedding neural networks and the Lagrangian mechanics describing a fictitious single-particle motion in the space of the tuneable parameters of the network itself. In this analogy the cost-function of the Hopfield model plays as the standard kinetic-energy term and its related Mattis overlap (naturally bounded by one) plays as the velocity. The Hamiltonian of the relativisitc model, once Taylor-expanded, results in a P-spin series with alternate signs: the attractive contributions enhance the information-storage capabilities of the network, while the repulsive contributions allow for an easier unlearning of spurious states, conferring overall more robustness to the system as a whole. Here we do not deepen the information processing skills of this generalized Hopfield network, rather we focus on its statistical mechanical foundation. In particular, relying on Guerra's interpolation techniques, we prove the existence of the infinite volume limit for the model free-energy and we give its explicit expression in terms of the Mattis overlaps. By extremizing the free energy over the latter we get the generalized self-consistent equations for these overlaps, as well as a picture of criticality that is further corroborated by a fluctuation analysis. These findings are in full agreement with the available previous results.
△ Less
Submitted 29 October, 2018;
originally announced October 2018.
-
Free energies of Boltzmann Machines: self-averaging, annealed and replica symmetric approximations in the thermodynamic limit
Authors:
Elena Agliari,
Adriano Barra,
Brunello Tirozzi
Abstract:
Restricted Boltzmann machines (RBMs) constitute one of the main models for machine statistical inference and they are widely employed in Artificial Intelligence as powerful tools for (deep) learning. However, in contrast with countless remarkable practical successes, their mathematical formalization has been largely elusive: from a statistical-mechanics perspective these systems display the same (…
▽ More
Restricted Boltzmann machines (RBMs) constitute one of the main models for machine statistical inference and they are widely employed in Artificial Intelligence as powerful tools for (deep) learning. However, in contrast with countless remarkable practical successes, their mathematical formalization has been largely elusive: from a statistical-mechanics perspective these systems display the same (random) Gibbs measure of bi-partite spin-glasses, whose rigorous treatment is notoriously difficult. In this work, beyond providing a brief review on RBMs from both the learning and the retrieval perspectives, we aim to contribute to their analytical investigation, by considering two distinct realizations of their weights (i.e., Boolean and Gaussian) and studying the properties of their related free energies. More precisely, focusing on a RBM characterized by digital couplings, we first extend the Pastur-Shcherbina-Tirozzi method (originally developed for the Hopfield model) to prove the self-averaging property for the free energy, over its quenched expectation, in the infinite volume limit, then we explicitly calculate its simplest approximation, namely its annealed bound. Next, focusing on a RBM characterized by analogical weights, we extend Guerra's interpolating scheme to obtain a control of the quenched free-energy under the assumption of replica symmetry: we get self-consistencies for the order parameters (in full agreement with the existing Literature) as well as the critical line for ergodicity breaking that turns out to be the same obtained in AGS theory. As we discuss, this analogy stems from the slow-noise universality. Finally, glancing beyond replica symmetry, we analyze the fluctuations of the overlaps for an estimate of the (slow) noise affecting the retrieval of the signal, and by a stability analysis we recover the Aizenman-Contucci identities typical of glassy systems.
△ Less
Submitted 8 March, 2019; v1 submitted 20 October, 2018;
originally announced October 2018.
-
Voltage Control of Magnetic Monopoles in Artificial Spin Ice
Authors:
Andres C. Chavez,
Anthony Barra,
Gregory P. Carman
Abstract:
Current research on artificial spin ice (ASI) systems has revealed unique hysteretic memory effects and mobile quasi-particle monopoles controlled by externally applied magnetic fields. Here, we numerically demonstrate a strain-mediated multiferroic approach to locally control the ASI monopoles. The magnetization of individual lattice elements is controlled by applying voltage pulses to the piezoe…
▽ More
Current research on artificial spin ice (ASI) systems has revealed unique hysteretic memory effects and mobile quasi-particle monopoles controlled by externally applied magnetic fields. Here, we numerically demonstrate a strain-mediated multiferroic approach to locally control the ASI monopoles. The magnetization of individual lattice elements is controlled by applying voltage pulses to the piezoelectric layer resulting in strain-induced magnetic precession timed for 180 degree reorientation. The model demonstrates localized voltage control to move the magnetic monopoles across lattice sites, in CoFeB, Ni, and FeGa based ASI$'$s. The switching is achieved at frequencies near ferromagnetic resonance and requires energies below 620 aJ. The results demonstrate that ASI monopoles can be efficiently and locally controlled with a strain-mediated multiferroic approach.
△ Less
Submitted 22 March, 2018;
originally announced March 2018.
-
Strain-mediated spin-orbit torque switching for magnetic memory
Authors:
Qianchang Wang,
John Domann,
Guoqiang Yu,
Anthony Barra,
Kang L. Wang,
Gregory P. Carman
Abstract:
Spin-orbit torque (SOT) represents an energy efficient method to control magnetization in magnetic memory devices. However, deterministically switching perpendicular memory bits usually requires the application of an additional bias field for breaking lateral symmetry. Here we present a new approach of field-free deterministic perpendicular switching using a strain-mediated SOT switching method. T…
▽ More
Spin-orbit torque (SOT) represents an energy efficient method to control magnetization in magnetic memory devices. However, deterministically switching perpendicular memory bits usually requires the application of an additional bias field for breaking lateral symmetry. Here we present a new approach of field-free deterministic perpendicular switching using a strain-mediated SOT switching method. The strain-induced magnetoelastic anisotropy breaks the lateral symmetry, and the resulting symmetry-breaking is controllable. A finite element model and a macrospin model are used to numerically simulate the strain-mediated SOT switching mechanism. The results show that a relatively small voltage (${\pm}0.5$ V) along with a modest current ($3.5 \times 10^{7} A/cm^{2}$) can produce a 180° perpendicular magnetization reversal. The switching direction (up or down) is dictated by the voltage polarity (positive or negative) applied to the piezoelectric layer in the magnetoelastic/heavy metal/piezoelectric heterostructure. The switching speed can be as fast as 10 GHz. More importantly, this control mechanism can be potentially implemented in a magnetic random-access memory system with small footprint, high endurance and high tunnel magnetoresistance (TMR) readout ratio.
△ Less
Submitted 8 December, 2017;
originally announced February 2018.
-
An evolutionary game model for behavioral gambit of loyalists: Global awareness and risk-aversion
Authors:
Eleonora Alfinito,
Adriano Barra,
Matteo Beccaria,
Alberto Fachechi,
Guido Macorini
Abstract:
We study the phase diagram of a minority game where three classes of agents are present. Two types of agents play a risk-loving game that we model by the standard Snowdrift Game. The behaviour of the third type of agents is coded by {\em indifference} w.r.t. the game at all: their dynamics is designed to account for risk-aversion as an innovative behavioral gambit. From this point of view, the cho…
▽ More
We study the phase diagram of a minority game where three classes of agents are present. Two types of agents play a risk-loving game that we model by the standard Snowdrift Game. The behaviour of the third type of agents is coded by {\em indifference} w.r.t. the game at all: their dynamics is designed to account for risk-aversion as an innovative behavioral gambit. From this point of view, the choice of this solitary strategy is enhanced when innovation starts, while is depressed when it becomes the majority option. This implies that the payoff matrix of the game becomes dependent on the global awareness of the agents measured by the relevance of the population of the indifferent players. The resulting dynamics is non-trivial with different kinds of phase transition depending on a few model parameters. The phase diagram is studied on regular as well as complex networks.
△ Less
Submitted 25 February, 2018; v1 submitted 16 January, 2018;
originally announced January 2018.
-
Complex Reaction Kinetics in Chemistry: A unified picture suggested by Mechanics in Physics
Authors:
Elena Agliari,
Adriano Barra,
Giulio Landolfi,
Sara Murciano,
Sarah Perrone
Abstract:
Complex biochemical pathways or regulatory enzyme kinetics can be reduced to chains of elementary reactions, which can be described in terms of chemical kinetics. This discipline provides a set of tools for quantifying and understanding the dialogue between reactants, whose framing into a solid and consistent mathematical description is of pivotal importance in the growing field of biotechnology.…
▽ More
Complex biochemical pathways or regulatory enzyme kinetics can be reduced to chains of elementary reactions, which can be described in terms of chemical kinetics. This discipline provides a set of tools for quantifying and understanding the dialogue between reactants, whose framing into a solid and consistent mathematical description is of pivotal importance in the growing field of biotechnology. Among the elementary reactions so far extensively investigated, we recall the socalled Michaelis-Menten scheme and the Hill positive-cooperative kinetics, which apply to molecular binding and are characterized by the absence and the presence, respectively, of cooperative interactions between binding sites, giving rise to qualitative different phenomenologies. However, there is evidence of reactions displaying a more complex, and by far less understood, pattern: these follow the positive-cooperative scenario at small substrate concentration, yet negative-cooperative effects emerge and get stronger as the substrate concentration is increased. In this paper we analyze the structural analogy between the mathematical backbone of (classical) reaction kinetics in Chemistry and that of (classical) mechanics in Physics: techniques and results from the latter shall be used to infer properties on the former.
△ Less
Submitted 5 January, 2018;
originally announced January 2018.
-
A relativistic extension of Hopfield neural networks via the mechanical analogy
Authors:
Adriano Barra,
Matteo Beccaria,
Alberto Fachechi
Abstract:
We propose a modification of the cost function of the Hopfield model whose salient features shine in its Taylor expansion and result in more than pairwise interactions with alternate signs, suggesting a unified framework for handling both with deep learning and network pruning. In our analysis, we heavily rely on the Hamilton-Jacobi correspondence relating the statistical model with a mechanical s…
▽ More
We propose a modification of the cost function of the Hopfield model whose salient features shine in its Taylor expansion and result in more than pairwise interactions with alternate signs, suggesting a unified framework for handling both with deep learning and network pruning. In our analysis, we heavily rely on the Hamilton-Jacobi correspondence relating the statistical model with a mechanical system. In this picture, our model is nothing but the relativistic extension of the original Hopfield model (whose cost function is a quadratic form in the Mattis magnetization which mimics the non-relativistic Hamiltonian for a free particle). We focus on the low-storage regime and solve the model analytically by taking advantage of the mechanical analogy, thus obtaining a complete characterization of the free energy and the associated self-consistency equations in the thermodynamic limit. On the numerical side, we test the performances of our proposal with MC simulations, showing that the stability of spurious states (limiting the capabilities of the standard Hebbian construction) is sensibly reduced due to presence of unlearning contributions in this extended framework.
△ Less
Submitted 5 January, 2018;
originally announced January 2018.
-
$Θ_S-$cyclic codes over $A_k$
Authors:
Irwansyah,
Aleams Barra,
Steven T. Dougherty,
Ahmad Muchlis,
Intan Muchtadi-Alamsyah,
Patrick Solé,
Djoko Suprijanto,
Olfa Yemen
Abstract:
We study $Θ_S-$cyclic codes over the family of rings $A_k.$ We characterize $Θ_S-$cyclic codes in terms of their binary images. A family of Hermitian inner-products is defined and we prove that if a code is $Θ_S-$cyclic then its Hermitian dual is also $Θ_S-$cyclic. Finally, we give constructions of $Θ_S-$cyclic codes.
We study $Θ_S-$cyclic codes over the family of rings $A_k.$ We characterize $Θ_S-$cyclic codes in terms of their binary images. A family of Hermitian inner-products is defined and we prove that if a code is $Θ_S-$cyclic then its Hermitian dual is also $Θ_S-$cyclic. Finally, we give constructions of $Θ_S-$cyclic codes.
△ Less
Submitted 14 July, 2017;
originally announced July 2017.
-
A Note on The Enumeration of Euclidean Self-Dual Skew-Cyclic Codes over Finite Fields
Authors:
Irwansyah,
Intan Muchtadi-Alamsyah,
Ahmad Muchlis,
Aleams Barra,
Djoko Suprijanto
Abstract:
In this paper we give the enumeration formulas for Euclidean self-dual skew-cyclic codes over finite fields when $(n,|θ|)=1$ and for some cases when $(n,|θ|)>1,$ where $n$ is the length of the code and $|θ|$ is the order of automorphism $θ.$
In this paper we give the enumeration formulas for Euclidean self-dual skew-cyclic codes over finite fields when $(n,|θ|)=1$ and for some cases when $(n,|θ|)>1,$ where $n$ is the length of the code and $|θ|$ is the order of automorphism $θ.$
△ Less
Submitted 17 May, 2017;
originally announced May 2017.
-
Skew-Cyclic Codes over $B_k$
Authors:
Irwansyah,
Aleams Barra,
Intan Muchtadi-Alamsyah,
Ahmad Muchlis,
Djoko Suprijanto
Abstract:
In this paper we study the structure of $θ$-cyclic codes over the ring $B_k$ including its connection to quasi-$\tildeθ$-cyclic codes over finite field $\mathbb{F}_{p^r}$ and skew polynomial rings over $B_k.$ We also characterize Euclidean self-dual $θ$-cyclic codes over the rings. Finally, we give the generator polynomial for such codes and some examples of optimal Euclidean $θ$-cyclic codes.
In this paper we study the structure of $θ$-cyclic codes over the ring $B_k$ including its connection to quasi-$\tildeθ$-cyclic codes over finite field $\mathbb{F}_{p^r}$ and skew polynomial rings over $B_k.$ We also characterize Euclidean self-dual $θ$-cyclic codes over the rings. Finally, we give the generator polynomial for such codes and some examples of optimal Euclidean $θ$-cyclic codes.
△ Less
Submitted 17 May, 2017;
originally announced May 2017.
-
Neural Networks retrieving Boolean patterns in a sea of Gaussian ones
Authors:
Elena Agliari,
Adriano Barra,
Chiara Longo,
Daniele Tantari
Abstract:
Restricted Boltzmann Machines are key tools in Machine Learning and are described by the energy function of bipartite spin-glasses. From a statistical mechanical perspective, they share the same Gibbs measure of Hopfield networks for associative memory. In this equivalence, weights in the former play as patterns in the latter. As Boltzmann machines usually require real weights to be trained with g…
▽ More
Restricted Boltzmann Machines are key tools in Machine Learning and are described by the energy function of bipartite spin-glasses. From a statistical mechanical perspective, they share the same Gibbs measure of Hopfield networks for associative memory. In this equivalence, weights in the former play as patterns in the latter. As Boltzmann machines usually require real weights to be trained with gradient descent like methods, while Hopfield networks typically store binary patterns to be able to retrieve, the investigation of a mixed Hebbian network, equipped with both real (e.g., Gaussian) and discrete (e.g., Boolean) patterns naturally arises. We prove that, in the challenging regime of a high storage of real patterns, where retrieval is forbidden, an extra load of Boolean patterns can still be retrieved, as long as the ratio among the overall load and the network size does not exceed a critical threshold, that turns out to be the same of the standard Amit-Gutfreund-Sompolinsky theory. Assuming replica symmetry, we study the case of a low load of Boolean patterns combining the stochastic stability and Hamilton-Jacobi interpolating techniques. The result can be extended to the high load by a non rigorous but standard replica computation argument.
△ Less
Submitted 15 March, 2017;
originally announced March 2017.
-
Phase Diagram of Restricted Boltzmann Machines and Generalised Hopfield Networks with Arbitrary Priors
Authors:
Adriano Barra,
Giuseppe Genovese,
Peter Sollich,
Daniele Tantari
Abstract:
Restricted Boltzmann Machines are described by the Gibbs measure of a bipartite spin glass, which in turn corresponds to the one of a generalised Hopfield network. This equivalence allows us to characterise the state of these systems in terms of retrieval capabilities, both at low and high load. We study the paramagnetic-spin glass and the spin glass-retrieval phase transitions, as the pattern (i.…
▽ More
Restricted Boltzmann Machines are described by the Gibbs measure of a bipartite spin glass, which in turn corresponds to the one of a generalised Hopfield network. This equivalence allows us to characterise the state of these systems in terms of retrieval capabilities, both at low and high load. We study the paramagnetic-spin glass and the spin glass-retrieval phase transitions, as the pattern (i.e. weight) distribution and spin (i.e. unit) priors vary smoothly from Gaussian real variables to Boolean discrete variables. Our analysis shows that the presence of a retrieval phase is robust and not peculiar to the standard Hopfield model with Boolean patterns. The retrieval region is larger when the pattern entries and retrieval units get more peaked and, conversely, when the hidden units acquire a broader prior and therefore have a stronger response to high fields. Moreover, at low load retrieval always exists below some critical temperature, for every pattern distribution ranging from the Boolean to the Gaussian case.
△ Less
Submitted 29 July, 2017; v1 submitted 20 February, 2017;
originally announced February 2017.
-
Phase transitions in Restricted Boltzmann Machines with generic priors
Authors:
Adriano Barra,
Giuseppe Genovese,
Peter Sollich,
Daniele Tantari
Abstract:
We study Generalised Restricted Boltzmann Machines with generic priors for units and weights, interpolating between Boolean and Gaussian variables. We present a complete analysis of the replica symmetric phase diagram of these systems, which can be regarded as Generalised Hopfield models. We underline the role of the retrieval phase for both inference and learning processes and we show that retrie…
▽ More
We study Generalised Restricted Boltzmann Machines with generic priors for units and weights, interpolating between Boolean and Gaussian variables. We present a complete analysis of the replica symmetric phase diagram of these systems, which can be regarded as Generalised Hopfield models. We underline the role of the retrieval phase for both inference and learning processes and we show that retrieval is robust for a large class of weight and unit priors, beyond the standard Hopfield scenario. Furthermore we show how the paramagnetic phase boundary is directly related to the optimal size of the training set necessary for good generalisation in a teacher-student scenario of unsupervised learning.
△ Less
Submitted 6 September, 2017; v1 submitted 9 December, 2016;
originally announced December 2016.
-
Complete integrability of information processing by biochemical reactions
Authors:
Elena Agliari,
Adriano Barra,
Lorenzo Dello Schiavo,
Antonio Moro
Abstract:
Statistical mechanics provides an effective framework to investigate information processing in biochemical reactions. Within such framework far-reaching analogies are established among (anti-) cooperative collective behaviors in chemical kinetics, (anti-)ferromagnetic spin models in statistical mechanics and operational amplifiers/flip-flops in cybernetics. The underlying modeling -- based on spin…
▽ More
Statistical mechanics provides an effective framework to investigate information processing in biochemical reactions. Within such framework far-reaching analogies are established among (anti-) cooperative collective behaviors in chemical kinetics, (anti-)ferromagnetic spin models in statistical mechanics and operational amplifiers/flip-flops in cybernetics. The underlying modeling -- based on spin systems -- has been proved to be accurate for a wide class of systems matching classical (e.g. Michaelis--Menten, Hill, Adair) scenarios in the infinite-size approximation. However, the current research in biochemical information processing has been focusing on systems involving a relatively small number of units, where this approximation is no longer valid. Here we show that the whole statistical mechanical description of reaction kinetics can be re-formulated via a mechanical analogy -- based on completely integrable hydrodynamic-type systems of PDEs -- which provides explicit finite-size solutions, matching recently investigated phenomena (e.g. noise-induced cooperativity, stochastic bi-stability, quorum sensing). The resulting picture, successfully tested against a broad spectrum of data, constitutes a neat rationale for a numerically effective and theoretically consistent description of collective behaviors in biochemical reactions.
△ Less
Submitted 11 November, 2016; v1 submitted 5 May, 2016;
originally announced May 2016.
-
Inertial terms to magnetization dynamics in ferromagnetic thin films
Authors:
Yi Li,
Anne-Laure Barra,
Stephane Auffret,
Ursula Ebels,
William E. Bailey
Abstract:
Inertial magnetization dynamics have been predicted at ultrahigh speeds, or frequencies approaching the energy relaxation scale of electrons, in ferromagnetic metals. Here we identify inertial terms to magnetization dynamics in thin Ni$_{79}$Fe$_{21}$ and Co films near room temperature. Effective magnetic fields measured in high-frequency ferromagnetic resonance (115-345 GHz) show an additional st…
▽ More
Inertial magnetization dynamics have been predicted at ultrahigh speeds, or frequencies approaching the energy relaxation scale of electrons, in ferromagnetic metals. Here we identify inertial terms to magnetization dynamics in thin Ni$_{79}$Fe$_{21}$ and Co films near room temperature. Effective magnetic fields measured in high-frequency ferromagnetic resonance (115-345 GHz) show an additional stiffening term which is quadratic in frequency and $\sim$ 80 mT at the high frequency limit of our experiment. Our results extend understanding of magnetization dynamics at sub-picosecond time scales.
△ Less
Submitted 9 September, 2015;
originally announced September 2015.
-
Insights in Economical Complexity in Spain: the hidden boost of migrants in international tradings
Authors:
Elena Agliari,
Adriano Barra,
Andrea Galluzzi,
Francisco Requena-Silvente,
Daniele Tantari
Abstract:
We consider extensive data on Spanish international trades and population composition and, through statistical-mechanics and graph-theory driven analysis, we unveil that the social network made of native and foreign-born individuals plays a role in the evolution and in the diversification of trades. Indeed, migrants naturally provide key information on policies and needs in their native countries,…
▽ More
We consider extensive data on Spanish international trades and population composition and, through statistical-mechanics and graph-theory driven analysis, we unveil that the social network made of native and foreign-born individuals plays a role in the evolution and in the diversification of trades. Indeed, migrants naturally provide key information on policies and needs in their native countries, hence allowing firm's holders to leverage transactional costs of exports and duties. As a consequence, international trading is affordable for a larger basin of firms and thus results in an increased number of transactions, which, in turn, implies a larger diversification of international traded products. These results corroborate the novel scenario depicted by "Economical Complexity", where the pattern of production and trade of more developed countries is highly diversified. We also address a central question in Economics, concerning the existence of a critical threshold for migrants (within a given territorial district) over which they effectively contribute to boost international trades: in our physically-driven picture, this phenomenon corresponds to the emergence of a phase transition and, tackling the problem from this perspective, results in a novel successful quantitative route. Finally, we can infer that the pattern of interaction between native and foreign-born population exhibits small-world features as small diameter, large clustering, and weak ties working as optimal cut-edge, in complete agreement with findings in "Social Complexity".
△ Less
Submitted 20 March, 2015;
originally announced March 2015.
-
Emerging heterogeneities in Italian customs and comparison with nearby countries
Authors:
Elena Agliari,
Adriano Barra,
Andrea Galluzzi,
Marco Alberto Javarone,
Andrea Pizzoferrato,
Daniele Tantari
Abstract:
In this work we apply techniques and modus operandi typical of Statistical Mechanics to a large dataset about key social quantifiers and compare the resulting behaviours of five European nations, namely France, Germany, Italy, Spain and Switzerland. The social quantifiers considered are $i.$ the evolution of the number of autochthonous marriages (i.e. between two natives) within a given territoria…
▽ More
In this work we apply techniques and modus operandi typical of Statistical Mechanics to a large dataset about key social quantifiers and compare the resulting behaviours of five European nations, namely France, Germany, Italy, Spain and Switzerland. The social quantifiers considered are $i.$ the evolution of the number of autochthonous marriages (i.e. between two natives) within a given territorial district and $ii.$ the evolution of the number of mixed marriages (i.e. between a native and an immigrant) within a given territorial district. Our investigations are twofold. From a theoretical perspective, we develop novel techniques, complementary to classical methods (e.g. historical series and logistic regression), in order to detect possible collective features underlying the empirical behaviours; from an experimental perspective, we evidence a clear outline for the evolution of the social quantifiers considered. The comparison between experimental results and theoretical predictions is excellent and allows speculating that France, Italy and Spain display a certain degree of {\em internal heterogeneity}, that is not found in Germany and Switzerland; such heterogeneity, quite mild in France and in Spain, is not negligible in Italy and highlights quantitative differences in the customs of Northern and Southern regions. These findings may suggest the persistence of two culturally distinct communities, long-term lasting heritages of different and well-established cultures.
△ Less
Submitted 23 November, 2015; v1 submitted 2 March, 2015;
originally announced March 2015.