Abstract
Neuromorphic machines are intriguing for building energy-efficient intelligent systems, where spiking neurons are pivotal components. Recently, memristive neurons with promising bio-plausibility have been developed, but with limited reliability, bulky capacitors or additional reset circuits. Here, we propose an anti-ferroelectric field-effect transistor neuron based on the inherent polarization and depolarization of Hf0.2Zr0.8O2 anti-ferroelectric film to meet these challenges. The intrinsic accumulated polarization/spontaneous depolarization of Hf0.2Zr0.8O2 films implements the integration/leaky behavior of neurons, avoiding external capacitors and reset circuits. Moreover, the anti-ferroelectric neuron exhibits low energy consumption (37âfJ/spike), high endurance (>1012), high uniformity and high stability. We further construct a two-layer fully ferroelectric spiking neural networks that combines anti-ferroelectric neurons and ferroelectric synapses, achieving 96.8% recognition accuracy on the Modified National Institute of Standards and Technology dataset. This work opens the way to emulate neurons with anti-ferroelectric materials and provides a promising approach to building high-efficient neuromorphic hardware.
Similar content being viewed by others
Introduction
In the past few decades, neuromorphic computing, mimicking the human brainâs architecture and operation with electronic devices, has attracted great interest due to its high biomimetic and high-energy efficiency1,2,3. Artificial neurons are the core components of neuromorphic computing implementation, emulating biological neurons functions of potential accumulation and firing4,5. For the hardware implementation of neurons, hardware overhead, energy efficiency, and reliability are the critical evaluation criteria5,6. Yet, current hardware demonstrations of neurons struggle to satisfy these key metrics simultaneously.
Generally, the complementary metal-oxide-semiconductor (CMOS) circuit is the most mature and stable scheme for emulating biological neurons. Nevertheless, due to the lack of intrinsic biological resemblance and the complexity of circuits, CMOS neurons face many challenges in density or energy efficiency7,8,9. Recently, various emerging devices have been extensively explored to emulate biological neurons benefiting from their biological resemblance and scalability. Memristive neurons have trigged the most interest among them, including redox memristors10,11,12,13, Mott memristors14,15,16,17,18,19, phase-change memristors (PCM)20,21,22, magnetic random access memory (MRAM)23,24, etc. These neurons utilize the gradual switching of conductance to mimic membrane potential evolution, successfully emulating essential biological neuron functions with low hardware cost. However, high electroforming voltage and limited reproducibility due to temporal and spatial variations are still open questions25,26. In addition, capacitors are usually needed to realize the integration in memristive neurons, which limits their practical applications in large-scale neuromorphic computing systems10,17,27. In the very recent research, novel ferroelectric polarization-based neurons are proposed and experimentally demonstrated28,29,30,31,32. They utilize gradual polarization to mimic the integration process of biological neurons without additional capacitors33. Moreover, polarization is the intrinsic property of ferroelectric materials, which is recognized to be reproducible, reliable, and energy-efficient29,34. These features are promising to implement neurons. However, ferroelectric devices are nonvolatile, and thus need a feedback path28,29,30 or a special design of ferroelectric layer31,32 to achieve spontaneous reset after firing. The feedback path will increase the hardware cost and energy consumption of neuron implementation. In addition, it will increase the complexity of the operation, as each new input must wait for the completion of the previous reset process, especially in a system with a rate coding scheme. Thus, demonstrating an ideal electronic device that processes advanced and balanced neuronal performance without additional capacitors and reset feedback path deserves more attention.
In this work, we report a leaky integrate-and-fire (LIF) neuron based on a CMOS-compatible anti-ferroelectric field-effect transistor (AFeFET). The intrinsic polarization/depolarization processes of the Hf0.2Zr0.8O2 AFeFET successfully emulate the integrate/leaky neuronal functions without any capacitors and reset peripheral circuits. Furthermore, attributing to the plentiful merits of ferroelectric materials, AFeFET neuron exhibits many superiorities: electroforming-free, ultra-low-energy consumption (37âfJ/spike), high endurance (>1012), small cycle-to-cycle variation (as low as 3.93%) and device-to-device variations (7.57%). Also, we present that the temporal integration speed in such an AFeFET neuron depends on the intensity of postsynaptic potential, illustrating the fundamental features for performing classification tasks. Subsequently, we demonstrate a two-layer spiking neural network (SNN) with full-ferroelectric architecture for learning and recognizing the Modified National Institute of Standards and Technology (MNIST) datasets by simulation, obtaining the maximum recognition accuracy 96.8% comparable to ideal neurons. These results demonstrate that the proposed AFeFET neuron is a competitive candidate for constructing neuromorphic systems.
Results
Volatile AFeFET as LIF neurons
Figure 1a shows the architecture and processing model of biological neurons. In specific, neuronal dendrites receive input spike information from pre-neurons and transmit it to soma. Then soma integrates information and triggers an action potential when the membrane potential reaches a threshold value. The axon transmits the generated action potentials to post-neurons, and the membrane potential depolarizes to a resting state5. The increase/decrease of membrane potential corresponds to the opening or shutting of Na+/K+ channels, corresponding to three stages in Fig. 1b. Here, the dynamic process of membrane potential can be mimicked vividly by the intrinsic polarization/depolarization of anti-ferroelectric (AFE) materials. Under a silent state, AFE materials have spontaneous polarizations, but the orientations of adjacent diploes are opposite, resulting in zero net macroscopic remanent polarization, as shown in the inset of Fig. 1c. However, the diploes can be aligned by the sufficient electric field, and the phase switches from AFE to ferroelectric (FE)35. Usually, the electric field-induced FE phase is not stable, which will recover to AFE phase when the electric field is released. Thus, AFE materials exhibit volatile characteristics and representative double hysteresis (Prâââ0 at 0âMV/cm) as shown in Fig. 1c. We establish the dynamic relation between the intrinsic volatile characteristic of AFE and integrate/leaky neuronal functions by constructing an AFeFET, which is integrated by an AFE capacitor (TiN/Hf0.2Zr0.8O2/TiN) and a MOSFET (see âMethodsâ for the details of fabrication processes), as shown in the inset of Fig. 1d. Figure 1d shows that the typical transfer characteristic of AFeFET exhibits volatility, which differs from that of nonvolatile FeFET at Vgâ=â0 (Supplementary Fig. 1).
The volatility of AFeFET emulates the self-recovery of biological neurons, which helps avoid external peripheral reset circuits. Furthermore, the AFeFET device is electroforming-free, which saves an additional high-voltage forming circuit. In order to investigate the LIF function of AFeFET, continuous gate pulses (100âμs width, 100âμs interval, 1.5âV amplitude) representing postsynaptic potentials are applied to the gate of the AFeFET. The corresponding drain current (Id) representing the membrane potential demonstrates replicable LIF behavior under gate pulse trains (Fig. 1e). The Id increases under the excitatory spike trains and the neuron fires when Id reaches a threshold (1âμA). After firing, Id decreases spontaneously in a millisecond of free time eventually, which means the AFeFET neuron recovers and gets prepared for the next firing. These merits make AFeFET suitable for emulating the integration and recovery process of neurons.
Device characteristics and mechanism of the AFeFET
In this work, the volatile characteristics of the AFeFET neuron are dominated by the composition of zirconium in HfxZr1-xO2 film. The HfxZr1-xO2 exhibits paraelectric-FE-AFE transition with increasing the concentration of zirconium element (0â100%) (Supplementary Fig. 2). Actually, HfO2 and ZrO2 exhibit very similar physical and chemical properties, such as crystal phases, lattice parameters except the dielectric properties. The pure HfO2 shows linear dielectric characteristics under electric field due to the centrosymmetric monoclinic structure36. The FE properties occur with the increasing zirconium content in doped HfO2, which is induced by the existence of a non-centrosymmetric o-phase structure. The hafnium-rich ferroelectric HfxZr1-xO2 oxides exhibit nonvolatility37,38,39, thus generally serve as memory materials. With further increasing the zirconium content, the volatile AFE properties occur in zirconium-rich HfxZr1-xO2 oxides. The polarization of AFE can be triggered by an electric field and increases under a higher electric field. But it can still revert to the initial state as the applied electric field is removed (Supplementary Fig. 3). Usually, the polarization of AFE can be ascribed to the phase transition from AFE to FE phase under the influence of electric field. The electric field-induced phase transition is always accompanied by a large-volume change40. When the electric field is released, the induced FE phase will recover to the AFE phase due to the strains resulting from volume expansion38,41,42. As a result, the zirconium-rich HfxZr1-xO2 oxides exhibit intrinsic volatility. This is the charm of AFE materials used for constructing artificial neurons.
Figure 2a, b shows the plane structure and the detailed cross-sectional image of the AFeFET neuron. According to these images, the structure of the AFeFET neuron can be observed clearly, in which an AFE capacitor (yellow square) integrates on the gate of a conventional MOSFET. The energy-dispersive X-ray spectroscopy (EDS) mapping and line scan EDS were performed to further identify the elements and structure of AFeFET (Fig. 2c, d). The thickness of TiN/HfxZr1-xO2/TiN is 40ânm/10ânm/40ânm, and the interfaces of all layers are clean and flat. In addition, the Hf, Zr, Ti, N, W elements distribute uniform and are free of inter-diffused. Then we focus on investigating the characteristics of AFE layer due to its dominant role in neuronal behavior. The composition of AFE layer is controlled by alternate deposition (one cycle HfO2 and four cycles ZrO2), and is confirmed (hafnium:zirconium â1:4) by the peak areas and the relative sensitivity factors in X-ray photoelectron spectroscopy (XPS) results (Supplementary Fig. 4). The high-resolution transmission electron microscopy (HRTEM) image for the details of the AFE films is presented in Fig. 2e. The polycrystalline nature and the lattice fringes of different crystals can be observed clearly. Figure 2f, g depicts the crystal structure and corresponding fast Fourier transform (FFT) image of the white square area in Fig. 2e. The relative angle and distance between two lattice planes and diffraction spots indicate the existence of [0-10]-oriented AFE tetragonal P42/nmc phase. In addition, the arrangement of zirconium atoms (green dots) is very regular, and the relative angle and lattice constants are measured directly as 55.6°, 3.49âà , and 3.2âà , respectively. These zirconium atoms parameters match the atomic model of [0-10]-oriented plane of tetragonal P42/nmc phase exactly in Supplementary Fig. 5. These results confirm the existence of t-phase in AFE films, which is the foundation of the AFeFET neuron.
In order to demonstrate the dynamic of AFeFET neuron, the mechanism is shown in Fig. 2h, which is related to the transformation between AFE and FE domains. At stage 1, several input pulses as postsynaptic signals are applied to the gate of AFeFET neuron, and the electric field-induced FE orthorhombic phase (o-phase) domains nucleate, which transform from AFE tetragonal phase (t-phase) domains under the gate pulse stimuli. The polarized charges accumulate in the AFE layer and modulate the channel resistance of MOSFET, resulting in the Id begins stepping up gradually. This process is just as the small portion of Na+ channels opening. With further applying gate pulses, it comes to stage 2, at which the electric field-induced FE o-phase domains grow and expand. As a result, more attracted electrons accumulate in the channel of AFeFET, and the Id increases greatly, corresponding more Na+ channels opening. Once the Id surpasses the threshold, the AFeFET neuron would fire. After firing, the electric field-induced FE o-phase domains transform back to AFE t-phase domains due to the release of gate pulse. Consequently, the attracted electrons discharge and the channel of AFeFET switches off, indicating the AFeFET neuron returns to resting potential (stage 3). This process corresponds to the opening of K+ channels in biological neurons. Then, the AFeFET neuron fires again under another gate pulse stimuli. This repeatable and stable electric field-induced phase transition accounts for the intrinsic neuronal resemblance of AFeFET. The atomic-scale phase transition between t-phase and o-phase under the influence of electric field has been observed clearly by Lombardo et al. via in situ HRTEM42.
To present the gradual electric field-induced phase transition of AFE t-phase, we investigate the tendencies of Id under continuous gate pulses. Figure 2i shows the tendencies of Id under different gate pulse amplitude, while the gate pulse interval and width are fixed to 100âμs, respectively. Under the first 20 continuous gate pulse stimuli, an obvious integration process of Id can be observed. This resulted from the gradual formation of electric field-induced o-phase, which induces the electrons accumulation in the channel of AFeFET. The gate pulses with larger amplitude result in more o-phase e domains formation and quicker growth of Id. With further gate pulse stimuli, the reversible domains tend to reach a saturation regime. This represents the dynamic balance between the electric field-induced phase transition and the recovery of the AFE t-phase, and the Id does not increase anymore. Compared between stimuli with different amplitudes, it is clearly that the higher pulse amplitude will lead to faster growth speed and a larger saturation value of Id, which is because that more AFE t-phase domains can be switched to FE o-phase domains. Noting that a similar tendency of Id can be observed under different gate pulse intervals and widths, as illustrated in Supplementary Fig. 6. Input stimuli pulses with shorter pulse intervals or wider widths induce faster integration speed and larger saturation value of Id. In all cases, the Id increases gradually and then tends to saturate corresponds to the gradual electric field-induced phase transition and saturation processes of AFE t-phase. In addition, the tendencies of Id also illustrate the intrinsic plasticity of neurons43, demonstrating that the AFeFET has high potentiality for hardware implementation of artificial LIF neurons.
Neuronal characteristics of the AFeFET
To investigate the strength-modulated integration process of AFeFET neurons, we apply gate pulse stimuli with different amplitudes and widths to implement LIF neuron functions, as shown in Fig. 3a, b. As the input pulse intensity (amplitude/width) increasing, the AFeFET neuron needs fewer input spikes to reach the threshold (1âμA), which indicates a higher firing rate under stronger stimuli strength. This is because more electric field-induced o-phases are formed under stronger pulse intensity, resulting in faster charge integration speed. Correspondingly, a higher Id of the AFeFET neuron needs longer time to leak (Supplementary Fig. 7), which could be clarified as adaptive recovery. To further study how the leaky behavior influences the integration process, we measure the Id under gate pulse stimuli with different intervals (50â600âμs), as shown in Fig. 3c. As the interval increasing, more input pulses are needed to integrate the Id to reach the threshold value. This is because that more charges are released during the free interval time, and more input pulses are required to compensate for that. It is worth noting that, when the interval time is wider enough, the Id cannot reach the threshold anyway. This feature represents the filtering capability of the neuron for weaker input signals, which is important in biological systems and neuromorphic systems44. To further evaluate the stability of the AFeFET neuron, we extract the statistical data of input spike numbers for firing as a function of input amplitudes, as shown in Fig. 3d. The firing event needs fewer input spikes and tends to be more stable as the stimuli intensity increasing. This phenomenon exhibits that the AFeFET neuron performs high-precision computation under enough stimuli intensity, which is favorable for performing high-precision tasks. A similar relationship between input spike numbers for firing and pulse widths (or intervals) is observed, as shown in Fig. 3e, f, respectively. To directly present the stability of the AFeFET neuron, we calculate the standard deviation (Ï) of integration pulse number under each stimuli condition and label them out in Fig. 3dâf. Furthermore, the cycle-to-cycle variation is calculated by dividing standard deviation (Ï) by mean value (μ). The lowest variation (3.93%) between cycles is obtained under 1.7âV gate pulse amplitude, 50-μs interval, and 100âμs width. It should be noted that optimizing the gate pulse parameters may further enhance the uniformity between cycles. These results demonstrate that the AFeFET neurons can successfully emulate the strength-modulated spike frequency characteristics of biological neurons with high stability, making the AFeFET neurons firstcapable of carrying out the classification tasks11,45.
To further access the compatibility of the AFeFET neuron for implementing unsupervised learning, we investigated the lateral inhibitory property. During the accumulation of AFeFET neuron membrane potential, the excitability will be inhibited immediately when the AFeFET neuron receives inhibitory stimuli from adjacent neurons, as shown in Supplementary Fig. 8a,b. Moreover, the AFeFET neuron needs more excitatory inputs for the next firing under stronger lateral inhibition intensity (Supplementary Fig. 8c). This behavior is similar to the suppressive phenomenon in biological neurons between each other, which is valuable for performing competitive learning tasks.
For artificial neurons, low-energy consumption, high endurance, and high reliability are critical merits. To investigate the energy consumption of the AFeFET neuron, we performed a systematic analysis under different input pulse parameters and threshold values, as shown in Fig. 3g (extracted from Supplementary Fig. 9). The lowest energy consumption of 37âfJ/per spike can be obtained under the 50ânA threshold, 1âμs pulse width, and 1-μs pulse interval. Furthermore, the energy consumption decreases remarkably as the threshold and pulse-width (interval) decrease. Thus, it is reasonable to infer that the energy consumption can be further reduced by decreasing the threshold and pulse width (interval). Moreover, the AFeFET neurons demonstrate considerable repeatability. Supplementary Fig. 10a shows 5âÃâ105 stable firing cycles of the AFeFET neuron without any significant deterioration. In order to speed up the measurement, the AFE MIM structure, which is the endurance bottleneck of AFeFET, is measured for higher endurance (1012 cycles) (Supplementary Fig. 10b). Based on the endurance measurements above, it is reasonable to believe the AFeFET neuron could support more than 1012 firing events (Fig. 3h). Figure 3i shows the histograms of input spike numbers for firing, which are collected from 100 firing activities of each AFeFET neuron. The required pulse number for firing is concentrated nearby 12, and the device-to-device variation is calculated to be as low as 7.57%. This variation is extracted from 500 firing activities in five AFeFET neurons. As we claimed before, the uniformity could be further enhanced by optimizing the gate pulse parameters. These results indicate that the AFeFET neuron has high uniformity and great potential in large-scale applications.
The basic integration and fire functionality of the proposed neuron can be achieved by only one AFeFET, while the examination of current threshold, the generation of output spike, and controllable refractory period need additional circuits as shown in Supplementary Fig. 11. The detail of this circuit design is described in the supporting information. For clearly presenting the comprehensive merits of our AFeFET neuron, a benchmark comparison with other typical neurons based on emerging devices is summarized in Table 1. Considering the energy consumption, endurance, and hardware overhead, which are the critical evaluation criteria of the artificial neurons, the AFeFET neuron exhibits attractive performances.
Network-level performance of AFeFET neuron
We have demonstrated that the AFeFET neurons can provide better energy efficiency and higher uniformity compared to the other neurons based on emerging devices. It is also essential to evaluate the network-level performance using the AFeFET neuron for the hardware implementation of SNNs. Subsequently, we construct a two-layer SNN (784âÃâ400âÃâ10) for classifying MNIST datasets, as shown in Fig. 4a. In this network, we adopt a time-to-first-spike coding method, in which all input neurons fire exactly one spike per stimulus, but the firing order carries information. A larger input corresponds to an earlier spike of the neuron. In the output layer, the first fired neuron determines the class of stimulus. As soon as one of the output neurons fires, the network assigns the corresponding category to the input, and the inference process stop. Thus, such a coding scheme is much more suitable for hardware implementation. The right panel of Fig. 4a presents the proposed hardware implementation of the network based on ferroelectric field-effect transistor (FeFET) synapses and AFeFET neurons. During inference, the input signal is applied to the drains of FeFET synapses (BLs), pulse-width modulators (PWMs) collect current on source lines (SLs) and convert to pulses with fixed amplitude and various widths. The outputs of PWMs serve as postsynaptic potentials and be applied to the gates of AFeFET neurons for performing the integration process. Then, we train such a network to learn MNIST datasets for illustrating the feasibility. During training, we adopt a supervised temporal backpropagation algorithm proposed by Kheradpisheh46. Figure 4b shows the training results with the AFeFET neurons under 1âμA threshold, achieving ~95% recognition accuracy. These results demonstrate that our AFeFET neurons have great potential to be used for fabricating SNNs chips.
During training, the threshold value determines the number of integrated inputs (the number of membrane states) and thus affects the network performance. Then, the relation between the threshold value and network performance is further investigated as shown in Fig. 4c. It should be noted that the recognition accuracy is related to the threshold, with the highest 96.8% accuracy under the 2âμA threshold, which is nearly identical to ideal IF neurons. The pulse number for firing is equivalent to the number of the membrane potential during training. When increasing the threshold, the number of potential membranes increases, corresponding to the increasing precision of the membrane potential. Thus, with increasing the threshold, the recognition accuracy increases. Nonetheless, the inference time increases with increasing the threshold value because more integration number is required to trigger neuron firing at higher threshold cases. Thus, there should be a trade-off between recognition accuracy and inference time, and an appropriate threshold value should be selected to balance the network performance in practical applications. Fortunately, the network can still achieve high accuracy (>86%) even at a threshold value low to 62.5ânA, which is favorable for applications that need faster inference time but not rigorous accuracy. As we claimed before, the threshold also affects the energy consumption of the neuron, which is the key parameter for SNNs chip applications. We extract the spike number and energy consumption of the neurons in the system with different training thresholds to evaluate this feature, as shown in Fig. 4d. When the threshold is higher than 0.125âμA, the spike number in the hidden layer (2nd layer) decreases with increasing the threshold. This is because the hidden neurons with a higher threshold are hard to fire. On the contrary, we observe that the total energy consumption of neurons decreases as the threshold decreases. When the threshold reduces to 62.5ânA, the spike number abruptly decreases to be less than 100. This is because under a low threshold value, the winner neuron in the output layer fires earlier, and the network could finish the inference process faster. In that case, only a minority of neurons in the hidden layers fire, and thus the total neurons consume less energy. These results demonstrate that decreasing the threshold supports fast inference speed and could greatly decrease the energy consumption of hardware neurons. In addition, device variation is another important parameter in practical applications. Figure 4e presents the inference accuracy under various device variations after training under the 1âμA threshold. As the red dots are shown, the inference accuracy only decreases 1% even the variation increases to ±16.7%, illustrating nearly no network performance degradation. These results further demonstrate that the proposed AFeFET neuron is suitable for performing SNNs tasks and has great potential for the hardware implementation of SNNs chips.
Discussion
SNNs, inspired by the human brain, are powerful platforms for enabling low-power event-driven neuromorphic hardware. In SNNs, spiking neurons are the key units that enable spikes, which exchange information through connected plastic synapses. With rich physical dynamics, memristive devices are considered promising devices to emulate spiking neurons. However, the high-energy consumption or limited reliability hinders the applications of memristive neurons in neuromorphic computing.
In this work, we demonstrated a leaky integrate-and-fire neuron based on an AFeFET. The dynamic relationship between the intrinsic polarization/depolarization process of the Hf0.2Zr0.8O2 AFeFET and integrate/leaky neuronal functions are successfully built. The AFeFET neuron features CMOS-compatible, tunable firing frequency, ultra-low hardware cost (no capacitance and additional reset circuit), ultra-low-energy consumption (37âfJ/spike), high endurance (>1012), and high uniformity among different cycles and devices, showing advanced overall performances compared with emerging devices-based neurons in literature. To verify the feasibility of the neuron, we constructed a two-layer SNN combined with FeFET synapses, achieving high recognition accuracy (96.8%), low-energy consumption, and high robustness on MNIST datasets. These results demonstrate that the AFeFET neuron is a promising candidate for constructing high-efficient SNN systems and may promote the industrial landing of neuromorphic machines based on anti-ferroelectric materials.
Methods
Sample fabrication
The fabrication processes of AFeFET neuron devices are as follows: (1) After ultraviolet lithography and lift-off process, the bottom electrode TiN (40ânm) was deposited on the gate terminal of the NMOS transistor by ion beam sputtering. The NMOS transistor was fabricated by 0.18âμm CMOS technology. The W/L of NMOS is 10âμm/1âμm and its dielectric thickness is 4ânm. (2) Then, 10ânm Hf0.2Zr0.8O2 AFE thin films were deposited on 40ânm-thick TiN bottom electrode by atomic layer deposition (ALD) process at 280â°C substrate temperature. The Hf[N(C2H5)CH3]4, Zr[N(C2H5)CH3]4, and H2O were used as hafnium precursor, Zr precursor and oxygen source, respectively. The hafnium/zirconium ratio was controlled by alternate deposition of one cycle HfO2 and four cycles ZrO2. (3) Then after the ultraviolet lithography process, 40ânm-thick TiN was grown by an ion beam top electrode was released. The two-terminal metal-insulatorâmetal structure was integrated on the gate of the NMOS transistor. (4) The fabricated device was annealed for 30âs at 500â°C in nitrogen atmosphere to crystallize.
Measurement method
The element ratio is confirmed by X-ray photoelectron spectroscopy (ESCALAB 250Xi). The cross-section TEM, high-resolution TEM, energy-dispersive spectroscopy and crystal structure were analyzed by transmission electron microscopy (FEI Tecnai TF-20, UK). The DC mode is measured by Agilent B1500 semiconductor parameter analyzer. A B1530A fast measurement unit module was used for generating the voltage pulse and measure the response current at the same time. Capacitance-electric field (C-E) tests are performed with 10âkHz AC probing frequency and 30âmV amplitude by Agilent B1500.
SNN simulation method
In this work, we use time-to-first-spike time coding to encode the input image into a sparse spike train. Each pixel of the input image is encoded into a single spike whose spiking time is inversely proportional to its pixel value. The dense input corresponds to earlier spiking time. And each input pixel will only generate one spike, resulting in sparser spike train than the rate coding method and significantly reducing energy consumption in hardware implementation. We constructed a 784âÃâ400âÃâ10 fully ferroelectric SNN for MNIST recognition based on such a coding method. FeFET synapses were considered during training, whose conductance was between 5âμS and 60âμS according to the experimental data in ref. 47. The pulse number (64) for firing under 10âμs pulse width is used due to the highest number counts that correspond to the bits number (6 bits) of the neuronâs membrane potentials. The 9% cycle-to-cycle variation is extracted from the statistical data of 10âμs pulse width in Fig. 3e. In addition, the leaky time constant (800âμs) is extracted from the integrate-and-fire process under 10âμs pulse width (Supplementary Fig. 7b). The energy consumption was calculated according to the data of 10âμs pulse width in Fig. 3g. When the neuron reaches the firing threshold, it will emit a spike to the subsequent layer. After emitting a spike, neurons will remain resting state until the end of the time window. In the output layer, the first spiking neuron determines the network decision. Before training, the synapse weights are initialized randomly. In the training process, the target firing time of the correct output neuron is the earliest time that all neurons fire, and the target firing time of other neurons is set to be later than the earliest firing time. According to the defined error function, synapses of the fired neuron before the actual firing time will be updated. In order to update the weights of the hidden layer, a backpropagation algorithm is used to calculate the error of the hidden layer46. During training, all time steps need to calculate the error function. In the inference process, the recognition result is obtained when the first spike is generated in the output layer, so there is no need to perform the later time step. Fewer time steps mean lower recognition latency and less energy consumption. By adjusting the neuronâs threshold, the spike generation time can be adjusted, resulting in the adjustable recognition speed and energy consumption with acceptable accuracy loss.
Data availability
All data needed to evaluate the conclusions in the paper are present in the paper and/or the Supplementary Materials. Additional data related to this paper can be requested from the authors. Source data are provided with this paper.
Code availability
The code of SNN simulation will be available from the corresponding authors upon request.
References
Silver, D. et al. Mastering the game of Go with deep neural networks and tree search. Nature 529, 484â489 (2016).
Zhang, W. et al. Neuro-inspired computing chips. Nat. Electron. 3, 371â382 (2020).
Merolla, P. A. et al. A million spiking-neuron integrated circuit with a scalable communication network and interface. Science 345, 668â673 (2014).
Kandel, E. R. et al. Principles of Neural Science (McGraw-Hill, 2000).
Zhu, J. et al. A comprehensive review on emerging artificial neuromorphic devices. Appl. Phys. Rev. 7, 011312 (2020).
Wang, Z. et al. Resistive switching materials for information processing. Nat. Rev. Mater. 5, 173â195 (2020).
Indiveri, G. et al. A VLSI array of low-power spiking neurons and bistable synapses with spike-timing dependent plasticity. IEEE Trans. Neural Netw. 17, 211â221 (2006).
Indiveri, G. et al. Neuromorphic silicon neuron circuits. Front. Neurosci. 5, 73 (2011).
Sourikopoulos, I. et al. A 4-fJ/spike artificial neuron in 65 nm CMOS technology. Front. Neurosci. 11, 123 (2017).
Wang, Z. et al. Fully memristive neural networks for pattern classification with unsupervised learning. Nat. Electron. 1, 137â145 (2018).
Zhang, X. et al. An artificial neuron based on a threshold switching memristor. IEEE Electron Device Lett. 39, 308â311 (2018).
Wang, Z. et al. Capacitive neural network with neuro-transistors. Nat. Commun. 9, 3208 (2018).
Lee, D. et al. Various threshold switching devices for integrate and fire neuron applications. Adv. Electron. Mater. 5, 1800866 (2019).
Zhang, X. et al. Experimental demonstration of conversion-based SNNs with 1âT1R Mott neurons for neuromorphic Inference. In 2019 IEEE International Electron Devices Meeting (IEDM) 6.7.1â6.7.4 (IEEE, 2019). https://ieeexplore.ieee.org/document/8993519.
Bo, Y. et al. NbO2 memristive neurons for burst-based perceptron. Adv. Intell. Syst. 2, 2000066 (2020).
Jerry, M. et al. Ultra-low power probabilistic IMT neurons for stochastic sampling machines. In 2017 Symposium on VLSI Technology T186âT187 (IEEE, 2017). https://ieeexplore.ieee.org/document/7998148.
Yi, W. et al. Biological plausibility and stochasticity in scalable VO2 active memristor neurons. Nat. Commun. 9, 4661 (2018).
Stoliar, P. et al. A leaky-integrate-and-fire neuron analog realized with a Mott insulator. Adv. Funct. Mater. 27, 1604740 (2017).
Pickett, M. D. et al. A scalable neuristor built with Mott memristors. Nat. Mater. 12, 114â117 (2013).
Tuma, T. et al. Stochastic phase-change neurons. Nat. Nanotechnol. 11, 693â699 (2016).
Cobley, R. A. et al. A self-resetting spiking phase-change neuron. Nanotechnology 29, 195202 (2018).
Wright, C. D. et al. Beyond von-Neumann computing with nanoscale phase-change memory devices. Adv. Funct. Mater. 23, 2248â2254 (2012).
Sengupta, A. et al. Magnetic tunnel junction mimics stochastic cortical spiking neurons. Sci. Rep. 6, 30039 (2016).
Wu, M.-H. et al. Extremely compact integrate-and-fire STT-MRAM neuron: a pathway toward all-spin artificial deep neural network. In 2019 Symposium on VLSI Technology T34âT35 (IEEE, 2019). https://ieeexplore.ieee.org/document/8776569.
Li, Y. et al. Anomalous resistive switching in memristors based on two-dimensional palladium diselenide using heterophase grain boundaries. Nat. Electron. 4, 348â356 (2021).
Lanza, M. et al. Memristive technologies for data storage, computation, encryption, and radio-frequency communication. Science 376, eabj9979 (2022).
Wu, Z. et al. A habituation sensory nervous system with memristors. Adv. Mater. 32, e2004398 (2020).
Dutta, S. et al. Biologically plausible ferroelectric quasi-leaky integrate and fire neuron. In 2019 Symposium on VLSI Technology T140âT141 (IEEE, 2019). https://ieeexplore.ieee.org/abstract/document/8776487.
Dutta, S. et al. Supervised learning in all FeFET-based spiking neural network: opportunities and challenges. Front. Neurosci. 14, 634 (2020).
Wang, Z. et al. Experimental demonstration of ferroelectric spiking neurons for unsupervised clustering. in 2018 IEEE International Electron Devices Meeting (IEDM) 13.13.11â13.13.14 (IEEE, 2018).
Chen, C. et al. Bio-inspired neurons based on novel leaky-FeFET with ultra-low hardware cost and advanced functionality for all-ferroelectric neural network. In 2019 Symposium on VLSI Technology T136âT137 (IEEE, 2019). https://ieeexplore.ieee.org/abstract/document/8776495.
Luo, J. et al. Capacitor-less stochastic leaky-FeFET neuron of both excitatory and inhibitory connections for SNN with reduced hardware cost. in 2019 IEEE International Electron Devices Meeting (IEDM) 6.4.1â6.4.4 (IEEE, 2019).
Mulaosmanovic, H. et al. Ferroelectric field-effect transistors based on HfO2: a review. Nanotechnology 32, 502002 (2021).
Salahuddin, S. et al. The era of hyper-scaling in electronics. Nat. Electron. 1, 442â450 (2018).
Hao, X. et al. A comprehensive review on the progress of lead zirconate-based antiferroelectric materials. Prog. Mater. Sci. 63, 1â57 (2014).
Sang, X. et al. On the structural origins of ferroelectricity in HfO2 thin films. Appl. Phys. Lett. 106, 162905 (2015).
Muller, J. et al. Ferroelectricity in simple binary ZrO2 and HfO2. Nano Lett. 12, 4318â4323 (2012).
Park, M. H. et al. Ferroelectricity and antiferroelectricity of doped thin HfO2-based films. Adv. Mater. 27, 1811â1831 (2015).
Müller, J. et al. Ferroelectric Zr0.5Hf0.5O2 thin films for nonvolatile memory applications. Appl. Phys. Lett. 99, 112901 (2011).
Zhang, S.-T. et al. High-strain lead-free antiferroelectric electrostrictors. Adv. Mater. 21, 4716â4720 (2009).
Pan, W. et al. Field-forced antiferroelectric-to-ferroelectric switching in modified lead zirconate titanate stannate ceramics. J. Am. Ceram. Soc. 72, 571â578 (1989).
Lombardo, S. et al. Atomic-scale imaging of polarization switching in an (anti-)ferroelectric memory material: zirconia (ZrO2). In 2020 Symposium on VLSI Technology 1â2 (IEEE, 2020).
Baek, E. et al. Intrinsic plasticity of silicon nanowire neurotransistors for dynamic memory and learning functions. Nat. Electron. 3, 398â408 (2020).
Khan, A. I. et al. The future of ferroelectric field-effect transistor technology. Nat. Electron. 3, 588â597 (2020).
Burkitt, A. N. A review of the integrate-and-fire neuron model: I. Homogeneous synaptic input. Biol. Cybern. 95, 1â19 (2006).
Kheradpisheh, S. R. & Masquelier, T. Temporal backpropagation for spiking neural networks with one spike per neuron. Int. J. Neural Syst. 30, 2050027 (2020).
Jerry, M. et al. Ferroelectric FET analog synapse for acceleration of deep neural network training. in 2017 IEEE International Electron Devices Meeting (IEDM) 6.2.1â6.2.4 (IEEE, 2017).
Li, Y. et al. High-uniformity threshold switching HfO2-based selectors with patterned Ag nanodots. Adv. Sci. 7, 2002251 (2020).
Song, J. et al. Effects of liner thickness on the reliability of AgTe/TiO2-based threshold switching devices. IEEE Trans. Electron Dev. 64, 4763â4767 (2017).
Jeonghwan, S. et al. Threshold selector with high selectivity and steep slope for cross-point memory array. IEEE Electron Device Lett. 36, 681â683 (2015).
Grisafe, B. et al. Performance enhancement of Ag/HfO2 metal ion threshold switch cross-point selectors. IEEE Electron Device Lett. 40, 1602â1605 (2019).
Chen, A. et al. Multiâfunctional controllable memory devices applied for 3D integration based on a single niobium oxide layer. Adv. Electron. Mater. 6, 1900756 (2019).
Lin, C.-Y. et al. A high-speed MIM resistive memory cell with an inherent vanadium selector. Appl. Mater. Today 21, 100848 (2020).
Kang, M. & Son, J. Off-state current reduction in NbO2-based selector device by using TiO2 tunneling barrier as an oxygen scavenger. Appl. Phys. Lett. 109, 202101 (2016).
Sebastian, A. et al. Computational phase-change memory: beyond von Neumann computing. J. Phys. D: Appl. Phys. 52, 443002 (2019).
Dorrance, R. et al. Scalability and design-space analysis of a 1T-1MTJ memory cell for STT-RAMs. IEEE Trans. Electron Dev. 59, 878â887 (2012).
Vincent, A. F. et al. Spin-transfer torque magnetic memory as a stochastic memristive synapse for neuromorphic systems. IEEE Trans. Biomed. Circuits Syst. 9, 166â174 (2015).
Wang, K. et al. Reconfigurable codesign of STT-MRAM under process variations in deeply scaled technology. IEEE Trans. Electron Dev. 62, 1769â1777 (2015).
Cao, R. et al. Effects of capping electrode on ferroelectric properties of Hf0.5Zr0.5O2 thin films. IEEE Electron Device Lett. 39, 1207â1210 (2018).
Sun, C. et al. First demonstration of BEOL-compatible ferroelectric TCAM featuring a-IGZO Fe-TFTs with large memory window of 2.9 V, scaled channel length of 40 nm, and high endurance of 108 cycles. in 2021 Symposium on VLSI Technology 1â2 (IEEE, 2021).
Acknowledgements
This work was supported by the National Key R&D Program of China under Grant No. 2018YFA0701500, 2019YFB2205102, the National Natural Science Foundation of China under Grant Nos. 62004220, 62004219, 61974164, 62074166, 61804181, 61825404, 61888102, 62104044, 61732020, 61821091, 62104256, and 61851402, the Strategic Priority Research Program of the Chinese Academy of Sciences under Grant XDB44000000 and the China Postdoctoral Science Foundation under Grant No. 2020M673696. The authors thank the valuable discussion from W. Wang, X. Li with National University of Defense Technology, Y. Li and Z.H. Wu with University of Chinese Academy of Sciences.
Author information
Authors and Affiliations
Contributions
R.R.C. and X.M.Z. designed the experiments. R.R.C., X.M.Z., and Y.Z.S. designed and fabricated the AFeFET devices. R.R.C. carried out the electrical experiments. X.M.Z., J.K.L., and Y.Z.W. conducted the simulation. R.R.C., Y.Y., and W.W. carried out the TEM and EDS tests. S.L., H.J., H.X., J.L.W., and Q.L. assisted with data analysis and interpretation. R.R.C., X.M.Z., and S.L. co-wrote the manuscript. All authors discussed the results and revised the manuscript. Q.J.L. and Q.L. supervised the research.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Jang-Sik Lee, Stefan Slesazeck and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisherâs note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Cao, R., Zhang, X., Liu, S. et al. Compact artificial neuron based on anti-ferroelectric transistor. Nat Commun 13, 7018 (2022). https://doi.org/10.1038/s41467-022-34774-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-022-34774-9
This article is cited by
-
Realization of sextuple polarization states and interstate switching in antiferroelectric CuInP2S6
Nature Communications (2024)
-
Recent progress of hafnium oxide-based ferroelectric devices for advanced circuit applications
Science China Information Sciences (2023)