Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2023 Feb 17;4(1):14. doi: 10.1038/s43246-023-00342-x

A multi-timescale synaptic weight based on ferroelectric hafnium zirconium oxide

Mattia Halter 1,2,3,, Laura Bégon-Lours 1, Marilyne Sousa 1, Youri Popoff 1,2, Ute Drechsler 1, Valeria Bragaglia 1, Bert Jan Offrein 1
PMCID: PMC9936949  PMID: 36843629

Abstract

Brain-inspired computing emerged as a forefront technology to harness the growing amount of data generated in an increasingly connected society. The complex dynamics involving short- and long-term memory are key to the undisputed performance of biological neural networks. Here, we report on sub-µm-sized artificial synaptic weights exploiting a combination of a ferroelectric space charge effect and oxidation state modulation in the oxide channel of a ferroelectric field effect transistor. They lead to a quasi-continuous resistance tuning of the synapse by a factor of 60 and a fine-grained weight update of more than 200 resistance values. We leverage a fast, saturating ferroelectric effect and a slow, ionic drift and diffusion process to engineer a multi-timescale artificial synapse. Our device demonstrates an endurance of more than 1010 cycles, a ferroelectric retention of more than 10 years, and various types of volatility behavior on distinct timescales, making it well suited for neuromorphic and cognitive computing.

Subject terms: Electronic devices, Electronic devices, Computational science


Brain-inspired neuromorphic computing is a key technology for processing an ever-growing amount of data. Here, an artificial synapse with dual resistance modulation mechanisms is demonstrated, achieving a dynamic range of 60, an endurance exceeding 1010 cycles, and more than 10 years of retention.

Introduction

The amount of data created during the last thirty years (~320 ZB) compares with what will be created during the next three years (~364 ZB projected for 2022–2024), a trend also accelerated by the covid-19 pandemic13. A considerable amount will be created by the rapidly growing Internet of Things (IoT)4, which connects the physical world and computing entities. The development of sensors and actuators that connect to the internet, comes in pair with the emergence of Artificial Neural Networks (ANNs) for data processing. The conventional von-Neumann architecture cannot sustain such evolution, because of the energy and performance bottleneck coming from the massive data movement between the physically separated memory and processing units. Novel processing architectures, device technologies, and computational paradigms have therefore recently emerged. In-memory computing5 co-locates memory and processing, eliminating inefficient data movement, which is especially beneficial for ANN training by data-intense machine-learning algorithms. In addition to ANNs, computing systems like Spiking Neural Networks (SNNs), which mimic the type of information processing in the human brain, promise low-power computation and dynamic learning in the context of complex data6. A major challenge for SNNs remains in supporting operations under a wide range of effective timescales7. This requirement arises from the need to adapt the computation to the input timescale in real-time online applications (e.g., real-world sensory signals) and because multi-timescales are inherent in spiking neurodynamics (e.g., neural activation decay, combination of short- and long-term plasticity mechanisms79). To emulate the complex and multi-timescale plasticity processes of biological synapses7, physical effects acting at different timescales must be orchestrated in an artificial synapse9. Being able to tune the timescale (tunable volatility) in SNNs would bring the ability to mimic many of the basic processing and storage operations of the mammalian brain10 and facilitate reservoir computing or unsupervised learning11.

In ANNs and SNNs, analog non-volatile resistive memory elements, so-called memristors, are used to emulate synaptic functionality, to locally store network parameters12,13, and to serve as analog computing element14. Lately, multi-terminal memristive devices have gained a lot of attention due to the ability of using extra terminals to tune their switching dynamics (volatility, plasticity timescales), opening opportunities to implement advanced bio-inspired learning rules. For example, reward-modulated spike-timing-dependent plasticity was demonstrated in a 4-terminal cell based on ferroelectric P(VDFTrFE)15. The concept of multi-terminal can also be extended to combining electrical and light spikes to control the synapse plasticity16. Hafnia-based technologies1719 are CMOS-compatible; the low-power and multi-state nature of hafnia-based Ferroelectric Field-Effect Transistors (FeFETs) makes them a viable candidate for the development of a multi-timescale synaptic element in neuromorphic circuits. FeFETs2022 have the advantage over two-terminal devices of separating the read and write path23. On one hand, this permits to write to a high-impedance gate with low-power. On the other hand, the reading current flows through the channel that can be engineered to be ohmic24.

Scaled FeFETs nevertheless suffer from a finite number of conductance levels21 due to the discrete number of ferroelectric domains available in each device. Integrating the FeFETs in the Back-End-Of-Line (BEOL) leaves more area for the control electronics while at the same time it enables analog weight updates through relaxed size constraints2426.

In this work we present a BEOL-compatible, back-gated FeFET utilizing a 10 nm thick HfZrO4 (HZO) ferroelectric gate dielectric and a 4 nm tungsten oxide (WOx) thin-film channel. The screening of the ferroelectric polarization charges of HZO results in an accumulation or depletion of electrons in the WOx channel, effectively changing its resistivity. Beyond the absence of interfacial dielectric layers (which limit the performance of FeFETs with doped-Si channels27), oxide channels are of interest for the intrinsic presence of slow ionic drift and diffusion mechanisms, and hence are viable candidates for a slow plasticity timescale effect. WOx was chosen due to its relatively mobile oxygen-ions and thereby tunable conductivity as a function of oxygen content28,29. Furthermore, the junction less contacts to the WOx permit a ohmic conduction between source and drain24.

We engineered our device by using a channel with the appropriate stoichiometry and by having it in direct contact with HZO. In the first section, we capture how these measures facilitate a resistance modulation of the channel when the device is programmed with different timescales: on one hand the resistance modulation is driven by fast electronic screening of the HZO polarization charges, and on the other hand by the slow ionic drift and diffusion process of oxygen in the WOx. In the second section, we characterize the synaptic weight and provide an evaluation of the proposed technology’s performance when implemented in an ANN.

Results and discussion

Artificial synaptic weight structure

We developed a back-gated FeFET with two metal lines (M1, M2) for the Gate (G) and two (M3, M4) for the Source (S) and Drain (D) to establish a three-terminal device configuration. The FeFET gate stack is made of a TiN contact and a 10 nm HZO ferroelectric gate dielectric. The thin-film channel consists of a 4 nm WOx layer on top of the HZO. The S and D contacts are made of Pt/W and the passivation layer above the WOx consists of an Al2O3 and a SiO2 layer. In our design, the gate fully overlaps with S and D to avoid large topographic steps beneath the channel. The device schematic is depicted in Fig. 1a. In this work, single, stand-alone devices are studied. In view of their integration in passive pseudo-crossbar circuits, the geometry of the channel is designed to optimize the channel resistance (we refer to Supplementary Note 1 where the pseudo-crossbar operation is further discussed). For the integration in active pseudo-crossbar circuits (as proposed for example by Jerry et al.20), the devices would be embedded in at least four metal levels. Therefore, within our stand-alone device processing scheme, we include these challenges inherent to the complexity of the active crossbar realization.

Fig. 1. Structural data of the FeFET.

Fig. 1

a Schematic illustration of the FeFET, indicating a source (S), a drain (D), a gate (G), a WOx channel, a ferroelectric HZO gate dielectric, G access metal line (M1, M2), S access metal line (M3), and D access metal line (M4). b Channel length effect: The dynamic range of 120 devices with a channel width of 600 nm and a varying channel length from 300 nm to 2 μm. Write pulses with an amplitude (Vw) from −6 V to 6 V and width (tw) of 500 μs were applied. The same data for (tw = 500 ms are available in Supplementary Fig. 6a. The boxes extend from the lower to upper quartile values of the data, with a line at the median. The whiskers extend from the box to show the range of the data. Flier points are those past the end of the whiskers.

The structural analysis of our device material stack is described in Supplementary Note 2. Grazing-Incidence X-Ray Diffraction GIXRD scans confirm the absence of the monoclinic phase, the peak at 30.7 corresponds to the ferroelectric orthorhombic phase of HZO30. Bright-field Scanning Transmission Electron Microscopy (BF-STEM) analysis confirms the expected layer thickness. Energy-dispersive X-ray spectroscopy reveals the diffusion of Al into the WOx layer, a metal expected to promote the reduction of WOx31.

In addition to the ferroelectric field-effect, this work exploits the high mobility of oxygen in WOx, as observed during reduction anneals32 and in WOx nanowires33. More generally, WOx was chosen as a channel for its electron-doping ability through the removal of oxygen34,35. In this work, the as-deposited WOx was close to a stoichiometry of x=3 (insulating). We then leverage the mobility of oxygen: during the deposition of the passivation layers (Al2O3 and SiO2), the WOx was slightly reduced (consistently with the Al diffusion found earlier). The resulting mobility and carrier concentration are suitable for a large channel conductance modulation by the polarization charges via Coulomb coupling29: the screening of the HZO polarization charges in the WOx occurs over a distance comparable to the channel thickness24. A 2D time-dependent Ginzburg-Landau model of the HZO/WOx FeFET36 provided guidelines for the optimization of the channel thickness and stoichiometry. Moreover, the resulting channel resistances in the MΩ to GΩ range are well suited for energy-efficient and scalable crossbar array operation37.

The capacitance, resistance, and polarization characterization of the gate stack are found in the Supplementary Notes 3 and 4.

Switching components and timescales

Potentiation (depression) cycles on 120 devices with constant channel width (Wch = 600 nm) and a channel length varying from Lch = 300 nm to 2 μm were measured by applying write pulses with an amplitude (Vw) up to 6 V (−6 V) and a width (tw) of 500 μs. In this work, the device is programmed symmetrically, i.e., by applying Vw on the gate while grounding both the S and D contacts. The dynamic range (DR) is defined as the ratio of the source-to-drain resistance (RSD) in the high resistive state (HRS) to the low resistive state (LRS). First, the influence of the channel length on the DR is examined by performing a statistical analysis on 120 devices to average out variations due to local fabrication inhomogeneities (see further discussion in section “Characteristics for ANNs”). The average DR and the distribution are represented as a function of Lch in Fig. 1b. A detailed analysis of the DR dependence on Lch is found in the Supplementary Note 5. We found that the two components of the source-to-drain resistance RSD=Rch+2Rc, the channel resistance Rch and the contact resistance Rc, are modulated to a different extend by the write pulses. While Rc displays a modulation of DRRc=8.4, the dynamic range of the channel DRRCh=2.8 is three times less. Consequently, the total DR in Fig. 1b, that includes both contributions, decreases for longer channels as the relative contribution of DRRCh increases. Moreover, an overall increase of the DR by a factor of ~2.8 was observed by increasing tw to 500 ms, a first indication of another RSD modulation than caused by the polarization switching at short tw (500 μs).

The influence of tw on the dynamic range was further analyzed by performing a sequence of 20 constant amplitude potentiation (Vw=6V) and depression (Vw=6V) cycles with changing tw from 10 ns to 1 s (Fig. 2a). In Fig. 2b, the same 20 potentiation/depression cycles are superposed, to highlight the low cycle-to-cycle variability and better visualize the potentiation and depression shape. A dynamic range of almost 60 was reached (average of 20 cycles), a considerable increase as compared to the state-of-the-art oxide-channel FeFETs24,25. The RSD shows a strong dependence on tw: there are two different regimes for both the potentiation and depression. For short pulses (tw > 3 μs, Vw=6V) a steep depression is observed, followed by a less steep change (tw > 3 μs, Vw=6V) that does not saturate up to the maximum tw of 1 s. In the long pulse regime, cumulative switching is observed: repeating the same pulse (e.g., tw = 100 ms, Vw=6V) 100 times increased RSD after the first 10 cycles still by a factor 2 (Supplementary Note 6). The absence of a saturation and a change of slope indicates that the resistance modulation for tw > 3 μs is based on an additional and slower physical process with respect to the previous regime (tw > 3 μs): the first regime (tw > 3 μs) is attributed to the ferroelectric switching as it is known to occur at very fast timescales38. In the second regime (tw > 3 μs), the energy of each pulse was potentially large enough to additionally enable oxygen migration33,39 between HZO and WOx. The oxidation or reduction of the WOx channel in turn adds to the modulation of RSD.

Fig. 2. Potentiation and depression cycles with a constant pulse amplitude scheme.

Fig. 2

The write pulse width tw was modulated from 10 ns to 1 s while keeping the write pulse amplitude Vw constant at 6 V (−6 V) for the potentiation (depression) on a FeFET with L = 400 nm and W = 1 μm. a Source-drain resistance as a function of the pulse number for 20 consecutive cycles. On the bottom, the corresponding duration tw of each pulse is provided. b Superposition of the data points shown in a) and represented as a function of the pulse number in each cycle, to visualize the small cycle-to-cycle variations and potentiation and depression shape.

The underlying conduction mechanisms of the two regimes are discussed next. Several electrode-limited and bulk-limited conduction mechanisms depend on temperature in different ways40. Temperature-dependent IDSVDS measurements of the channel were performed (20° C–60 °C) after programming the device in the LRS (Vw = 6 V) and in the HRS (Vw=6V). From Fig. 2b, we observe that fast, ferroelectric effects occur already from pulse widths of 100 ns. The onset of the change of regime is observed for pulses of 3 μs. To reveal that there are indeed two different mechanisms to modulate the resistance, the effect of the temperature on the DR was investigated at different timescales. Write pulses of tw = 500 μs (where the ferroelectric effect is fully saturated and oxygen migration just starts) and longer write pulse trains of 90 · tw = 100 ms (to enhance the oxygen migration) were applied.

No difference in the conduction mechanism between the two timescales was observed, as discussed in Supplementary Note 7. Both the LRS and HRS show a linear ID-VDS characteristic and are best fitted with the ohmic conduction model41.

In contrast, comparing the DR as a function of temperature for the two timescales (Fig. 3a, b) displays a clear difference. For the pulses where the ferroelectric effect dominates (tw = 500 μs), the DR decreases with increasing temperature. This moderate effect can be attributed to a phase transition from the orthorhombic ferroelectric phase of HZO to its tetragonal, anti-ferroelectric phase at elevated temperatures, as observed in ZrO242 and Si:HfO243. From 50° C–60 °C, an increase in the dynamic range is observed, indicating an additional mechanism that modulates the resistance becoming dominant. This is explained by the chosen pulse width of tw = 500 μs that ensures saturation of the polarization but is also large enough for the second effect to become noticeable (predicted by Fig. 2b for pulses above 3 µs). In contrast, for the slower timescale, an increase in the dynamic range with temperature is observed in the whole temperature range and is one to two orders of magnitude larger. From 50° C–60 °C, the relative increase in the dynamic range is the highest. The temperature range was limited to 20° C–60 °C since above, irreversible changes of the gate were observed for some devices when applying a bias of 6 V. These observations are consistent with increased oxygen-ion mobility at elevated temperatures, which facilitates the channels oxidation or reduction. In this scenario, the observed increase of the DR for 500 μs pulses above 50° C suggests a dual dependency of the oxygen-ion mobility on the temperature and on the pulse duration. The different types of behavior with temperature confirm the presence of two effects modulating our channel resistance. A schematic diagram illustrating the two effects can be found in Supplementary Fig. 10.

Fig. 3. Temperature-dependent measurements.

Fig. 3

Dynamic range modulation with temperature for a tw = 500 μs and b longer write pulse trains of 90 · tw = 100 ms. c Retention measurements at elevated temperature (85 °C) for long set pulses tw = 500 ms.

Retention measurements at elevated temperatures (85 °C) and with long write pulses (tw = 500 ms display a DR of almost three orders of magnitude, a direct consequence of the aforementioned increased oxygen-ion mobility (Fig. 3c). The larger the resistance change obtained within this retention study, the faster (hours to minutes) it drifted back. The improved retention of the intermediate states compared to the extreme states rules out the effect of a depolarization field, as observed by Muller et al. in TiN/SiHfO2/TiN capacitors44. Furthermore, the observation that both the LRS and HRS relax towards intermediate resistances confirm that the dominant mechanism is not a depolarization field due to a limited number of carriers in the WOx. Therefore, a resistance modulation by displacing oxygen-ions will eventually relax back to equilibrium by ion-diffusion processes, introducing the possibility to tune the plasticity timescale by controlling the oxygen-ion mobility. The more consecutive pulses are applied, the larger the resistance change and the longer the relaxation back to a stable long-term state. This characteristic can be used to implement short-term plasticity45. In dedicated neuromorphic hardware, the above-mentioned relaxation occurs on long timescales (seconds, minutes, hours): this should then also hold for the stimuli or events processed by the network, as for example in the application field of health monitoring or autonomous driving. At shorter timescales, our ferroelectric synaptic functionalities ascribed to the existence of a threshold voltage for domain switching (coercive field), makes our devices good candidates for implementation in spiking neural networks. For example, spike-timing-dependent plasticity (STDP) was reported in perovskite46 and hafnia47 based thin films. Finally, the gradual and field-driven switching of the ferroelectric domains is well suited for implementing voltage-based STDP models4850. In recent work, Garg et al.51 showed that such unsupervised, voltage-dependent-plasticity learning rules outperforms STDP in classification tasks.

Building on this finding of multiple timescales and physical locations where the RSD modulation occurs, we envision adding a 4th electrode with a non-ferroelectric gate on top of the WOx dielectric to further oxidize or reduce the channel without affecting the ferroelectric state. This additional terminal provides a volatile resistance component to tune the switching dynamics, similarly to52. Having such an additional volatile component permits the implementation of a wide range of neuromorphic engineering paradigms53.

Characteristics for ANNs

In contrast to SNNs, where a tunable plasticity is desired, the efficient operation of ANNs on analog memristive crossbar arrays requires artificial synapses that behave as long-term memory with long data retention and low variability. Figure 4a shows a single potentiation and (depression) cycle of RSD by increasing (decreasing) Vw from 0 V to 6 V (−0.6 V to −6 V) in 25 mV steps. The programming and RSD measurement protocols are described in the section “Methods, Electrical characterization”; for a sampling bias of 200 mV, the conduction in the channel is Ohmic. The pulse duration tw was kept constant at 500 μs. The 241 (217) steps for the potentiation (depression) exhibit a quasi-continuous resistance range with a monotonic change of the resistance and a DR of 16. The Cycle-to-Cycle Variability (CtCV) was analyzed by performing 40 sub-range cycles of depression (−0.6 V to −5 V) and potentiation (−1.25 V to −5.5 V) as shown in Fig. 4b. The corresponding Vw to each pulse is given at the bottom. Figure 4c is a superposition of all 40 cycles to help visualize the CtCV. By reducing the Vw range, the DR decreases to 10.4 on average. Figure 4d is a visualization of the CtCV (standard deviation) as a percentage of the channel resistance range (HRS-LRS). The CtCV does not exceed 6% and is as low as 1.9% on average.

Fig. 4. Potentiation and depression cycles with a constant pulse width scheme.

Fig. 4

The FeFET has a channel length of 300 nm and a channel width of 2 μm: a one potentiation (0 V to 6 V) and depression (−0.6 V to −6 V) cycle with a constant tw = 500 μs showing quasi-continuous channel resistance states (dep: 241 states, pot: 217). b 40 sub-range potentiation (0.6 V to 5 V, 171 levels) and depression (1.25 V to −5.5 V, 169 levels) cycles. Channel resistance RSD as a function of pulse number (top) and the corresponding Vw (bottom) are reported. c Superposition of all 40 cycles from (b) to visualize the cycle-to-cycle variability. d Standard deviation of a channel resistance state normalized by the resistance window (HRS-LRS) for each Vw.

To study the device-to-device variability, 20 identical FeFETs were measured and the normalized standard deviation of the HRS (39%), LRS (39%), and DR (28%) was extracted. The device-to-device variability can be explained by process variations across the sample. The polycrystalline nature of HZO results in different ferroelectric properties from device to device19. Also, the WOx conductivity is rather sensitive to its oxygen content and can be reduced at elevated temperatures (>250 °C) by other oxides or nitrides interfacing it, such as SiO2, Al2O3, or SiN. Hence, local temperature differences during processing can cause different local reduction states of the WOx.

The performance of our FeFETs as artificial synapses in a crossbar array was investigated by using the MLP+ NeuroSimV3.054 framework. The on-line learning accuracy of a pseudo-crossbar array of 400 input, 250 hidden, and 10 output neurons trained on the MNIST database was simulated by using the aforementioned values. The non-linearity (NL) parameters were extracted by fitting the potentiation and depression curves according to ref. 54 (Fig. 5a). They are 2.32 and 4.63 for the potentiation and depression, respectively. Moreover, Fig. 4a shows that operating the FeFET in a subloop regime (±3V) would improve the linearity even further, but at the cost of a smaller dynamic range. Here, the MLP+ NeuroSimV3.0 code was slightly adapted (Supplementary information Note 8) to apply a random Conductance Range Variation (CRV) to every device in the network, which takes into account our device-to-device variability. The spread of the HRS, LRS, and DR around the average (No var) are depicted in Fig. 5b, c. Figure 5d reports the learning accuracy on the MNIST database. By only considering the NL parameters and the Finite Number-of-States (FNoS) as a non-ideality, an excellent performance of 92% recognition accuracy is achieved. This metrics show that the proposed technology compares to state-of-the-art FeFET evaluated with the same platform (Supplementary Table 3). By further introducing the CtCV to the simulation, the performance remains as high as 89% and with the HRS, LRS, and DR spread included, still 88% is reached.

Fig. 5. Online learning performed with MLP+ NeuroSimV3.0.

Fig. 5

a Exponential fit of the potentiation and depression curves to extract the corresponding non-linearity parameter. b, c Histogram of the HRS, LRS, and DR after taking the device-to-device variability into account. d MNIST classification performance of our FeFETs with different degrees of non-idealities included: non-linearity factors and finite number of steps (purple), + cycle to cycle variation (gray), and + conductance range variation (blue).

We then perform endurance and retention tests to evaluate the benefit of using a metallic oxide channel, compared to state-of-the-art FeFET on Si, for which is typically reported an endurance of 10619 cycles, although lately up to 10855 cycles were reported. The endurance of a FeFET with L = 800 nm and W = 600 nm is shown in Fig. 6a. Cycling pulses of ±3 V and ±4 V with a frequency of 100 KHz were applied to the gate, while the source and drain were grounded. When switching at ±3 V we observed a small continuous decay of the DR to ~70% of its initial value after 1010 cycles, but no failure could be identified. To the best of our knowledge this is the best endurance reported on hafnia-based FeFETs. Especially when compared with Si-based channel FeFETs this is a major improvement. Increasing the cycling voltage to ±4V (on another device with the same dimensions) accelerated the fatigue and the device failed after 8109 cycles. Cycling at even higher fields was not tested as online learning happens in small changes and not by constantly switching between the extreme states.

Fig. 6. Endurance and retention.

Fig. 6

a Endurance of a FeFET with L = 800 nm and W = 600 nm. Triangular pulses with a frequency of 100 kHz were applied up to 1010 cycles. The amplitude of the pulses was ±3 V and ±4 V. The evolution of the HRS and LRS (left axis) and the corresponding dynamic range (right axis) are shown. b Retention measurements at room temperature for a FeFET with L = 300 nm and W = 2 μm showing a good retention of >10 years for the four programmed states. Only the HRS has a small drift. The solid lines are the experimental data and the dashed lines are linear extrapolations in the log-log scale.

With a channel resistance between 20 MΩ and 2 GΩ (depending on the geometry) and a read voltage of 100 mV, between 5 pW and 500 pW are dissipated during a read operation (Pread=VSD2/RSD). The write operation of a single device (tw = 500 μs) to the high gate impedance (Supplementary Fig. 4b) has a lower energy consumption, between 1.7 fJ and 1.2 pJ for Vw of 1V and 6V, respectively (Ewrite=twVw2/RG). The low-power operation, long endurance, and promising MNIST classification performance make this a viable candidate for large crossbar implementations for ANN training.

Retention measurements are depicted in Fig. 6b. By fitting a linear regression in the log-log scale, no drift is observed for the intermediate states. Both, the LRS and HRS display a small drift towards lower values. Extrapolating the fit to 10 years yields a change of about 11% and 15%, respectively. This excellent retention time confirms the advantage of using metal-oxide thin films over Si as channels as there is no back-switching of the ferroelectric domains due to charge trapping at the oxide interlayer formed between Si and the ferroelectric56. This low drift opens the path to inference and memory applications for the FeFET devices presented in this study.

Conclusion

We demonstrated a scaled (sub-µm) BEOL-compatible FeFET artificial synapse with an amorphous WOx < 3 channel. The device concept was engineered to leverage two controllable resistance modulation mechanisms activated on two different write pulse timescales: a fast ferroelectric field effect (tw < 3 μs) and an oxidation/reduction of the channel by oxygen movement at a slower timescale (tw > 3 μs). Key enablers were the control of the channel oxidation state and of its thickness down to 4 nm, as well as having the channel in direct contact with the ferroelectric HZO gate without the formation of a spurious interlayer.

The dual nature of the resistance modulation mechanisms was derived from the write time-dependent dynamics and from the two different potentiation and depression slopes. The temperature-dependent current measurements showed opposite dynamic range trends for the two timescales, confirming that the resistance changes originate from two different mechanisms. Moreover, the temperature-dependent retention measurements highlighted the role of the oxygen drift across the layers in the slow regime (tw > 3 μs).

With this extra option to extend the dynamic range, our scaled FeFETs have a dynamic HRS/LRS ratio at room temperature that is 30 times larger than the state-of-the art24. The plasticity of our synapse shows a different response over multiple timescales, making our FeFETs interesting candidates for neuromorphic engineering. The ohmic nature and a resistance of our WOx channel in the MΩ regime are excellent features for precise and low-power readout operations. The extremely fine-grained, quasi-continuous monotonic resistance changes with more than 200 steps between the LRS and HRS, together with an excellent cycle-to-cycle variability led to a good MNIST classification accuracy of 88% with the NeuroSim framework. The endurance was extended to >1010 cycles and an excellent retention of >10 years was obtained with only little dynamic range loss. Therefore, our FeFET technology is not only promising for online learning but also for in-memory computing and neural network inference applications. A 4th electrode acting as top gate could introduce a volatile component, a desired property for bio-inspired neuromorphic engineering paradigms.

Methods

Sample preparation

Our FeFET is a back-gated device to allow the crystallization of the ferroelectric HZO gate dielectric before the deposition of the thin-film WOx channel. First, 500 nm SiO2 were formed by thermally oxidizing the Si substrate. A 100 nm thick W layer was deposited by sputtering and consecutively etched in a Reactive Ion Etcher (RIE) with an SF6 plasma to form the first gate metal level (M1). Then, a 100 nm thick SiO2 passivation was deposited by Plasma-Enhanced Chemical Vapor Deposition (PECVD) at 300 °C and vias to M1 were etched using an RIE with a CHF3/O2 plasma. M2 was deposited as M1 and continued by a 10 nm TiN layer deposited using a tetrakis- (dimethylamino)titanium (TDMAT) precursor and N2/H2 plasma in an Oxford Instruments plasma-enhanced atomic layer deposition (PEALD) system. To avoid large topographies, 600 nm of SiO2 were deposited on top of M2 and then removed by chemical mechanical polishing (CMP) until only a thin layer of SiO2 was left above M2. This effectively removed the topography introduced by M1 and M2 (Supplementary Fig. 3d). The last few nm of SiO2 were then etched by RIE to expose the TiN on top of M2. An ~10 nm thick layer of HZO was grown in a process using alternating cycles of tetrakis-(ethylmethylamino)-hafnium (TEMAH) and bis-(methyl-η5-cyclopentadienyl)-methoxymethyl-zirconium (ZrCMMM) at 300 °C and then capped by 30 nm of W deposited by sputtering. The crystallization of the HZO was then performed by a millisecond flash lamp anneal (ms-FLA)30 where the sample was heated to 375 °C. After the crystallization the W was removed by a wet etch in H2O2 at 50 °C. 4 nm of WOx was then deposited using a bis-(tert-butylimino)-bis-(dimethylamino)-tungsten precursor and an oxygen plasma at 375 °C in a PEALD system. To fully oxidize the WOx, it was annealed in a rapid thermal annealer at 350 °C with 50 sccm O2 for 6min. After the structuring of the WOx by an SF6 plasma in the RIE, source (S) and drain (D) contacts (Pt/W) were deposited on top of the WOx channel by lift-off. Finally, the structures were passivated by 100 nm of SiO2 and two metal levels (M3, M4) were fabricated as M1 and M2.

Structural characterization

Grazing-Incidence X-Ray Diffraction (GIXRD) measurements were executed on a Bruker D8 Discover diffractometer equipped with a copper rotating anode generator. Cross-sectional cuts and lamellas for Scanning Transmission Electron Microscope (STEM) analyses were prepared by Focused Ion Beam (FIB) using an FEI Helios NanoLab 450 S. STEM analysis was carried out on a double spherical aberration-corrected JEOL JEM-ARM200F microscope. Bright-Field STEM (BF-STEM) images were acquired at 200 kV, and energy-dispersive X-ray spectroscopy (EDS) line profiles were performed using a liquid-nitrogen-free silicon drift detector.

Electrical characterization

The electrical characterization was performed using a probestation in atmosphere. For the temperature-dependent measurements, the stage was heated and the temperature controlled by a thermocouple. Otherwise, the measurements were performed at room temperature. Potentiation and depression, resistance, retention, endurance, and capacitance measurements were performed on an Agilent B1500A with a B1530A waveform generator/fast measurement unit and a B1520A multi-frequency capacitance measurement unit. Write pulses were generated by a remote sense unit module close to the probe and applied to the gate while source and drain were grounded. To read the channel resistance, four sampling points at VDS = 200 mV and VDS = −200 mV were taken with the source measurement unit at the source while the drain was grounded and the gate was floating. PUND measurements on MFSM capacitors were performed on a TF analyzer 2000 from aixACCT. The PUND signal of ±5 V at 1 kHz was applied to the W/TiN contact (equivalent of the gate in a FeFET) of the W/TiN/HZO/WOx/W capacitor, while the other side was grounded. Before the PUND measurement, devices were woken up by 105 fully switching cycles with an amplitude of ±4 V.

Supplementary information

Acknowledgements

We thank the Cleanroom Operations Team of the Binnig and Rohrer Nanotechnology Center (BRNC) for their help and support. This work is funded by H2020: ULPEC (no. 732642) and BeFerroSynaptic (no. 871737).

Author contributions

L.B.L. contributed to the temperature-dependent conduction mechanism measurements and interpretation, to manuscript revision, and to technical guidance; M.S. contributed to the STEM analysis; Y.P. and U.D. contributed by doing CMP; V.B. contributed with result interpretation and manuscript revision; B.J.O. contributed with manuscript revision and technical guidance; M.H. contributed with the design and fabrication of the devices, with the acquisition and interpretation of all not yet mentioned data, adaption of software code and writing the manuscript.

Peer review

Peer review information

Communications Materials thanks Kai Ni and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Aldo Isidori.

Data availability

The data that support the findings of this study have been included in the manuscript and supplementary information. Any additional data are available from the corresponding author upon reasonable request.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

The online version contains supplementary material available at 10.1038/s43246-023-00342-x.

References

  • 1.Völske, M. et al. Web Archive Analytics. In INFORMATIK 2020 (eds. Reussner, R. H., Koziolek, A. & Heinrich, R.) 61–72 10.18420/inf2020_05 (Gesellschaft für Informatik, Bonn, 2021).
  • 2.Umair M, Cheema MA, Cheema O, Li H, Lu H. Impact of COVID-19 on IoT adoption in healthcare, smart homes, smart buildings, smart cities, transportation and industrial IoT. Sensors. 2021;21:3838. doi: 10.3390/s21113838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Reinsel, D., Rydning, J. & Gantz, J. Worldwide Global DataSphere Forecast, 2021–2025: the world keeps creating more data — now, what do we do with it all? Doc # US46410421. Available at: https://www.idc.com/getdoc.jsp?containerId=US46410421 (2021).
  • 4.Abdo, G. & Mahale, V. Worldwide managed edge services forecast, 2021–2025. Doc # US47308121 Available at: https://www.idc.com/getdoc.jsp?containerId=US47308121 (2021).
  • 5.Yang R. In-memory computing with ferroelectrics. Nat. Electron. 2020;3:237–238. doi: 10.1038/s41928-020-0411-2. [DOI] [Google Scholar]
  • 6.Christensen, D. V. et al. 2022 roadmap on neuromorphic computing and engineering. 2, 022501 (2021).
  • 7.Feldman DE. Synaptic mechanisms for plasticity in neocortex. Annu. Rev. Neurosci. 2009;32:33–55. doi: 10.1146/annurev.neuro.051508.135516. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Kasabov N, Dhoble K, Nuntalid N, Indiveri G. Dynamic evolving spiking neural networks for on-line spatio- and spectro-temporal pattern recognition. Neural Networks. 2013;41:188–201. doi: 10.1016/j.neunet.2012.11.014. [DOI] [PubMed] [Google Scholar]
  • 9.Beilliard Y, Alibart F. Multi-terminal memristive devices enabling tunable synaptic plasticity in neuromorphic hardware: a mini-review. Front. Nanotechnol. 2021;3:1–9. doi: 10.3389/fnano.2021.779070. [DOI] [Google Scholar]
  • 10.Youngblood N, et al. Tunable volatility of Ge 2 Sb 2 Te 5 in integrated photonics. Adv. Funct. Mater. 2019;29:1807571. doi: 10.1002/adfm.201807571. [DOI] [Google Scholar]
  • 11.Diehl PU, Cook M. Unsupervised learning of digit recognition using spike-timing-dependent plasticity. Front. Comput. Neurosci. 2015;9:1–9. doi: 10.3389/fncom.2015.00099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Payvand M, Nair MV, Müller LK, Indiveri G. A neuromorphic systems approach to in-memory computing with non-ideal memristive devices: from mitigation to exploitation. Faraday Discuss. 2019;213:487–510. doi: 10.1039/C8FD00114F. [DOI] [PubMed] [Google Scholar]
  • 13.Indiveri G, Linares-Barranco B, Legenstein R, Deligeorgis G, Prodromakis T. Integration of nanoscale memristor synapses in neuromorphic computing architectures. Nanotechnology. 2013;24:384010. doi: 10.1088/0957-4484/24/38/384010. [DOI] [PubMed] [Google Scholar]
  • 14.Kub FJ, Moon KK, Mack IA, Long FM. Programmable analog vector-matrix multipliers. IEEE J. Solid-State Circuits. 1990;25:207–214. doi: 10.1109/4.50305. [DOI] [Google Scholar]
  • 15.Zhou Y, et al. A reconfigurable Two‐WSe 2 ‐transistor synaptic cell for reinforcement learning. Adv. Mater. 2022;34:e2107754. doi: 10.1002/adma.202107754. [DOI] [PubMed] [Google Scholar]
  • 16.Xue F, et al. Optoelectronic ferroelectric domain‐wall memories made from a single van der waals ferroelectric. Adv. Funct. Mater. 2020;30:2004206. doi: 10.1002/adfm.202004206. [DOI] [Google Scholar]
  • 17.Han H, Yu H, Wei H, Gong J, Xu W. Recent progress in three‐terminal artificial synapses: from device to system. Small. 2019;15:1900695. doi: 10.1002/smll.201900695. [DOI] [PubMed] [Google Scholar]
  • 18.Yu S, et al. Ferroelectric HfO 2 -based synaptic devices: recent trends and prospects. Semicond. Sci. Technol. 2021;36:104001. doi: 10.1088/1361-6641/ac1b11. [DOI] [Google Scholar]
  • 19.Mulaosmanovic H, et al. Ferroelectric field-effect transistors based on HfO 2: a review. Nanotechnology. 2021;32:502002. doi: 10.1088/1361-6528/ac189f. [DOI] [PubMed] [Google Scholar]
  • 20.Jerry, M. et al. Ferroelectric FET analog synapse for acceleration of deep neural network training. in 2017 IEEE International Electron Devices Meeting (IEDM)6, 6.2.1-6.2.4 (IEEE, 2017).
  • 21.Mulaosmanovic H, et al. Switching kinetics in nanoscale hafnium oxide based ferroelectric field-effect transistors. ACS Appl. Mater. Interfaces. 2017;9:3792–3798. doi: 10.1021/acsami.6b13866. [DOI] [PubMed] [Google Scholar]
  • 22.Dunkel, S. et al. A FeFET-based super-low-power ultra-fast embedded NVM technology for 22nm FDSOI and beyond. in 2017 IEEE International Electron Devices Meeting (IEDM)1, 19.7.1-19.7.4 (IEEE, 2017).
  • 23.Breyer ET, Mulaosmanovic H, Mikolajick T, Slesazeck S. Perspective on ferroelectric, hafnium oxide based transistors for digital beyond von-Neumann computing. Appl. Phys. Lett. 2021;118:050501. doi: 10.1063/5.0035281. [DOI] [Google Scholar]
  • 24.Halter M, et al. Back-end, CMOS-compatible ferroelectric field-effect transistor for synaptic weights. ACS Appl. Mater. Interfaces. 2020;12:17725–17732. doi: 10.1021/acsami.0c00877. [DOI] [PubMed] [Google Scholar]
  • 25.Mo, F. et al. Experimental demonstration of ferroelectric HfO 2 FET with ultrathin-body IGZO for high-density and low-power memory application. In: 2019 Symposium on VLSI Technology T42–T43 10.23919/VLSIT.2019.8776553 (IEEE, 2019).
  • 26.Kim M-K, Lee J-S. Ferroelectric analog synaptic transistors. Nano Lett. 2019;19:2044–2050. doi: 10.1021/acs.nanolett.9b00180. [DOI] [PubMed] [Google Scholar]
  • 27.Mulaosmanovic H, Chicca E, Bertele M, Mikolajick T, Slesazeck S. Mimicking biological neurons with a nanoscale ferroelectric transistor. Nanoscale. 2018;10:21755–21763. doi: 10.1039/C8NR07135G. [DOI] [PubMed] [Google Scholar]
  • 28.CHARLTON MG. Hydrogen reduction of tungsten trioxide. Nature. 1952;169:109–110. doi: 10.1038/169109a0. [DOI] [Google Scholar]
  • 29.Ingham B, Hendy SC, Chong SV, Tallon JL. Density-functional studies of tungsten trioxide, tungsten bronzes, and related systems. Phys. Rev. B. 2005;72:075109. doi: 10.1103/PhysRevB.72.075109. [DOI] [Google Scholar]
  • 30.O’Connor É, et al. Stabilization of ferroelectric Hf x Zr 1−x O 2 films using a millisecond flash lamp annealing technique. APL Mater. 2018;6:121103. doi: 10.1063/1.5060676. [DOI] [Google Scholar]
  • 31.Haubner R, Schubert WD, Lassner E, Lux B. Influence of aluminum on the reduction of tungsten oxide to tungsten powder. Int. J. Refract. Hard Met. 1987;6:161–167. [Google Scholar]
  • 32.Aird A, Domeneghetti MC, Mazzi F, Tazzoli V, Salje EKH. Sheet superconductivity in: crystal structure of the tetragonal matrix. J. Phys. Condens. Matter. 1998;10:L569–L574. doi: 10.1088/0953-8984/10/33/002. [DOI] [Google Scholar]
  • 33.He X, et al. Memristive properties of hexagonal WO3 nanowires induced by oxygen vacancy migration. Nanoscale Res. Lett. 2013;8:50. doi: 10.1186/1556-276X-8-50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Georg A, Graf W, Wittwer V. Comparison of electrical conductivity and optical properties of substoichiometrically and electrochemically coloured WOx films of different crystallinity. Sol. Energy Mater. Sol. Cells. 1998;51:353–370. doi: 10.1016/S0927-0248(97)00255-9. [DOI] [Google Scholar]
  • 35.Chien WC, et al. A study of the switching mechanism and electrode material of fully CMOS compatible tungsten oxide ReRAM. Appl. Phys. A Mater. Sci. Process. 2011;102:901–907. doi: 10.1007/s00339-011-6271-x. [DOI] [Google Scholar]
  • 36.Wen X, Halter M, Bégon-Lours L, Luisier M. Physical modeling of HZO-based ferroelectric field-effect transistors with a WOx channel. Front. Nanotechnol. 2022;4:1–11. doi: 10.3389/fnano.2022.900592. [DOI] [Google Scholar]
  • 37.Gokmen T, Vlasov Y. Acceleration of deep neural network training with resistive cross-point devices: design considerations. Front. Neurosci. 2016;10:1–13. doi: 10.3389/fnins.2016.00333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Li J, et al. Ultrafast polarization switching in thin-film ferroelectrics. Appl. Phys. Lett. 2004;84:1174–1176. doi: 10.1063/1.1644917. [DOI] [Google Scholar]
  • 39.Xiao Z, et al. Controlling resistance switching performances of Hf 0.5 Zr 0.5 O 2 films by substrate stress and potential in neuromorphic computing. Adv. Intell. Syst. 2022;4:2100244. doi: 10.1002/aisy.202100244. [DOI] [Google Scholar]
  • 40.Chiu F-C. A review on conduction mechanisms in dielectric films. Adv. Mater. Sci. Eng. 2014;2014:1–18. [Google Scholar]
  • 41.Sze, S. M. & Ng, K. K. Physics of semiconductor devices. America10, (John Wiley & Sons, Inc., 2006).
  • 42.Müller J, et al. Ferroelectricity in simple binary ZrO 2 and HfO 2. Nano Lett. 2012;12:4318–4323. doi: 10.1021/nl302049k. [DOI] [PubMed] [Google Scholar]
  • 43.Böscke TS, et al. Phase transitions in ferroelectric silicon doped hafnium oxide. Appl. Phys. Lett. 2011;99:112904. doi: 10.1063/1.3636434. [DOI] [Google Scholar]
  • 44.Mueller, S., Muller, J., Schroeder, U. & Mikolajick, T. Reliability characteristics of ferroelectric Si:HfO2 thin films for memory applications. IEEE Trans. Device Mater. Reliab.13, 93–97 (2013).
  • 45.Ilyas N, et al. Analog switching and artificial synaptic behavior of Ag/SiOx:Ag/TiOx/p++-Si memristor device. Nanoscale Res. Lett. 2020;15:30. doi: 10.1186/s11671-020-3249-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Boyn S, et al. Learning through ferroelectric domain dynamics in solid-state synapses. Nat. Commun. 2017;8:14736. doi: 10.1038/ncomms14736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Max B, Hoffmann M, Mulaosmanovic H, Slesazeck S, Mikolajick T. Hafnia-based double-layer ferroelectric tunnel junctions as artificial synapses for neuromorphic computing. ACS Appl. Electron. Mater. 2020;2:4023–4033. doi: 10.1021/acsaelm.0c00832. [DOI] [Google Scholar]
  • 48.Clopath C, Büsing L, Vasilaki E, Gerstner W. Connectivity reflects coding: a model of voltage-based STDP with homeostasis. Nat. Neurosci. 2010;13:344–352. doi: 10.1038/nn.2479. [DOI] [PubMed] [Google Scholar]
  • 49.Ziegler M, Riggert C, Hansen M, Bartsch T, Kohlstedt H. Memristive hebbian plasticity model: device requirements for the emulation of hebbian plasticity based on memristive devices. IEEE Trans. Biomed. Circuits Syst. 2015;9:197–206. doi: 10.1109/TBCAS.2015.2410811. [DOI] [PubMed] [Google Scholar]
  • 50.Diederich N, Bartsch T, Kohlstedt H, Ziegler M. A memristive plasticity model of voltage-based STDP suitable for recurrent bidirectional neural networks in the hippocampus. Sci. Rep. 2018;8:9367. doi: 10.1038/s41598-018-27616-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Garg N, et al. Voltage-dependent synaptic plasticity: unsupervised probabilistic Hebbian plasticity rule based on neurons membrane potential. Front. Neurosci. 2022;16:1–12. doi: 10.3389/fnins.2022.983950. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.He C, et al. Artificial synapse based on van der waals heterostructures with tunable synaptic functions for neuromorphic computing. ACS Appl. Mater. Interfaces. 2020;12:11945–11954. doi: 10.1021/acsami.9b21747. [DOI] [PubMed] [Google Scholar]
  • 53.Xi F, et al. Artificial synapses based on ferroelectric schottky barrier field-effect transistors for neuromorphic applications. ACS Appl. Mater. Interfaces. 2021;13:32005–32012. doi: 10.1021/acsami.1c07505. [DOI] [PubMed] [Google Scholar]
  • 54.Chen, P.-Y., Peng, X. & Yu, S. NeuroSim+: an integrated device-to-algorithm framework for benchmarking synaptic devices and array architectures. In: 2017 IEEE International Electron Devices Meeting (IEDM) 6.1.1–6.1.4 10.1109/IEDM.2017.8268337 (IEEE, 2017).
  • 55.Bae JH, et al. Highly scaled, high endurance, Ω-gate, nanowire ferroelectric FET memory transistors. IEEE Electron Device Lett. 2020;41:1637–1640. doi: 10.1109/LED.2020.3028339. [DOI] [Google Scholar]
  • 56.Yurchuk E, et al. Origin of the endurance degradation in the novel HfO2-based 1T ferroelectric non-volatile memories. IEEE Int. Reliab. Phys. Symp. Proc. 2014;2:1–5. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The data that support the findings of this study have been included in the manuscript and supplementary information. Any additional data are available from the corresponding author upon reasonable request.


Articles from Communications Materials are provided here courtesy of Nature Publishing Group

RESOURCES