Skip to main content
Science Advances logoLink to Science Advances
. 2021 Jul 16;7(29):eabh0648. doi: 10.1126/sciadv.abh0648

Analog memristive synapse based on topotactic phase transition for high-performance neuromorphic computing and neural network pruning

Xing Mou 1, Jianshi Tang 1,2,*, Yingjie Lyu 3, Qingtian Zhang 2, Siyao Yang 1, Feng Xu 1, Wei Liu 1, Minghong Xu 1, Yu Zhou 3, Wen Sun 1, Yanan Zhong 1, Bin Gao 1,2, Pu Yu 2,3,*, He Qian 1,2, Huaqiang Wu 1,2
PMCID: PMC8284889  PMID: 34272239

Memristor with topotactic phase transition demonstrates controllable analog switching and implements neural network pruning.

Abstract

Inspired by the human brain, nonvolatile memories (NVMs)–based neuromorphic computing emerges as a promising paradigm to build power-efficient computing hardware for artificial intelligence. However, existing NVMs still suffer from physically imperfect device characteristics. In this work, a topotactic phase transition random-access memory (TPT-RAM) with a unique diffusive nonvolatile dual mode based on SrCoOx is demonstrated. The reversible phase transition of SrCoOx is well controlled by oxygen ion migrations along the highly ordered oxygen vacancy channels, enabling reproducible analog switching characteristics with reduced variability. Combining density functional theory and kinetic Monte Carlo simulations, the orientation-dependent switching mechanism of TPT-RAM is investigated synergistically. Furthermore, the dual-mode TPT-RAM is used to mimic the selective stabilization of developing synapses and implement neural network pruning, reducing ~84.2% of redundant synapses while improving the image classification accuracy to 99%. Our work points out a new direction to design bioplausible memristive synapses for neuromorphic computing.

INTRODUCTION

The growth of computing power in digital hardware, including central processing unit and graphics processing unit, has driven the rapid development of artificial intelligence. This, in turn, raises higher and higher demand on the hardware performance, even exceeding the pace of Moore’s law. One of the key bottlenecks arises from the physical separation of memory and computing units in the widely adopted von Neumann architecture, which leads to a grand challenge of memory wall problem. Inspired by neurobiological systems, neuromorphic computing has emerged as a promising computing paradigm with the feature of massively parallel computation in memory to break the so-called von Neumann bottleneck (1, 2). Various nonvolatile memories (NVMs), such as resistive random-access memory (RRAM) (3, 4) and phase-change memory (PCM) (5), have been extensively studied as artificial synapses and neurons to build prototype artificial intelligence chips (68). Different from digital memory applications, here, reproducible analog switching characteristics (e.g., multilevel conductance states, weight update linearity and symmetry, and low variability) are desired to meet the requirement of high computing accuracy and energy efficiency (9, 10).

Unfortunately, those existing emerging NVMs still suffer from nonideal device characteristics (fig. S1), which are one of the main challenges for the hardware implementation of large-scale neuromorphic computing systems. For example, conventional filament-type RRAM relies on the random oxygen vacancy (Vo) migration in the amorphous switching oxides, leading to intrinsically large device variations, while the absence of local Joule heating effect in interface-type RRAM usually results in poor retention and low speed (11). For PCM, it typically shows asymmetric switching due to the abrupt quench process in the crystalline-to-amorphous phase transition and also suffers from the conductance drift issue (12). It is noted that those imperfect device characteristics originate from their intrinsic working mechanisms and hence are difficult to be eliminated by simply optimizing their device structures (13, 14). In addition, so far, these devices are mainly limited to mimic the functionalities of an individual neuron or synapse (synaptic plasticity, neuronal firing, etc.), while the biomimicry of many important network-level properties, such as neural network pruning that is critical for cognitive learning in biology, has not been explored yet. Therefore, for future high-performance neuromorphic computing, innovations in materials and devices with new working mechanisms are highly desired to yield more controllable analog switching characteristics and further construct more bioplausible neural networks (15, 16).

In search of new materials and structures for low-variability analog switching memristors, here, we propose a novel synapse, namely, topotactic phase transition RAM (TPT-RAM), using brownmillerite (BM) oxides [such as SrCoO2.5 (SCO) (17, 18) and SrFeO2.5 (1921)] as the resistive switching oxide. We chose SCO as an exemplary material whose unique crystal structure formed by alternating stacks of oxygen octahedra and oxygen tetrahedra provides the favorable conditions to achieve uniform analog switching: (i) The highly ordered one-dimensional oxygen vacancy channels (OVCs) provide predefined freeway for the migration of oxygen ions to induce phase transition and resistive switching (22). Compared with other methods intended to confine the ion migration, such as metal doping (23) and dislocation engineering (15), the highly ordered and atomically precise OVCs in BM oxides are more uniform and easier to manipulate without additional ex situ processes (24). (ii) The multivalent cobalt ions change reversibly between BM and perovskite (PV) structures on the basis of the adjustment of oxygen stoichiometry without losing the intrinsic lattice architecture (2527), which can yield gradual switching. (iii) High-quality, stable BM oxide as the resistive switching layer ensures excellent retention at multilevel conductance states and also enhanced endurance.

In this work, to implement low-power neuromorphic computing, we designed and fabricated SCO-based TPT-RAM with tunable OVCs as memristive synapses. Such TPT-RAM provided an excellent platform to thoroughly study the topotactic phase transition–associated switching mechanism by correlating electrical and structural characterizations with comprehensive atomic-device modeling and simulations (2830), which, however, is difficult to perform for conventional RRAM with amorphous oxide like HfO2 (31). Experimentally, we demonstrated that the high-speed and uniform analog TPT-RAM can be achieved by manipulating OVCs through the top and bottom electrodes (BEs). Furthermore, inspired by the selective stabilization of developing synapses in biological neural networks, we implemented the online training of a sparse neural network through automatic pruning, realizing a substantial reduction of both network size and power consumption.

RESULTS

Orientation-dependent switching characteristics of TPT-RAM

Figure 1 (A and B) illustrates the schematic of TPT-RAM synapses with tunable OVCs and their phase transition mechanism. To start, 35-/17-nm-thick single-crystalline BM-SCO/SrRuO3 (SRO) thin films were epitaxially grown on both (001)- and (110)-oriented SrTiO3 (STO) substrates by pulsed laser deposition (PLD) method. The alternating stacking of oxygen octahedra and tetrahedra in the BM-SCO results in highly ordered OVCs. Because of the epitaxial strain and crystalline symmetry, the OVCs orients within the plane for films grown on SCO (001)pc and have a large out-of-plane component for the films grown on SCO (110)pc (3234). The metallic SRO was used as the BE, enabling an atomically clean epitaxial interface without misfit dislocations (fig. S2) (35). After the film growth, a 20-nm-thick Al2O3 capping layer was deposited on top to protect the SCO layer. Then, the TPT-RAM device area was defined by etching contact vias in Al2O3 with sizes ranging from (4 μm)2 to (100 μm)2. Last, the top electrodes (TEs) were made by sputtering of 100-nm-thick Au. Both Au and SRO could form an ohmic contact with SCO (36), which is crucial to eliminate the effect of interfacial barriers and obtain symmetric I-V characteristics. The different crystal structures of these two SCO thin films were further verified by both x-ray diffraction (XRD) (Fig. 1C) and aberration-corrected scanning transmission electron microscopy (Fig. 1D), where the oxygen tetrahedral layers (ordered OVCs) are labeled with pink arrows.

Fig. 1. Design of the SCO-based TPT-RAM synaptic devices with tunable orientations of OVCs.

Fig. 1

(A) Schematics of TPT-RAM based on different crystal orientations of SCO. (B) Reversible field–driven topotactic phase transition between BM-SCO and PV-SCO depends on the oxygen stoichiometry. (C) XRD θ-2θ patterns of the two epitaxial SCO thin films grown on the SRO-buffered STO (110) substrate (top) and (001) substrate (bottom). (D) Atomic-resolution scanning transmission electron microscopy images of SCO (110)pc (top) and SCO (001)pc (bottom) exhibit two different orientations of OVCs, where the oxygen tetrahedral layers are labeled with pink arrows. The insets show the corresponding structure models. a.u., arbitrary units.

To investigate the device switching mechanism, we compared the switching characteristics of SCO (001)pc and SCO (110)pc TPT-RAM. Figure 2A presents the typical forming processes and subsequent consecutive I-V sweeps for the two SCO devices. Figure 2B plots the statistical distributions of the forming voltage measured from 30 devices to affiliate the oxygen ion migration. The corresponding values in SCO (001)pc devices (5.04 ± 0.07 V) are notably higher than that in SCO (110)pc devices (3.98 ± 0.05 V). It also shows markedly enhanced uniformity in SCO (110)pc devices. Fig. 2C displays the uniformity of TPT-RAM device conductance (measured at a read voltage of 0.2 V) over 500 switching cycles. The results in fig. S3 (A and B) show that the cycle-to-cycle variations (σ/μ) are very low in SCO (110)pc TPT-RAM: only 1.8% for set voltage and 0.9% for reset voltage, while 2.25 and 13.74% variations for the high-resistance state (HRS) and the low-resistance state (LRS), respectively. The cycle-to-cycle uniformity of TPT-RAM can be attributed to the highly anisotropic ion migration pathways in the anisotropic SCO crystal structure. In addition, fig. S3 (C and D) also shows excellent reproducibility with low device-to-device variation (down to 4.9%) and good batch-to-batch uniformity, which is mainly due to the high-quality epitaxial SCO thin film.

Fig. 2. Effect of the tunable OVCs on the switching characteristics of TPT-RAM.

Fig. 2

(A) Typical DC forming and I-V curves for 100 consecutive sweeps. (B) Statistical analysis of the forming voltages extracted from 32 different devices of each SCO orientation. (C) Cumulative probability plots of the device conductance over 500 I-V sweeps with 0.2-V read voltage. (D) The required pulse operation condition to program the device from the same initial conductance Ginital to the target value Gtarget. The on/off ratio is defined as n = Gtarget/Ginital. (E) The oxygen migration pathways within the oxygen tetrahedral layers and (F) across the oxygen tetrahedral layers in SCO.

Furthermore, we developed a pulse test scheme to evaluate the operation speed of these two different SCO devices (fig. S4). Each device was started from a similar low-conductance state (Ginitial = 40 μS), and then a series of pulses with fixed amplitude (0.8 to 4 V) but different widths (100 ns to 10 ms) were continuously applied. The device conductance value was read out after each operation pulse, until it reached the target conductance Gtarget (80 and 160 μS, corresponding to a conductance on/off ratio of n = Gtarget/Ginitial = 2 and 4, respectively). After that, the device was reset to the initial low-conductance state, and then the pulse amplitude was changed to program the device again. In this way, by adding up the pulse widths applied to program the device from its initial state to the target state, we can estimate the pulse operation conditions needed to reach the target state. This test method can avoid the effect of different initial resistance states and test as many pulse conditions as possible. The results in Fig. 2D show that the conductance change of SCO (110)pc was much easier than SCO (001)pc, where lower operation voltage (down to 0.8 V) and faster speed (up to 100 ns, which was limited by our measurement equipment) were achieved in the SCO (110)pc TPT-RAM device as compared to the values (~2.0 V and ~10 μs, respectively) for SCO (001)pc.

To better understand the electrical characteristics of TPT-RAM devices, we established a synergistic modeling approach to study the underlying switching mechanism (Fig. 2, E and F, and figs. S5 to S8). First, density functional theory (DFT) calculations were performed to evaluate the migration barriers along different directions within SCO at the atomic scale (fig. S5) (22). Figure 2E presents two different migration pathways for oxygen ions within the oxygen tetrahedral layers. Take two adjacent positions as the initial and final states, respectively, and the migration barrier along the OVCs is estimated to be 0.56 eV. In comparison, the lowest migration barrier perpendicular to the OVCs is 0.97 eV, where the oxygen at site X moves to site Y, while another oxygen atom at Y hops to site Z. Besides, Fig. 2F presents that the migration barrier from one tetrahedral layer to another through the octahedral layer is 1.84 eV, where there are three oxygen ions involved. The calculated values summarized in fig. S6 suggest that the oxygen ions favor the migration along the orientation of OVCs.

Besides, we performed kinetic Monte Carlo (KMC) simulations to capture the complete physical processes of the resistive switching (fig. S7) (3739). In the SCO (110)pc device, the OVCs aligned with the TE-BE electric field direction provide preferable migration paths for oxygen ions so that the BM-to-PV phase transition occurred easily and multiple filaments were formed. In comparison, oxygen ions in the SCO (001)pc device were driven by the electric field perpendicular to the OVCs, so there was a lower probability for ions moving along the electric field direction, leading to detrimental random phase transitions (fig. S8 and movie S1). Therefore, the SCO (110)pc device was easier to form conducting filaments and hence could operate at a higher speed and lower voltage as shown in Fig. 2D. It is also consistent with the observed smaller variation (σ/μ = 0.9%) in the write voltage for SCO (110)pc devices (fig. S3). These results suggest that the formation of conducting filaments depends on the oriented oxygen ion migrations along the OVCs in the SCO (110)pc. Moreover, the atomic-scale spatial uniformity shows a significant improvement compared to the conventional amorphous oxide–based RRAM devices, in which the random Vo formations and movements would induce a larger switching variation (38). In addition, the weak dependence of LRS on the device area in SCO (110)pc in fig. S9A suggests that the switching mechanism is filamentary type, which is consistent with the KMC simulation results. The local electric field was enhanced at the tips of filaments after forming, which resulted in a localized region of filament formation within the device area. On the contrary, there is a clear area dependence of the device conductance for SCO (001)pc, which suggests that the phase transition is not localized with a small region as in the case of SCO (110)pc (fig. S9B) (18).

Analog switching characteristics of TPT-RAM

On the basis of the above results, we used SCO (110)pc–based TPT-RAM to further investigate its high-performance analog switching characteristics for neuromorphic computing. The long-term potentiation (LTP) and long-term depression (LTD) characteristics were tested with identical set and reset pulses (Fig. 3A). Each cycle consists of 50 set and 50 reset pulses with the pulse width of 1 μs followed by the read pulses of 0.2 V and 1 μs. The result confirms that the analog switching of TPT-RAM can be realized with identical low-voltage pulses (1 V), and it also exhibited nearly ideal linear switching with nonlinearity factors of ν = 0.20 and 1.29 for LTP and LTD, respectively (fig. S10A). Both values are much lower than those reported for the typical filament-type RRAM (9). Besides, the cycle-to-cycle pulse programming tests in Fig. 3B also show reproducible analog switching characteristics with both identical pulses and amplitude-increasing pulses. It is shown that the latter programming scheme could achieve better linearity (fig. S10) and larger on/off ratio (40).

Fig. 3. Analog switching characteristics of SCO (110)pc–based TPT-RAM for neuromorphic computing.

Fig. 3

(A) LTP and LTD with identical set and reset pulses. The write pulse train consists of 50 set pulses (amplitudes of 1, 1.5, and 2 V, the width of 1 μs) followed by 50 reset pulses (amplitudes of −1, −1.5, and −2 V, the width of 1 μs). The device conductance was measured by a read pulse (amplitude of 0.2 V, the width of 1 μs) after each write pulse. (B) Reproducible and uniform analog switching behavior over multiple cycles: under identical pulses with an amplitude of 1 V, a width of 1 μs (top) and increasing pulse amplitudes with 1 to 2.89 V, width of 100 ns (bottom). (C) Retention test over a long period of 3000 s at 85°C for five conductance levels. (D) Device endurance measurement with different resistive switching windows (fixed HRS and varied LRS). For a given switching window, here, we use pulses with amplitudes of 1.8 to 3.2 V and a width of 1 μs to program the device to the desired HRS and LRS in each cycle, which were recorded at a read voltage of 0.2 V. The device remains stable after 108 cycles in both cases, demonstrating excellent reliability.

The reversible phase transition without losing parent crystal structure enables excellent retention and endurance of TPT-RAM. Figure 3C shows that the device has long retention of more than 3000 s at 85°C. Furthermore, using the extracted activation energy from temperature-dependent measurements, the device retention at room temperature can be predicted to be more than 10 years (fig. S11) (41). The presence of oriented OVCs in the SCO (110)pc–based TPT-RAM enables the device to be programmed readily with a lower migration barrier and also have excellent retention with a higher diffusion barrier (fig. S12). Figure 3D shows reliable multilevel switching with different resistive switching windows over 108 pulses. All these results demonstrate that developed TPT-RAM based on SCO (110)pc thin films can serve as a high-performance synaptic device (for comparison with other devices, see fig. S1 and table S1).

The nonvolatile analog switching characteristics presented above were measured after the forming process, before which the device behaved as a diffusive memristor instead (42). Figure 4A schematically illustrates the working mechanism under different pulse conditions, where the device initially exhibited a high resistance (>1 megohm) due to the insulating BM phase. A weak excitation below its forming voltage, e.g., 2 V (Fig. 4C), would extract oxygen ions from the SRO layer to the SRO/SCO interface and lower the device resistance. However, it could not facilitate a fully stable phase transition of SCO that requires a sufficiently large amount of oxygen ions (27, 43). Once removing the voltage bias, the chemical potential difference at the interface would induce the diffusion of oxygen ions back to the SRO layer, causing the device to gradually relax back to the initial HRS. A read voltage pulse of 0.2 V was applied before and after the write pulse to record the device conductance state. The device was first programmed to a relatively low resistance state but then spontaneously relaxed back to the initial state within ~1 s (Fig. 4C). As a result, the TPT-RAM device exhibited diffusive memristor behavior under weak excitation. In addition, the endurance test in fig. S13 demonstrates a device endurance of more than 1500 cycles in the diffusive mode.

Fig. 4. The diffusive nonvolatile dual mode of the TPT-RAM synapse.

Fig. 4

(A) Schematic of the memristive switching mechanism. In the initial state, a weak excitation cannot induce a stable phase transition, and the device behaves as a diffusive memristor. After a strong excitation, sufficient amount of oxygen ions migrate across the SRO/SCO interface so that conductive filaments are formed, and the device exhibits nonvolatile characteristics. (B) The pulse forming process. A small number (≤50) of pulses could only induce diffusive switching behavior, but more pulses (≥100) could electroform the device and turn it into a nonvolatile memristor. (C) Pulse test before strong excitation (forming) shows diffusive characteristics under weak pulse. PN, pulse number; PW, pulse width. (D) Pulse test after strong excitation (forming) shows nonvolatile memristor characteristics, where the pulse condition is the same as in (C). (E)The multicycle pulse test in the diffusive mode also shows good cycle-to-cycle uniformity (low variation of σ/μ = 1.35%) and analog switching characteristics.

On the contrary, under strong excitation, e.g., DC forming at about 4 V (Fig. 3A), sufficient oxygen ions were driven into SCO, which triggered a stable phase transition. After that, a small positive voltage could attract more oxygen ions to migrate and induce a stable phase transition from BM-SCO to PV-SCO (Fig. 3A), increasing the device conductance (set process). The reverse phase transition would occur when a negative voltage is applied to drive oxygen ions back to SRO, reducing the device conductance (reset process). As a result, in this nonvolatile mode, relatively low resistance and excellent retention (>3000 s at 85°C) were observed. The electrical test results in the two different modes are shown in Fig. 4 (C and D, respectively). To further clarify the forming process, we tested the device using different numbers of identical pulses (Vpulse = 3 V, pulse width = 10 ms) as shown in Fig. 4B, and the transition from diffusive mode to nonvolatile mode can also be observed. Starting from the same initial conductance state, both 10 and 50 pulses could induce incremental conductance change, but it gradually decayed after the pulse operations, suggesting that the device operated in the diffusive mode. However, as the number of consecutive pulses increased to 100, an abrupt conductance change was observed, turning the TPT-RAM device into the nonvolatile mode. These results indicate that the device can be formed by either a single large voltage or an adequate amount of repeated small voltage pulses. Besides, the cycling test with 50 set pulses of 3 V and 10 ms in Fig. 4E shows good uniformity (variation down to σ/μ = 1.35%) and analog switching characteristics (nonlinearity factor of TPT-RAM down to 1.8 as shown in fig. S14) in the diffusive mode as well.

Implementation of neural network pruning

Furthermore, we would like to highlight that the unique diffusive nonvolatile dual mode of TPT-RAM memristive synapse as revealed in Fig. 4 can be used to achieve better emulation of neurobiological functions beyond synaptic weight representation (12). In the past decade, there have been extensive studies on the emulation of synaptic behaviors in biology, such as synaptic plasticity, short-term memory, and long-term memory, using emerging memristive devices (42, 44). However, so far, they are mainly limited to device-level biomimicry, while the emulation of many important network-level properties has not been explored yet (2), such as neural network pruning that is critical in human brain development (45). As a result, there is still a huge gap between biological and artificial neural networks, and more attention should be paid to systematically co-optimize the architectural designs and bioplausible hardware properties (2). In addition, the deployment of deep neural networks in resource-limited applications (e.g., portable electronics and Internet of Things) is largely limited by the high power consumption and lack of real-time processing capability. Neural network pruning is considered an effective pathway to reduce network complexity and avoid overfitting (46). However, this usually results in irregular network connections and may require extra efforts to represent sparse locations, thus incurring additional hardware overhead and computational cost (47). Therefore, neural network pruning has not been demonstrated using memristive synapses yet.

In this work, we used the developed TPT-RAM to implement the automatic neural network pruning in the network training with deep learning algorithm (46). Such automatic pruning process is inspired by the synapse-developing process in the human brain (45), as illustrated in Fig. 5A. In the human brain, the number of synaptic connections reaches its maximum in early childhood, and then active synapses are selectively stabilized, while redundant synapses that are rarely used are gradually eliminated. This natural synaptic pruning process is essential for refining the neural network and increasing network efficiency. Here, a differential pair of two TPT-RAM devices was used to represent one synaptic weight. Initially, all the synaptic devices worked in the diffusive mode. During training, the devices that were frequently updated would be electroformed and turned into the nonvolatile state eventually (representing stabilized synaptic connections), while the others would naturally decay to off state. In this way, the neural network could enhance those important synaptic weight connections and trim down other irrelevant synaptic connections and eventually became a sparse network after the training. The simulation results in Fig. 5 (B and C) are compared to the baselines trained with conventional nonvolatile synapses in both multilayer perceptron (MLP) and convolutional neural network (CNN). Here, the device cycle-to-cycle variations extracted from experiments were taken into consideration in simulation. The results reveal that training with the diffusive nonvolatile dual-mode feature, the pruned network could reduce up to 84.2% of synapses and also save up to 63% in power while improving the accuracy to ~99% for the Mixed National Institute of Standards and Technology database (MNIST) recognition task by avoiding overfitting. These computational advantages brought by TPT-RAM may become more prominent when training a larger neural network with automatic pruning.

Fig. 5. Implementation of bioplausible neural network pruning.

Fig. 5

(A) The online training with SCO-based TPT-RAM synapses is similar to the selective stabilization of synapse development in the human brain. To start, all the synapses in the neural network are in the diffusive mode. During training, if one synaptic device is frequently updated and the conductance reaches a certain threshold, then it is electroformed into the nonvolatile mode (which implies that the synaptic connection is stabilized). Otherwise, the device conductance spontaneously decays back to almost zero, and the connection is trimmed. (B) The recognition error rate is reduced in the pruned neural networks (blue and red) by avoiding overfitting. The highest accuracy achieved is about 99% for the Mixed National Institute of Standards and Technology database (MNIST) recognition. (C) The simulated synaptic weight distribution after training. The pruning rates are 73.8 and 84.2% for the MLP and CNN, respectively. In addition, the synaptic power consumption of the network is reduced by 38 and 63% for the pruned MLP and CNN, respectively.

DISCUSSION

To sum up, we demonstrated TPT-RAM as a new type of memristive synapse relying on the topotactic phase transition in SCO. The unique oxygen migrations along the highly ordered OVCs led to excellent analog switching characteristics with a much reduced cycle-to-cycle variability of ~0.9% and a device-to-device variability of ~4.9%, the low operation voltage of 0.8 V, and a fast speed below 100 ns. DFT calculations and KMC simulations further confirmed the resistive switching mechanism consistency with the measured device electrical characteristics. These results demonstrated the significance of controlling the ion migration paths to improve the uniformity of RRAM, which is beneficial to guide the optimization of future neuromorphic devices. For future integration with silicon transistors to build functional synaptic arrays based on TPT-RAM, new techniques such as remote epitaxy and sacrificial layer–assisted film transfer could be adopted (48, 49). Furthermore, the SCO-based synapse exhibited a unique diffusive nonvolatile dual mode, which was used to mimic the developing synapses of the human brain and implement neural network pruning during the online training, reducing up to 82.5% redundant synapses and improving the MNIST recognition accuracy to 99%. Our work points out a new direction to design and explore bioplausible analog switching memristive synapses for high-performance neuromorphic computing.

MATERIALS AND METHODS

Growth and characterization of the SCO thin films

Thirty-five–nanometer SCO and 17-nm SRO thin films were grown on a STO (001) and STO (110) substrate by using a reflection high-energy electron diffraction–assisted PLD system. The growth conditions were optimized at the temperature of 750°C with an oxygen environment of 100 mtorr. The laser energy (KrF, λ = 248 nm) was fixed at 1.2 J/cm2 with the repetition rate of 2 Hz. After the film growth, the samples were cooled down to room temperature at the cooling rate of 7°C/min in the oxygen atmosphere of 100 mtorr. Each sample thickness was controlled by the growth time, and the crystalline structures of the thin films were characterized by XRD and reciprocal space mapping. The atomic structures of the SCO films were characterized using an ARM 200CF (JEOL, Tokyo, Japan) transmission electron microscope.

Device fabrication and characterization

A 20-nm isolation dielectric Al2O3 layer was formed by atomic layer deposition, and contact vias were open with the size from 4 × 4 to 100 × 100 μm2, which defined the active device size. The Au (100 nm in thickness) electrodes with the size of 100 × 100 μm2 were deposited on top of the vias by a magnetron sputtering method. The electrical measurements were performed using a semiconductor parameter analyzer (Agilent B1500), a pulse generator (Agilent 81110A), and a switch matrix (Keithley 707).

Neural network simulation

An MLP of 784 × 100 × 10 and a typical CNN LeNet-5 (50) were used to demonstrate neural network pruning. LeNet-5, which consists of three convolution layers, two pooling layers, a fully connected layer, and a radial base function (RBF) layer, is simulated in this work. The input is an image of 32 × 32 pixels. The first convolutional layer (C1) measures 28 × 28 × 6 after convolution with a kernel of 1 × 5 × 5 × 6. The result is subsampled by a pooling layer (S2) using a 2 × 2 average pooling with a sliding stride of 2 and then passed through a sigmoid function. The second convolutional layer (C3) measures 10 × 10 × 6 after convolution with a kernel of 6 × 5 × 5 × 16. Another subsampling layer (S4) subsequently formed in a similar way as C1 and S2. The third convolutional layer (C5) measures 1 × 1 × 120 after convolution with a kernel of 16 × 5 × 5 × 120. Then, the outputs are fed into the fully connected layer (F6), which has 84 neurons with tanh activation function (F6). Last, the output layer is composed of 10 Euclidean RBF units for each class.

In the network simulations, a differential pair of two SCO (110)pc–based TPT-RAM devices was used to represent one synaptic weight. The neural networks were trained with standard backpropagation and stochastic gradient descent algorithms. The batch size was 200. The learning rate was 0.01. The MLP was trained for 10 epochs, and the CNN was trained for 20 epochs. Initially, all the synaptic devices were in the diffusive mode, and the measured curves in Fig. 4B were used for the simulation. The devices exhibited ~100 conductance levels before forming and were programmed with a write pulse of 3 V/10 ms (interval of 10 ms). To mimic the learning process of human, a series of rest stages were used in the training process. The frequency of rest was once an epoch for MLP and once every two epochs for CNN. In the rest stage, there were no learning tasks, and the synapses that were frequently updated would be electroformed and turned into the nonvolatile state eventually (representing stabilized synaptic connections), while other irrelevant synaptic connections that were still in the diffusive mode would be automatically decayed (fig. S13) to realize the pruning function. The remaining stabilized synaptic connections after pruning were simulated with a pulse of 1 V/1 μs (interval of 1 μs) (Fig. 3B) for neural network training. The read voltage was 0.2 V, and the weights were normalized in the range of (−1,1). The cycle-to-cycle variation extracted from the experimental data and a noise model obeying Gaussian distribution were included in the simulation. The classification accuracies of the network were measured on the test dataset after each rest stage. The pruning rates were calculated as the ratio of the zero synaptic weights, specifically, the number of synapses whose weights are less than 1 × 10−6 divide the total number of synapses. The synaptic power consumption of the network is estimated by Psynapse = (Vinput)2 × Gsynapse.

Acknowledgments

Funding: This work was, in part, supported by the National Natural Science Foundation of China (91964104, 61974081, 51872155, and 52025024) and the National Basic Research Program of China (2016YFA0301004). Author contributions: X.M., J.T., Y.L., and P.Y. conceived and designed the experiments. Y.L., Y. Zhou, and P.Y. contributed to the thin-film growth and characterizations. W.S., H.W., and H.Q. contributed to the transmission electron microscopy analysis. X.M. and Y. Zhong performed the experiments and data analysis. S.Y., W.L., F.X., B.G., and M.X. performed the device modeling and simulation. Q.Z. carried out the neural network simulations. X.M. and J.T. wrote the paper. All authors discussed the results and commented on the manuscript. J.T. and P.Y. supervised the project. Competing interests: The authors declare that they have no competing interests. Data and materials availability: All data needed to evaluate the conclusions in the paper are present in the paper and/or the Supplementary Materials. Additional data related to this paper may be requested from the authors.

SUPPLEMENTARY MATERIALS

Supplementary material for this article is available at http://advances.sciencemag.org/cgi/content/full/7/29/eabh0648/DC1

REFERENCES AND NOTES

  • 1.Zidan M. A., Strachan J. P., Lu W. D., The future of electronics based on memristive systems. Nat. Electron. 1, 22–29 (2018). [Google Scholar]
  • 2.Zhang W., Gao B., Tang J., Yao P., Yu S., Chang M.-F., Yoo H.-J., Qian H., Wu H., Neuro-inspired computing chips. Nat. Electron. 3, 371–382 (2020). [Google Scholar]
  • 3.H. Wu, P. Yao, B. Gao, W. Wu, Q. Zhang, W. Zhang, N. Deng, D. Wu, H.-S. P. Wong, S. Yu, H. Qian, Device and circuit optimization of RRAM for neuromorphic computing, in Proceedings of the 2017 IEEE International Electron Devices Meeting (IEDM) (IEEE, 2017), pp. 11.5.1–11.5.4. [Google Scholar]
  • 4.Ielmini D., Resistive switching memories based on metal oxides: Mechanisms, reliability and scaling. Semicond. Sci. Technol. 31, 063002 (2016). [Google Scholar]
  • 5.Wong H.-S. P., Raoux S., Kim S., Liang J., Reifenberg J. P., Rajendran B., Asheghi M., Goodson K. E., Phase change memory. Proc. IEEE 98, 2201–2227 (2010). [Google Scholar]
  • 6.Wang Z., Wu H., Burr G. W., Hwang C. S., Wang K. L., Xia Q., Yang J. J., Resistive switching materials for information processing. Nat. Rev. Mater. 5, 173–195 (2020). [Google Scholar]
  • 7.Yao P., Wu H., Gao B., Tang J., Zhang Q., Zhang W., Yang J. J., Qian H., Fully hardware-implemented memristor convolutional neural network. Nature 577, 641–646 (2020). [DOI] [PubMed] [Google Scholar]
  • 8.Li X., Tang J., Zhang Q., Gao B., Yang J. J., Song S., Wu W., Zhang W., Yao P., Deng N., Deng L., Xie Y., Qian H., Wu H., Power-efficient neural network with artificial dendrites. Nat. Nanotechnol. 15, 776–782 (2020). [DOI] [PubMed] [Google Scholar]
  • 9.Xi Y., Gao B., Tang J., Chen A., Chang M. F., Hu X. S., Van Der Spiegel J., Qian H., Wu H., In-memory learning with analog resistive switching memory: A review and perspective. Proc. IEEE 109, 14–42 (2021). [Google Scholar]
  • 10.Yu S., Neuro-inspired computing with emerging nonvolatile memorys. Proc. IEEE 106, 260–285 (2018). [Google Scholar]
  • 11.B. Gao, H. Wu, J. Kang, H. Yu, H. Qian, Oxide-based analog synapse: Physical modeling, experimental characterization, and optimization, in Proceedings of the 2016 IEEE International Electron Devices Meeting (IEDM) (IEEE, 2016), pp. 7.3.1–7.3.4. [Google Scholar]
  • 12.Tang J., Yuan F., Shen X., Wang Z., Rao M., He Y., Sun Y., Li X., Zhang W., Li Y., Gao B., Qian H., Bi G., Song S., Yang J. J., Wu H., Bridging biological and artificial neural networks with emerging neuromorphic devices: Fundamentals, progress, and challenges. Adv. Mater. 31, 1902761 (2019). [DOI] [PubMed] [Google Scholar]
  • 13.Fang Y., Yu Z., Wang Z., Zhang T., Yang Y., Cai Y., Huang R., Improvement of HfOx-Based RRAM device variation by inserting ALD TiN buffer layer. IEEE Electron Dev. Lett. 39, 819–822 (2018). [Google Scholar]
  • 14.Jiang R., Han Z., Du X., Reliability/uniformity improvement induced by an ultrathin TiO2 insertion in Ti/HfO2/Pt resistive switching memories. Microelectron. Reliab. 63, 37–41 (2016). [Google Scholar]
  • 15.Choi S., Tan S. H., Li Z., Kim Y., Choi C., Chen P.-Y., Yeon H., Yu S., Kim J., SiGe epitaxial memory for neuromorphic computing with reproducible high performance based on engineered dislocations. Nat. Mater. 17, 335–340 (2018). [DOI] [PubMed] [Google Scholar]
  • 16.Zhang F., Zhang H., Krylyuk S., Milligan C. A., Zhu Y., Zemlyanov D. Y., Bendersky L. A., Burton B. P., Davydov A. V., Appenzeller J., Electric-field induced structural transition in vertical MoTe2- and Mo1–xWxTe2-based resistive memories. Nat. Mater. 18, 55–61 (2019). [DOI] [PubMed] [Google Scholar]
  • 17.Tambunan O. T., Parwanta K. J., Acharya S. K., Lee B. W., Jung C. U., Kim Y. S., Park B. H., Jeong H., Park J.-Y., Cho M. R., Park Y. D., Choi W. S., Kim D. W., Jin H., Lee S., Song S. J., Kang S.-J., Kim M., Hwang C. S., Resistance switching in epitaxial SrCoOx thin films. Appl. Phys. Lett. 105, 063507 (2014). [Google Scholar]
  • 18.Li H.-B., Lu N., Zhang Q., Wang Y., Feng D., Chen T., Yang S., Duan Z., Li Z., Shi Y., Wang W., Wang W.-H., Jin K., Liu H., Ma J., Gu L., Nan C., Yu P., Electric-field control of ferromagnetism through oxygen ion gating. Nat. Commun. 8, 2156 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Nallagatla V. R., Kim J., Lee K., Chae S. C., Hwang C. S., Jung C. U., Complementary resistive switching and synaptic-like memory behavior in an epitaxial SrFeO2.5 thin film through oriented oxygen-vacancy channels. ACS Appl. Mater. Interfaces 12, 41740–41748 (2020). [DOI] [PubMed] [Google Scholar]
  • 20.Nallagatla V. R., Heisig T., Baeumer C., Feyer V., Jugovac M., Zamborlini G., Schneider C. M., Waser R., Kim M., Jung C. U., Dittmann R., Topotactic phase transition driving memristive behavior. Adv. Mater. 31, 1903391 (2019). [DOI] [PubMed] [Google Scholar]
  • 21.Tian J., Wu H., Fan Z., Zhang Y., Pennycook S. J., Zheng D., Tan Z., Guo H., Yu P., Lu X., Zhou G., Gao X., Liu J. M., Nanoscale topotactic phase transformation in SrFeOx epitaxial thin films for high-density resistive switching memory. Adv. Mater. 31, 1903679 (2019). [DOI] [PubMed] [Google Scholar]
  • 22.Mitra C., Meyer T., Lee H. N., Reboredo F. A., Oxygen diffusion pathways in brownmillerite SrCoO2.5: Influence of structure and chemical potential. J. Chem. Phys. 141, 084710 (2014). [DOI] [PubMed] [Google Scholar]
  • 23.Yoon J. H., Han J. H., Jung J. S., Jeon W., Kim G. H., Song S. J., Seok J. Y., Yoon K. J., Lee M. H., Hwang C. S., Highly improved uniformity in the resistive switching parameters of TiO2 thin films by inserting Ru nanodots. Adv. Mater. 25, 1987–1992 (2013). [DOI] [PubMed] [Google Scholar]
  • 24.Acharya S. K., Nallagatla R. V., Togibasa O., Lee B. W., Liu C., Jung C. U., Park B. H., Park J. Y., Cho Y., Kim D. W., Jo J., Kwon D. H., Kim M., Hwang C. S., Chae S. C., Epitaxial brownmillerite oxide thin films for reliable switching memory. ACS Appl. Mater. Interfaces 8, 7902–7911 (2016). [DOI] [PubMed] [Google Scholar]
  • 25.Lu N., Zhang P., Zhang Q., Qiao R., He Q., Li H.-B., Wang Y., Guo J., Zhang D., Duan Z., Li Z., Wang M., Yang S., Yan M., Arenholz E., Zhou S., Yang W., Gu L., Nan C.-W., Wu J., Tokura Y., Yu P., Electric-field control of tri-state phase transformation with a selective dual-ion switch. Nature 546, 124–128 (2017). [DOI] [PubMed] [Google Scholar]
  • 26.Jeen H., Choi W. S., Biegalski M. D., Folkman C. M., Tung I. C., Fong D. D., Freeland J. W., Shin D., Ohta H., Chisholm M. F., Lee H. N., Reversible redox reactions in an epitaxially stabilized SrCoOx oxygen sponge. Nat. Mater. 12, 1057–1063 (2013). [DOI] [PubMed] [Google Scholar]
  • 27.Jeen H., Choi W. S., Freeland J. W., Ohta H., Jung C. U., Lee H. N., Topotactic phase transformation of the brownmillerite SrCoO2.5 to the perovskite SrCoO3-δ. Adv. Mater. 25, 3651–3656 (2013). [DOI] [PubMed] [Google Scholar]
  • 28.Zhang Q., He X., Shi J., Lu N., Li H., Yu Q., Zhang Z., Chen L.-Q., Morris B., Xu Q., Yu P., Gu L., Jin K., Nan C.-W., Atomic-resolution imaging of electrically induced oxygen vacancy migration and phase transformation in SrCoO2.5-σ. Nat. Commun. 8, 104 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Yao L., Inkinen S., van Dijken S., Direct observation of oxygen vacancy-driven structural and resistive phase transitions in La2/3Sr1/3MnO3. Nat. Commun. 8, 14544 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ferreiro-Vila E., Blanco-Canosa S., Lucas del Pozo I., Vasili H. B., Magén C., Ibarra A., Rubio-Zuazo J., Castro G. R., Morellón L., Rivadulla F., Room-temperature AFM electric-field-induced topotactic transformation between perovskite and brownmillerite SrFeOx with sub-micrometer spatial resolution. Adv. Funct. Mater. 29, 1901984 (2019). [Google Scholar]
  • 31.Sun W., Gao B., Chi M., Xia Q., Yang J. J., Qian H., Wu H., Understanding memristive switching via in situ characterization and device modeling. Nat. Commun. 10, 3453 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Rossell M. D., Lebedev O. I., Van Tendeloo G., Hayashi N., Terashima T., Takano M., Structure of epitaxial Ca2Fe2O5 films deposited on different perovskite-type substrates. J. Appl. Phys. 95, 5145–5152 (2004). [Google Scholar]
  • 33.Young J., Rondinelli J. M., Crystal structure and electronic properties of bulk and thin film brownmillerite oxides. Phys. Rev. B Condens. Matter Mater. Phys. 92, 174111 (2015). [Google Scholar]
  • 34.Inoue S., Kawai M., Ichikawa N., Kageyama H., Paulus W., Shimakawa Y., Anisotropic oxygen diffusion at low temperature in perovskite-structure iron oxides. Nat. Chem. 2, 213–217 (2010). [DOI] [PubMed] [Google Scholar]
  • 35.Jo J., Nallagatlla V. R., Acharya S. K., Kang Y., Kim Y., Yoon S., Lee S., Baik H., Han S., Kim M., Jung C. U., Effects of the heterointerface on the growth characteristics of a brownmillerite SrFeO2.5 thin film grown on SrRuO3 and SrTiO3 perovskites. Sci. Rep. 10, 3807 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Saib F., Bellal B., Trari M., Preparation and characterization of the brownmillerite Sr2Co2O5 as novel photocatalyst in the hydrogen generation. Mater. Sci. Semicond. Process. 63, 122–126 (2017). [Google Scholar]
  • 37.F. Pan, V. Subramanian, Kinetic Monte Carlo simulation of resistive switching and filament growth in electrochemical RRAMs, in Proceedings of the 68th Device Research Conference (IEEE, 2010), pp. 255–256. [Google Scholar]
  • 38.B. Gao, H. Wu, W. Wu, X. Wang, P. Yao, Y. Xi, W. Zhang, N. Deng, P. Huang, X. Liu, J. Kang, H. Y. Chen, S. Yu, H. Qian, Modeling disorder effect of the oxygen vacancy distribution in filamentary analog RRAM for neuromorphic computing, in Proceedings of the 2017 IEEE International Electron Devices Meeting (IEDM) (IEEE, 2017), pp. 4.4.1–4.4.4. [Google Scholar]
  • 39.Hang C. Z., Wang C., Gao B., Chen H., Xu M.-H., Hao L., Lu H.-L., Sub-nanosecond pulse programming and device design strategy for analog resistive switching in HfOx-based resistive random access memory. Appl. Phys. Lett. 114, 112102 (2019). [Google Scholar]
  • 40.M. Jerry, P. Y. Chen, J. Zhang, P. Sharma, K. Ni, S. Yu, S. Datta, Ferroelectric FET analog synapse for acceleration of deep neural network training, in Proceedings of the 2017 IEEE International Electron Devices Meeting (IEDM) (IEEE, 2017), pp. 6.2.1–6.2.4. [Google Scholar]
  • 41.M. Zhao, H. Wu, B. Gao, Q. Zhang, W. Wu, S. Wang, Y. Xi, D. Wu, N. Deng, S. Yu, H. Chen, H. Qian, Investigation of statistical retention of filamentary analog RRAM for neuromophic computing, in Proceedings of the 2017 IEEE International Electron Devices Meeting (IEDM) (IEEE, 2017), pp. 39.4.1–39.4.4. [Google Scholar]
  • 42.Wang Z., Joshi S., Savel’ev S. E., Jiang H., Midya R., Lin P., Hu M., Ge N., Strachan J. P., Li Z., Wu Q., Barnell M., Li G. L., Xin H. L., Williams R. S., Xia Q., Yang J. J., Memristors with diffusive dynamics as synaptic emulators for neuromorphic computing. Nat. Mater. 16, 101–108 (2017). [DOI] [PubMed] [Google Scholar]
  • 43.Wang L., Yang Z., Bowden M. E., Du Y., Brownmillerite phase formation and evolution in epitaxial strontium ferrite heterostructures. Appl. Phys. Lett. 114, 231602 (2019). [Google Scholar]
  • 44.Yang J.-T., Ge C., Du J.-Y., Huang H.-Y., He M., Wang C., Bin Lu H., Yang G.-Z., Jin K.-J., Artificial synapses emulated by an electrolyte-gated tungsten-oxide transistor. Adv. Mater. 30, 1801548 (2018). [DOI] [PubMed] [Google Scholar]
  • 45.Changeux J.-P., Danchin A., Selective stabilisation of developing synapses as a mechanism for the specification of neuronal networks. Nature 264, 705–712 (1976). [DOI] [PubMed] [Google Scholar]
  • 46.S. Han, H. Mao, W. J. Dally, Deep compression: Compressing deep neural networks with pruning, trained quantization and Huffman coding, in Proceedings of the 2017 IEEE International Conference on Multimedia and Expo (ICME) (IEEE, 2017), pp. 667–672. [Google Scholar]
  • 47.Anwar S., Hwang K., Sung W., Structured pruning of deep convolutional neural networks. ACM J. Emerg. Technol. Comput. Syst. 13, 1–18 (2017). [Google Scholar]
  • 48.Kim Y., Cruz S. S., Lee K., Alawode B. O., Choi C., Song Y., Johnson J. M., Heidelberger C., Kong W., Choi S., Qiao K., Almansouri I., Fitzgerald E. A., Kong J., Kolpak A. M., Hwang J., Kim J., Remote epitaxy through graphene enables two-dimensional material-based layer transfer. Nature 544, 340–343 (2017). [DOI] [PubMed] [Google Scholar]
  • 49.Lu D., Baek D. J., Hong S. S., Kourkoutis L. F., Hikita Y., Hwang H. Y., Synthesis of freestanding single-crystal perovskite films and heterostructures by etching of sacrificial water-soluble layers. Nat. Mater. 15, 1255–1260 (2016). [DOI] [PubMed] [Google Scholar]
  • 50.LeCun Y., Bottou L., Bengio Y., Haffner P., Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998). [Google Scholar]
  • 51.QuantumATK version Q-2019.12, Synopsys QuantumATK; www.synopsys.com/silicon/quantumatk.html.
  • 52.Yeon H., Lin P., Choi C., Tan S. H., Park Y., Lee D., Lee J., Xu F., Gao B., Wu H., Qian H., Nie Y., Kim S., Kim J., Alloying conducting channels for reliable neuromorphic computing. Nat. Nanotechnol. 15, 574–579 (2020). [DOI] [PubMed] [Google Scholar]
  • 53.Chen S., Mahmoodi M. R., Shi Y., Mahata C., Yuan B., Liang X., Wen C., Hui F., Akinwande D., Strukov D. B., Lanza M., Wafer-scale integration of two-dimensional materials in high-density memristive crossbar arrays for artificial neural networks. Nat. Electron. 3, 638–645 (2020). [Google Scholar]
  • 54.Yang R., Li H., Smithe K. K. H., Kim T. R., Okabe K., Pop E., Fan J. A., Wong H.-P., Ternary content-addressable memory with MoS2 transistors for massively parallel data search. Nat. Electron. 2, 108–114 (2019). [Google Scholar]
  • 55.S. Park, A. Sheri, J. Kim, J. Noh, J. Jang, M. Jeon, B. Lee, B. R. Lee, B. H. Lee, H. Hwang, Neuromorphic speech systems using advanced ReRAM-based synapse, in Proceedings of the 2013 EEE International Electron Devices Meeting (IEDM) (IEEE, 2013), pp. 625–628. [Google Scholar]
  • 56.Ding K., Wang J., Zhou Y., Tian H., Lu L., Mazzarello R., Jia C., Zhang W., Rao F., Ma E., Phase-change heterostructure enables ultralow noise and drift for memory operation. Science 366, 210–215 (2019). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material for this article is available at http://advances.sciencemag.org/cgi/content/full/7/29/eabh0648/DC1


Articles from Science Advances are provided here courtesy of American Association for the Advancement of Science

RESOURCES