Abstract
A neuromorphic visual system integrating optoelectronic synapses to perform the in‐sensor computing is triggering a revolution due to the reduction of latency and energy consumption. Here it is demonstrated that the dwell time of photon‐generated carriers in the space‐charge region can be effectively extended by embedding a potential well on the shoulder of Schottky energy barrier. It permits the nonlinear interaction of photocurrents stimulated by spatiotemporal optical signals, which is necessary for in‐sensor reservoir computing (RC). The machine vision with the sensor reservoir constituted by designed self‐powered Au/P(VDF‐TrFE)/Cs2AgBiBr6/ITO devices is competent for both static and dynamic vision tasks. It shows an accuracy of 99.97% for face classification and 100% for dynamic vehicle flow recognition. The in‐sensor RC system takes advantage of near‐zero energy consumption in the reservoir, resulting in decades‐time lower training costs than a conventional neural network. This work paves the way for ultralow‐power machine vision using photonic devices.
Keywords: Cs2AgBiBr6 , in‐sensors, lead‐free double perovskites, machine vision, reservoir
In this work, photovoltaic devices are innovatively used as self‐powered reservoirs. Ultralow‐power machine vision is achieved by the self‐powered sensor reservoir with Cs2AgBiBr6 photonic devices. Both image processing and dynamic video analysis are energy‐efficiently achieved in this in‐sensor reservoir computing system. The accuracy is 99.67% for face classification and 100% for dynamic vehicle flow recognition.

1. Introduction
Machine vision mainly conducts heavy matrix multiplication for image‐processing tasks, such as pattern recognition and object detection.[ 1 ] In a traditional machine vision system, visual information is captured by an analog camera and converted into digital signals for memory units and afterward computing units.[ 2 , 3 ] The separation of processor and memory in the traditional computing unit based on a von Neumann architecture causes huge latency and energy consumption during the matrix‐multiplication processing,[ 4 , 5 , 6 ] which hinders the delay‐sensitive applications such as driverless vehicles, robotics, or industrial manufacturing. Replacing the von Neumann architecture with an in‐memory computing technology is the most convenient strategy.[ 7 , 8 ] But it suffers additional digital‐to‐analog conversion (DAC) due to the analog processing in the In‐memory artificial intelligence (AI) chip (Figure 1a,c). Furthermore, the honest transmission of redundant data from camera causes high power consumption.
Figure 1.

Schematics of machine vision systems. a) A traditional machine vision system where visual information is captured by an analog camera, converted into digital/analog signals for memory unit/afterward in‐memory AI chip. b) A bionic visual system by an in‐sensor RC. The full‐analog processing and reduced data movement result in a low latency and high energy efficiency. c) The in‐memory AI chip with a high‐density memristor crossbar array.
It gives a better solution in human visual system. The sensory neurons detect analog optical signals and perform a first‐stage image processing in which the amount of data is greatly reduced before the following visual signal processing in visual cortex of the human brain.[ 9 ] The full‐analog processing and reduced data movement result in a low latency and high energy efficiency (Figure 1b). A neuromorphic visual system integrating optoelectronic synaptic devices to form the in‐sensor computing is newly proposed[ 9 , 10 ] and triggers much attention.[ 1 , 2 , 3 , 11 , 12 , 13 , 14 , 15 , 16 ] It is experimentally demonstrated that the visual pre‐processing by a photoconductive synaptic device array can improve the processing efficiency and image recognition rate.[ 1 , 9 , 12 , 14 ] Mennel et al. show that both supervised and unsupervised learning algorithms can be performed in photovoltaic sensors with an ultrahigh throughput of 20 million bins per second.[ 3 ] Very recently, Sun et al. reported that the reservoir algorithm can also be achieved based on photoconductive sensors.[ 17 ] Thus, the reservoir computing (RC) opens a way for high energy‐efficient machine vision because it can efficiently process complex and temporal data with low‐training cost, taking advantage of that only the readout function needs to be trained.[ 18 , 19 , 20 , 21 ] Among these optoelectronic synapses, the photocurrent other than photoconductance is preferred to simulate the synaptic weight because the photovoltaic work mode makes the photocurrent device free of energy consumption. Aiming the high energy efficiency, using self‐powered photovoltaic devices to constitute the in‐sensor RC is extremely appealing, but not realized yet due to the transient characteristic of photocurrents.
In this work, an ultralow‐power machine vision is implemented based on the in‐sensor RC constituted by self‐powered Au/P(VDF‐TrFE)/Cs2AgBiBr6/ITO devices. The designed band alignment with a potential well on the shoulder of Schottky energy barrier greatly extends the dwell time of photon‐generated carriers in the space‐charge region, permitting the nonlinear interaction of photocurrents stimulated by spatiotemporal optical signals. This mechanism is unambiguously confirmed by the polarization‐modulated coupling strength between optoelectronic responses. We demonstrate both image processing, such as face classification, and dynamic video analysis, such as vehicle flow detection, can be energy‐efficiently performed by the photocurrent‐encoded in‐sensor RC. It shows an accuracy of 99.97% to classify face images with decades‐lower training cost than conventional neural networks and an accuracy of 100% to classify orientation of dynamic vehicle flow.
2. Results and Discussion
Organic‐inorganic hybrid halide perovskites (OHPs, CH3NH3PbX3) are widely used in optoelectronic devices due to its excellent optoelectronic properties. However, the intrinsic instability and environmental unfriendliness of OHPs greatly limit their further commercial application.[ 22 , 23 ] All‐inorganic lead‐free double perovskites, especially Cs2AgBiBr6, have become one of the most promising alternatives to OHPs for optoelectronics[ 24 ] due to their advantages of environmental stability, low toxicity, and simple solution‐processing method,[ 25 , 26 ] allowing for applications requiring large area, low cost or mechanical flexibility. In addition, ferroelectric materials are a kind of functional materials with tunable spontaneous polarization by external electric field[ 27 , 28 ] and are always adopted in synaptic transistors as gate insulators to control the conductance variations.[ 29 , 30 , 31 ]
2.1. Device Structure
The photonic synapse in this work is designed for directly receiving optical signals and giving spatiotemporal‐linked responses, which simulates the function of photoreceptors and following synapses in human visual system (Figure 2a). A simple structure combining a photosensitive layer of Cs2AgBiBr6 and ferroelectric polymer layer was fabricated with top Au electrode and bottom ITO glass electrode to form the bionic photonic synapse (Figure 2b). The Cs2AgBiBr6/P(VDF‐TrFE) heterostructure with clear interface and undamaged phase was confirmed by using scanning electron microscopy (SEM) (Figure S1, Supporting Information) and X‐ray diffraction (XRD) spectroscopy (Figure S2, Supporting Information). The UV–vis absorption spectra show that Cs2AgBiBr6 has an absorption peak around 445 nm, and the additional layer of P(VDF‐TrFE) doesn't affect its absorption characteristic (Figure S3, Supporting Information). Thus, optical pulses with a wavelength of 445 nm were used to testify the device functions.
Figure 2.

Tunable EPSC in Cs2AgBiBr6 photonic synapses. a) Schematic diagrams of human visual system. b) The diagram of the designed photonic synapse. Inset shows the crystal structure of Cs2AgBiBr6. c) The band energy alignment of Au, P(VDF‐TrFE), Cs2AgBiBr6, and ITO. d) The band diagram of the Au/Cs2AgBiBr6/ITO device under optical stimulus. e) EPSC triggered by 10 optical pulses on the Au/Cs2AgBiBr6/ITO device. f) The band diagram of the Au/P(VDF‐TrFE)/Cs2AgBiBr6/ITO device under optical stimulus. g) EPSC triggered by 10 optical pulses on the Au/P(VDF‐TrFE)/Cs2AgBiBr6/ITO device. h) The evolution of EPSC coupling for three polarization states that is up after negative pooling, random for the pristine film, and down after positive pooling, respectively.
2.2. Tunable Photocurrents in Cs2AgBiBr6 Photonic Synapses
The ferroelectric polymer layer plays a crucial role in the modulation of optoelectronic response in the Cs2AgBiBr6 photonic synapse. The band energy alignment of Au, P(VDF‐TrFE), Cs2AgBiBr6, and ITO materials obtained from literatures[ 32 , 33 ] are demonstrated in Figure 2c. The work function of 5.1 eV in Au electrode and 4.7 eV in ITO electrode facilitate the Schottky contact at Au/n‐type Cs2AgBiBr6 interface and almost ohmic contact at n‐type Cs2AgBiBr6/ITO interface. Without the P(VDF‐TrFE) layer, a direct Schottky barrier at the Au side (Figure 2d) permits photocurrent through the device when it is under optical pulses, and the photocurrent fade away rapidly after removing the light pulses (Figure 2e). This photocurrent represents excitatory post‐synaptic current (EPSC) as we treat our device as a synapse. The EPSC in the Au/Cs2AgBiBr6/ITO device originates from the separation of photon‐generated carriers by the built‐in electric field in the Schottky barrier. When the source of photon‐generated carriers is shut down with the turning off of optical pulse, the built‐in electric field could remove these residual carriers away and the EPSC disappears. There is slight coupling between ESPC under ten successive optical pulses with a width of 100 ms, power density of 36.6 µW cm−2, and an internal span of 100 ms (Figure 2e). When a P(VDF‐TrFE) layer is introduced between the Au electrode and Cs2AgBiBr6 layer, the high binding energy of the valence electron states in P(VDF‐TrFE) layer gives an energy potential well for hole carriers at the P(VDF‐TrFE)/Cs2AgBiBr6 interface (Figure 2f). This energy potential well makes the photon‐generated hole carriers difficult to migrate to the Au electrode, which weakens the EPSC under the optical pulse, but much extends the lifetime of EPSC after removing the optical stimulus. Furthermore, a strong coupling between EPSC is observed and the amplitude of EPSC continually strengthened under the ten successive optical pulses (Figure 2g). The EPSC coupling can be quantified by the facilitation factor (An/A1), where An is the amplitude of EPSC under the nth optical pulse. The amplitude of EPSC and their coupling strength can be tuned by altering the ferroelectric polarization orientation through modulating the shape of the energy potential well (Figures S4 and S5, Supporting Information). The EPSC coupling for three polarization states, that is up after negative pooling, random for the pristine film, and down after positive pooling, are presented in Figure 2h, from which one can see that the polarization reversal induces a variation of 234.56% in the EPSC coupling. It is noted that the positive relationship between the coupling strength of optoelectronic response and the depth of energy potential well which is modulated by polarization further confirms the validity of the mechanism (Note S2, Supporting Information). The tunable EPSC enables Cs2AgBiBr6 photonic synapse potential for the rapid In‐sensor computing.[ 2 , 3 ]
2.3. The Nonlinear and Spatiotemporal‐Linked EPSC and Device Variation
Besides the tunability, the long‐lifetime EPSC itself and their couplings are also appealing for optical‐involved information processing. The EPSC characteristics in Cs2AgBiBr6 photonic synapse are further explored. Under the stimulus of an optical pulse with a width of 100 ms and power density of 36.6 µW cm−2, the EPSC instantly reaches to ≈10.6 pA and then decreases gradually after removing the stimulus (Figure S6, Supporting Information). The amplitude of EPSC can be modulated by the width (Figure S7a,b, Supporting Information) and power density (Figure S7c,d, Supporting Information) of the optical pulses. With a constant power density of 36.6 µW cm−2, as the pulse width increases from 0.1 to 4.5 s, the EPSC increases from 10.9 to 145.3 pA, achieving nonlinear 1233% enhancement (Figure S7b, Supporting Information). With a constant pulse width of 100 ms, the EPSC increased 524% (from 10.1 to 63.1 pA) as the power density increased from 36.6 to 383.4 µW cm−2 (Figure S7d, Supporting Information).
Pair‐pulse facilitation (PPF) is one of the most representative behaviors of biological synapses. When two optical pulses are applied to the device sequentially, the EPSC value of the second optical pulse is higher than the first one because of the EPSC coupling. Figure 3a presents the PPF behavior stimulated by two optical pulses with a width of 100 ms, power density of 36.6 µW cm−2, and Δt of 100 ms. The PPF ratio can be modulated by the interval time (Δt) between the pair of optical pulses. As shown in Figure 3b, the PPF ratio (defined as A2/A1 × 100%) decreases from 152% to 112% as the Δt increases from 150 ms to 9 s. The experimental data (blue dots) could be well fitted by the double exponential decay function:[ 34 ]
| (1) |
where τ 1 = 0.16 s and τ 2 = 3.57 s are the characteristic relaxation time and C is the initial facilitation magnitude. Correspondingly, temporal plasticity such as spiking‐rate dependent plasticity (SRDP) and spike‐number dependent plasticity (SNDP) can be implemented by using our synaptic device. Figure 3c shows the EPSC triggered by 10 optical pulses with different frequencies of 5, 1.1, and 0.33 Hz, respectively. The EPSC nonlinearly depends on the spiking rate, and the higher rate triggers the larger EPSC. Similar nonlinear EPSC response with a spiking number is also observed (Figure 3d). The EPSC decay curves of SNDP (Figure 3e) can be well fitted by an exponential decay function[ 35 ]
| (2) |
where τ is the characteristic relaxation time of the decay process. Figure 3f shows the decay time (τ) increases from 5.44 to 11.77 s as the optical spiking number increases from 5 to 40. These nonlinear, spatiotemporal‐linked, and short‐term EPSC responses to either spiking width, power density, rate, or spiking number, make the device potential to process complex temporal information.
Figure 3.

The nonlinear and spatiotemporal‐linked EPSC and device variation. a) PPF behavior stimulated by a pair of optical pulses with the time interval (Δt) of 100 ms. b) Dependence of the PPF ratio (defined as A2/A1 × 100%) on the time interval (Δt). c) SRDP triggered by different rates of optical pulse. d) SNDP triggered by different numbers of optical pulses. e,f) Decay curves and corresponding decay time (τ) of SNDP in Figure 3d. g,h) The uniformity and robustness were investigated by evaluating the EPSC response from 28 random‐picked devices stimulated by 2 optical pulses and 5 optical pulses. All measurements were performed at 0 V bias.
Before the application demonstration, the uniformity and robustness of our Cs2AgBiBr6 photonic synapses were investigated by evaluating the EPSC responses of 28 random‐picked devices stimulated by 2 and 5 optical pulses, respectively, and each device was repeatedly inspected for 20 times. The amplitude of EPSC at 1 s (Figure 3g) was taken for statistical analyses. As demonstrated in Figure 3h, all 28 devices show good uniformity and robustness. The typical cycle‐to‐cycle variation is 0.078 for 2 stimuli and 0.075 for 5 stimuli, and the device‐to‐device variation is 0.221 for 2 stimuli and 0.159 for 5 stimuli. The decrease of variation for either cycle to cycle or device to device may attribute to the canceling effect of superimposed random variations.
2.4. In‐Sensor Reservoir Computing for Image Classification
RC is a simple and efficient brain‐like algorithm suitable for processing timing signals. In an RC system, a dynamic “reservoir” of synapses with short‐term memory maps complex timing signals into a new space, represented by the state of the node in the reservoir. Because only the weight of the output connection between the reservoir and the output layer needs to be trained, RC greatly reduces the training cost of the network. The necessary short‐term memory in RC can be simulated by the nonlinear and spatiotemporal‐linked EPSC responses that are triggered directly by optical signals. Thus, it is possible to build an In‐sensor RC system with Cs2AgBiBr6 photonic synapses.
Figure 4a shows a conceptual RC system, including input layer, dynamic reservoir, and output layer. The transient input of all synapses in reservoir at time t forms the input state u(t). With the dynamic evolutions of reservoir internal state x(t), the input u(t) can be successfully mapped to the high‐dimensional feature space as y(t). Then the high‐dimensional feature space y(t) is used as new inputs to the following network for processing. By encoding the space information to time‐series u(t), the network size for image processing is greatly reduced. For example, to classify gray‐scale face images with 28 columns × 35 rows pixels. A conventional neural network will cost 28 × 35 × n weights to recognize the topic one from n candidates. On the contrary, the RC system only needs 28 × n weights to implement the same recognition task by encoding 35 pixels in each column as time‐series u(t). Here 28 devices were selected to build the reservoir to map four gray‐scale face images (28 × 35 pixels) and a following 28 × 4 memristor network is simulated to accomplish the classification task.
Figure 4.

In‐sensor RC using Cs2AgBiBr6 photonic synapses. a) Schematic of a conceptual RC system, including input layer, dynamic reservoir, and output layer. b) Schematic of the RC system with optical pulse sequences as the inputs, the memristor reservoir, and a readout network. c) A concrete example for the process of face classification. d) EPSC evolutions of reservoir device one (D1) and device nine (D9) in Figure 4c.
Figure 4b illustrates the schematic of the RC system, consisting of input, reservoir, and readout. The optical pulse sequences are fed to the reservoir as inputs, which will cause a change in the EPSC values in the devices in the reservoir. A concrete example is shown in Figure 4c to explain the process of face classification. Four identification photos (ID photo) from East China Normal University are chosen and preloaded to 28 × 35 pixels size (Figure 5a). These ID photos are in grayscale and each pixel value ranges from 0 to 5 to reflect their gray level where the pure white color corresponds to 0 and pure dark color to 5, respectively. Pixels in the same column are serially encoded to one optical pulse sequence where the pixel value matches the optical pulse number.
Figure 5.

Face image classification by the In‐sensor RC system. a) Four ID photos were chosen for face recognition and preloaded to 980 pixels in 28 × 35 size. b) One group of normalized y(t) for four ID photos (class 1, class 2, class 3, and class 4). c) Weights of memristors in 28 × 4 array are updated during each training epoch. d–g) The readout function (f) value of the corresponding targeted neuron. h) 20 experimental y(t) spaces of class 1. i) Top panel: the recognition rate for classification of class 1. Both experimental y(t) spaces and simulated y(t) spaces with a random variation of 0.16 and 0.22 are involved to obtain the recognition rate. Bottom panel: The failed number for classification of class 1 in 10 240 tests.
In our devices, the EPSC decay curves of SRDP can be well fitted by the exponential decay function of Equation (2) (Figure S8, Supporting Information). Figure S8c, Supporting Information, shows the decay time (τ) increases from 11 s to 21.73 s as the spiking frequency of ten optical pulses increases from 0.2 to 8.33. The SRDP ratio (defined as A10/A1) increases from 130% to 301% with the pulse frequency increasing from 0.1 to 5.55 Hz and approaching saturation at 5 Hz (Figure S8d, Supporting Information). Thus, to fully utilize the coupling between stimulus, a frequency of 5 Hz (width of 100 ms and interval of 100 ms) are used to determine the gray level of each pixel. Considering the decay time of tens of seconds, a frequency of 1 Hz (1 s for each pixel) is used to handle these images with 35 pixels in each column. Thus, each pixel costs a period of 1 s in which each optical pulse survives 0.1 s followed by a cease span of 0.1 s. For example, as demonstrated in Figure 4c,d, there are no optical pulses during the period of 1 s if the pixel value is 0. There are three optical pulses at the first 0.6 s if the pixel value is 3, and there are five optical pulses during the whole 1 s if the pixel value is 5. All 28 optical pulse sequences are input to the 28 reservoir devices and the EPSC responses are recorded. Figure 4d shows EPSC evolutions of reservoir device one (D1) and device nine (D9) in Figure 4c. Because of the nonlinear EPSC coupling, different optical pulse sequences trigger unequable EPSC responses. The amplitude of EPSC at 35 s is collected from the 28 reservoir devices as the new feature space y(t). An activation function is used to further differentiate the intensity distribution in y(t) and converts y(t) to normalized voltage inputs for the following network (Note S4, Supporting Information).
Through our in‐sensor reservoir system, the 28 × 35 pixels information in each photo are abstracted to a new feature y(t) space with only 28 elements. One group of normalized y(t) for four ID photos (class 1, class 2, class 3, and class 4 in Figure 5a are shown in Figure 5b. Note that the abstraction from u(t) to y(t) is not a rigorous one‐two‐one process since it cannot be guaranteed that all 535 kinds of optical pulse sequence triggers distinguishable EPSC responses. To test the function of our in‐sensor reservoir system, a virtual 28 × 4 memristor network is trained and used for the face classification task. Following the flow chart in the right panel of Figure 4c, the training simulation is performed using the new y(t) inputs and a Manhattan update rule.[ 36 , 37 ] Ten groups of experimentally collected y(t) inputs for class 1 are used to train the readout memristor network. All weights of memristors in 28 × 4 arrays are initially randomly selected with a conductance range between 3 and 7 µS. Then all weights are updated during each training epoch (Figure 5c). Correspondingly, the value of readout function (f) of the targeted neuron continuously increases as the training epoch and exceeds others after 20 epochs (Figure 5d–g). To check the accuracy, 10 240 y(t) spaces are created by randomly picking experimental y i(t) that is not used during the training process, that is, from the other ten groups of experimentally collected y(t) inputs in Figure 5h. After each training epoch, all 10 240 y(t) spaces and simulated y(t) spaces with a random variation of 0.16 and 0.22 are inputted to the 28 × 4 memristor network to test the recognition rate. The results show that the in‐sensor reservoir system based on Cs2AgBiBr6 photonic synapses can achieve a satisfactory recognition rate of 99.97% for face classification which accords well with the simulation results with the variation of 0.16 and 0.22 (Figure 5i).
We have performed another training simulation on a readout memristor network with uniform initial weights. All weights of memristors in 28 × 4 arrays are initially programmed to the same conductance of 5 µS and updated during each training epoch (Figure S9e, Supporting Information). Interestingly, the value of readout function (f) of the targeted neuron equals to others at the initial, exceeds others after the first epoch, and continuously increases as the training epoch evolves (Figure S9a–d, Supporting Information). After each training epoch, all 10 240 y(t) spaces same as that in the case of random initial weights and simulated y(t) spaces with a random variation of 0.16 and 0.22 are inputted to the 28 × 4 memristor network to test the recognition rate. The results show that the in‐sensor reservoir system can achieve a satisfactory recognition rate of 99.67% for face classification (Figure S9f, Supporting Information), which approaches the result of 99.97% for the readout memristor network with random initial weights. These results imply that the accuracy is slightly dependent on the initial weights of the readout memristor network.
2.5. In‐Sensor Reservoir Computing for Detecting Vehicle Flow
By directly collecting the spatiotemporal optoelectronic signals as u(t), the In‐sensor RC is suitable for processing dynamic video information. A scene of road camera with in‐sensor RC for monitoring vehicle flow is designed. As demonstrated in Figure 6a,b, an array of 5 × 5 Cs2AgBiBr6 photonic synapses forms the sensor reservoir. The spatial vehicle information is optically inputted to the sensor reservoir by synchronizing vehicle pixels from the dynamic vision sensor (DSV) video (bottom panel in Figure 6a). Figure 6c shows how the reservoir array responds to the up‐to‐down vehicle flow. During the vehicle movement, all 25 Cs2AgBiBr6 photonic synapses are active and release EPSC once they receive the five optical stimuli that symbolize the position of vehicle pixels. Vehicle flows with different orientations give disparate spatiotemporal EPSC responses, resulting in distinguishable EPSC pattern.
Figure 6.

In‐sensor RC for detecting vehicle flow. a) Optical (top panel) and DSV (bottom panel) image at a crossroad. b) The sensor reservoir constituted by an array of 5 × 5 Cs2AgBiBr6 photonic synapses, in which the moving vehicle can be detected. c) A concrete example of the reservoir array responding to the up‐to‐down vehicle flow. The new y(t) spaces are inputted to the following 25 × 4 memristor array for classification. d–g) The readout function (f) value of the corresponding targeted neuron that connects to the outputs of the 25 × 4 memristor array. h,i) The weights of memristors in the 25 × 4 array before and after the 50 training epochs. j) The recognition rate for detecting vehicle flow. Both experimental y(t) spaces and simulated y(t) spaces with random variation of 0.16 are involved to obtain the recognition rate.
Following the flow chart in the right panel of Figure 4c, a virtual 25 × 4 memristor network is trained and used for the vehicle flow recognition task. Weights of memristors in 25 × 4 arrays are randomly programmed around conductance of 5 µS (Figure 6h) and updated during each training epoch. During the training process, the supervised f value continuously increases for the targeted y(t) inputs and decreases for other unintended y(t) inputs. After 50 times training epoch, the new‐arranged conductance weights array (Figure 6i) gives the maximum f value for each orientation neuron when the vehicle flows with the same orientation are inputted (Figure 6d–j). 20 vehicle flows with down orientation are experimentally inputted to this In‐sensor RC system after each training epoch to evaluate the accuracy. As shown in Figure 6j, a pleasant recognition rate of 100% after 24 training epochs are obtained, which is comparable with the simulation results with a variation of 0.16.
2.6. Stability and Energy Consumption
Cs2AgBiBr6 is one of the most promising alternatives to OHPs for optoelectronic devices due to their advantages of environmental stability. The stability of the Cs2AgBiBr6‐based photonic device was investigated by keeping in an ambient environment (the relative humidity was about 60% and the temperature was around 25 °C) for 8 weeks. Figure S10a, Supporting Information, shows the XRD patterns of Cs2AgBiBr6/P(VDF‐TrFE) hybrid film before and after eight week's storage, and no obvious changes were observed. The response to single optical pulse (power density and pulse width were 28.3 µW cm−2 and 100 ms, respectively) was almost the same with after 56 days’ storage, as illustrated in Figure S10b, Supporting Information (Inset was the EPSC triggered by the same input from day 1 to day 56). The results further confirm the good reproducibility and environmental stability of the Cs2AgBiBr6‐based photonic device.
A significant advantage of using the RC system is the reduction of network size and training cost. For example, in our in‐sensor RC system, the spatial information can be encoded to temporal signals so only a small network needs to be trained. For example, the 28 × 35 pixels information of ID photo in this work are abstracted to new feature y(t) space with only 28 elements. The following memristor network has 28 inputs for the 28 y(t) elements and 4 outputs for classifying four ID photos, 28 × 4 weights need to be trained. However, a conventional neural network will have 28 × 35 inputs for 28 × 35 pixels and 4 outputs for classifying four ID photos, 28 × 35 × 4 weights need to be trained.[ 36 , 37 ] On the bias of hardware size, both the energy consumption and training cost are decades time reduced. This saved energy value will increase very quickly if one or more hidden layers are involved.
Compared with other conductance‐based reservoir layers, the photovoltaic reservoir devices in this work not only avoid both the analog‐to‐digital conversion (ADC), from an analog camera to memory unit, and the DAC, from memory unit to the neural network hardware, but also free of energy consumption in itself. The energy consumption of per synaptic event can be calculated by V × I × t, where V is the measuring voltage, I is the EPSC of the device, and t is the optical pulse duration. Near‐zero energy is in principle consumed in our two‐terminal device, due to its voltage‐free photovoltaic (i.e., self‐powered) work mode. The comparation of energy cost per reservoir operation in the reservoir layer between reported reservoir systems is summarized in Table 1 . It demonstrates that our reservoir layer takes advantage of near‐zero energy consumption. The energy is consumed only in the following processes, such as current‐to‐voltage conversion and weights programming in the readout memristor network. Note that, when it processes spatial information, the spatial information is converted to temporal information by the reservoir layer to save the following hardware size,[ 1 ] which is time‐consuming. The reservoir system is very suitable for directly processing temporal information that has a similar time scale, such as trajectory analysis for car, steamer, or animals in our reservoir devices with a time scale of seconds.
Table 1.
Comparation of energy cost per reservoir operation in the reservoir layer between reported reservoir systems
| Device unit | mechanism | Classifier task | Energy cost per reservoir operation | Ref. |
|---|---|---|---|---|
| W/WOx/Pd | Resistive synapse | Digit classification | ≈ 1 pJ | [ 18 , 20 ] |
| Pt/Ag‐doped SiO2/Pd | Diffusive synapse | Digit classification | ≈ 1 nJ | [ 19 ] |
| Au/Cr/SnS/Cr/Au | Optoelectronic synapse | Language classification |
≈ 5 nJ for electrical ≈ 100 nJ for optical |
[ 17 ] |
| Au/P(VDF‐TrFE) /Cs2AgBiBr6/ITO | Photonic synapse | Face classification | 0 | This work |
3. Conclusion
In this work, we experimentally demonstrate that robust nonlinear interaction of photocurrents responding to spatiotemporal optical signals can be achieved by embedding a potential well on the shoulder of Schottky energy barrier in Au/P(VDF‐TrFE)/Cs2AgBiBr6/ITO device. The tunable photocurrent by the polarization reversal through external electric field makes the Cs2AgBiBr6 photonic synapses appealing for conventional In‐sensor computing.[ 3 ] Besides this, these nonlinear and spatiotemporal‐linked photocurrents triggered directly by optical signals are intently used to constitute an In‐sensor RC system. Both image processing (face classification) and dynamic video analysis (vehicle flow recognition) are energy‐efficiently achieved with high accuracy. Notably, near‐zero energy is in principle consumed in Cs2AgBiBr6 photonic synapses due to its voltage‐free photovoltaic work mode. Furthermore, by encoding the spatial information to temporal signals, only a much smaller network needs to be trained in the In‐sensor RC system. Thus, the proposed machine vision by self‐powered sensor RC based on Cs2AgBiBr6 photonic synapses takes significant advantage of low energy consumption.
4. Experimental Section
Materials
Cesium bromide (CsBr, 99.9%), silver bromide (AgBr, 98%), P(VDF‐TrFE) (70:30), and diethyl carbonate (99%) were purchased from Aladdin Company. Bismuth bromide (BiBr3, 99%) was obtained from Alfa Aesar. Dimethylsulfoxide (DMSO, 99.9%) was sourced from J&K company.
Preparation of Cs2AgBiBr6 Solution
CsBr (2 mmol, 426 mg), AgBr (1 mmol, 188 mg), and BiBr3 (1 mmol, 449 mg) were mixed in 2 mL DMSO to acquire 0.5 m Cs2AgBiBr6 precursor, and stirred at 70 °C for 2 h. A 0.22‐µm filter was used to filter the precursor solution to obtain a clear yellow solution.
Device Fabrication
Indium‐tin oxide (ITO) glass was first cleaned by ultrasonication for 20 min with deionized water, acetone, alcohol, and isopropyl alcohol, respectively, and dried with nitrogen gas for further use. The Cs2AgBiBr6 film was synthesized by a simple one‐step spin‐coated method and treated by a typical low‐pressure‐assisted (LPA) method according to previous reports.[ 38 , 39 ] First of all, 60 µL 0.5 m Cs2AgBiBr6 solution was spin‐coated on the ultraviolet‐ozone treated ITO substrate at a low speed of 500 rpm for 5 s, followed by a high speed of 3000 rpm for 60 s to get a transparent yellow film. Then, the film was transferred to a low‐pressure chamber for 10 min. Next, the Cs2AgBiBr6 film was annealed at 280 °C for 5 min. After cooling down to the room temperature, 70 µL 2.5 wt.% P(VDF‐TrFE) (70:30) in diethyl carbonate was spin‐coated on the fabricated Cs2AgBiBr6 film at a low speed of 500 rpm for 5 s, followed by a high speed of 4000 rpm for 30 s. The sample was annealed at 140 °C for 4 h to enhance the crystallinity of the ferroelectric hybrid film. At last, about 50 nm Au was deposited by thermal evaporation to form the Au/P(VDF‐TrFE)/Cs2AgBiBr6/ITO artificial photonic synapse device. The Au top electrodes were fabricated under very “soft” deposition conditions to minimize the ferroelectric polymer film damage (background vacuum of ≈10−4 mbar and evaporation vacuum of ≈10−3 mbar, distance between Au source and sample of ≈25 cm).
Characterization
XRD patterns of Cs2AgBiBr6 and the hybrid layer film were recorded on a diffractometer (PAN analytical EMPTREAN S3) equipped with Cu Kα radiation source (λ = 1.5406 Å). The UV–vis spectra were measured with an ultraviolet‐visible spectrophotometer (TU‐1901). The surface morphology SEM images were recorded via a SEM (Zeiss Gemini 450) while the cross‐sectional image was obtained using a transmission electron microscope (TEM, JEM‐2100F). All the photonic synapse characteristics of the device were measured with a self‐constructed photoelectric test system, which included a semiconductor parameter analyzer (Keithley 2636B), a laser at 445 nm wavelength, TTL‐controlled optical shutters, and a dual‐channel arbitrary function generator (Tektronix AFG3022C). An optical power meter (Field Max II) was employed to obtain the power density of optical pulses. The polarization voltage was applied to the top Au electrode while the ITO bottom electrode was ground. All the characteristic tests were performed under ambient atmosphere and at room temperature.
Face Recognition Method
The simulation for face recognition was built with MATLAB. Eventually, β was defined as 1000 A−1, and η equals to 10−7. Besides, the target value of the activation function was 0.5 for the right class and −0.5 for other wrong classes during training process.
Statistical Analysis
To evaluate the device variation, a waiting time of 5 min before each test were inputted to eliminate the influence of the previous stimulus. 2 optical pulses and 5 optical pulses were used to repeatedly stimuli the device 20 times, respectively. This variation test was performed on 28 devices. All beginnings of each stimulus were set to be the zero time and the value of EPSC at 1 s was taken for statistical analyses (Figure 3h). The blue and red dots shown in Figure 3h were the experiment results tested by our photonic devices, and the boxes contain values within the upper and lower quartiles (25%–75%), which reflect the central location and spread range of data distribution. Origin pro and Excel software were used for the analysis of the variation of 28 devices. The variation value (0.078 for 2 stimuli and 0.075 for 5 stimuli) of cycle to cycle was calculated by (max − min)/average).
Conflict of Interest
The authors declare no conflict of interest.
Supporting information
Supporting Information
Acknowledgements
J.L. and M.Y. contributed equally to this work. This work was supported by NSFC (No. 62174053, 61804055, and 62004204), Shanghai Science and Technology Innovation Action Plan (No. 19JC141670, 21JC1402000, and 21520714100), Open Research Projects of Zhejiang Lab (2021MD0AB03), Natural Science Foundation of Shanghai (No. 18ZR1410900), and the Fundamental Research Funds for the Central Universities. The authors thank ECNU Multifunctional Platform for Innovation (004 and 006) for technology support. ID photos of class 1, class 2, class 3, and class 4 used in this article were from authors J.L., Q.Z., H.P., and B.T. respectively. All of them declare the acknowledgement of these ID photos as research content in the article and agree to their publication and reproduction.
Lao J., Yan M., Tian B., Jiang C., Luo C., Xie Z., Zhu Q., Bao Z., Zhong N., Tang X., Sun L., Wu G., Wang J., Peng H., Chu J., Duan C., Ultralow‐Power Machine Vision with Self‐Powered Sensor Reservoir. Adv. Sci. 2022, 9, 2106092. 10.1002/advs.202106092
Contributor Information
Bobo Tian, Email: bbtian@ee.ecnu.edu.cn.
Hui Peng, Email: hpeng@ee.ecnu.edu.cn.
Chungang Duan, Email: cgduan@clpm.ecnu.edu.cn.
Data Availability Statement
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
- 1. Wang C.‐Y., Liang S.‐J., Wang S., Wang P., Li Z. a., Wang Z., Gao A., Pan C., Liu C., Liu J., Yang H., Liu X., Song W., Wang C., Cheng B., Wang X., Chen K., Wang Z., Watanabe K., Taniguchi T., Yang J. J., Miao F., Sci. Adv. 2020, 6, eaba6173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Yang C., Nature 2020, 579, 32.32132685 [Google Scholar]
- 3. Mennel L., Symonowicz J., Wachter S., Polyushkin D. K., Molina‐Mendoza A. J., Mueller T., Nature 2020, 579, 62. [DOI] [PubMed] [Google Scholar]
- 4. Manipatruni S., Nikonov D. E., Young I. A. J. N. P., Nat. Phys. 2018, 14, 338. [Google Scholar]
- 5. Liu C., Yan X., Song X., Ding S., Zhang D. W., Zhou P., Nat. Nanotechnol. 2018, 13, 404. [DOI] [PubMed] [Google Scholar]
- 6. Yang C. M., Chen T. C., Verma D., Li L. J., Liu B., Chang W. H., Lai C. S., Adv. Funct. Mater. 2020, 30, 2001598. [Google Scholar]
- 7. Yao P., Wu H., Gao B., Tang J., Zhang Q., Zhang W., Yang J. J., Qian H., Nature 2020, 577, 641. [DOI] [PubMed] [Google Scholar]
- 8. Cai F., Correll J. M., Lee S. H., Lim Y., Bothra V., Zhang Z., Flynn M. P., Lu W. D., Nat. Electron. 2019, 2, 290. [Google Scholar]
- 9. Zhou F., Zhou Z., Chen J., Choy T. H., Wang J., Zhang N., Lin Z., Yu S., Kang J., Wong H.‐S. P., Nat. Nanotechnol. 2019, 14, 776. [DOI] [PubMed] [Google Scholar]
- 10. Zhou F., Chai Y., Nat. Electron. 2020, 3, 664. [Google Scholar]
- 11. John R. A., Acharya J., Zhu C., Surendran A., Bose S. K., Chaturvedi A., Tiwari N., Gao Y., He Y., Zhang K. K., Nat. Commun. 2020, 11, 3211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Zhu Q.‐B., Li B., Yang D.‐D., Liu C., Feng S., Chen M.‐L., Sun Y., Tian Y.‐N., Su X., Wang X.‐M., Nat. Commun. 2021, 12, 1798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Yu J., Yang X., Gao G., Xiong Y., Wang Y., Han J., Chen Y., Zhang H., Sun Q., Wang Z. L., Sci. Adv. 2021, 7, eabd9117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Wang S., Wang C.‐Y., Wang P., Wang C., Li Z.‐A., Pan C., Dai Y., Gao A., Liu C., Liu J., Natl. Sci. Rev. 2021, 8, nwaa172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Wang T.‐Y., Meng J.‐L., Li Q.‐X., He Z.‐Y., Zhu H., Ji L., Sun Q.‐Q., Chen L., Zhang D. W., Nano Energy 2021, 89, 106291. [Google Scholar]
- 16. Feng G., Jiang J., Li Y., Xie D., Tian B., Wan Q., Adv. Funct. Mater. 2021, 31, 2104327. [Google Scholar]
- 17. Sun L., Wang Z., Jiang J., Kim Y., Joo B., Zheng S., Lee S., Yu W. J., Kong B.‐S., Yang H., Sci. Adv. 2021, 7, eabg1455. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Du C., Cai F., Zidan M. A., Ma W., Lee S. H., Lu W. D., Nat. Commun. 2017, 8, 2204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Midya R., Wang Z., Asapu S., Zhang X., Rao M., Song W., Zhuo Y., Upadhyay N., Xia Q., Yang J. J., Adv. Intell. Syst. 2019, 1, 1900084. [Google Scholar]
- 20. Moon J., Ma W., Shin J. H., Cai F., Du C., Lee S. H., Lu W. D., Nat. Electron. 2019, 2, 480. [Google Scholar]
- 21. Zhong Y., Tang J., Li X., Gao B., Qian H., Wu H., Nat. Commun. 2021, 12, 408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Lee J. W., Kim D. H., Kim H. S., Seo S. W., Cho S. M., Park N. G., Adv. Energy Mater. 2015, 5, 1501310. [Google Scholar]
- 23. Conings B., Drijkoningen J., Gauquelin N., Babayigit A., D'Haen J., D'Olieslaeger L., Ethirajan A., Verbeeck J., Manca J., Mosconi E., Adv. Energy Mater. 2015, 5, 1500477. [Google Scholar]
- 24. Ning W., Gao F., Adv. Mater. 2019, 31, 1900326. [DOI] [PubMed] [Google Scholar]
- 25. Longo G., Mahesh S., Buizza L. R., Wright A. D., Ramadan A. J., Abdi‐Jalebi M., Nayak P. K., Herz L. M., Snaith H. J., ACS Energy Lett. 2020, 5, 2200. [Google Scholar]
- 26. Yang X., Chen Y., Liu P., Xiang H., Wang W., Ran R., Zhou W., Shao Z., Adv. Funct. Mater. 2020, 30, 2001557. [Google Scholar]
- 27. Tian B., Wang J., Fusil S., Liu Y., Zhao X., Sun S., Shen H., Lin T., Sun J., Duan C., Bibes M., Barthélémy A., Dkhil B., Garcia V., Meng X., Chu J., Nat. Commun. 2016, 7, 11502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Wu G., Tian B., Liu L., Lv W., Wu S., Wang X., Chen Y., Li J., Wang Z., Wu S., Shen H., Lin T., Zhou P., Liu Q., Duan C., Zhang S., Meng X., Wu S., Hu W., Wang X., Chu J., Wang J., Nat. Electron. 2020, 3, 43. [Google Scholar]
- 29. Tian B., Liu L., Yan M., Wang J., Zhao Q., Zhong N., Xiang P., Sun L., Peng H., Shen H., Lin T., Dkhil B., Meng X., Chu J., Duan C., Adv. Electron. Mater. 2019, 5, 1800600. [Google Scholar]
- 30. Yan M., Zhu Q., Wang S., Ren Y., Feng G., Liu L., Peng H., He Y., Wang J., Zhou P., Meng X., Tang X., Chu J., Dkhil B., Tian B., Duan C., Adv. Electron. Mater. 2021, 7, 2001276. [Google Scholar]
- 31. Tian B., Zhong N., Duan C., Chin. Phys. B 2020, 29, 097701. [Google Scholar]
- 32. Duan C., Mei W. R., Hardy J., Ducharme S., Choi J., Dowben P. A., Europhys. Lett. 2003, 61, 81. [Google Scholar]
- 33. Kumar S., Hassan I., Regue M., Gonzalez‐Carrero S., Rattner E., Isaacs M. A., Eslava S., J. Mater. Chem. A 2021, 9, 12179. [Google Scholar]
- 34. Gong J., Yu H., Zhou X., Wei H., Ma M., Han H., Zhang S., Ni Y., Li Y., Xu W., Adv. Funct. Mater. 2020, 30, 2005413. [Google Scholar]
- 35. Luo Z.‐D., Xia X., Yang M.‐M., Wilson N. R., Gruverman A., Alexe M., ACS Nano 2019, 14, 746. [DOI] [PubMed] [Google Scholar]
- 36. Yao P., Wu H., Gao B., Eryilmaz S. B., Huang X., Zhang W., Zhang Q., Deng N., Shi L., Wong H.‐S. P., Nat. Commun. 2017, 8, 15199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Prezioso M., Merrikh‐Bayat F., Hoskins B., Adam G. C., Likharev K. K., Strukov D. B., Nature 2015, 521, 61. [DOI] [PubMed] [Google Scholar]
- 38. Igbari F., Wang R., Wang Z.‐K., Ma X.‐J., Wang Q., Wang K.‐L., Zhang Y., Liao L.‐S., Yang Y., Nano Lett. 2019, 19, 2066. [DOI] [PubMed] [Google Scholar]
- 39. Lao J., Xu W., Jiang C., Zhong N., Tian B., Lin H., Luo C., Travas‐sejdic J., Peng H., Duan C., J. Mater. Chem. C 2021, 9, 5706. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supporting Information
Data Availability Statement
The data that support the findings of this study are available from the corresponding author upon reasonable request.
