Abstract
Optical logic operations lie at the heart of optical computing, and they enable many applications such as ultrahigh-speed information processing. However, the reported optical logic gates rely heavily on the precise control of input light signals, including their phase difference, polarization, and intensity and the size of the incident beams. Due to the complexity and difficulty in these precise controls, the two output optical logic states may suffer from an inherent instability and a low contrast ratio of intensity. Moreover, the miniaturization of optical logic gates becomes difficult if the extra bulky apparatus for these controls is considered. As such, it is desirable to get rid of these complicated controls and to achieve full logic functionality in a compact photonic system. Such a goal remains challenging. Here, we introduce a simple yet universal design strategy, capable of using plane waves as the incident signal, to perform optical logic operations via a diffractive neural network. Physically, the incident plane wave is first spatially encoded by a specific logic operation at the input layer and further decoded through the hidden layers, namely, a compound Huygens’ metasurface. That is, the judiciously designed metasurface scatters the encoded light into one of two small designated areas at the output layer, which provides the information of output logic states. Importantly, after training of the diffractive neural network, all seven basic types of optical logic operations can be realized by the same metasurface. As a conceptual illustration, three logic operations (NOT, OR, and AND) are experimentally demonstrated at microwave frequencies.
Subject terms: Metamaterials; Electronics, photonics and device physics
Metasurface provides clarity for optical logic
Carefully-designed diffractive metasurfaces enable accurate logical operations for optical computing. Using photons instead of electrons to convey information could reduce energy consumption and increase computing speed. However, the light properties must be controlled with very high precision in order to clearly distinguish between ‘0’ and ‘1’ logical states, and this is difficult to do without adding extra bulky components. Now, Hongsheng Chen at Zhejiang University in China and co-workers have used diffractive metasurfaces – carefully structured optical components designed from the atomic scale upwards – to realize all seven of the basic logic operations used in computation. Their components, which can be shrunk to the chip scale, clearly direct light into one of two designated areas indicating ‘0’ or ‘1’, even when the properties of the input light have not been optimally controlled.
Introduction
Optical computing, which operates with photons instead of electrons, is becoming increasingly important, since it promises to increase the efficiency of information processing beyond traditional electron-based computing1. Due to its unique features of signal propagation at the speed of light, low power consumption, and the capability of parallel processing2–5, optical computing holds huge potential in many practical scenarios, particularly those involving high-throughput and on-the-fly data processing, such as augmented reality and autonomous driving6. The logic operation lies at the heart of all computers7. Correspondingly, optical logic gates8–13, including plasmonic logic gates, are essential for the further exploration and development of optical analogy computing, nanophotonic processing14,15, and the field of cryptographically secured wireless communication16. As such, there are growing and strong interests to provide optical logic gates with complete logic functionality in photonic systems with compact dimensions.
Previous methodologies towards optical logic gates considered mainly constructive/destructive interference effects, including linear8–11 and nonlinear interference12,13, between the input light signals. We note that the reported works are heavily dependent on the precise control of the basic properties of two input light signals, the control light and/or the pump light, including their phase difference, polarization, and intensity7 (Supplementary Note 6); if the two nanowires are close to each other, such as for the plasmonic logic gate, there is also a stringent requirement on the size of input light beams to avoid a potential false input. As a result, a better precise control of input light can more thoroughly realize constructive or destructive interference and lead to a larger intensity contrast ratio between the two output optical logic states “1” and “0”, which is a key feature to characterize the performance of an optical logic gate.
The heavy reliance on the precise control of input light has two unfavourable influences on the design of compact optical logic gates. First, their miniaturization becomes difficult if the additional bulky apparatus to achieve these controls are taken into consideration. Second, owing to the difficulty and complexity to achieve the ideal control of input light, their performance may suffer from an inherent instability, and the intensity contrast ratio between two output logic states may become quite low in practical scenarios10. For miniaturized optical logic gates, it is thus highly desirable to get rid of these critical requirements on the input light. Such a goal remains an open challenge that is long sought after due to its importance for the development of novel architectures for all-optical devices and systems.
To this end, here we introduce a simple yet universal design strategy, namely, a diffractive neural network17, to realize all seven basic optical logic operations in a compact system, simply using plane waves as the input signal. The diffractive neural network is implemented by a compound Huygens’ metasurface18, and it can partially mimic the functionality of an artificial neural network. After training, the compound metasurface can directionally scatter or focus the input encoded light into one of the two designated small areas/points, one of which represents logic state ‘1’ and the other stands for ‘0’. As a conceptual demonstration, three basic logic gates, i.e., NOT, OR, and AND, are experimentally verified using a two-layer high-efficiency dielectric metasurface at microwave frequency. Our design strategy features two distinct advantages. First, the realization of optical logic operations here gets rid of the complicated and necessarily precise control of the features of input light; such a scheme is thus totally different from previous works. Moreover, the design of the input layer is very general and powerful, and it can be flexibly modified into other user-favoured and programmable forms. Second, the proposed strategy can enable complete logic functionalities in a single optical network if the transmittance state of the input layer is dynamically tuneable, e.g., electrically tuneable if the optical mask is constructed by a spatial light modulator. Therefore, the revealed universal design strategy has the potential to facilitate a single miniaturized programmable photonic processor for arbitrary logic operations.
Results
Design principle and underlying physics of the optical logic operation
We start with the design principle of the optical logic operation. For binary optical logic operation, the output has only two cases, ‘1’ or ‘0’, which is very similar to a classification/decision-making task from the perspective of machine learning19 and can be readily tackled by an artificial neural network; Supplementary Note 1 verifies the theoretical feasibility. Analogous to an artificial neural network (Fig. 1a), in the optical regime, a diffractive neural network (composed of one input layer, at least one hidden layer and one output layer) has been found to allow powerful wavefront manipulation and communicate information among layers at the speed of light. As delineated in Fig. 1b, the input layer is a common optical mask and is patterned to form multiple regions. Without loss of generality, each region in the optical mask is set to have two different states for the transmittance of light, and its high (low) transmittance state indicates that it is (is not) selected for optical computing. Then, it is possible and convenient to directly define all seven basic optical logic operators and the input logic states in the optical mask, simply by assigning each of them to a specific region. The hidden layers are designed to decode the encoded input light and image the calculated result at the output layer.
We then progress to the introduction of the underlying physics of the design of hidden layers. We use a metasurface made up of a dense array of subwavelength meta-atoms to construct each hidden layer. Each meta-atom behaves like an independent neuron in the neural network and interconnects to other meta-atoms of the following layers through the diffraction of light. Based on Rayleigh–Sommerfeld diffraction20, the meta-atom/neuron in the lth hidden layer, e.g., located at , serves as a secondary source. The Huygens wavelet of such a source arises as a z-derivative of the spherical wave (Fig. 1b) and can be described by , where
1 |
In Eq. (1), , and k is the wavevector of light in free space. The complex-valued factor is determined by the product of the input wave to the neuron and its transmission coefficient , i.e., . As such, the total propagation field is the summation of the field excited by all neurons in the lth layer, and it can be expressed as
2 |
For the first hidden layer with l = 1, is the transmitted light spatially encoded by the input layer.
Following the forward propagation model in Eq. (2), the encoded input light can be directed into any desired location at the output layer via all learnable parameters . As shown in Fig. 1b, we designate two small regions with a radius of less than half a wavelength. If most of the field intensity is focused in the left (right) region, the computing result is “1” (“0”). Note that this judgement criterion remains valid and consistent for all logic operations being considered, distinct from the case in refs. 11,16. Before implementing the diffractive neural network, the transmission coefficients at each hidden layer should be adequately trained via an error back-propagation algorithm. In doing so, we define a loss function with mean square error to evaluate the performance between the output intensity and the ground truth target , where K is the number of the measurement points. The gradient of the loss function with respect to all the trainable network variables is backpropagated to iteratively update the network during each cycle of the training phase until the network converges; see Supplemental Note 2 and “Methods” section for details. Note that, in our case, we do not split the input data into training, validation and test sets as done in the traditional manner, since our goal is to achieve zero-error classifications for all cases.
Experimental demonstration of three basic logic operations, NOT, OR, and AND
As a conceptual demonstration, we first numerically realize three basic logic operations (Fig. 2), i.e., NOT, OR, and AND, at the designed frequency f0, since the combination of them can realize any other logic operation9. Our proposed design strategy for optical logic operations is, in principle, applicable for arbitrary frequencies. To facilitate the following experimental verification, f0 = 17 GHz (wavelength λ0 = 17.6 mm) is chosen here. Figure 2a shows the pattern of the input layer. For simplicity, the high (low) transmittance state for each region is assumed to have a transmittance of 100% (0%).
The hidden layers are composed of a cascaded two-layer transmission metasurface21,22 with an axial distance of 17λ0 (one of the tuneable parameters in the training process of diffractive neural network). Each metasurface consists of 30 × 42 meta-atoms (inset in Fig. 2b), where each meta-atom has a square cross section with a width of 0.57λ0. Here, we adopt a facile yet viable high-efficiency dielectric metasurface by taking advantage of its unique properties such as high transmittance and polarization insensitivity. The local transmission response of the designed meta-atoms is shown in Fig. 2b, where the constituent F4B dielectric has a relative permittivity of 3.5 + 0.003i and is fabricated by mechanical processing with an error <0.05 mm. The transmission phase ϕ varies smoothly over the height h of the meta-atom. Approximately, we have , where Δn is the refractive index difference between free space and the chosen dielectric. In contrast, the magnitude of transmission coefficients is almost uniform and close to unity. This way, one may target to train phase-only diffractive modulation layers. The training details are left to Supplementary Note 2. Figures 2c–l depict the numerical field intensity after training. As expected, most of the fields are correctly focused into one of the two small designated regions.
Figure 3 shows the microwave experimental demonstration of the theoretical proposal in Fig. 2. The experiment setup is depicted in Fig. 3a and described in “Methods” section. A horn antenna excites transverse electric (TE or s-polarized) waves with the electric field along the x-axis, and it is placed far from the input layer (~45λ0), so that the incident light signal can be reasonably treated as plane waves23 (see Fig. S5). The transmitted fields at the output layer, including their relative phase and amplitude, are measured by an E-field probe (a small monopole antenna24). For example, the inset at the output layer in Fig. 3a shows the measured 2D field intensity for the optical logic operation of “1+0”. Moreover, the experimental performance of all optical logic operations is shown in Fig. 3b. As expected, all the peaks of field intensity definitely appear within one of the two designated regions, consistent with Fig. 2c–l. Quantitatively, the contrast ratios between the measured intensities of two designated regions are all larger than 9.6 dB. The weak fields outside the two designated regions might be caused by the impedance mismatch at the air–dielectric interfaces, and this mismatch can be further reduced by introducing periodic antireflection structures25.
Discussion
Direct realization of all seven optical logic gates and cascaded optical logic gates
We emphasize that the proposed design strategy can, in principle, directly construct any type (basic and compound) of optical logic operation, such as all seven basic logic operations as shown in Figs. 4 and S6. This can be done by extending the encoding manner at the input layer and developing a more sophisticated neural network configuration. For more complete functionalities, we can cascade multiple logic gates. As shown in Fig. S7, the output waves from one logic gate couple into the waveguides and then are guided to the input layer of another logic gate as the inputs26; see the details in Supplementary Note 5.
Optical logic gates at higher frequencies
Although our experimental design in Fig. 3 only works at microwave frequencies, our theoretical design strategy in Fig. 1 should in principle be applicable to various frequency regimes, including terahertz and optical frequencies. The reason is that the main underlying mechanism in this work follows the universal diffractive law, which is scalable according to Maxwell equations. To let our proposed idea work at higher frequencies, we should at least consider scaling down the four key ingredients to higher frequencies, namely, the metasurfaces, the input light encoder (or the spatial light modulator), the light source and detector. These ingredients are accessible to experimental investigations with current technology17,25,27.
Comparisons with the traditional-related design
Our design principles of a multi-functional optical logic gate and its switching behaviour are both different from those of the traditional related design; see Supplementary Note 6. First, the traditional multi-functional optical logic gate essentially relies on several single-functional logic gates, which are independent of each other and stacked for multi-functional capability. In contrast, our design relies on just one integrated multi-functional optical logic gate. Second, traditional switches generally need to precisely control the input light, or involve the nonlinearity and refractive indices of materials. These stringent controls unfavourably incur a high complexity, high cost, large volume, and even inherent instability of the system. In contrast, our switch gets rid of these stringent requirements, and it just allows or prevents light passing through the corresponding regions/channels. This simplified switch in our design makes a step towards a future miniaturized multi-functional optical logic gate.
Other platforms to facilitate optical logic gates
Apart from the multi-layer metasurfaces, there are also other platforms to facilitate optical logic gates, for example, metamaterials/nanophotonics, which can offer ultra-high computing density in a compact and layer-free fashion26. By suitably engineering its spatial inhomogeneity, we can obtain an optical neural network on the chip scale, and some optical computing tasks such as image recognition and wavelength demultiplexer have already been facilitated28. In Fig. S9, we design a compact integrated-nanophotonic optical XOR logic gate as an example using topology optimization and finite-difference time domain (FDTD) simulation29,30.
To sum up, we have demonstrated a general framework for all optical logic operations by a compound Huygens’ metasurface enacted diffractive neural network, making a step towards multi-functional optical logic gates and high computing density. In a conceptually microwave experiment, we successfully realize three basic logical operations, i.e., NOT, OR, and AND, on a two-layer dielectric metasurface. Implementing our proposed architecture with metamaterials/nanophotonics may lead to chip-scale, ultrafast computing elements and promise the option of all-optical or hybrid optical–electronic technology. Looking forward, our proposed approach will also lead to a broad scope of applications, for example, real-time object recognition in surveillance systems and intelligent wave shaping inside biological tissues in microscope imaging31.
Materials and methods
Training of the diffractive neural network
The diffractive neural network is trained using Python version 3.5.0. and TensorFlow framework version 1.10.0 (Google Inc.) on a server (GeForce 249 10 GTX TITAN X GPU and Intel(R) Xeon(R) CPU X5570 @2.93 GHz with 48 GB RAM, running a Linux 250 operating system). It takes dozens of minutes to make our diffractive neural network converge. Notice that our process does not involve nonlinear activation function. We leave that to future work and experimentally compensate for its absence by a nonlinear optical medium, such as a photorefractive crystal and magneto-optical trap.
Experiment setup
A near-field 3D scanning system was used for measurements. A horn antenna centred at the two-layer metasurface was used as the excitation source. Another small monopole probe oriented vertically to the ground was used to scan the relative amplitude and phase (S21) of the electric field Ex. In measurement, the source and probe were connected to port 1 and port 2 of a vector network analyser, respectively, and the parameter S21 was recorded. The scan resolution in the xoy plane was 2 mm × 2 mm.
Supplementary information
Acknowledgements
The work at Zhejiang University was sponsored by the National Natural Science Foundation of China (NNSFC) under Grants Nos. 61625502, 11961141010, and 61975176, the Top-Notch Young Talents Programme of China, the Fundamental Research Funds for the Central Universities, Nanyang Technological University for NAP Start-Up Grant, and the Singapore Ministry of Education (Grant Nos. MOE2018-T2-1-022 (S), MOE2016-T3-1-006 and Tier 1 RG174/16 (S)). C.Q. was supported by the Chinese Scholarship Council (CSC No. 201906320294) and Zhejiang University Academic Award for Outstanding Doctoral Candidates.
Author contributions
C.Q. conceived the idea and conducted the numerical simulation and experiment; Y.S. helped prepare the experimental samples. C.Q. and X.L. interpreted detailed results and contributed extensively to the writing of the manuscript. X.L., B.Z. and H.C. supervised the project. All members contributed to the discussion and analysis of the results.
Conflict of interest
The authors declare that they have no conflict of interest.
Contributor Information
Xiao Lin, Email: xiaolinbnwj@ntu.edu.sg.
Baile Zhang, Email: blzhang@ntu.edu.sg.
Hongsheng Chen, Email: hansomchen@zju.edu.cn.
Supplementary information
Supplementary information is available for this paper at 10.1038/s41377-020-0303-2.
References
- 1.Caulfield HJ, Dolev S. Why future supercomputing requires optics. Nat. Photonics. 2010;4:261–263. doi: 10.1038/nphoton.2010.94. [DOI] [Google Scholar]
- 2.Silva A, et al. Performing mathematical operations with metamaterials. Science. 2014;343:160–163. doi: 10.1126/science.1242818. [DOI] [PubMed] [Google Scholar]
- 3.Zhu TF, et al. Plasmonic computing of spatial differentiation. Nat. Commun. 2017;8:15391. doi: 10.1038/ncomms15391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Guo C, et al. Photonic crystal slab Laplace operator for image differentiation. Optica. 2018;5:251–256. doi: 10.1364/OPTICA.5.000251. [DOI] [Google Scholar]
- 5.Graves A, et al. Hybrid computing using a neural network with dynamic external memory. Nature. 2016;538:471–476. doi: 10.1038/nature20101. [DOI] [PubMed] [Google Scholar]
- 6.Lane ND, et al. Squeezing deep learning into mobile and embedded devices. IEEE Pervasive Comput. 2017;16:82–88. doi: 10.1109/MPRV.2017.2940968. [DOI] [Google Scholar]
- 7.Miller DAB. Are optical transistors the logical next step? Nat. Photonics. 2010;4:3–5. doi: 10.1038/nphoton.2009.240. [DOI] [Google Scholar]
- 8.Wei H, et al. Quantum dot-based local field imaging reveals plasmon-based interferometric logic in silver nanowire networks. Nano Lett. 2011;11:471–475. doi: 10.1021/nl103228b. [DOI] [PubMed] [Google Scholar]
- 9.Wei H, et al. Cascaded logic gates in nanophotonic plasmon networks. Nat. Commun. 2011;2:387. doi: 10.1038/ncomms1388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Fu YL, et al. All-optical logic gates based on nanoscale plasmonic slot waveguides. Nano Lett. 2012;12:5784–5790. doi: 10.1021/nl303095s. [DOI] [PubMed] [Google Scholar]
- 11.Sang YG, et al. Broadband multifunctional plasmonic logic gates. Adv. Opt. Mater. 2018;6:1701368. doi: 10.1002/adom.201701368. [DOI] [Google Scholar]
- 12.Xu QF, Lipson M. All-optical logic based on silicon micro-ring resonators. Opt. Express. 2007;15:924–929. doi: 10.1364/OE.15.000924. [DOI] [PubMed] [Google Scholar]
- 13.McCutcheon MW, et al. All-optical conditional logic with a nonlinear photonic crystal nanocavity. Appl. Phys. Lett. 2009;95:221102. doi: 10.1063/1.3265736. [DOI] [Google Scholar]
- 14.Lee SW, et al. A fast and low-power microelectromechanical system-based non-volatile memory device. Nat. Commun. 2011;2:220. doi: 10.1038/ncomms1227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Driscoll T, et al. Memory metamaterials. Science. 2009;325:1518–1521. doi: 10.1126/science.1176580. [DOI] [PubMed] [Google Scholar]
- 16.Manjappa M, et al. Reconfigurable MEMS Fano metasurfaces with multiple-input–output states for logic operations at terahertz frequencies. Nat. Commun. 2018;9:4056. doi: 10.1038/s41467-018-06360-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lin X, et al. All-optical machine learning using diffractive deep neural networks. Science. 2018;361:1004–1008. doi: 10.1126/science.aat8084. [DOI] [PubMed] [Google Scholar]
- 18.Raeker BO, Grbic A. Compound metaoptics for amplitude and phase control of wave fronts. Phys. Rev. Lett. 2019;122:113901. doi: 10.1103/PhysRevLett.122.113901. [DOI] [PubMed] [Google Scholar]
- 19.Esteva A, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;542:115–118. doi: 10.1038/nature21056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Goodman, J. W. Introduction to Fourier Optics 3rd edn (Roberts and Company, Greenwoood Village, 2005).
- 21.Cai T, et al. High-performance bifunctional metasurfaces in transmission and reflection geometries. Adv. Opt. Mater. 2017;5:1600506. doi: 10.1002/adom.201600506. [DOI] [Google Scholar]
- 22.Wu. LW, et al. High-transmission ultrathin huygens’ metasurface with 360° phase control by using double-layer transmitarray elements. Phys. Rev. Appl. 2019;12:024012. doi: 10.1103/PhysRevApplied.12.024012. [DOI] [Google Scholar]
- 23.Qian C, et al. Experimental observation of superscattering. Phys. Rev. Lett. 2019;122:063901. doi: 10.1103/PhysRevLett.122.063901. [DOI] [PubMed] [Google Scholar]
- 24.Ye DX, et al. Observation of reflectionless absorption due to spatial Kramers–Kronig profile. Nat. Commun. 2017;8:51. doi: 10.1038/s41467-017-00123-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Yi H, et al. 3-D printed millimeter-wave and terahertz lenses with fixed and frequency scanned beam. IEEE Trans. Antennas Propag. 2016;64:442–449. doi: 10.1109/TAP.2015.2505703. [DOI] [Google Scholar]
- 26.Estakhri NM, Edwards B, Engheta N. Inverse-designed metastructures that solve equations. Science. 2019;363:1333–1338. doi: 10.1126/science.aaw2498. [DOI] [PubMed] [Google Scholar]
- 27.Qian, C. et al. Deep-learning-enabled self-adaptive microwave cloak without human intervention. Nat. Photonicshttps://www.nature.com/articles/s41566-020-0604-2 (2020).
- 28.Molesky S, et al. Inverse design in nanophotonics. Nat. Photonics. 2018;12:659–670. doi: 10.1038/s41566-018-0246-9. [DOI] [Google Scholar]
- 29.Qian C, et al. Transient response of a signal through a dispersive invisibility cloak. Opt. Lett. 2016;41:4911–4914. doi: 10.1364/OL.41.004911. [DOI] [PubMed] [Google Scholar]
- 30.Qian C, et al. Observing the transient buildup of a superscatterer in the time domain. Opt. Express. 2017;25:4967–4974. doi: 10.1364/OE.25.004967. [DOI] [PubMed] [Google Scholar]
- 31.Jang M, et al. Wavefront shaping with disorder-engineered metasurfaces. Nat. Photonics. 2018;12:84–90. doi: 10.1038/s41566-017-0078-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.