Abstract
Programmable photonic integrated circuits represent an emerging technology that amalgamates photonics and electronics, paving the way for light-based information processing at high speeds and low power consumption. Programmable photonics provides a flexible platform that can be reconfigured to perform multiple tasks, thereby holding great promise for revolutionizing future optical networks and quantum computing systems. Over the past decade, there has been constant progress in developing several different architectures for realizing programmable photonic circuits that allow for realizing arbitrary discrete unitary operations with light. Here, we systematically investigate a general family of photonic circuits for realizing arbitrary unitaries based on a simple architecture that interlaces a fixed intervening layer with programmable phase shifter layers. We introduce a criterion for the intervening operator that guarantees the universality of this architecture for representing arbitrary unitary operators with phase layers. We explore this criterion for different photonic components, including photonic waveguide lattices and meshes of directional couplers, which allows the identification of several families of photonic components that can serve as the intervening layers in the interlacing architecture. Our findings pave the way for efficiently designing and realizing novel families of programmable photonic integrated circuits for multipurpose analog information processing.
Keywords: On-chip photonic unit, Unitary programmable unit, Random matrices, Waveguide arrays, Coupled mode theory, Interlaced architectures
Subject terms: Integrated optics, Optoelectronic devices and components
Levering the unique properties of light to perform computations in novel ways is a subject with a long history1,2. Although an all-optical processor for universal computing seems to be a far reach goal, photonics can provide exciting opportunities for unconventional computing building on analog logic and with novel information processing configurations. What makes photonics an intriguing option for unconventional computing are exotic potentials such as the intrinsically high speed and low energy consumption, the capabilities for massive parallelization, and long-range interactions. Nevertheless, a significant challenge in optical computing lies in the absence of appropriate computing paradigms, methods, and algorithms that harness the unique capabilities of this technology to develop efficient and application-specific photonic processors. In particular, matrix-by-vector multiplication is one of the most basic mathematical operations that lies at the core of various tasks ranging from optical convolution schemes3,4 and matrix eigenvalue solver5 to novel optical memristors6,7 and optical artificial neural networks8,9. In the past decade, with rapid technological progress, there has been a resurrection in efforts devoted to developing such programmable photonic integrated circuit that performs matrix-vector multiplication10–12. The utility of such a device as an energy-efficient photonic accelerator in conjunction with electronic processors appears to be a distant possibility, considering the inherent difficulties associated with scaling and precision. However, there is no doubt that an on-chip programmable photonic matrix-vector multiplier can create exciting opportunities in classical and quantum computing through various applications that range from quantum information and quantum transport simulations13–15, to optical signal processing16, neuromorphic computing17, and optical neural networks3,18,19, as well as putting forward a platform for rapid prototyping of linear multiport photonic devices20,21.
Indeed, the optical realization of arbitrary unitary operations has been known since the seminal paper of Reck et al.22, which originally concerned free space optics but successfully translated to photonic integrated circuits by Miller23–25. This architecture builds on breaking down unitary matrices of any order into lower-dimension unitary matrices, which ensures the existence of an optical realization through two fundamental building blocks that are beam splitters (couplers) and phase shifters. Despite its generality, this method uses a pyramid-shaped array of Mach-Zehnder interferometers (MZI), which is impractical for larger implementations because the number of beam splitters grows quadratically with the number of ports. In turn, Clements et al.26 introduced an alternative and symmetric rectangular-shaped array, resulting in a device with half the total optical depth and, consequently, more loss-tolerant. Such a rectangular array has been proved robust enough to create photonic realizations of Haar-random matrices27. Further unitary realizations related to other mesh geometries have been explored in28,29, as well as topological photonic lattices with hexagonal-shaped arrays of MZI30,31. In turn, free-space propagation setups have been devised based on plane-light conversion32,33 and using diffractive surface layers34.
While the latter devices originally consisted of bulky optical components, the principle has recently been applied to on-chip structures as well. Recent studies have explored the use of particular transfer matrices (henceforth called F) alternating with phase mask layers to obtain an arbitrary unitary transformation10,12,35–37. Pastor et al.12 considers wave propagation in multimode slab waveguides to implement a Discrete Fourier transform (DFT) as their transformation F. They showed that an arbitrary transformation could be performed when phase layers and 6N DFT elements, where N is the number of ports. Tanomura et al.35 interleave the phase masks with multimode interference couplers connected with single-mode waveguides and use simulated annealing optimization to argue for well-approximated conversions when . Fully functional unitary four-, eight-, ten-, and twelve-port devices have been proposed and manufactured38–41. Moreover, an alternative device using polarization and multiple wavelength degrees of freedom has been considered in42. Markowitz and Miri37 have explored similar structures and have found rigorous numerical evidence that interleaved phase arrays and discrete fractional Fourier transform (DFrFT) are indeed universal, whereas the use of Haar-random unitary matrices has been proved to lead to the desired universality10. A further waveguide array with varying propagation constants with step-like profiles has been reported43. The interlacing architecture appears to exhibit interesting auto-calibrating properties, which makes it resilient to fabrication errors44. Furthermore, we recently have shown that the intervening structure can go beyond implementing unitaries to directly implementing arbitrary non-unitary operations when the diagonal matrices are relaxed to leave the unitary circle in the complex domain45. In this sense, by utilizing both amplitude and phase modulations, one can realize a fully programmable device for arbitrary matrix operations45. This shows an important generalization of the previous results that show by itself an advantage of the interlacing architecture over the mesh geometries.
This manuscript discusses a broad class of universal on-chip photonic architectures based on a layered configuration of phase mask layers as programmable units interlaced with a passive random matrix F. It is shown that the proposed interlaced architecture is far more flexible by showing that broad families of matrices F can serve as the fixed intervening operator. The phases are steered to reconstruct a unitary target matrix, provided that F has well-posed properties. Numerical evidence based on rigorous optimization algorithms reveals that universality is reached for dense matrices F, while a phase transition in the accuracy of reconstructed targets occurs at , with M the total number of phase mask layers. Tests using the discrete Fourier transform (DFT) and discrete fractional Fourier transform (DFrFT) confirm the latter claim, and Haar-random matrices also show outstanding convergence. To generalize the domain of the valid intervening operators, a density criterion is derived so that matrices F can be classified according to their elements to ensure universality. To demonstrate this result, photonic lattices with uniform, nonuniform, and disordered coupling coefficients are considered as photonic realizations for the matrix F, while using the proposed density criterion, it is shown that universality is reached for specific length intervals. Furthermore, we explore waveguide coupler meshes as an alternative intervening unit F and determine the minimum number of coupler layers required to guarantee the universality of the interlacing architecture.
Results
Architecture and mathematical foundation
Let us consider an arbitrary unitary matrix , with U(N) the group of unitary matrices. Our goal is to implement a proper factorization of in terms of another unitary matrix F to be defined and a set composed of phase matrices , with a diagonal matrix and , where and . The factorization proposed here is such that it intercalates F with a phase matrix through the relation
1 |
which is resourceful as it allows for optical implementations. The phase matrices can be implemented through layers of phase shifters (active optical elements), and the matrix F (passive optical element) has to be selected so that arbitrary unitary target matrices can be reconstructed with minimal error by adequately tuning the phase shifters . If the latter is achieved, it is said that the universality property has been met. An arbitrary matrix requires real parameters to be fully defined. Therefore, while performing the factorization, it is vital to consider at least the same number of parameters. For the device proposed in (1), we have MN free parameters in total and expect that in order to achieve the desired universality. Although there are some cases where has a particular symmetry that reduces the number of parameters, we aim for the general case. The proposed architecture (1) and its optical implementation are sketched in Fig. 1. On the one hand, the mathematical structure of the passive unitary F (see top panels of Fig. 1) can be that of the discrete Fourier transform (DFT), discrete fractional Fourier transform37, or simply a Haar-random matrix. On the other hand, the photonic implementation of F can be performed through different optical realizations involving a unitary wave evolution (see bottom panels of Fig. 1), such as waveguide arrays45,46, meshes of directional coupler26,28,29, and multimode interference (MMI)12,47. Particularly, MMI couplers have been shown to be suitable to represent the DFT matrix12,48, whereas waveguide arrays lead to simple representations of the DFrFT46.
Here, we focus on architectures based on the first two solutions, the numerical analysis and universality of which are discussed below. Layered architectures akin to Eq. (1) have been numerically validated in previous works using different optical arrays and MZI meshes, where different optimization algorithms such as gradient-descent, stochastic gradient descent, simulated annealing, and basin-hopping have been implemented. See for instance10,40,43,49. In order to demonstrate the universality of this device, we optimize the NM phases for a variety of randomly chosen target matrices generated in accordance with the Haar measure50. The objective function to be minimized, also called error norm, is defined by
2 |
where stands for the Frobenius definition of the norm, is the target matrix being tested, and is the reconstructed matrix using the factorization (1). The Levenberg-Marquardt (LM) algorithm51,52 is used to find the minimum of this function. For a given target , the phases are randomly initialized between 0 and . The optimization was performed in MATLAB. In Fig. 2a, the norm for 100 target unitary matrices is shown for various cases, fixing the default tolerance values to . A phase transition occurs between the and layers, which is unsurprising given the system becomes over-determined by N parameters. These jumps are larger than reported by Tang et al.53 due to their usage of a probabilistic algorithm (Simulated Annealing) rather than a gradient-based one such as LMA. A downside of gradient-based approaches is they may require many runs with different starting conditions. We can decrease the overhead by using a stopping criteria for the norm along with a maximum iteration for each run of LMA. Using a maximum iteration of 50, we find that we rarely need more than 100 runs in the case , to achieve norms less than (Fig. 2b). For systems with a lower number of ports N, the distribution skews towards lower values. To more confidently label choices of F which are not Haar-random generated as “bad” mixing layers, we set the maximum number of runs somewhat higher to 250 or 500 as found appropriate.
The density criterion and the Goldilocks principle
Our preliminary numerical results suggest that Haar random matrices F also possess the required properties to render a universal architecture from (1). Random matrices drawn from the Haar measure are typically dense matrices whose sparsity is low, suggesting that density might be a criterion to classify F matrices as good candidates. To test this idea, it is thus desirable to find a proper measure to quantify the density for unitary matrices. Indeed, random matrix theory establishes robust criteria for studying random complex-valued matrices at the limit of large size N based on the singular value decomposition (SVD)54. On the one hand, in the current setup, we focus on unitary matrices of relatively small size, as the number of ports in our architecture is not necessarily long enough to take into account the asymptotic analysis of random matrix theory. On the other hand, the singular value decomposition is not particularly helpful when dealing with unitary matrices, as for a complex-valued matrix A, the SVD requires the computation of the spectral properties of and , which for unitary matrices is always equal to the identity matrix. For these reasons, the density criterion discussed below suits better for the interlaced architectures here constructed.
To better understand the importance of dense matrices in our universal architecture, let us further inspect the factorization in (1). By fixing as a diagonal unitary matrix, for for , it is straightforward to notice that (1) reduces to a diagonal unitary matrix as well, which is far from representing a universal device. That is, diagonal F matrices can only reconstruct diagonal unitary matrices. This behavior extends to any of the different permutations allowed to the diagonal matrix F, as the factorized matrix acquires the same structure as that of the permuted F matrix. Now, let us consider the set of unitary matrices , where and , such that is an N-dimensional block-diagonal unitary matrix. This particular selection for F leads to a unitary operator with the same structure, lacking the required universality property. Although block-diagonal matrices F lose their structure when randomly rearranged, anti-diagonal block matrices will always result in either block-diagonal or anti-block-diagonal matrices, neither of which are universal.
Thus, we have identified a particular class of bad-performing interlacing matrices F, which are useful to trace a suitable Goldilocks region where the passive F matrices have the required density property. To this end, let us first remark that any N-dimensional unitary matrix can be written as , where are complex-valued column vectors that form an orthogonal set through the Euclidean inner product in , i.e., . From the orthogonality condition, we can simply focus on the columns (or equivalently the rows) of , whereas the normalization imposes a constraint on elements across the columns (rows). Since the elements of are complex numbers, we alternative work with the matrix , composed of the modulus of the elements of . Let us define as the q-th element of the p-th column vector , with , so that . From the unitarity of , it follows that for all p, q, so that we can focus on the density of either the columns or rows of . Without loss of generality, we work with the columns. Since we are interested in how the elements are sparse across each column, we compute the corresponding variance
for each column p.
Following the normalization condition, it is straightforward to prove that the variance is a bounded quantity in the closed interval , where the lower bound corresponds to the case where all the elements of are equal, i.e., is maximally spread. The upper bound corresponds to the case where is one of the canonical basis vectors, . Thus, a given column p of is said to be denser if its corresponding variance approaches . The more sparse the elements of the column p, the more approaches 0. These are the key ideas we use henceforth to characterize density across the full matrix . Let be the set of variances associated with each column of . We define the mean and standard deviation associated with the elements of , so that the density of a given unitary matrix can be characterized by defining the point
3 |
Since row permutation leaves invariant and column permutation only permutes the index p, the quantities and , and consequently the point , are permutation invariant.
There are two note-worthy extremal cases, namely the maximally dense and the diagonal cases (sparsest cases) unitary matrices. In the former case, is composed of column vectors so that the variances vanish, , for all . The DFT matrix of dimension N is such an example, leading to , which we consider as the ideal case. For the second case, the variances are maximal per each column, , for all , and the statistical information of the matrix reduces to and . We thus have two comparison points, from which we find the bounded interval . Additional reference points can be traced out if we take the block-diagonal matrices , with unitary and maximally dense (DFT) matrices of dimension for , , and , so that bad-performing matrices are generated (see discussion above). Particularly, let us consider the case so that , with . One can assign the indexes and , with nonequivalent ways to define the two block-diagonal matrices F that leads to the reference points
Although further reference points exist for , those points are farther from the ideal (maximally dense) case than those marked with , and are thus disregarded.
In this form, we can just focus on the area spanned by the points marked between and . Interestingly, for and , one obtains the reference points with smaller standard deviation, which are and for even and odd N, respectively. Note that in the limit , the mean converges to the non-vanishing value 1/2. In turn, the maximum standard deviation is determined by minimizing in terms of , from which one obtains the critical value and the maximum standard deviation . That is, the standard deviation is bounded to the interval , where the upper bound is independent of N and is given by . The latter allows us forming a polygon with vertices at and , the area of which is non-null and finite even for (see Fig. 3). We focus on unitary matrices whose vector lies inside the latter polygon while avoiding the vertices, as the latter are the well-known bad-performing cases (with the exception of ). We can go a step further and make a better prediction of unitary matrices by reducing the area of the polygon and imposing a threshold to and . For the former, we already know that the standard deviation reaches its maximum value at for all N. We thus implement the threshold at half of the maximum allowed standard deviation, i.e., . For even N, the reference points are above such a threshold for . Likewise, we fix the threshold for the mean at .
Therefore, the Goldilocks region is defined as the region spanned by the interception of the polygon spanned by the set of points and the thresholds and . This region is illustrated in Fig. 3a by the blue-shaded area. Any unitary matrix F whose associated vector lies inside the Goldilocks region is said to fulfill the Goldilocks principle; i.e., the components of F have the statistical properties to be deemed as dense enough to render a universal factorization in (1).
Random matrices and performance test
To test the performance of the Goldilocks principle, we generate sets of random unitary matrices (not necessarily Haar random) and determine the corresponding in the plane . To ensure control over the testing matrices, we generate random matrices using the decomposition , with a Hermitian matrix in to be defined. Furthermore, we introduce the four non-symmetric matrices , , , and , with the column null-vector in . The column vectors are composed of zeros in the first j inputs and random numbers elsewhere, with . We thus consider the Hermitian construction as , with . In this form, the number of random parameters increases in each case as additional non-null columns are included, rendering random unitary matrices defined by higher number of random parameters. In other words, the density of the random matrices is expected to be higher for D than A, B, C.
In this form, we establish a controlled benchmark for the Goldilocks principle and the subsequent universality of the matrix under consideration. Here, 50 target unitary matrices are considered for each testing unitary matrix so that the relative error of the optimized targets from (1) and the corresponding vector can be analyzed for a broad number of cases. Particularly, Fig. 3a depicts the reference points (filled-circles) and the points associated with the sets of random unitary matrices for and . This allows determining which unitary matrices have points lying inside the Goldilocks region. One may notice that, for the random unitary matrices , only the points and are expected to fulfill the Goldilocks principle. In turn, we expect more well-behaved matrices , with only outside of the Goldilocks region. This is corroborated in Fig. 3b, where the optimization routine using the LMA has been implemented for each testing target matrix. Indeed, the error norm for the matrices and render values within the preestablished tolerance values, as predicted in Fig. 3a. On the other hand, the numerical optimization also reveals that and should be good candidates, whereas the density criterion has ruled them out. This is an example of false-negative outcomes. As seen from both Fig. 3b,c, this is usually the case for matrices F with lying in the vicinity of the Goldilocks region. Due to the existence of false-negative results, we deem the density criterion as only a sufficient condition. Likewise, a similar analysis can be carried out for the testing matrices and . To complement the analysis, and to better visualize and assess the Goldilocks region, we put forward an alternative representation in Fig. 3c, where we depict both and separately. Here, the Goldilocks principle is established if both quantities lie below their respective thresholds and . In this form, it is no longer required to draw the universality region, and one can assess the Goldilocks principle in a simple plot.
As previously discussed, the the computational time required to optimize (1) and test the corresponding g universality for a given choice of F increases with the total number of ports N. In turn, the identification of the Goldilocks region and the associated vector for a specific matrix F enables a quick classification for preselecting the feasibility of the matrix. Furthermore, the Goldilocks region spanned in the -space is finite and non-null for , making it a suitable measure for architectures with an arbitrary number of ports.
Photonic platform and feasible realizations
So far, the universality of the proposed architecture has been numerically established using different choices of the intervening matrix F as the passive mixing layer possessing the required density criterion. In the following, we will discuss potential candidates for creating F matrices using photonic systems, with a specific emphasis on photonic lattices and meshes of directional couplers. Such systems can be readily implemented with silicon photonics. One can build such structures, e.g., using buried silicon waveguides at the telecommunication wavelength of 1550nm. The waveguide system comprises a silicon (Si) core surrounded by a silica (SiO2) cladding and substrate with refractive indices and , respectively. The core dimensions are 500nm in width and 220nm in height. Such a geometry renders the fundamental quasi-TE01 mode with an effective mode index of , which is the operational mode used for the unitary devices discussed in the following. This applies to passive F matrix solutions based on both waveguide arrays and directional coupler meshes. For the active layers, phase shifters based on thermo-optical effects can be considered. This might include solutions based on metal heaters55,56, which are widely implemented by open-access foundries and occupy an approximate area of 370 30 . Alternatively, one can consider ultra-compact phase shifters based on phase-change materials (PCMs)7, achieving phase shifts of approximately radians per one-micron length57. Although the latter is not as broadly implemented as metal-heater solutions in foundries, it lights up the path for the future of dense programmable photonic chips.
Waveguide lattices
Waveguide arrays can be modeled with high precision using coupled-mode theory58. The latter takes into account the coupling of evanescent waves from one waveguide interacting with its nearest neighbor while neglecting farther neighbors due to their weak coupling. In this form, the effective Hamiltonian describing an array of N waveguides is characterized by a tridiagonal and symmetric matrix Hamiltonian of dimension N. The wave evolution through the lattice is ruled by the dynamical law , where is the mode field amplitude at each waveguide at the propagation distance z. Since , the wave evolution is determined through the unitary evolution operator as .
We can thus implement waveguide arrays in the universal architecture, provided they fulfill the desired universality. To this end, we can test the behavior of a given lattice evolution operator for specific lengths using the Goldilocks principle. Particularly, we consider the photonic lattice46, the homogeneous lattice59, and the disordered homogeneous lattice60 as the physical waveguide arrays under consideration described by the respective Hamiltonians , , and . The matrix elements of the latter are explicitly given by
4 |
with . Here, stands for the coupling parameter between nearest waveguide neighbors in the lattice, where is a design scaling factor, and are random numbers taken from the normal distribution characterizing the disorder effects. The coupled waveguide implementation for each Hamiltonian is depicted in Fig. 4.
The corresponding unitary evolution operators are simply given by and . Although both are functions of the lattice length z, the lattice (Fig. 4a) has equidistant eigenvalues that lead to a periodic unitary evolution operator in z, so that we can simply focus on the interval . We first estimate the lengths that induce universality in our architecture, which is depicted in Fig. 4b. In the latter, we mark the particular lengths and that denote the local minima and maxima of the standard deviation , the exact values of which have been determined numerically and presented in Table 1. In turn, the black-thick line in Fig. 4b denotes the lengths where both and are below the universality threshold; i.e., the lengths where universality is expected. Without any prior performance test, one can see that the lengths , , and may fail in obtaining the desired universality. Recall that our estimation criterion is only a sufficient condition and may rule out positive cases. Nevertheless, all the other marked points can be considered candidates for the matrix F in our architecture, as no false positive cases will be included. This is indeed verified in the performance test portrayed in Fig. 4c, where, for each point, we have used fifty randomly generated unitary matrices as targets. The latter confirms our predictions, where the only bad-performing length is found at , reinforcing the fact that only two positive cases were discarded, but no false positives were included. In this form, we can confidently conclude that a universal architecture can be built using lattices with lengths as small as for .
Table 1.
z | Homogeneous | z | Homogeneous | ||
---|---|---|---|---|---|
0.48781 | 1.1968 | 0.6579 | 1.6099 | ||
0.78735 | 1.9211 | 0.9256 | 2.2285 | ||
1.11393 | 2.8088 | 1.2227 | 2.9441 | ||
1.32647 | 3.3290 | 1.4293 | 3.5156 | ||
1.52812 | 4.1988 | 4.3984 | |||
5.1835 | 4.8832 | ||||
5.6182 | 5.4811 | ||||
6.1544 |
We alternatively consider the homogeneous lattice, which contains waveguide arrays homogeneously distributed (Fig. 4d). The eigenvalues accordingly distributed as , and no periodic behavior is expected. We thus focus on the interval for this particular lattice. The density criterion shown in Fig. 4e reveals that lattice lengths in the interval are suitable for our universal architecture. Particularly, note that the interval contains lengths so that and remain mostly constant with minor variations. Thus, the performance test in this interval is expected to perform well. In the Goldilocks region, there is a local minimum isolated and associated with a shorter lattice length. This reference point may be useful for reducing the size of the universal structure. Figure 4f displays the corresponding performance test, which supports our previous statements. As expected, the performance for lattices with length and is particularly poor. However, the length has a generally good performance, with only two test targets displaying slightly higher errors than the other well-performing cases.
We additionally take into account the effects of disorder on the homogeneous lattice, which may be caused by impurities or imperfections during the manufacturing process, resulting in waveguides not being in their ideal positions or displaying deviations in their sizes (see Fig. 4g). The defects here considered are such that the nearest-neighbor interactions deviate from the ideal homogeneous lattice by a factor of twenty percent; i.e., the disorder couplings in (4) take values from the normal distribution as . Although the lattice structure is modified in the latter disorder, the estimation of density does not differ significantly from the ideal case, as shown in Fig. 4h–i.
Directional coupler mesh
Alternatively, the matrix F can be optically realized through proper mesh arrays of directional couplers. Particularly, we consider a construction based on two-port passive elements, which act as a power divider (3-dB directional coupler) equivalent, up to a global phase, to the unitary matrix , with the conventional Pauli matrices for and the identity matrix. The latter can be used as a building block to construct other U(2) matrices61 as well as higher dimensional unitary matrices U(N) through appropriate Kronecker products22. The silicon photonics platform discussed above allows the implementation of each 3-dB directional coupler through a coupling length and waveguide separation of 29.35 m and 630 nm, respectively. In this section, we consider the symmetric 10-port array portrayed in Fig. 5a, composed of power dividers interconnected through different layers and . Here, each layer is described by the U(10) matrices and , with and the Kronecker product and Kronecker sum (Such operations are also known as direct product and direct sum.), respectively, and the identity matrix. The p-layered unitary matrix describing the power divider array is thus given by
5 |
where we have truncated the maximum number of layers to ten.
It is not mandatory to truncate these layers, and the procedure can involve additional layers if needed. However, for practical physical implementations and to reduce the device footprint, we aim for devices with a minimal number of layers. To this end, we estimate the Goldilocks region in Fig. 5b as a function of the number of layers p, revealing that , are indeed good candidates. This is further corroborated through the LMA optimization results shown in Fig. 5c, which indeed shows that layers render a well-performing F layer. Notably, the latter also indicates that is also a valid choice. The case was originally deemed inadequate from the Goldilocks principle, but Fig. 5b shows that is in the vicinity of the threshold, which, as discussed above, usually renders false negative outcomes. Despite the latter, no false positives were detected during the analysis; that is, the Goldilocks principle did not show good-performing cases that contained high error norms L. This is strictly necessary to avoid faulty designs in the final architecture.
Conclusions
We have introduced the design for a lossless universal photonic architecture based on a layered scheme of interlaced active phase shifter layers and passive random matrices. Numerical results obtained from the LMA optimization revealed that generating Haar random matrices F leads, in a vast majority of cases, to the desired universal architecture. It is observed that well-behaved matrices F show a phase transition on the error norm L of the reconstructed target at , with M the total number of phase shifter layers. In such a layer number, the error drops significantly to numerical noise values. While this is not proof that the factorization is exact, the error involved in the reconstruction process lies in the numerical error regime, and it is thus low enough to ensure that any unitary matrix is reconstructed with the desired accuracy.
Despite the accuracy of the LMA optimization, the computational time required for testing the universality of the random matrices F scales with the total number of ports, which becomes impractical for particularly large architectures. Numerical evidence shows that denser matrices perform better than sparse ones, usually involving relatively large errors. Therefore, a density criterion has been devised and introduced to classify the candidates for the matrix F used in the architecture. This criterion is built on preliminary knowledge of bad-performing matrices, such as diagonal and block diagonal matrices, which are analytically known to fail but serve as reference points to look for good-performing matrices, such as the DFT case. In this form, instead of performing a long optimization routine on the candidate for F, we simply analyze the standard deviation of the modulus of its columns or rows, which provides information about its density. This allows defining a mapping , which renders a vector that estimates whether F is suitable for the architecture. We thus possess a tool to preselect matrices F beforehand, making the design process more practical than generating and testing several random matrices.
Our tests using randomly generated unitary matrices showed that matrices within the threshold marked by the density criterion led to the required universality. Thus, universality is not limited to a specific realization of F; as shown in the results, infinitely many unitary matrices can meet our requirements. This paves the way for more efficient construction and optimization of compact devices that are simultaneously resilient to random defects. Particularly, the photonic Jx lattice was found suitable for this task at lengths different than the previously reported critical value 37. This defines intervals in the lattice length for which the architecture is universal, leading to more flexibility in the manufacturing process so that one can allow for deviations in lattice length. This fact is further supported in the context of homogeneous lattices, which are also suitable for our architecture and robust against disorder effects due to waveguide impurities or mismatching sizes. The latter was tested by introducing deviations of up to into the homogeneous lattice, from which the density estimation showed no significant difference in the universality performance for the lattice lengths considered. Further constructions for the F matrices are indeed allowed, and an alternative construction based on a layered array of power dividers was shown to be efficient for our purposes, the analysis of which allowed us to determine the optimal number of passive elements required for the architecture.
The universality of the interlaced architecture in (1) can be further assessed using the density criterion for optical implementations of the passive matrix F beyond the waveguide arrays and directional coupler mesh discussed in the manuscript, as long as F is described by a unitary matrix. For instance, the DFT can be implemented using MMI couplers using the self-image property48. The MMI construction of the DFT has been used to develop an interlaced unitary akin to (1), but in such a construction, it was analytically proved that phase layers are required using a DFT as the intervening layer12. Our results in Fig. 2b provide numerical evidence that such a construction can be realized with only phase layers with the prescribed accuracy. Furthermore, a microwave implementation of the DFrFT has been theoretically and experimentally validated62, offering an alternative approach for the interlaced architecture. The latter helps facilitate the design of the passive layer to reduce the overall architecture size, account for potential manufacturing errors, and diminish the device footprint. This is particularly handy when deploying more complex optical circuits.
Acknowledgements
This project is supported by the U.S. Air Force Office of Scientific Research (AFOSR) Young Investigator Program (YIP) Award# FA9550-22-1-0189 and the City University of New York (CUNY) Junior Faculty Research Award in Science and Engineering (JFRASE) funded by the Alfred P. Sloan Foundation.
Author contributions
All authors have accepted responsibility for the entire content of this manuscript and approved its submission.
Data availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
Competing interest
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Kevin Zelaya, Matthew Markowitz and Mohammad-Ali Miri.
References
- 1.Joannopoulos, J. D., Villeneuve, P. R. & Fan, S. Photonic crystals: Putting a new twist on light. Nature386, 143–149. 10.1038/386143a0 (1997). [Google Scholar]
- 2.Yan, R., Gargas, D. & Yang, P. Nanowire photonics. Nat. Photon.3, 569–576. 10.1038/nphoton.2009.184 (2009). [Google Scholar]
- 3.Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. Photon.11, 441–446. 10.1038/nphoton.2017.93 (2017). [Google Scholar]
- 4.Zelaya, K. & Miri, M.-A. Integrated photonic fractional convolution accelerator (2023). ArXiv:2307.10976 [physics.optics]
- 5.Liao, K. et al. Matrix eigenvalue solver based on reconfigurable photonic neural network. Nanophotonics11, 4089–4099. 10.1515/nanoph-2022-0109 (2022). [Google Scholar]
- 6.Mao, J.-Y., Zhou, L., Zhu, X., Zhou, Y. & Han, S.-T. Photonic memristor for future computing: A perspective. Adv. Opt. Mater.7, 1900766. 10.1002/adom.201900766 (2019). [Google Scholar]
- 7.Youngblood, N., Ríos Ocampo, C. A., Pernice, W. H. & Bhaskaran, H. Integrated optical memristors. Nat. Photon.10.1038/s41566-023-01217-w (2023). [Google Scholar]
- 8.Ashtiani, F., Geers, A. J. & Aflatouni, F. An on-chip photonic deep neural network for image classification. Nature606, 501–506. 10.1038/s41586-022-04714-0 (2022). [DOI] [PubMed] [Google Scholar]
- 9.Liao, K., Dai, T., Yan, Q., Hu, X. & Gong, Q. Integrated photonic neural networks: Opportunities and challenges. ACS Photon.10.1021/acsphotonics.2c01516 (2023). [Google Scholar]
- 10.Saygin, M. Y. et al. Robust architecture for programmable universal unitaries. Phys. Rev. Lett.124, 010501. 10.1103/PhysRevLett.124.010501 (2020). [DOI] [PubMed] [Google Scholar]
- 11.Zhou, H. et al. Photonic matrix multiplication lights up photonic accelerator and beyond. Light Sci. Appl.11, 30. 10.1038/s41377-022-00717-8 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Pastor, V. L., Lundeen, J. & Marquardt, F. Arbitrary optical wave evolution with Fourier transforms and phase masks. Opt. Express29, 38441–38450. 10.1364/OE.432787 (2021). [DOI] [PubMed] [Google Scholar]
- 13.Madsen, L. S. et al. Quantum computational advantage with a programmable photonic processor. Nature606, 75–81. 10.1038/s41586-022-04725-x (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Harris, N. C. et al. Quantum transport simulations in a programmable nanophotonic processor. Nat. Photon.11, 447–452. 10.1038/nphoton.2017.95 (2017). [Google Scholar]
- 15.Slussarenko, S. & Pryde, G. J. Photonic quantum information processing: A concise review. Appl. Phys. Rev.10.1063/1.5115814 (2019). [Google Scholar]
- 16.Notaros, J. et al. Programmable dispersion on a photonic integrated circuit for classical and quantum applications. Opt. Express25, 21275–21285. 10.1364/OE.25.021275 (2017). [DOI] [PubMed] [Google Scholar]
- 17.Xu, X. et al. Neuromorphic computing based on wavelength-division multiplexing. IEEE J. Sel. Top. Quantum Electron.29, 1–12. 10.1109/JSTQE.2022.3203159 (2023). [Google Scholar]
- 18.Zhu, H. et al. Space-efficient optical computing with an integrated chip diffractive neural network. Nat. Commun.13, 1044. 10.1038/s41467-022-28702-0 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Li, X.-K. et al. High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit. Nat. Commun.15, 1044. 10.1038/s41467-024-45305-z (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Tang, R. et al. Two-layer integrated photonic architectures with multiport photodetectors for high-fidelity and energy-efficient matrix multiplications. Opt. Express30, 33940–33954. 10.1364/OE.457258 (2022). [DOI] [PubMed] [Google Scholar]
- 21.Xu, S. et al. Parallel optical coherent dot-product architecture for large-scale matrix multiplication with compatibility for diverse phase shifters. Opt. Express30, 42057–42068. 10.1364/OE.471519 (2022). [DOI] [PubMed] [Google Scholar]
- 22.Reck, M., Zeilinger, A., Bernstein, H. J. & Bertani, P. Experimental realization of any discrete unitary operator. Phys. Rev. Lett.73, 58. 10.1103/PhysRevLett.73.58 (1994). [DOI] [PubMed] [Google Scholar]
- 23.Miller, D. A. All linear optical devices are mode converters. Opt. Express20, 23985–23993. 10.1364/OE.20.023985 (2012). [DOI] [PubMed] [Google Scholar]
- 24.Miller, D. A. Self-configuring universal linear optical component. Photon. Res.1, 1–15. 10.1364/PRJ.1.000001 (2013). [Google Scholar]
- 25.Miller, D. A. Self-aligning universal beam coupler. Opt. Express21, 6360–6370. 10.1364/OE.21.006360 (2013). [DOI] [PubMed] [Google Scholar]
- 26.Clements, W. R., Humphreys, P. C., Metcalf, B. J., Kolthammer, W. S. & Walmsley, I. A. Optimal design for universal multiport interferometers. Optica3, 1460–1465. 10.1364/OPTICA.3.001460 (2016). [Google Scholar]
- 27.Burgwal, R. et al. Using an imperfect photonic network to implement random unitaries. Opt. Express25, 28236–28245. 10.1364/OE.25.028236 (2017). [Google Scholar]
- 28.Shokraneh, F., Geoffroy-Gagnon, S. & Liboiron-Ladouceur, O. The diamond mesh, a phase-error-and loss-tolerant field-programmable mzi-based optical processor for optical neural networks. Opt. Express28, 23495–23508. 10.1364/OE.395441 (2020). [DOI] [PubMed] [Google Scholar]
- 29.Mojaver, K. H. R., Zhao, B., Leung, E., Safaee, S. M. R. & Liboiron-Ladouceur, O. Addressing the programming challenges of practical interferometric mesh based optical processors. Opt. Express31, 23851–23866. 10.1364/OE.489493 (2023). [DOI] [PubMed] [Google Scholar]
- 30.On, M. B. et al. Programmable integrated photonics for topological hamiltonians (2023). Arxiv:2307.05003 [physics.optics] [DOI] [PMC free article] [PubMed]
- 31.Wang, M. et al. Topologically protected entangled photonic states. J. Nanophoton.8, 1327–1335. 10.1515/nanoph-2019-0058 (2019). [Google Scholar]
- 32.Labroille, G. et al. Efficient and mode selective spatial mode multiplexer based on multi-plane light conversion. Opt. Express22, 15599–15607. 10.1364/OE.22.015599 (2014). [DOI] [PubMed] [Google Scholar]
- 33.Morizur, J.-F. et al. Programmable unitary spatial mode manipulation. JOSA A27, 2524–2531. 10.1364/JOSAA.27.002524 (2010). [DOI] [PubMed] [Google Scholar]
- 34.Kulce, O., Mengu, D., Rivenson, Y. & Ozcan, A. All-optical synthesis of an arbitrary linear transformation using diffractive surfaces. Light Sci. Appl.10, 196. 10.1038/s41377-021-00623-5 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Tanomura, R., Tang, R., Ghosh, S., Tanemura, T. & Nakano, Y. Robust integrated optical unitary converter using multiport directional couplers. J. Lightw. Technol.38, 60–66. 10.1109/JLT.2019.2943116 (2020). [Google Scholar]
- 36.Tanomura, R. et al. Scalable and robust photonic integrated unitary converter based on multiplane light conversion. Phys. Rev. Appl.17, 024071. 10.1103/PhysRevApplied.17.024071 (2022). [Google Scholar]
- 37.Markowitz, M. & Miri, M.-A. Universal unitary photonic circuits by interlacing discrete fractional fourier transform and phase modulation (2023). ArXiv:2307.07101 [physics.optics]
- 38.Ribeiro, A., Ruocco, A., Vanacker, L. & Bogaerts, W. Demonstration of a 4 × 4-port universal linear circuit. Optica3, 1348–1357. 10.1364/OPTICA.3.001348 (2016). [Google Scholar]
- 39.Taballione, C. et al. 8 × 8 reconfigurable quantum photonic processor based on silicon nitride waveguides. Opt. Express27, 26842–26857. 10.1364/OE.27.026842 (2019). [DOI] [PubMed] [Google Scholar]
- 40.Tang, R., Tanomura, R., Tanemura, T. & Nakano, Y. Ten-port unitary optical processor on a silicon photonic chip. ACS Photon.8, 2074–2080. 10.1021/acsphotonics.1c00419 (2021). [Google Scholar]
- 41.Taballione, C. et al. A universal fully reconfigurable 12-mode quantum photonic processor. Mater. Quant. Technol.1, 035002. 10.1088/2633-4356/ac168c (2021). [Google Scholar]
- 42.Tanomura, R., Tanomura, T. & Nakano, Y. Multi-wavelength dual-polarization optical unitary processor using integrated multi-plane light converter. Jpn. J. Appl. Phys.62, 1029. 10.35848/1347-4065/acab70 (2023). [Google Scholar]
- 43.Skryabin, N., Dyakonov, I., Saygin, M. Y. & Kulik, S. Waveguide-lattice-based architecture for multichannel optical transformations. Opt. Express29, 26058–26067. 10.1364/OE.426738 (2021). [DOI] [PubMed] [Google Scholar]
- 44.Markowitz, M., Zelaya, K. & Miri, M.-A. Auto-calibrating universal programmable photonic circuits: Hardware error-correction and defect resilience. Opt. Express31, 37673–37682. 10.1364/OE.502226 (2023). [DOI] [PubMed] [Google Scholar]
- 45.Markowitz, M., Zelaya, K. & Miri, M.-A. Learning arbitrary complex matrices by interlacing amplitude and phase masks with fixed unitary operations (2023). ArXiv:2312.05648 [physics.optics]
- 46.Weimann, S. et al. Implementation of quantum and classical discrete fractional Fourier transforms. Nat. Commun.7, 11027. 10.1038/ncomms11027 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Cooney, K. & Peters, F. H. Analysis of multimode interferometers. Opt. Express24, 22481–22515. 10.1364/OE.24.022481 (2016). [DOI] [PubMed] [Google Scholar]
- 48.Bachmann, M., Besse, P. A. & Melchior, H. General self-imaging properties in n × n multimode interference couplers including phase relations. Appl. Opt.33, 3905–3911. 10.1364/AO.33.003905 (1994). [DOI] [PubMed] [Google Scholar]
- 49.Taguchi, Y., Wang, Y., Tanomura, R., Tanemura, T. & Ozeki, Y. Iterative configuration of programmable unitary converter based on few-layer redundant multiplane light conversion. Phys. Rev. Appl.19, 054002. 10.1103/PhysRevApplied.19.054002 (2023). [Google Scholar]
- 50.Mezzadri, F. How to generate random matrices from the classical compact groups (2007). ArXiv:math-ph/0609050
- 51.Levenberg, K. A method for the solution of certain non-linear problems in least squares. Q. Appl. Math.2, 164–168 (1944). [Google Scholar]
- 52.Marquardt, D. W. An algorithm for least-squares estimation of nonlinear parameters. J. Soc. Ind. Appl. Math.11, 431–441 (1963). [Google Scholar]
- 53.Tang, R., Tanemura, T. & Nakano, Y. Integrated reconfigurable unitary optical mode converter using mmi couplers. IEEE Photon. Technol. Lett.29, 971–974. 10.1109/LPT.2017.2700619 (2017). [Google Scholar]
- 54.Livan, G., Novaes, M. & Vivo, P. Introduction to Random Matrices: Theory and Practice (Springer, 2018). [Google Scholar]
- 55.Harris, N. C. et al. Efficient, compact and low loss thermo-optic phase shifter in silicon. Opt. Express22, 10487–10493. 10.1364/OE.22.010487 (2014). [DOI] [PubMed] [Google Scholar]
- 56.Liu, S. et al. Thermo-optic phase shifters based on silicon-on-insulator platform: State-of-the-art and a review. Front. Optoelectron.15, 9. 10.1007/s12200-022-00012-9 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Ríos, C. et al. Ultra-compact nonvolatile phase shifter based on electrically reprogrammable transparent phase change materials. PhotoniX3, 26. 10.1186/s43074-022-00070-4 (2022). [Google Scholar]
- 58.Huang, W.-P. Coupled-mode theory for optical waveguides: An overview. JOSA A11, 963–983. 10.1364/JOSAA.11.000963 (1994). [Google Scholar]
- 59.Christodoulides, D. N., Lederer, F. & Silberberg, Y. Discretizing light behaviour in linear and nonlinear waveguide lattices. Nature424, 817–823. 10.1038/nature01936 (2003). [DOI] [PubMed] [Google Scholar]
- 60.Miri, M.-A. Integrated random projection and dimensionality reduction by propagating light in photonic lattices. Opt. Lett.46, 4936–4939. 10.1364/OL.433101 (2021). [DOI] [PubMed] [Google Scholar]
- 61.de Guise, H., Di Matteo, O. & Sánchez-Soto, L. L. Simple factorization of unitary transformations. Phys. Rev. A97, 022328. 10.1103/PhysRevA.97.022328 (2018). [Google Scholar]
- 62.Keshavarz, R., Shariati, N. & Miri, M.-A. Real-time discrete fractional Fourier transform using metamaterial coupled lines network. IEEE Trans. Microw. Theory Tech.71, 3414–3423. 10.1109/TMTT.2023.3278929 (2023). [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.