Design of task-specific optical systems using broadband diffractive neural networks

Luo, Yi; Mengu, Deniz; Yardimci, Nezih T.; Rivenson, Yair; Veli, Muhammed; Jarrahi, Mona; Ozcan, Aydogan

doi:10.1038/s41377-019-0223-1

Download PDF

Article
Open access
Published: 02 December 2019

Design of task-specific optical systems using broadband diffractive neural networks

Yi LuoÂ ORCID: orcid.org/0000-0001-9442-547X^1,2,3^Â na1,
Deniz Mengu^1,2,3^Â na1,
Nezih T. Yardimci^1,3,
Yair Rivenson^1,2,3,
Muhammed Veli^1,2,3,
Mona Jarrahi^1,3 &
â¦
Aydogan OzcanÂ ORCID: orcid.org/0000-0002-0717-683X^1,2,3,4Â

Light: Science & Applications volumeÂ 8, ArticleÂ number:Â 112 (2019) Cite this article

17k Accesses
172 Citations
66 Altmetric
Metrics details

Subjects

Abstract

Deep learning has been transformative in many fields, motivating the emergence of various optical computing architectures. Diffractive optical network is a recently introduced optical computing framework that merges wave optics with deep-learning methods to design optical neural networks. Diffraction-based all-optical object recognition systems, designed through this framework and fabricated by 3D printing, have been reported to recognize hand-written digits and fashion products, demonstrating all-optical inference and generalization to sub-classes of data. These previous diffractive approaches employed monochromatic coherent light as the illumination source. Here, we report a broadband diffractive optical neural network design that simultaneously processes a continuum of wavelengths generated by a temporally incoherent broadband source to all-optically perform a specific task learned using deep learning. We experimentally validated the success of this broadband diffractive neural network architecture by designing, fabricating and testing seven different multi-layer, diffractive optical systems that transform the optical wavefront generated by a broadband THz pulse to realize (1) a series of tuneable, single-passband and dual-passband spectral filters and (2) spatially controlled wavelength de-multiplexing. Merging the native or engineered dispersion of various material systems with a deep-learning-based design strategy, broadband diffractive neural networks help us engineer the lightâmatter interaction in 3D, diverging from intuitive and analytical design methods to create task-specific optical components that can all-optically perform deterministic tasks or statistical inference for optical machine learning.

All-optical synthesis of an arbitrary linear transformation using diffractive surfaces

Article Open access 24 September 2021

All-optical information-processing capacity of diffractive surfaces

Article Open access 28 January 2021

Diffractive optical computing in free space

Article Open access 20 February 2024

Introduction

Deep learning has been redefining the state-of-the-art results in various fields, such as image recognition^1,2, natural language processing³ and semantic segmentation^4,5. The photonics community has also benefited from deep-learning methods in various applications, such as microscopic imaging^6,7,8,9,10 and holography^11,12,13, among many others^14,15,16,17. Aside from optical imaging, deep learning and related optimization tools have recently been utilized to solve inverse problems in optics related to, e.g., nanophotonic designs and nanoplasmonics^{18,19,20,21,22}. These successful demonstrations and many others have also inspired a resurgence on the design of optical neural networks and other optical computing techniques motivated by their advantages in terms of parallelization, scalability, power efficiency, and computation speed^{23,24,25,26,27,28,29}. A recent addition to this family of optical computing methods is Diffractive Deep Neural Networks (D²NNs)^27,30,31, which are based on lightâmatter interaction engineered by successive diffractive layers designed in a computer by deep-learning methods such as error backpropagation and stochastic gradient descent. Once the training phase is finalized, a diffractive optical network that is composed of transmissive and/or reflective layers is physically fabricated using, e.g., 3D printing or lithography. Each diffractive layer consists of elements (termed neurons) that modulate the phase and/or amplitude of the incident beam at their corresponding location in space, connecting one diffractive layer to successive ones through spherical waves based on the HuygensâFresnel principle²⁷. Using spatially and temporally coherent illumination, these neurons at different layers collectively compute the spatial light distribution at the desired output plane based on a given task that is learned. Diffractive optical neural networks designed using this framework and fabricated by 3D printing were experimentally demonstrated to achieve all-optical inference and data generalization for object classification²⁷, a fundamental application in machine learning. Overall, multi-layer diffractive neural networks have been shown to achieve improved blind testing accuracy, diffraction efficiency and signal contrast with additional diffractive layers, exhibiting a depth advantage even when using linear optical materials^27,30,31. In all these previous studies on diffractive optical networks, the input light was both spatially and temporally coherent, i.e., utilized a monochromatic plane wave at the input.

In general, diffractive optical networks with multiple layers enable generalization and perform all-optical blind inference on new input data (never seen by the network before), beyond the deterministic capabilities of the previous diffractive surfaces^{32,33,34,35,36,37,38,39,40,41,42} that were designed using different optimization methods to provide wavefront transformations without any data generalization capability. These previous demonstrations cover a variety of applications over different regions of the electromagnetic spectrum, providing unique capabilities compared to conventional optical components that are designed by analytical methods. While these earlier studies revealed the potential of single-layer designs using diffractive surfaces under temporally coherent radiation^33,34, the extension of these methods to broadband designs operating with a continuum of wavelengths has been a challenging task. Operating at a few discrete wavelengths, different design strategies have been reported using a single-layer phase element based on, e.g., composite materials³⁵ and thick layouts covering multiple 2Ï modulation cycles^{36,37,38,39,40}. In a recent work, a low numerical aperture (NAâ~â0.01) broadband diffractive cylindrical lens design was also demonstrated⁴³. In addition to these diffractive surfaces, metasurfaces also present engineered optical responses, devised through densely packed subwavelength resonator arrays that control their dispersion behaviour^{44,45,46,47,48}. Recent advances in metasurfaces have enabled several broadband, achromatic lens designs for, e.g., imaging applications^49,50,51. On the other hand, the design space of broadband optical components that process a continuum of wavelengths relying on these elegant techniques has been restrained to single-layer architectures, mostly with an intuitive analytical formulation of the desired surface function⁵².

Here, we demonstrate a broadband diffractive optical network that unifies deep-learning methods with the angular spectrum formulation of broadband light propagation and the material dispersion properties to design task-specific optical systems through 3D engineering of the lightâmatter interaction. Designed in a computer, a broadband diffractive optical network, after its fabrication, can process a continuum of input wavelengths all in parallel and perform a learned task at its output plane, resulting from the diffraction of broadband light through multiple layers. The success of broadband diffractive optical networks is demonstrated experimentally by designing, fabricating and testing different types of optical components using a broadband THz pulse as the input source (see Fig. 1). First, a series of single-passband and dual-passband spectral filters are demonstrated, where each design uses three diffractive layers fabricated by 3D printing, experimentally tested using the set-up shown in Fig. 1. A classical tradeoff between the Q-factor and the power efficiency is observed, and we demonstrate that our diffractive neural network framework can control and balance these design parameters on demand, i.e., based on the selection of the diffractive network training loss function. Combining the spectral filtering operation with spatial multiplexing, we also demonstrate spatially controlled wavelength de-multiplexing using three diffractive layers that are trained to de-multiplex a broadband input source onto four output apertures located at the output plane of the diffractive network, where each aperture has a unique target passband. Our experimental results obtained with these seven different diffractive optical networks that were 3D printed provided very good fits to our trained diffractive models.

**Fig. 1: Schematic of spectral filter design using broadband diffractive neural networks and the experimental set-up.**

We believe that broadband diffractive optical neural networks provide a powerful framework for merging the dispersion properties of various material systems with deep-learning methods to engineer lightâmatter interactions in 3D and help us create task-specific optical components that can perform deterministic tasks as well as statistical inference and data generalization. In the future, we also envision the presented framework to be empowered by various metamaterial designs as part of the layers of a diffractive optical network and to bring additional degrees of freedom by engineering and encoding the dispersion of the fabrication materials to further improve the performance of broadband diffractive networks.

Results

Design of broadband diffractive optical networks

Designing broadband, task-specific and small-footprint compact components that can perform arbitrary optical transformations is highly sought in all parts of the electromagnetic spectrum for various applications, including e.g., tele-communications⁵³, biomedical imaging⁵⁴ and chemical identification⁵⁵, among others. We approach this general broadband inverse optical design problem from the perspective of diffractive optical neural network training and demonstrate its success with various optical tasks. Unlike the training process of the previously reported monochromatic diffractive neural networks^27,30,31, in this work, the optical forward model is based on the angular spectrum formulation of broadband light propagation within the diffractive network, precisely taking into account the dispersion of the fabrication material to determine the light distribution at the output plane of the network (see the Methods section). Based on a network training loss function, a desired optical task can be learned through error backpropagation within the diffractive layers of the optical network, converging to an optimized spectral and/or spatial distribution of light at the output plane.

In its general form, our broadband diffractive network design assumes an input spectral frequency band between f_min and f_max. Uniformly covering this range, M discrete frequencies are selected for use in the training phase. In each update step of the training, an input beam carrying a random subset of B frequencies out of these M discrete frequencies is propagated through the diffractive layers, and a loss function is calculated at the output plane, tailored according to the desired task; without loss of generality, B/M has been selected in our designs to be less than 0.5% (refer to the Methods section). At the final step of each iteration, the resultant error is backpropagated to update the physical parameters of the diffractive layers controlling the optical modulation within the optical network. The training cycle continues until either a predetermined design criterion at the network output plane is satisfied or the maximum number of epochs (where each epoch involves M/B successive iterations, going through all the frequencies between f_min and f_max) is reached. In our broadband diffractive network designs, the physical parameter to be optimized was selected as the thickness of each neuron within the diffractive layers, enabling the control of the phase modulation profile of each diffractive layer in the network. In addition, the material dispersion, including the real and imaginary parts of the refractive index of the network material as a function of the wavelength, was also taken into account to correctly represent the forward model of the broadband light propagation within the optical neural network. As a result of this, for each wavelength within the input light spectrum, we have a unique complex (i.e., phase and amplitude) modulation, corresponding to the transmission coefficient of each neuron, determined by its physical thickness, which is a trainable parameter for all the layers of the diffractive optical network.

Upon completion of this digital training phase in a computer, which typically takes ~5âh (see the Methods section for details), the designed diffractive layers were fabricated using a 3D printer, and the resulting optical networks were experimentally tested using the THz time-domain spectroscopy (TDS) system illustrated in Fig. 1, which has a noise-equivalent power bandwidth of 0.1â5âTHz⁵⁶.

Single-passband spectral filter design and testing

Our diffractive single-passband spectral filter designs are composed of three diffractive layers, with a layer-to-layer separation of 3âcm and an output aperture positioned 5âcm away from the last diffractive layer, serving as a spatial filter, as shown in Fig. 1. For our spectral filter designs, the parameters M, f_min and f_max were taken as 7500, 0.25âTHz and 1âTHz, respectively. Using this broadband diffractive network framework employing three successive layers, we designed four different spectral bandpass filters with centre frequencies of 300âGHz, 350âGHz, 400âGHz and 420âGHz, as shown in Fig. 2aâd, respectively. For each design, the target spectral profile was set to have a flat-top bandpass over a narrow band (Â±2.5âGHz) around the corresponding centre frequency. During the training of these designs, we used a loss function that solely focused on increasing the power efficiency of the target band, without a specific penalty on the Q-factor of the filter (see the Methods section). As a result of this design choice during the training phase, our numerical models converged to bandpass filters centred around each target frequency, as shown in Fig. 2aâd. These trained diffractive models reveal the peak frequencies (and the Q-factors) of the corresponding designs to be 300.1âGHz (6.21), 350.4âGHz (5.34), 399.7âGHz (4.98) and 420.0âGHz (4.56), respectively. After the fabrication of each of these trained models using a 3D printer, we also experimentally tested these four different diffractive networks (Fig. 1) to find a very good match between our numerical testing results and the physical diffractive network results. Based on the blue-dashed lines depicted in Fig. 2aâd, the experimental counterparts of the peak frequencies (and the Q-factors) of the corresponding designs were calculated as 300.4âGHz (4.88), 351.8âGHz (7.61), 393.8âGHz (4.77) and 418.6âGHz (4.22).

**Fig. 2: Single-passband spectral filter designs using broadband diffractive neural networks and their experimental validation.**

Furthermore, the power efficiencies of these four different bandpass filter designs, calculated at the corresponding peak wavelength, were determined to be 23.13, 20.93, 21.76 and 18.53%, respectively. To shed more light on these efficiency values of our diffractive THz systems and estimate the specific contribution due to the material absorption, we analysed the expected power efficiency at 350âGHz by modelling each diffractive layer as a uniform slab (see the Methods section for details). Based on the extinction coefficient of the 3D-printing polymer at 350âGHz (Supplementary Figure S1), three successive flat layers, each with a 1âmm thickness, provide 27.52% power efficiency when the material absorption is assumed to be the only source of loss. This comparison reveals that the main source of power loss in our spectral filter models is in fact the material absorption, which can be circumvented by selecting different types of fabrication materials with lower absorption compared to our 3D printer material (VeroBlackPlus RGD875).

To further exemplify the different degrees of freedom in our diffractive network-based design framework, Fig. 2e illustrates another bandpass filter design centred at 350âGHz, same as in Fig. 2b; however, different from Fig. 2b, this particular case represents a design criterion where the desired spectral filter profile was set as a Gaussian with a Q-factor of 10. Furthermore, the training loss function was designed to favour a high Q-factor rather than better power efficiency by penalizing Q-factor deviations from the target value more severely compared to poor power efficiency (see the Methods section for details). To provide a fair comparison between Figs. 2b and 2e, all the other design parameters, e.g., the number of diffractive layers, the size of the output aperture and the relative distances, are kept identical. Based on this new design (Fig. 2e), the numerical (experimental) values of the peak frequency and the Q-factor of the final model can be calculated as 348.2âGHz (352.9âGHz) and 10.68 (12.7), once again providing a very good match between our numerical testing and experimental results, following the 3D printing of the designed network model. Compared to the results reported in Fig. 2b, this improvement in the Q-factor also comes at the expense of a power efficiency drop to 12.76%, which is expected by design, i.e., the choice of the training loss function.

Another important difference between the designs depicted in Figs. 2b, e lies in the structures of their diffractive layers. A comparison of the 3rd layers shown in Figs. 2b, e reveals that while the former design demonstrates a pattern at its 3rd layer that is intuitively similar to a diffractive lens, the thickness profile of the latter design (Fig. 2e) does not evoke any physically intuitive explanation of its immediate function within the diffractive network; the same conclusion is also evident if one examines the 1st diffractive layers reported in Fig. 2e as well as in Figs. 3 and 4. Convergence to physically non-intuitive designs, such as in these figures, in the absence of a tailored initial condition or prior design shows the power of our diffractive computational framework in the context of broadband, task-specific optical system design.

**Fig. 3: Dual-passband spectral filter design using a broadband diffractive neural network and its experimental validation.**

**Fig. 4: Broadband diffractive neural network design for spatially controlled wavelength de-multiplexing and its experimental validation.**

Dual-passband spectral filter design and testing

Having presented the design and experimental validation of five different bandpass filters using broadband diffractive neural networks, we next used the same design framework for a more challenging task: a dual-passband spectral filter that directs two separate frequency bands onto the same output aperture while rejecting the remaining spectral content of the broadband input light. The physical layout of the diffractive network design is the same as before, being composed of three diffractive layers and an output aperture plane. The goal of this diffractive optical network is to produce a power spectrum at the same aperture that is the superposition of two flat-top passband filters around the centre frequencies of 250 and 450âGHz (see Fig. 3). Following the deep-learning-based design and 3D fabrication of the resulting diffractive network model, our experimental measurement results (dashed blue line in Fig. 3a) provide very good agreement with the numerical results (red line in Fig. 3a); the numerical diffractive model has peak frequencies at 249.4 and 446.4âGHz, which closely agree with our experimentally observed peak frequencies, i.e., 253.6 and 443.8âGHz, for the two target bands.

Despite the fact that we did not impose any restrictions or loss terms related to the Q-factor during our training phase, the power efficiencies of the two peak frequencies were calculated as 11.91 and 10.51%. These numbers indicate a power efficiency drop compared to the single-passband diffractive designs reported earlier (Fig. 2); however, we should note that the total power transmitted from the input plane to the output aperture (which has the same size as before) is maintained at approximately 20% in both the single-passband and the double-passband filter designs.

A projection of the intensity distributions produced by our 3-layer design on the xz plane (at yâ=â0) is also illustrated in Fig. 3b, which exemplifies the operation principles of our diffractive network regarding the rejection of the spectral components residing between the two targeted passbands. For example, one of the undesired frequency components at 350âGHz is focused onto a location between the 3rd layer and the output aperture, with a higher numerical aperture (NA) compared to the waves in the target bands. As a result, this frequency quickly diverges as it propagates until reaching the output plane; hence, its contribution to the transmitted power beyond the aperture is significantly decreased, as desired. In general, the diffractive layers of a broadband neural network define a tuneable 3D space that can be optimized to approximate different sets of wavelength-dependent grating-like structures that couple the input broadband light into different modes of radiation that are engineered depending on the target function in space and/or spectrum (see, e.g., Supplementary Figure S3).

From the spectrum reported in Fig. 3a, it can also be observed that there is a difference between the Q-factors of the two passbands. The main factor causing this variation in the Q-factor is the increasing material loss at higher frequencies (Supplementary Figure S1), which is a limitation due to our 3D printing material. If one selects the power efficiency as the main design priority in a broadband diffractive network, the optimization of a larger Q-factor optical filter function is relatively more cumbersome for higher frequencies due to the higher material absorption that we experience in the physically fabricated, 3D-printed system. As a general rule, maintaining both the power efficiencies and the Q-factors over K bands in a multi-band filter design requires optimizing the relative contributions of the training loss function sub-terms associated with each design criterion (refer to the Methods section for details); this balance among the sub-constituents of the loss function should be carefully engineered during the training phase of a broadband diffractive network depending on the specific task of interest.

Spatially controlled wavelength de-multiplexing

Next, we focused on the simultaneous control of the spatial and spectral content of the diffracted light at the output plane of a broadband diffractive optical network and demonstrated its utility for spatially controlled wavelength de-multiplexing by training three diffractive layers (Fig. 4b) that channel the broadband input light onto four separate output apertures on the same plane, corresponding to four passbands centred at 300, 350, 400 and 450âGHz (Fig. 4a). The numerically designed spectral profiles based on our diffractive optical network model (red) and its experimental validation (dashed blue), following the 3D printing of the trained model, are reported in Fig. 4c for each sub-band, providing once again a very good match between our numerical model and the experimental results. Based on Fig. 4c, the numerically estimated and experimentally measured peak frequency locations are (297.5, 348.0, 398.5, 450.0) and (303.5âGHz, 350.1, 405.1, 454.8âGHz), respectively. The corresponding Q-factors calculated based on our simulations (11.90, 10.88, 9.84, and 8.04) are also in accordance with their experimental counterparts (11.0, 12.7, 9.19, and 8.68), despite various sources of experimental errors, as detailed in our Discussion section. Similar to our earlier observations in the dual-passband filter results, higher bands exhibit a relatively lower Q-factor that is related to the increased material losses at higher frequencies (Supplementary Figure S1). Since this task represents a more challenging optimization problem involving four different detector locations corresponding to four different passbands, the power efficiency values also exhibit a relative compromise compared to earlier designs, yielding 6.99, 7.43, 5.14 and 5.30% for the corresponding peak wavelengths of each passband. To further highlight the challenging nature of spatially controlled wavelength de-multiplexing, Supplementary Figure S4 reports that the same task cannot be successfully achieved using only two learnable diffractive layers, which demonstrates the advantage of additional layers in a diffractive optical network to perform more sophisticated tasks through deep-learning-based optimization.

In addition to the material absorption losses, there are two other factors that need to be considered for wavelength multiplexing- or de-multiplexing-related applications using diffractive neural networks. First, the lateral resolution of the fabrication method that is selected to manufacture a broadband diffractive network might be a limiting factor at higher frequencies; for example, the lateral resolution of our 3D printer dictates a feature size of ~Î»/2 at 300âGHz that restricts the diffraction cone of the propagating waves at higher frequencies. Second, the limited axial resolution of a 3D fabrication method might impose a limitation on the thickness levels of the neurons of a diffractive layer design; for example, using our 3D printer, the associated modulation functions of individual neurons are quantized with a step size of 0.0625âmm, which provides 4 bits (within a range of 1âmm) in terms of the dynamic range, which is sufficient over a wide range of frequencies. With increasing frequencies, however, the same axial step size will limit the resolution of the phase modulation steps available per diffractive layer, partially hindering the associated performance and the generalization capability of the diffractive optical network. Nevertheless, with dispersion engineering methods (using, e.g., metamaterials) and/or higher-resolution 3D fabrication technologies, including, e.g., optical lithography or two-photon polymerization-based 3D printing, multi-layer wavelength multiplexing/de-multiplexing systems operating at various parts of the electromagnetic spectrum can be designed and tested using broadband diffractive optical neural networks.

Discussion

There are several factors that might have contributed to the relatively minor discrepancies observed between our numerical simulations and the experimental results reported. First, any mechanical misalignment (lateral and/or axial) between the diffractive layers due to, e.g., our 3D printerâs resolution can cause some deviation from the expected output. In addition, the THz pulse incident on the input plane is assumed to be spatially uniform, propagating parallel to the optical axis, which might introduce additional experimental errors in our results due to the imperfect beam profile and alignment with respect to the optical axis. Moreover, the wavelength-dependent properties of our THz detector, such as the acceptance angle and the coupling efficiency, are not modelled as part of our forward model, which might also introduce error. Finally, potential inaccuracies in the characterization of the dispersion of the 3D-printing materials used in our experiments could also contribute some error in our measurements compared to our trained model numerical results.

For all the designs presented in this manuscript, the width of each output aperture is selected as 2âmm, which is approximately 2.35 times the largest wavelength (corresponding to f_minâ=â0.25âTHz) targeted in our design. The reason behind this specific design choice is to mitigate some of the unknown effects of the Si lens attached in front of our THz detector, since the theoretical wave optics model of this lens is not available. Consequently, for some of our single-passband spectral filter designs (Fig. 2aâd), the last layer before the output aperture intuitively resembles a diffractive lens. However, unlike a standard diffractive lens, our diffractive neural network, which is composed of multiple layers, can provide a targeted Q-factor even for a large range of output apertures, as illustrated in Supplementary Figure S5.

It is interesting to note that our diffractive single-passband filter designs reported in Fig. 2 can be tuned by changing the distance between the diffractive neural network and the detector/output plane (see Fig. 1c), establishing a simple passband tunability method for a given fabricated diffractive network. Figure 5a reports our simulations and experimental results at five different axial distances using our 350âGHz diffractive network design, where Î_Z denotes the axial displacement around the ideal, designed location of the output plane. As the aperture gets closer to the final diffractive layer, the passband experiences a redshift (centre frequency decreases), and any change in the opposite direction causes a blueshift (centre frequency increases). However, deviations from the ideal position of the output aperture also decrease the resulting Q-factor (see Fig. 5b); this is expected since these distances with different Î_Z values were not considered as part of the optical system design during the network training phase. Interestingly, a given diffractive spectral filter model can be used as the initial condition of a new diffractive network design and be further trained with multiple loss terms around the corresponding frequency bands at different propagation distances from the last diffractive layer to yield a better-engineered tuneable frequency response that is improved from that of the original diffractive design. To demonstrate the efficacy of this approach, Figs. 5c, d report the output power spectra of this new model (centred at 350âGHz) and the associated Q-factors, respectively. As desired, the resulting Q-factors are now enhanced and more uniform across the targeted Î_Z range due to the additional training with a band tunability constraint, which can be regarded as the counterpart of the transfer learning technique (frequently used in machine learning) within the context of optical system design using diffractive neural network models. Supplementary Figure S6 also reports the differences in the thickness distributions of the diffractive layers of these two designs, i.e., before and after the transfer learning, corresponding to Fig. 5aâd respectively.

**Fig. 5: Tunability of broadband diffractive networks.**

In conclusion, the presented results of this manuscript indicate that the D²NN framework can be generalized to broadband sources and process optical waves over a continuous, wide range of frequencies. Furthermore, the computational capacity of diffractive deep neural networks performing machine learning tasks, e.g., object recognition or classification^27,30,31, can potentially be increased significantly through multi-wavelength operation enabled by the broadband diffractive network framework presented in this manuscript, under the assumption that the available fabrication technology can provide adequate resolution, especially for shorter wavelengths of the desired band of operation. The design framework described in this manuscript is not limited to THz wavelengths and can be applied to other parts of the electromagnetic spectrum, including the visible band, and therefore, it represents a vital progress towards expanding the application space of diffractive optical neural networks for scenarios where broadband operation is more attractive and essential. Finally, we anticipate that the presented framework can be further strengthened using metasurfaces^{49,50,57,58,59,60} that engineer and encode the dispersion of the fabrication materials in unique ways.

Materials and methods

Terahertz TDS system

A Ti:sapphire laser (Coherent MIRA-HP) is used in mode-locked operation to generate femtosecond optical pulses at a wavelength of 780ânm. Each optical pulse is split into two beams. One part of the beam illuminates the THz emitter, a high-power plasmonic photoconductive nano-antenna array⁶¹. The THz pulse generated by the THz emitter is collimated and guided to a THz detector through an off-axis parabolic mirror, which is another plasmonic nano-antenna array that offers high-sensitivity and broadband operation⁵⁶. The other part of the optical beam passes through an optical delay line and illuminates the THz detector. The generated signal as a function of the delay line position and incident THz/optical fields is amplified with a current pre-amplifier (Femto DHPCA-100) and detected with a lock-in amplifier (Zurich Instruments MFLI). For each measurement, traces are collected for 5âs, and 10 pulses are averaged to obtain the time-domain signal. Overall, the system offers signal-to-noise ratio levels over 90âdB and observable bandwidths up to 5âTHz. Each time-domain signal is acquired within a time window of 400âps.

Each diffractive neural network model, after its 3D printing, was positioned between the emitter and the detector, coaxial with the THz beam, as shown in Fig. 1d, e. With a limited input beam size, the first layer of each diffractive network was designed with a 1âÃâ1âcm input aperture (as shown in e.g., Fig. 1b). After their training, all the diffractive neural networks were fabricated using a commercial 3D printer (Objet30 Pro, Stratasys Ltd.). The apertures at the input and output planes were also 3D-printed and coated with aluminium (Figs. 1a and 4a).

Without loss of generality, a flat input spectrum was assumed during the training of our diffractive networks. Since the power spectrum of the incident THz pulse at the input plane is not flat, we measured its spectrum with only the input aperture present in the optical path (i.e., without any diffractive layers and output apertures). Based on this reference spectrum measurement of the input pulse, all the experimentally measured spectra generated by our 3D-printed network models were normalized; accordingly, Figs. 2â5 reflect the input-normalized power spectrum produced by the corresponding 3D-printed network model.

Forward propagation model

The broadband diffractive optical neural network framework performs optical computation through diffractive layers connected by free space propagation in air. We model the diffractive layers as thin modulation elements, where each pixel on the lth layer at a spatial location (x_i, y_i, z_i) provides a wavelength (Î») dependent modulation, t,

$$\begin{array}{*{20}{c}} {t^l\left( {x_i,y_i,z_i,\lambda } \right) = a^l\left( {x_i,y_i,z_i,\lambda } \right)\exp \left( {j\phi ^l\left( {x_i,y_i,z_i,\lambda } \right)} \right)} \end{array}$$

(1)

where a and Ï denote the amplitude and phase, respectively.

Between the layers, free space light propagation is calculated following the Rayleigh-Sommerfeld equation^27,30. The ith pixel on the lth layer at location (x_i, y_i, z_i) can be viewed as the source of a secondary wave $w_i^l\left( {x,y,z,\lambda } \right)$, which is given by

$$\begin{array}{*{20}{c}} {w_i^l\left( {x,y,z,\lambda } \right) = \frac{{z - z_i}}{{r^2}}\left( {\frac{1}{{2\pi r}} + \frac{1}{{j\lambda }}} \right)\exp \left( {\frac{{j2\pi r}}{\lambda }} \right)} \end{array}$$

(2)

where $r = \sqrt {\left( {x - x_i} \right)^2 + \left( {y - y_i} \right)^2 + \left( {z - z_i} \right)^2}$ and $j = \sqrt { - 1}$. Treating the incident field as the 0th layer, the modulated optical field u^l by the lth layer at location (x_i, y_i, z_i) is given by

$$\begin{array}{*{20}{c}} \begin{array}{l}u^l\left( {x_i,y_i,z_i,\lambda } \right) = t^l\left( {x_i,y_i,z_i,\lambda } \right) \cdot \mathop {\sum }\limits_{k \in I} u^{l - 1}\left( {x_k,\,y_k,z_k,\lambda } \right) \cdot \\ w_k^{l - 1}\left( {x_i,y_i,z_i,\lambda } \right),\,l \ge 1\end{array} \end{array}$$

(3)

where I denotes all pixels on the previous layer.

Digital implementation

Without loss of generality, a flat input spectrum was used during the training phase, i.e., for each distinct Î» value, a plane wave with unit intensity and a uniform phase profile was assumed. The assumed frequency range at the input plane was taken as 0.25â1âTHz for all the designs, and this range was uniformly partitioned into Mâ=â7500 discrete frequencies. A square input aperture with a width of 1âcm was chosen to match the beam width of the incident THz pulse.

Restricted by our fabrication method, a pixel size of 0.5âmm was used as the smallest printable feature size. To accurately model the wave propagation over a wide range of frequencies based on the RayleighâSommerfeld diffraction integral, the simulation window was oversampled four times with respect to the smallest feature size, i.e., the space was sampled with 0.125âmm steps. Accordingly, each feature of the diffractive layers of a given network design was represented on a 4âÃâ4 grid, all 16 elements sharing the same physical thickness. The printed thickness value, h, is the superposition of two parts, h_m and h_base, as depicted in Eq. (4b). h_m denotes the part where the wave modulation takes place and is confined between h_minâ=â0 and h_maxâ=â1âmm. The second term, h_baseâ=â0.5âmm, is a constant, non-trainable thickness value that ensures robust 3D printing, helping with the stiffness of the diffractive layers. To achieve the constraint applied to h_m, we defined the thickness of each diffractive feature over an associated latent (trainable) variable, h_p, using the following analytical form:

$$h_m = \left( {\sin \left( {h_p} \right) + 1} \right) \times \frac{{h_{{\rm{max}}}}}{2}$$

(4a)

$$h = q\left( {h_m} \right) + h_{{\rm{base}}}$$

(4b)

where q(.) denotes a 16-level uniform quantization (0.0625âmm for each level, with h_maxâ=â1âmm).

The amplitude and phase components of the ith neuron on layer l, i.e., a^l(x_i, y_i, z_i, Î») and Ï^l(x_i, y_i, z_i, Î») in Eq. (1), can be defined as a function of the thickness of each individual neuron, h_i, and the incident wavelength as follows:

$$\begin{array}{*{20}{c}} {a^l\left( {x_i,y_i,z_i,\lambda } \right) = \exp \left( { - \frac{{2\pi \kappa \left( \lambda \right)h_i}}{\lambda }} \right)} \end{array}$$

(5)

$$\begin{array}{*{20}{c}} {\phi ^l\left( {x_i,y_i,z_i,\lambda } \right) = \left( {n\left( \lambda \right) - n_{air}} \right)\frac{{2\pi h_i}}{\lambda }} \end{array}$$

(6)

The wavelength-dependent parameters, n(Î») and the extinction coefficient Îº(Î»), are defined over the real and imaginary parts of the refractive index, $\tilde n\left( \lambda \right) = n\left( \lambda \right) + j\kappa \left( \lambda \right)$, characterized by the dispersion analysis performed over a broad range of frequencies (Supplementary Figure S1).

Loss function and training-related details

After light propagation through the layers of a diffractive network, a 2âmm wide output aperture was used at the output plane, right before the integrated detector lens, which is made of Si and has the shape of a hemisphere with a radius of 0.5âcm. In our simulations, we modelled the detector lens as an achromatic flat Si slab with a refractive index of 3.4 and a thickness of 0.5âcm. After propagating through this Si slab, the light intensity residing within a designated detector active area was integrated and denoted by I_out. The power efficiency was defined by

$$\begin{array}{*{20}{c}} {\eta = \frac{{I_{{\rm{out}}}}}{{I_{{\rm{in}}}}}} \end{array}$$

(7)

where I_in denotes the power of the incident light within the input aperture of the diffractive network. For each diffractive network model, the reported power efficiency reflects the result of Eq. (7) for the peak wavelength of a given passband.

The loss term, L, used for single-passband filter designs was devised to achieve a balance between the power efficiency and the Q-factor, defined as

$$\begin{array}{*{20}{c}} {L = \alpha L_p + \beta L_Q} \end{array}$$

(8)

where L_p denotes the power loss and L_Q denotes the Q-factor loss term; Î± and Î² are the relative weighting factors for these two loss terms, which were calculated using the following equations:

$$\begin{array}{*{20}{c}} {L_p = \mathop {\sum }\limits_{\omega \in B} {\rm{rect}}\left( {\frac{{\omega - \omega _0}}{{{\mathrm{\Delta }}\omega _P}}} \right) \times \left( {I_{{\rm{in}}} - I_{{\rm{out}}}} \right)} \end{array}$$

(9a)

$$\begin{array}{*{20}{c}} {L_Q = \mathop {\sum }\limits_{\omega \in B} \left( {1 - {\rm{rect}}\left( {\frac{{\omega - \omega _0}}{{{\mathrm{\Delta }}\omega _Q}}} \right)} \right) \times I_{{\rm{out}}}} \end{array}$$

(9b)

with B, Ï₀ and âÏ_p denoting the number of frequencies used in a training batch, the centre frequency of the target passband and the associated bandwidth around the centre frequency, respectively. The rect (Ï) function is defined as

$$\begin{array}{*{20}{c}} {{\rm{rect}}\left( \omega \right) = \left\{ {\begin{array}{*{20}{c}} {1,\,\left| \omega \right| \le \frac{1}{2}} \\ {0,\,\left| \omega \right| \, > \ \frac{1}{2}} \end{array}} \right.} \end{array}$$

(10)

Assuming a power spectrum profile with a Gaussian distribution N(Ï₀, Ï²) with a full-width-half-maximum (FWHM) bandwidth of âÏ, the standard deviation and the associated âÏ_Q were defined as

$$\begin{array}{*{20}{c}} {\sigma ^2 = - \frac{{\left( {\frac{{\omega _0}}{{{\mathrm{\Delta }}\omega }}} \right)^2}}{{8\log \left( {0.5} \right)}}} \end{array}$$

(11a)

$$\begin{array}{*{20}{c}} {\Delta \omega _Q = 6\sigma } \end{array}$$

(11b)

The Q-factor was defined as

$$\begin{array}{*{20}{c}} {Q = \frac{{\omega _0}}{{{\mathrm{\Delta }}\omega }}} \end{array}$$

(12)

For the single-passband diffractive spectral filter designs reported in Fig. 2aâd and the dual-passband spectral filter reported in Fig. 3, âÏ_P for each band was taken as 5âGHz. For these five diffractive designs, Î² in Eq. (8) was set to 0 to enforce the network model to maximize the power efficiency without any restriction or penalty on the Q-factor. For the diffractive spectral filter design illustrated in Fig. 2e, on the other hand, $\frac{\alpha }{\beta }$ ratio (balancing the power efficiency and Q-factor) was set to 0.1 in Eq. (8).

In the design phase of the spatially controlled wavelength de-multiplexing system (Fig. 4), following the strategy used in the filter design depicted in Fig. 2e, the target spectral profile around each centre frequency was taken as a Gaussian with a Q-factor of 10. For simplicity, the $\frac{\alpha }{\beta }$ ratio in Eq. (8) was set to 0.1 for each band and detector location, i.e., $\frac{{\alpha _1}}{{\beta _1}} = \frac{{\alpha _2}}{{\beta _2}} = \frac{{\alpha _3}}{{\beta _3}} = \frac{{\alpha _4}}{{\beta _4}} = \frac{1}{{10}}$, where the indices refer to the four different apertures at the detector/output plane. Although not implemented in this work, the $\frac{\alpha }{\beta }$ ratios among different bands/channels can also be separately tuned to better compensate for the material losses as a function of the wavelength. In general, to design an optical component that maintains the photon efficiency and Q-factor over K different bands based on our broadband diffractive optical network framework, a set of 2K coefficients, i.e., (Î±₁, Î±₂, â¦, Î±_K, Î²₁, Î²₂, â¦, Î²_K), must be tuned according to the material dispersion properties for all the subcomponents of the loss function.

In our training phase, Mâ=â7500 frequencies were randomly sampled in batches of Bâ=â20, which is mainly limited by our GPU memory. The trainable variables, h_p in Eq. (4b), were updated following the standard error backpropagation method using the Adam optimizer⁶² with a learning rate of 1âÃâ10^â3. The initial conditions of all the trainable parameters were set to 0. For the diffractive network models with more than one detector location reported in this manuscript, the loss values were individually calculated for each detector with a random order, and the design parameters were updated thereafter. In other words, for a d-detector optical system, loss calculations and parameter updates were performed d-times with respect to each detector in random order.

Our models were simulated using Python (v3.7.3) and TensorFlow (v1.13.0, Google Inc.). All the models were trained using 200 epochs (the network saw all 7500 frequencies at the end of each epoch) with a GeForce GTX 1080 Ti graphical processing unit (GPU, Nvidia Inc.), an IntelÂ® Coreâ¢ i9-7900X central processing unit (CPU, Intel Inc.) and 64 GB of RAM, running the Windows 10 operating system (Microsoft). Training of a typical diffractive network model takes ~5âh to complete with 200 epochs. The thickness profile of each diffractive layer was then converted into the.stl file format using MATLAB.

Code availability

The deep-learning models reported in this work used standard libraries and scripts that are publicly available in TensorFlow.

Data availability

All the data and methods needed to evaluate the conclusions of this work are present in the main text and the Supplementary Materials. Additional data can be requested from the corresponding author.

References

Russakovsky, O. et al. ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115, 211â252 (2015).
ArticleÂ MathSciNetÂ Google ScholarÂ
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436â444 (2015).
ArticleÂ ADSÂ Google ScholarÂ
Collobert, R. & Weston, J. A unified architecture for natural language processing: deep neural networks with multitask learning. In Proc. 25th International Conference on Machine Learning (eds McCallum, A. & Roweis, S.) 160â167 (Helsinki, Finland: ACM, 2008). https://doi.org/10.1145/1390156.1390177.
Chen, L. C. et al. DeepLab: semantic image segmentation with deep convolutional nets, Atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 40, 834â848 (2018).
ArticleÂ Google ScholarÂ
Long, J., Shelhamer, E. & Darrell, T. Fully convolutional networks for semantic segmentation. In Proc. 2015 IEEE Conference on Computer Vision and Pattern Recognition 3431â3440 (Boston, MA, USA: IEEE, 2015).
Rivenson, Y. et al. Deep learning enhanced mobile-phone microscopy. ACS Photonics 5, 2354â2364, https://doi.org/10.1021/acsphotonics.8b00146 (2018).
ArticleÂ Google ScholarÂ
Rivenson, Y. et al. Deep learning microscopy. Optica 4, 1437â1443 (2017).
ArticleÂ ADSÂ Google ScholarÂ
Nehme, E. et al. Deep-STORM: super-resolution single-molecule microscopy by deep learning. Optica 5, 458â464 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Kim, T., Moon, S. & Xu, K. Information-rich localization microscopy through machine learning. Nat. Commun. 10, 1996 (2019).
ArticleÂ ADSÂ Google ScholarÂ
Ouyang, W. et al. Deep learning massively accelerates super-resolution localization microscopy. Nat. Biotechnol. 36, 460â468 (2018).
ArticleÂ Google ScholarÂ
Rivenson, Y. et al. Phase recovery and holographic image reconstruction using deep learning in neural networks. Light.: Sci. Appl. 7, 17141 (2018).
ArticleÂ Google ScholarÂ
Rivenson, Y. et al. PhaseStain: the digital staining of label-free quantitative phase microscopy images using deep learning. Light.: Sci. Appl. 8, 23 (2019).
ArticleÂ ADSÂ Google ScholarÂ
Sinha, A. et al. Lensless computational imaging through deep learning. Optica 4, 1117â1125 (2017).
ArticleÂ ADSÂ Google ScholarÂ
Barbastathis, G., Ozcan, A. & Situ, G. On the use of deep learning for computational imaging. Optica 6, 921â943 (2019).
ArticleÂ ADSÂ Google ScholarÂ
Li, Y. Z., Xue, Y. J. & Tian, L. Deep speckle correlation: a deep learning approach toward scalable imaging through scattering media. Optica 5, 1181â1190 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Rahmani, B. et al. Multimode optical fiber transmission with a deep learning network. Light.: Sci. Appl. 7, 69 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Rivenson, Y. et al. Virtual histological staining of unlabelled tissue-autofluorescence images via deep learning. Nat. Biomed. Eng. 3, 466â477 (2019).
ArticleÂ Google ScholarÂ
Malkiel, I. et al. Plasmonic nanostructure design and characterization via Deep Learning. Light.: Sci. Appl. 7, 60 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Liu, D. J. et al. Training deep neural networks for the inverse design of Nanophotonic structures. ACS Photonics 5, 1365â1369 (2018).
ArticleÂ Google ScholarÂ
Peurifoy, J. et al. Nanophotonic particle simulation and inverse design using artificial neural networks. Sci. Adv. 4, eaar4206 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Ma, W., Cheng, F. & Liu, Y. Deep-learning-enabled on-demand design of chiral metamaterials. ACS Nano 12, 6326â6334 (2018).
ArticleÂ Google ScholarÂ
Piggott, A. Y. et al. Inverse design and demonstration of a compact and broadband on-chip wavelength demultiplexer. Nat. Photonics 9, 374â377 (2015).
ArticleÂ ADSÂ Google ScholarÂ
Psaltis, D. et al. Holography in artificial neural networks. Nature 343, 325â330 (1990).
ArticleÂ ADSÂ Google ScholarÂ
Krishnamoorthy, A. V., Yayla, G. & Esener, S. C. Design of a scalable Opto-electronic neural system using free-space optical interconnects. In Proc. IJCNN-91-Seattle International Joint Conference on Neural Networks 527â534 (Seattle, WA, USA: IEEE, 1991).
Shen, Y. C. et al. Deep learning with coherent nanophotonic circuits. Nat. Photonics 11, 441â446 (2017).
ArticleÂ ADSÂ Google ScholarÂ
Shastri, B. J. et al. in Unconventional Computing: A Volume in the Encyclopedia of Complexity and Systems Science 2nd edn (Adamatzky, A. ed) 83â118 (Springer, New York, NY, 2018), 83-118. 10.1007/978-1-4939-6883-1_702.
Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004â1008 (2018).
ArticleÂ ADSÂ MathSciNetÂ MATHÂ Google ScholarÂ
Chang, J. L. et al. Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification. Sci. Rep. 8, 12324 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Estakhri, N. M., Edwards, B. & Engheta, N. Inverse-designed metastructures that solve equations. Science 363, 1333â1338 (2019).
ArticleÂ ADSÂ MathSciNetÂ Google ScholarÂ
Mengu, D. et al. Analysis of diffractive optical neural networks and their integration with electronic neural networks. IEEE J. Sel. Top. Quantum Electron. 26, 1â14 (2020).
ArticleÂ Google ScholarÂ
Li, J. X. et al. Class-specific differential detection in diffractive optical neural networks improves inference accuracy. Adv. Photonics 1, 46001 (2019).
ArticleÂ Google ScholarÂ
OâShea, D. C. et al. Diffractive Optics: Design, Fabrication, and Test. (SPIE Optical Engineering Press, Bellingham, WA, 2004).
Google ScholarÂ
Piestun, R. & Shamir, J. Control of wave-front propagation with diffractive elements. Opt. Lett. 19, 771â773 (1994).
ArticleÂ ADSÂ Google ScholarÂ
Abrahamsson, S. et al. Multifocus microscopy with precise color multi-phase diffractive optics applied in functional neuronal imaging. Biomed. Opt. Express 7, 855â869 (2016).
ArticleÂ Google ScholarÂ
Arieli, Y. et al. Design of diffractive optical elements for multiple wavelengths. Appl. Opt. 37, 6174â6177 (1998).
ArticleÂ ADSÂ Google ScholarÂ
Sweeney, D. W. & Sommargren, G. E. Harmonic diffractive lenses. Appl. Opt. 34, 2469â2475 (1995).
ArticleÂ ADSÂ Google ScholarÂ
Faklis, D. & Morris, G. M. Spectral properties of multiorder diffractive lenses. Appl. Opt. 34, 2462â2468 (1995).
ArticleÂ ADSÂ Google ScholarÂ
Sales, T. R. M. & Raguin, D. H. Multiwavelength operation with thin diffractive elements. Appl. Opt. 38, 3012â3018 (1999).
ArticleÂ ADSÂ Google ScholarÂ
Kim, G., DomÃnguez-Caballero, J. A. & Menon, R. Design and analysis of multi-wavelength diffractive optics. Opt. Express 20, 2814â2823 (2012).
ArticleÂ ADSÂ Google ScholarÂ
Banerji, S. & Sensale-Rodriguez, B. A computational design framework for efficient, fabrication error-tolerant, planar THz diffractive optical elements. Sci. Rep. 9, 5801 (2019).
ArticleÂ ADSÂ Google ScholarÂ
Salo, J. et al. Holograms for shaping radio-wave fields. J. Opt. A: Pure Appl. Opt. 4, S161âS167 (2002).
ArticleÂ Google ScholarÂ
Jacob, Z., Alekseyev, L. V. & Narimanov, E. Optical Hyperlens: far-field imaging beyond the diffraction limit. Opt. Express 14, 8247â8256 (2006).
ArticleÂ ADSÂ Google ScholarÂ
Wang, P., Mohammad, N. & Menon, R. Chromatic-aberration-corrected diffractive lenses for ultra-broadband focusing. Sci. Rep. 6, 21545 (2016).
ArticleÂ ADSÂ Google ScholarÂ
Aieta, F. et al. Multiwavelength achromatic metasurfaces by dispersive phase compensation. Science 347, 1342â1345 (2015).
ArticleÂ ADSÂ Google ScholarÂ
Arbabi, E. et al. Controlling the sign of chromatic dispersion in diffractive optics with dielectric metasurfaces. Optica 4, 625â632 (2017).
ArticleÂ ADSÂ Google ScholarÂ
Wang, Q. et al. A broadband Metasurface-based terahertz flat-lens array. Adv. Opt. Mater. 3, 779â785 (2015).
ArticleÂ Google ScholarÂ
Avayu, O. et al. Composite functional metasurfaces for multispectral achromatic optics. Nat. Commun. 8, 14992 (2017).
ArticleÂ ADSÂ Google ScholarÂ
Lin, Z. et al. Topology-optimized multilayered Metaoptics. Phys. Rev. Appl. 9, 044030 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Wang, S. M. et al. Broadband achromatic optical metasurface devices. Nat. Commun. 8, 187 (2017).
ArticleÂ ADSÂ Google ScholarÂ
Chen, W. T. et al. A broadband achromatic metalens for focusing and imaging in the visible. Nat. Nanotechnol. 13, 220â226 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Wang, S. M. et al. A broadband achromatic metalens in the visible. Nat. Nanotechnol. 13, 227â232 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Campbell, S. D. et al. Review of numerical optimization techniques for meta-device design [Invited]. Opt. Mater. Express 9, 1842â1863 (2019).
ArticleÂ ADSÂ Google ScholarÂ
Karl, N. J. et al. Frequency-division multiplexing in the terahertz range using a leaky-wave antenna. Nat. Photonics 9, 717â720 (2015).
ArticleÂ ADSÂ Google ScholarÂ
Hu, B. B. & Nuss, M. C. Imaging with terahertz waves. Opt. Lett. 20, 1716â1718 (1995).
ArticleÂ ADSÂ Google ScholarÂ
Shen, Y. C. et al. Detection and identification of explosives using terahertz pulsed spectroscopic imaging. Appl. Phys. Lett. 86, 241116 (2005).
ArticleÂ ADSÂ Google ScholarÂ
Yardimci, N. T. & Jarrahi, M. High sensitivity terahertz detection through large-area plasmonic nano-antenna arrays. Sci. Rep. 7, 42667 (2017).
ArticleÂ ADSÂ Google ScholarÂ
Li, Y. & Engheta, N. Capacitor-inspired metamaterial inductors. Phys. Rev. Appl. 10, 054021 (2018).
ArticleÂ ADSÂ Google ScholarÂ
Liberal, I., Li, Y. & Engheta, N. Reconfigurable epsilon-near-zero metasurfaces via photonic doping. Nanophotonics 7, 1117â1127 (2018).
ArticleÂ Google ScholarÂ
Chaudhary, K. et al. Engineering phonon polaritons in van der Waals heterostructures to enhance in-plane optical anisotropy. Sci. Adv. 5, eaau7171 (2019).
ArticleÂ ADSÂ Google ScholarÂ
Yu, N. F. & Capasso, F. Flat optics with designer metasurfaces. Nat. Mater. 13, 139â150 (2014).
ArticleÂ ADSÂ Google ScholarÂ
Yardimci, N. T. et al. High-power terahertz generation using large-area Plasmonic photoconductive emitters. IEEE Trans. Terahertz Sci. Technol. 5, 223â229 (2015).
ArticleÂ ADSÂ Google ScholarÂ
Kingma, D. P. Variational Inference & Deep Learning: A New Synthesis https://hdl.handle.net/11245.1/8e55e07f-e4be-458f-a929-2f9bc2d169e8 (2017).

Download references

Acknowledgements

The Ozcan Research Group at UCLA acknowledges the support of Fujikura (Japan).

Author information

These authors contributed equally: Yi Luo, Deniz Mengu

Authors and Affiliations

Electrical and Computer Engineering Department, University of California, 420 Westwood Plaza, Los Angeles, CA, 90095, USA
Yi Luo,Â Deniz Mengu,Â Nezih T. Yardimci,Â Yair Rivenson,Â Muhammed Veli,Â Mona JarrahiÂ &Â Aydogan Ozcan
Bioengineering Department, University of California, Los Angeles, CA, 90095, USA
Yi Luo,Â Deniz Mengu,Â Yair Rivenson,Â Muhammed VeliÂ &Â Aydogan Ozcan
California NanoSystems Institute, University of California, Los Angeles, CA, 90095, USA
Yi Luo,Â Deniz Mengu,Â Nezih T. Yardimci,Â Yair Rivenson,Â Muhammed Veli,Â Mona JarrahiÂ &Â Aydogan Ozcan
Department of Surgery, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
Aydogan Ozcan

Authors

Yi Luo
View author publications
You can also search for this author in PubMedÂ Google Scholar
Deniz Mengu
View author publications
You can also search for this author in PubMedÂ Google Scholar
Nezih T. Yardimci
View author publications
You can also search for this author in PubMedÂ Google Scholar
Yair Rivenson
View author publications
You can also search for this author in PubMedÂ Google Scholar
Muhammed Veli
View author publications
You can also search for this author in PubMedÂ Google Scholar
Mona Jarrahi
View author publications
You can also search for this author in PubMedÂ Google Scholar
Aydogan Ozcan
View author publications
You can also search for this author in PubMedÂ Google Scholar

Contributions

Y.L. performed the design and fabrication of the diffractive systems, and N.T.Y. performed the experimental testing. D.M. provided assistance with the design and experimental testing of the diffractive models. M.V. provided assistance with the fabrication. All the authors participated in the analysis and discussion of the results. Y.L., D.M., Y.R., M.J. and A.O. wrote the manuscript with assistance from all the authors. A.O. initiated and supervised the project.

Corresponding author

Correspondence to Aydogan Ozcan.

Ethics declarations

Conflict of interest

A.O., Y.L., D.M. and Y.R. are co-inventors of a patent application on Broadband Diffractive Neural Networks.

Supplementary information

Supplementary Information

Research Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Luo, Y., Mengu, D., Yardimci, N.T. et al. Design of task-specific optical systems using broadband diffractive neural networks. Light Sci Appl 8, 112 (2019). https://doi.org/10.1038/s41377-019-0223-1

Download citation

Received: 04 October 2019
Revised: 08 November 2019
Accepted: 15 November 2019
Published: 02 December 2019
DOI: https://doi.org/10.1038/s41377-019-0223-1

This article is cited by

Broadband nonlinear modulation of incoherent light using a transparent optoelectronic neuron array
- Dehui Zhang
- Dong Xu
- Xiangfeng Duan
Nature Communications (2024)
Diffractive optical computing in free space
- Jingtian Hu
- Deniz Mengu
- Aydogan Ozcan
Nature Communications (2024)
All-optical image denoising using a diffractive visual processor
- ÃaÄatay IÅÄ±l
- Tianyi Gan
- Aydogan Ozcan
Light: Science & Applications (2024)
All-optical phase conjugation using diffractive wavefront processing
- Che-Yung Shen
- Jingxi Li
- Aydogan Ozcan
Nature Communications (2024)
Compact eternal diffractive neural network chip for extreme environments
- Yibo Dong
- Dajun Lin
- Min Gu
Communications Engineering (2024)