Advanced Photonics, Volume. 5, Issue 1, 016003(2023)

Massively parallel universal linear transformations using a wavelength-multiplexed diffractive optical network

Jingxi Li1,2,3, Tianyi Gan1,3, Bijie Bai1,2,3, Yi Luo1,2,3, Mona Jarrahi1,3, and Aydogan Ozcan1,2,3、*
Author Affiliations
  • 1University of California, Electrical and Computer Engineering Department, Los Angeles, California, United States
  • 2University of California, Bioengineering Department, Los Angeles, California, United States
  • 3University of California, California NanoSystems Institute, Los Angeles, California, United States
  • show less
    Figures & Tables(11)
    Schematic of massively parallel, wavelength-multiplexed diffractive optical computing. Optical layout of the wavelength-multiplexed diffractive neural network, where the diffractive layers are jointly trained to perform Nw different arbitrarily selected, complex-valued linear transformations between the input field i and the output field o′ using wavelength multiplexing. The optical fields at the input FOV, i1,i2,…,iNw, are encoded at a predetermined set of distinct wavelengths λ1,λ2,…,λNw, respectively, using a wavelength multiplexing (“MUX”) scheme. At the output FOV of the broadband diffractive network, wavelength demultiplexing (“DEMUX”) is performed to extract the diffractive output fields o1′,o2′,…,oNw′ at the corresponding wavelengths λ1,λ2,…,λNw, respectively, which represent the all-optical estimates of the target output fields o1,o2,…,oNw, corresponding to the target linear transformations (A1,A2,…,ANw). Through this diffractive architecture, Nw different arbitrarily selected complex-valued linear transformations can be all-optically performed at distinct wavelengths, running in parallel channels of the broadband diffractive processor.
    All-optical transformation performances of broadband diffractive networks using different numbers of wavelength channels. (a) As examples, we show the amplitude and phase of the first eight matrices in {A1,A2,…,A32} that were randomly generated, serving as the ground truth (target) for the diffractive all-optical transformations. See Fig. S1 in the Supplementary Material for the cosine similarity values calculated between any two combinations of these 32 target linear transformation matrices. (b) The mean values of the normalized MSE between the ground-truth transformation matrices (Aw) and the corresponding all-optical transforms (Aw′) across different wavelength channels are reported as a function of the number of diffractive neurons N. The results of the diffractive networks using different numbers of wavelength channels (Nw) are encoded with different colors, and the space between the simulation data points is linearly interpolated. Nw ∈ {1, 2, 4, 8, 16, and 32}, N ∈ {3.9k, 8.2k, 16.9k, 32.8k, 64.8k, 131.1k, 265.0k} and Ni=No=82. (c) Same as (b) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (d) Same as (b) but the MSE values between the diffractive network output fields and the ground-truth output fields are reported.
    All-optical transformation performances of the individual wavelength channels in broadband diffractive network designs with N≈2NwNiNo and Ni=No=82. The output field errors (MSEOutput) for the all-optical linear transforms achieved by the wavelength-multiplexed diffractive network models with (a) 2-channel wavelength multiplexing (Nw=2), N≈4NiNo; (b) 4-channel wavelength multiplexing (Nw=4), N≈8NiNo; (c) 8-channel wavelength multiplexing (Nw=8), N≈16NiNo; (d) 16-channel wavelength multiplexing (Nw=16), N≈32NiNo; and (e) 32-channel wavelength multiplexing (Nw=32), N≈64NiNo. The standard deviations (error bars) of these metrics are calculated across the entire testing data set.
    All-optical transformation matrices estimated by two different wavelength-multiplexed broadband diffractive networks with Nw=8 and Ni=No=82. The first broadband diffractive network has N≈2NwNiNo=16NiNo=64,800 trainable diffractive neurons. The second broadband diffractive network has N≈4NwNiNo=32NiNo=131,100 trainable diffractive neurons. The absolute differences between these all-optical transformation matrices and the corresponding ground-truth (target) matrices are also shown in each case. N=131,100 diffractive design achieves a much smaller and negligible absolute error due to the increased degrees of freedom.
    Examples of the input/output complex fields for the ground-truth (target) transformations along with the all-optical output fields resulting from the 8-channel wavelength-multiplexed diffractive design using N≈4NwNiNo=32NiNo=131,100. Absolute errors between the ground-truth output fields and the all-optical diffractive network output fields are negligible. Note that |∠o−∠o′^|π indicates the wrapped phase difference between the ground-truth output field o and the normalized diffractive network output field o′^.
    Exploration of the limits of the number of wavelength channels (Nw) that can be implemented in a broadband diffractive network. (a) The mean values of the normalized MSE between the ground-truth transformation matrices (Aw) and the all-optical transforms (Aw′) across different wavelength channels are reported as a function of Nw∈{1,2,4,8,16,32,64,128,184}. The results of the broadband diffractive networks using different numbers of diffractive neurons (N) are presented with different colors: N∈{1.5NwNiNo,2NwNiNo,3NwNiNo}. Dotted lines are fitted based on the data points whose diffractive networks share the same N. (b) Same as (a) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (c) Same as (a) but the MSE values between the diffractive network output fields and the ground-truth output fields are reported. Ni=No=52.
    The impact of material dispersion and losses on the all-optical transformation performance of wavelength-multiplexed broadband diffractive networks. (a) The mean values of the normalized MSE between the ground-truth transformation matrices (Aw) and the all-optical transforms (Aw′) across different wavelength channels are reported as a function of the material of the diffractive layers. The results of the diffractive networks trained with and without diffraction efficiency penalty are presented in yellow and purple colors, respectively. Nw=128, N=3NwNiNo, and Ni=No=52. (b) Same as (a) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (c) Same as (a) but the MSE values between the diffractive network output fields and the ground-truth fields are reported. (d) The mean diffraction efficiencies of the presented diffractive models across all the wavelength channels. (e) Diffraction efficiency of the individual wavelength channels for the broadband diffractive network model presented in (a)–(d) that uses the dielectric material without the diffraction efficiency-related penalty term in its loss function. (f) Same as (e), but the diffractive network was trained using a loss function with the diffraction efficiency-related penalty term.
    All-optical transformation performance of broadband diffractive network designs with Nw=184, reported as a function of N and the bit depth of the diffractive neurons. (a) The mean values of normalized MSE between the ground-truth transformation matrices (Aw) and the all-optical transforms (Aw′) across different wavelength channels are reported as a function of N. The results of the diffractive networks using different bit depths of the diffractive neurons, including 4, 8, 12, and 32, are encoded with different colors, and the space between the data points is linearly interpolated. N∈{0.5NwNiNo=56,000,NwNiNo=115.000,2NwNiNo=231,000,4NwNiNo=461,000}, and Ni=No=52. (b) Same as (a) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (c) Same as (a) but the MSE values between the diffractive network output fields and the ground-truth output fields are reported.
    The impact of the encoding wavelength error on the all-optical linear transformation performance of a wavelength-multiplexed broadband diffractive network; Nw=4, N≈2NwNiNo=8NiNo, and Ni=No=82. (a) The normalized MSE values between the ground-truth transformation matrices (Aw) and the all-optical transforms (Aw′) for the four different wavelength channels are reported as a function of the wavelengths used during the testing. The results of the different wavelength channels are shown with different colors, and the space between the simulation data points is linearly interpolated. (b) Same as (a) but the cosine similarity values between the all-optical transforms and their ground truth are reported. (c) Same as (a) but the MSE values between the diffractive network output fields and the ground-truth output fields are reported. The shaded areas indicate the standard deviation values calculated based on all the samples in the testing data set.
    An example of a wavelength-multiplexed diffractive network (Nw=8, N≈2NwNiNo=16NiNo=64,800) that all-optically performs eight different permutation (encoding) operations between its input and output FOVs, with each target permutation matrix assigned to a unique wavelength. (a) Input/output examples. Each one of the Nw=8 wavelength channels in the diffractive processor is assigned to a different permutation matrix Pw. The absolute differences between the diffractive network output fields and the ground-truth (target) permuted (encoded) output fields are also shown in the last column. (b) Arbitrarily generated permutation matrices P1,P2,…,P8 that serve as the ground truth (target) for the wavelength-multiplexed diffractive permutation transformations shown in (a).
    Experimental validation of a wavelength-multiplexed diffractive network with Nw=2 and Ni=No=32. (a) Photograph of the experimental setup, including the schematic of the THz setup. (b) The fabricated wavelength-multiplexed diffractive processor. (c) The learned thickness profiles of the diffractive layers. (d) Photographs of the 3D-printed diffractive layers. (e) Experimental results of the diffractive processor for the two wavelength channels λ1=0.667 mm and λ2=0.698 mm using the fabricated diffractive layers, which reveal a good agreement with their numerical counterparts and the ground truth. λm=(λ1+λ2)/2=0.6825 mm.
    Tools

    Get Citation

    Copy Citation Text

    Jingxi Li, Tianyi Gan, Bijie Bai, Yi Luo, Mona Jarrahi, Aydogan Ozcan, "Massively parallel universal linear transformations using a wavelength-multiplexed diffractive optical network," Adv. Photon. 5, 016003 (2023)

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Research Articles

    Received: Sep. 15, 2022

    Accepted: Dec. 27, 2022

    Posted: Jan. 4, 2023

    Published Online: Jan. 11, 2023

    The Author Email: Ozcan Aydogan (ozcan@ucla.edu)

    DOI:10.1117/1.AP.5.1.016003

    Topics