Intelligent algorithms: new avenues for designing nanophotonic devices [Invited]

Figure 3.Neurons in each layer process and transfer data in the form of column vectors, and the weights of neural networks are expressed as matrices. The $θ$ represents the element of the matrix $Θ .$ It is worth noting that we did not list all the weights of layer I $- 1$ , but only the information processing of the first neuron in the I $th$ layer is shown here.

Download full size

View all figures

Θ_{[l - 1]}

Usually, ANN can deal with two kinds of problems: regression problems and classification problems. The time consumed to train an ANN is a reference to evaluate an ANN. Also, when evaluating the performance of a neural network in classification problems, parameters such as precision and recall are often introduced, even though sometimes it is necessary to trade off these two parameters. Some ways can be used to improve the performance. The main ways are as follows. a. Longer training time and more training data. (Sometimes it does not work or even makes the ANN perform worse.) b. Select an appropriate architecture of ANNs. c. Select an appropriate algorithm (such as the Adam algorithm and dropout strategy) d. Select the appropriate activation functions. e. Fine tune the hyperparameters (e.g., the number of layers and the number of neurons per layer).

Deep learning methods have been applied in many fields, including the design of nanophotonic devices. In order to design and evaluate a nanophotonic device, it is necessary to predict the optical response, and the prediction is usually implemented by solving Maxwell’s equations using dedicated numerical methods [12], which is rather time-consuming. The trained ANN can be used to predict the optical response quickly by forward propagation. The trained neural network can also be used to design nanophotonic devices with high efficiency.

2.2 Typical architectures of ANNs

In this part, typical architectures of ANNs will be introduced and illustrated. There are several typical architectures of ANNs that are often adopted to design and optimize nanostructures with different functions.

Malkiel et al . trained and tested a bidirectional deep-learning architecture with the capability of predicting the geometry of nanostructures solely based on the far-field response of the nanostructures, and the prediction is accurate [13] . Once this deep neural network (DNN) is trained, the geometry of the nanostructure can be obtained by querying the inverse network according to the measured/expected transmission spectrum. Then the obtained geometry is input into the direct network after training, and the direct network calculates the predicted transmission spectrum [see Fig. 4(a)]. When dealing with the inverse scattering problem using neural networks, it often suffers from a typical non-uniqueness problem, which makes it rather difficult to train neural networks on a training set with a large amount of data. Liu et al . demonstrated a tandem network (TN) that tolerates both explicit and implicit nonunique training instances [see Fig. 4(b)]. The forward modeling network is trained in advance, so during the training process weights in the pretrained forward modeling network are fixed and the weights in the inverse network are adjusted to reduce the value of the cost function (i.e., the error between the predicted response and the target response). The outputs of the intermediate layer M are the designed parameters of the device. It provides a method for training large-scale neural networks for the inverse design of complex photonic structures [14] . Metasurfaces are versatile and novel platforms for manipulating the scattering, color, phase, or intensity of light. Lei et al . also utilized a TN to optimize a metasurface in order to reduce the computational cost significantly [15] . They proved that the metasurfaces can achieve up to 400 times the third harmonic enhancement after optimization.

Figure 4.(a) Bidirectional network used for inverse design^[13]. (b) The TN consists of an inverse design network and a forward modeling network^[14]. (c) A CNN consists of two bidirectional neural networks, and it is capable of automatically designing and optimizing three-dimensional (3D) chiral metamaterials with strong chiral-optical responses at specified wavelengths^[17]. (d) A DNN for forward and inverse design of a power splitter^[16].

Download full size

View all figures

In most cases, neural networks with more layers perform better, whereas fully connected deep neural networks (FCDNNs) generally suffer from the problem of vanishing gradients. As a result, increasing the depth of an FCDNN does not necessarily improve the performance. Kojima et al . solved this problem by using a residual deep neural network [ResNet, see Fig. 4(d)] to improve the depth of training up to 8 hidden layers for both the forward and inverse problem [16] . It takes them about two weeks to complete collecting the 20,000-simulation data by numerical simulations while approximately 22 min to train the neural network. Once again, it reflects the high efficiency of the deep learning method.

As a typical neural network structure, the convolutional neural network (CNN) has been successfully applied in the field of image recognition and is now also used in the design of nanophotonic devices. The two main advantages of CNNs over FCDNNs are parameter sharing and sparsity of connections (i.e., in each layer, each output value depends only on a small number of inputs, which somewhat avoids the problem of overfitting and is more suitable to deal with the design problems with more parameters). Ma et al . reported a CNN model comprising of two bidirectional neural networks assembled by a partial stacking strategy [see Fig. 4(c)], to automatically design and optimize 3D chiral metamaterials with strong chiral-optical responses at predesignated wavelengths [17] . Wu et al . used CNN to predict the topological invariant of a 1D photonic crystal (PC) for geometric configurations that lie outside the parameter space of the training dataset [18], as shown in Fig. 5(a) . Zhao et al . designed an optical fiber imaging system based on a deep CNN that can transmit real-time non-artificial cell images through a one-meter-long Anderson localizing optical fiber [19] . They showed that trained neural networks could learn to retrieve the images of cells with very different shapes and categories that have never been “seen” during training.

Figure 5.(a) CNN used to predict the invariance of 1D photonic crystal^[18]. (b) A novel CAVE for the design of a power splitter^[23].

Download full size

View all figures

Another type of neural network whose application range is extended rapidly is the generative adversarial network (GAN) [20] . Generating samples is a harder problem compared to discriminative models. Given a training set, the GAN learns to generate new data with the same statistics as the training set. This feature enables the GAN to be applied to inverse design. Fan et al . showed that the GAN can be trained from periodic and topologically optimized metagratings images to produce efficient and topologically complex devices that can operate over a wide range of deflection angles and wavelengths with a one-time computational cost [21] . However, GAN models sometimes suffer from problems such as mode collapse, non-convergence, and diminished gradient. Ma et al . presented a probabilistic model of a variational auto-encoder (VAE) for the design of devices [22] . A semi-supervised learning strategy is used in this work, which improves the performance of the model. The GAN can be combined with other generation model methods of deep neural networks, such as auto-encoder (AE), to improve the stability of the network. For example, Tang et al . utilized a novel conditional variational auto-encoder (CAVE) [see Fig. 5(b)] for their power splitter design application [23] . They succeeded in using only binary-level nanophotonic datasets to generate a power splitter with an arbitrary ratio in the bandwidth between 1250 nm and 1800 nm. The FDTD simulations confirm that the overall transmission is close to 90%.

Recently, benefiting from the development of deep learning itself and the open source software libraries such as TensorFlow, there are more and more reports of applying neural networks to the design of nanophotonic devices, and the overall trend is that the structure of neural networks is more advanced and complex. In addition, taking advantage of the “black box” characteristics of deep learning (i.e., people do not care about its internal structure, but only its input and output), some novel algorithms have been invented by modifying the deep learning method. Zhou et al . designed two programmable optical signal processing chips with a learning ability based on the idea of the deep learning method [24 - 26] . The chip can be trained to perform the desired function by the gradient descent method and can be treated as a black box without having to know the internal information.

2.3 Discussion and outlook

Deep learning methods have many advantages over traditional algorithms. First, one advantage of deep learning is that once trained it costs less time than traditional algorithms (i.e., less computational cost) and is more likely to find better local optimal solutions. For example, using neural networks to predict the spectrum of a nanoscale optical device tends to be more accurate than traditional algorithms. Hammond et al. trained ANNs to model both strip waveguides and chirped Bragg gratings, and they found that the trained ANNs decreased the computational cost relative to the traditional design methodologies by more than 4 orders of magnitude[27]. As a result of the higher efficiency of designing well-performed devices compared to the traditional algorithm, deep learning methods have also been successfully employed in other areas such as high-energy physics[28], condensed matter[29], chemical physics[30], and holography[31]. Second, compared with traditional optimization algorithms, deep learning methods can realize inverse design more easily and the deep learning methods for discovering optical structures based on desired functional characteristics have made rapid progress[13,14,16,27,32–42]. Third, the deep learning neural network has many typical structures and strong flexibility. We can choose the appropriate neural network for optimal design according to the needs of design devices, and many problems in the training process can be solved by adjusting the hyperparameters or structure of the neural network appropriately. Last, as one of the frontiers of computer science, deep learning is still in development with its wide-ranging applications. The one that gets the most attention is the all-optical neural network. Optical computing systems have attracted more and more attention due to the importance of cutting down computing costs. Optical computing has the advantages of low energy consumption, scalability, no photoelectric conversion, and broad bandwidth, and can be used as special accelerating hardware for AI algorithms (such as DNN). Shen et al. proposed a new architecture for an all-optical neural network that greatly improved the computational speed of dealing with conventional learning tasks[43]. Feldmann et al. showed the development of a new kind of all-optical neural network[44], and Lin et al. also realized all-optical machine learning through a diffraction deep neural network[45]. Some other architectures have been proposed as well such as Mach–Zehnder interferometers[43], single-pixel imaging[46], nanophotonic medium[47], and Fourier optics[48]. The development of all-optical neural networks may revolutionize the field of computing.

However, deep learning methods also have some limitations and drawbacks. First, since the design of nanophotonic devices is a non-convex problem, it is impossible to guarantee that the designed devices are optimal. Jiang et al . presented a global optimizer that performs a global search for the optimal device within the design space, but the final devices may not be the optimal [49] . Second, it takes a lot of computational cost and time cost to prepare the training set and train the ANN, especially when dealing with complex learning tasks. In order to train the CNN to predict the optical responses of arbitrary structures by 2D cross-section images, Qu et al . spent 15 days preparing input data and nearly a week training the network [50] . However, the introduction of unsupervised learning and transfer learning algorithms can help to release the burden on data [39, 51] . Third, it is sometimes difficult to exploit the trained neural network for further analysis because the learning mechanisms of ANN (note that these mechanisms can be useful sometimes) are operating as black boxes, whereas, the useful information about the features of photonic structures can be extracted with proper techniques, such as introducing latent space [22] . Fourth, stronger ability of transfer learning is needed to cope with the changeable situations [52], although this in-development ability has shown its power in the design of nanophotonic devices [51] . Last, when the number of training samples is small, conventional methods may perform better than deep learning methods. Jiao et al . found that linear-regression-based methods may outperform the deep learning approaches for two black-box optical imaging problems [53] .

At the end of this section, some prospects for neural networks are given. Based on the outstanding performance of the deep learning method in the nanophotonic field and the analysis of a number of papers, we can confidently predict that there will be less nanophotonic device design works in the future without a deep learning algorithm. Its flexibility also facilitates it to be an excellent candidate for handling other nanophotonic problems [54 - 56] . Additionally, as deep learning methods have a better transfer learning capability than the traditional machine learning methods, deep transfer learning also shines in other fields [57, 58] . We believe that in the future the trained neural network can be used not only for designing specific devices, but also for designing new devices, which means less time required to design devices with different functions and a wider parameter space to search for the optimal solution based on a pre-trained ANN.

3. Nanophotonic Devices Based on the Gradient-Based Inverse Design

Between the 1870s and 1880s, the importance of inverse problems has grown considerably in many fields. The mathematical expression of a physical law is a rule that defines a mapping T of a set of functions ξ called the parameters into a set of functions δ called the results. According to the above expression, to find inverse mappings of δ into ξ, inverse problems can be defined in a precise mathematical form that excludes the so-called “fitting procedure” in which models depending on a few parameters and giving a good fit of the experimental results are obtained by trial and error or any other techniques [2, 59] . In the field of nanophotonic devices, formulas for inverse problems have been widely understood, and the application of computational methods based on inverse design for nanophotonic devices has recently grown considerably [2, 3] . There are two central thrusts of inverse problems in nanophotonics, which are the determination of solution characteristics and the discovery of effective algorithms for working from desired characteristics to physical systems.

3.1 Introduction to the gradient-based inverse design

i = 1, \dots, M

n

L_{i j}

E_{i}

3.2 Application of the gradient-based inverse design

2 \times 2 \times 2 hub

Figure 6.Nanophotonic devices designed by the gradient-based inverse design. (a) The structure diagram of $2 \times 2 \times 2$ hub^[62]. (b) The electromagnetic energy density of the hub about the fundamental TE-polarized mode at either 1550 nm or 1310 nm. (c) Performance specification of the TE mode converter^[62]. (d) and (e) $1 \times 3$ power router with 500 nm wide input and output waveguides^[60]: (d) SEM image of the fabricated router; (e) the electromagnetic energy density of the power router at 1550 nm.

Download full size

View all figures

1.6 μ m \times 2.4 μ m

The researchers then improved the algorithm further, and they introduced the adjoint method to compute the gradient efficiently by using a single time-reversed electromagnetic simulation. Usually, when we optimize the parameters of a system, we know the laws of physics (usually expressed as PDE) that the system follows. This type of problem, called PDE-constrained optimization, has many application scenarios [69, 70] . A class of methods to solve this kind of problem is called the adjoint method [71] .

t

g [x (0), p] = 0, h (x, \dot{x}, p, t) = 0

F (x, p) = \min_{p} \int_{0}^{T} f (x, p, t) d t

ℒ = \int_{0}^{T} [f (x, p, t) + λ^{T} h (x, \dot{x}, p, t)] d t + μ^{T} g [x (0), p] .

d_{p} ℒ = \int_{0}^{T} [\partial_{x} f d_{p} x + \partial_{p} f + λ^{T} (\partial_{x} h d_{p} x + \partial_{\dot{x}} h d_{p} \dot{x} + \partial_{p} h)] d t + μ^{T} [\partial_{x (0)} g d_{p} x (0) + \partial_{p} g] .

d_{p} \dot{x}

d_{p} ℒ = \int_{0}^{T} [\partial_{x} f + λ^{T} \partial_{x} h - {\dot{λ}}^{T} \partial_{\dot{x}} h - λ^{τ} d_{t} (\partial_{\dot{x}} h)] d_{p} x d t + \int_{0}^{T} (\partial_{p} f + λ^{T} \partial_{p} h) d t + λ^{T} \partial_{\dot{x}} h d_{p} x |_{T} + {(μ^{T} \partial_{x (0)} g - λ^{T} \partial_{\dot{x}} h d_{p} x) |}_{0} d_{p} x (0) + μ^{T} \partial_{p} g .

x

p

Thus, for each step of gradient descent, we only need to do a few simulations and then solve a few ODEs. The computation is greatly reduced.

The application of the adjoint method in the optimization problem with constraints has two main aspects [71] . First, a system with unknown parameters can output data by collecting input, and then estimate the parameters of the system. Loss reflects the difference between the system output and the actual output measured. Second, in order to design a system with a certain function, loss reflects whether the system and the target function fit, and then we optimize the parameters of the system to complete the inverse design.

2.8 μ m \times 2.8 μ m,

Figure 7.Nanophotonic devices designed by the gradient-based inverse design. (a) The structure diagram of TE/TM router^[62]. (b) The Electromagnetic energy density of the TE/TM router at 1550 nm. (c) Measured transmission of the three-channel router^[65]. (d) Simulated electromagnetic energy density of the three-channel router at the three operating wavelengths.

Download full size

View all figures

Figure 8.Nanophotonic devices designed by the gradient-based inverse design. (a) SEM image of cascaded Fano–Lorentzian resonators implemented on a silicon-on-insulator platform^[67]. (b) The R $= 94 %$ reflector used to implement a Lorentzian resonator. Top: optimization trajectory to obtain the desired non-resonant high reflection; bottom: low-power transmission of a single device with non-resonant reflection R $= 94 %$ and red line is a fit with a Lorentzian line shape. (c) A conceptual photonic circuit that consists of a grating coupler followed by a waveguide-splitter and two resonators, and the outputs of them are then recombined in a waveguide-splitter and coupled off-chip through a grating coupler^[61]. (d) Spectra of the nano-beams from the device shown in (a). The green, black, and red data correspond to the upper, both, and the lower nano-beam, respectively. (e) Demonstration that cavities with a fabrication-induced frequency offset can be tuned in resonance via gas tuning; the color bar corresponds to normalized counts.

Download full size

View all figures

3.3 Discussions

The gradient-based inverse design can automatically design photonic devices, which is an automated photonics design, and only requires the user to input high-level parameters. The algorithm can afford large parameter space, and design devices that exploit the full space parameters of fabricable devices. It tends to require fewer simulations than genetic or particle swarm optimization as they do not rely on parameter sweeps or random perturbations to find their minima. The gradient-based inverse design algorithm can be used to design photonic devices with any passive and linear photonic element. However, the design achieved by the inverse design algorithm typically exhibits a continuous topography, and some very small components in structures may be formed during the inverse designing process, which brings challenges for sample fabrication. Moreover, the gradient-based inverse design method usually produces a local optimal solution, and it cannot realize the true global optimization.

4. Nanophotonic Devices Based on Swarm Intelligence Algorithms

Swarm intelligence refers to ‘the non-intelligent subject shows the characteristics of intelligent behavior through cooperation’, which is a kind of computing technology based on the laws of biological group behavior. In recent years, there have been various algorithms in the research field of swarm intelligence theory, such as the genetic algorithm (GA), particle swarm optimization (PSO), and the ant colony algorithm (ACA). It is proved that swarm intelligence algorithms are effective methods through the research of the theory and application method. It can effectively solve most optimization problems.

4.1 Genetic algorithm

GA is an adaptive optimization global search algorithm that simulates the genetic and evolutionary process of organisms in natural environments [72] . In essence, it is a parallel, efficient, and global search method that can automatically acquire and accumulate knowledge when searching space automatically and controlling the search process adaptively to obtain the optimal solution. GA has been successfully applied to various academic and industrial applications, such as communication and photonics [73 - 75] . GA is a stochastic optimization technique based on natural selection and evolutionary biology. It is well suited for complex problems where many of the system parameters must be optimized simultaneously and for other practical problems where there may not be a unique and well-defined optimal value.

n

Individuals selected from population are randomly matched and, for each individual, a certain probability (crossover probability 0.25-1.0) is used to swap parts of their chromosomes (partial position of the encoding bit string). The search ability of GA is extended better. Figure 9 is the flow chart of the GA.

Figure 9.The flow chart of GA^[77].

Download full size

View all figures

To obtain the desired optical properties, Huntington et al . designed a lattice evolution algorithm that allows lattice optical materials to exhibit simple properties or focus light on discrete points [6] . It is shown in Fig. 10(a) . Using multiple scattering and GA to determine the photonic crystal structure to be optimized is reliable and can complete specified optical tasks. GA is used to operate on a set of candidate structures to find new candidate structures with stronger performance during the whole iteration. David et al . used GA to design an optimized antireflection coating with broadband and omnidirectional characteristics [76] . The simulated reflection characteristics of the antireflection coating are shown in Fig. 10(b) . The simulation results of the optimized three-layer coating show that the performance of the coating is significantly improved compared with that of the traditional coating.

Figure 10.Nanophotonic devices designed by GA. (a) Lattice optical materials capable of focusing light into several different focal points in the far field. The left is a schematic diagram of the experimental device. The right shows light focused on several different points through a lattice of lattice optical materials^[6]. (b) Simulated reflection characteristics of antireflection coatings^[76]. (c) The left is the initial silicon plate and the corresponding electric field distribution before optimization, and the right is the structure and electric field distribution of the reflector after optimization^[77]. (d) The structure obtained after GA and simulated transmittance spectrum^[78].

Download full size

View all figures

Yu et al . used GA to optimize the design of the prevalent thin-film-on-insulator platform for reflectors [77] . The structure is composed of randomly distributed pixels, and the manufacturing process is compatible with CMOS, requiring only one-step lithography and etching. The structure and electric field distribution before and after the optimized design are shown in Fig. 10(c) . Sanchis et al . designed a coupler device capable of introducing light generation from optical fibers into a photonic crystal-based waveguide [78] . As is shown in Fig. 10(d), the optimized integrated device (waveguide coupler) and its electric field modulus diagram are illustrated.

1.4 μ m \times 1.8 μ m

Figure 11.Nanophotonic devices designed by GA. (a) The structure diagram of wavelength router and (b) the simulated transmittance^[7]. (c) The optimized structure of the polarization router. (d) and (e) are the simulated transmission spectra of the polarization router’s O1 and O2 ports^[8].

Download full size

View all figures

Chen et al . proposed a method [79] combining field emission (FE) modeling and GA to optimize the focused quality of integrated gated carbon nanotubes. The design effect is shown in Figs. 12(a) and 12(b) . It is challenging to overlap the radiation power spectrum between the magnetic dipole moment and the electric dipole moment of nanoparticles in a wideband way. Liu et al . combined GA, Maxwell’s equation, and electromagnetic multipole expansion [80] to design a nanoparticle that supported resonant broadband forward light scattering. The result is shown in Fig. 12(c) .

Figure 12.Nanophotonic devices designed by GA. (a) Measured data and calculated results (red solid line), the illustration is a schematic of carbon nanotube films and diode FE measurements. (b) Optimized electron beam trajectories for type of FE device^[79]. (c) The total scattering efficiency of normalization (black line), and the contribution of induced electric dipole (ED) and magnetic dipole (MD) moments of core-shell nanoparticles^[80].

Download full size

View all figures

GA is well suited for complex problems, such as having to optimize many system parameters at the same time, and some other application problems may not have well-defined and unique optimal values. GA can not only solve the single objective optimization problem, but can also play a more important role in the multi-objective optimization problem. The common selection method of multi-objective GA is to define individual fitness through different methods. Although the local search ability of GA is poor, it is often used in combination with other algorithms to improve the performance of the algorithm by taking advantage of its easy parallel implementation. In many works [6 - 8, 76 - 82], researchers apply GA to the field of photonics in order to achieve the optimal design of devices.

Just like GA, based on biological evolution, the cultural algorithm (CA) uses cultural or social evolution to simulate human society and solve optimization problems by using domain knowledge to reduce the search space [83] . CA is an evolutionary algorithm of social or cultural evolution proposed by Reynolds in 1994. It is a flexible technology that is easy to implement. Khorrami et al . first introduced a guided mode resonance (GMR) grating filter designed using CA and explained that CA can be used to design electromagnetic time problems, such as devices with more than three target parameters.

4.2 Particle swarm optimization

The PSO algorithm is derived from the simulation study of migration and aggregation behavior in the foraging process of birds. The basic idea is to find the optimal solution through the cooperation and information sharing among individuals in the group. It contains the characteristics of evolutionary calculation and swarm intelligence. It is essentially a kind of random search algorithm [84] that can converge to an optimal solution with a large probability.

In PSO, the velocity and position of each particle in the solution space are initialized, including the entire possible solution set [85] . The fitness function acts as a guide to get these particles to the target value of the fitness function.

V_{i d} = ω V_{i d} + C_{1} random (0, 1) (P_{i d} - X_{i d}) + C_{2} random (0, 1) (P_{g d} - X_{i d}),

Using the PSO algorithm to optimize the parameters, Djavid et al. proposed an evolutionary design approach of the photonic crystal notch filter[86]. The designed filter and the recording of the electric field intensity are shown in Figs. 13(a) and 13(b), respectively. Kumar et al. used the simulated PSO algorithm to optimize the structure of photonic crystals and studied a waveguide terminal realizing directional emission of photonic crystals[5]. Figure 13(c) shows the optimized PSO structure and their electric field distributions. Forestiere et al. used the PSO algorithm to optimize the array of plasma nanoparticles[87], resulting in a non-periodic structure and an enhanced broadband field across the entire visible spectrum. They also found that the broadband field enhancement in nanoplasmas can be achieved by designing aperiodic arrays, and aperiodic arrays provide the necessary interactions between distant diffraction interactions at multiple scales and near-field quasi-static couplers within small nanoparticle clusters. The optimized array of silver nanoparticles and its Fourier transform magnitude are shown in Figs. 13(d) and 13(e), respectively.

Figure 13.Nanophotonic devices designed by PSO. (a) A notch filter based on microcavity and (b) single frame extract video recording of the electric field intensity of the notch filter at the wavelength of 1500 nm^[86]. (c) The structure of the tapered PSO and the distribution inside the electric field^[5]. (d) The optimized geometry of the silver nanoparticles array and (e) the magnitude of its Fourier transform^[87].

Download full size

View all figures

N

Figure 14.Nanophotonic devices designed by PSO. (a) The SEM image of SOL and (b) the SEM image of the cluster of nanoholes on the metal membrane. The SOL image shows all the main features of the cluster^[88]. (c) Optimized power splitter device and (d) normalized strength^[90]. (e) The white rectangle represents the spatial distribution of the nanometer aperture of the two-channel multiplexing lens. (f) The simulated intensity profiles of the radiated beam of the two-channel multiplexing metalens in the xz plane^[91].

Download full size

View all figures

35 μ m \times 35 μ m

PSO has a fairly fast speed of approaching the optimal solution, which can effectively optimize the parameters of the system. The advantage of PSO is that it can be applied to continuous function optimization problems. The main drawback of this method is that it is easy to produce premature convergence, especially in dealing with complex multiple optimal value search problems, and its local optimization ability is poor. PSO falls into local minimum, which is mainly attributed to the loss of diversity of population in search space. To further improve it, we can either combine it with other algorithms or add mutation operation. PSO has been used to optimize nanostructures and design nanophotonic devices. It can be used to optimize multidimensional problems. Although PSO has a high requirement for parameter setting, its process is easy to understand and its convergence speed is fast.

4.3 Ant colony algorithm

ACA is derived by simulating the process of ants finding their way in nature, and it is an intelligent algorithm to search the shortest path. ACA has the advantages of strong robustness [93] and easy integration with other algorithms.

m

Figure 15.The flow chart of ACA optimization process^[94].

Download full size

View all figures

Using ACA, Saouane et al . obtained the setting of the optimal inclination angle for the photovoltaic collector through simulation and improved the efficiency of the collector [94] . Guo et al . proposed a method to optimize the anti-reflective coating of silicon solar cells with ACA in the range of 400 nm to 1000 nm wavelength [95] . Figures 16(a) and 16(b) show the schematic diagram of the antireflection coating system designed on the silicon substrate by the ACA-based calculation method and the simulation results of the reflectance performance of the optimized antireflection coating with a wavelength ranging from 400 nm to 1100 nm and an angle from 0° to 90°.

Figure 16.Nanophotonic devices designed by ACA. (a) The ACA-based method was used to calculate the reflection coefficient of the antireflection coating system on silicon substrate and (b) the simulation results show that the reflectivity of the antireflection coating system is changed with wavelength and incident angle by ACA^[95].

Download full size

View all figures

However, if the parameters are not set properly, the solution speed will be very slow and the quality of the solution will be particularly poor. In the early stage, it takes a long search time and a large amount of calculation, which leads to a long time for the overall solution. In the design of nanophotonic devices, ACA is suitable for combinatorial optimization and continuous function optimization. The whole process of the algorithm is intuitive, but it takes a long time to solve.

For swarm intelligence algorithms, the overhead of each individual in the system is very small, and the functions that each individual can achieve are very simple, which leads to the short execution time of each individual. Therefore, the implementation is relatively simple and convenient for researchers to implement programming and parallel processing on the computer. However, parameter sensitivity is a problem that needs to be paid attention to, because improper selection will increase the time cost and complexity of subsequent calculations.

5. Nanophotonic Devices Based on Individual Inspired Algorithms

5.1 Simulated annealing algorithm

The SAA was first introduced by Kirkpatrick et al . in 1983 to mainly apply to discrete optimization problems. Originating from the physical process in which a crystalline solid slowly cools down from a relatively high temperature and gradually forms a regular crystal configuration during the annealing process, the algorithm provides a strategy to escape local optima, hoping to achieve the global optimum [96] .

P (S_{i}, S_{j}) = {\begin{array}{c} 1, & f (S_{j}) < f (S_{i}), \\ e x p {[f (S_{i}) - f (S_{j})] / T}, & f (S_{j}) \geq f (S_{i}), \end{array}

Figure 17.The flow chart of SAA^[98].

Download full size

View all figures

Different from swarm intelligence algorithms, SAA has a simple structure that allows application of SAA under various circumstances. As another advantage, SAA requires no knowledge of the specific problem and thus guarantees the robustness of a random initial guess. The convergence of SAA was promised with strict mathematical demonstration [97], while it does not promise a global optimum, like other heuristic algorithms. As a defect, the performance of SAA is sensitive to the customized parameters, especially the initial temperature.

The analogous physical annealing process inspires us to set a high initial temperature in avoidance of an insufficient cooling process, that is, loss of ability to escape local minima. But it introduces a waste of computing budget as the algorithm loses the ability to judge the quality of the new-found solution and accepts all of them until the excessively high initial temperature cools to a critical temperature. A critical temperature represents a balance point at which objective function values are preferred, but the temperature is warm enough to tunnel through such solutions. We have no idea about the appropriate value for the initial temperature when the algorithm needs no knowledge of the problem. In that case, experiments are expected to identify the initial temperature and such a method was proposed by Basu et al. [98] .

Due to the mechanism of SAA, a large computing budget is always expected to search for the optimum. The situation deteriorates even more with an excessive initial temperature. Considering the efficiency and computing time required in the field of nanophotonic devices, it is not appropriate to employ such a time-consuming algorithm alone, which might be the reason for SAA’s not being widely used to design nanophotonic devices. But strategies like combining SAA with other algorithms to develop its efficiency could still be a good option when it comes to devices with discrete parameters to be optimized.

- 1 (+ 1)

Figure 18.Nanophotonic devices designed by SAA. (a) A schematic of the twisted light emitter. (b) Details of structure parameters. R (R $= 1200 nm$ ) stands for the radius of the device, H (H $= 220 nm$ ) the height and W (W $= 440 nm$ ) the width of the waveguide. The red arrow represents light from the left waveguide. $Φ_{1}$ and $Φ_{2}$ represents the propagating phase modulation and resonance modulation, respectively. Here, a scanning electron microscope image of the fabricated OAM is presented^[102]. (c) FDTD simulation result and experiment data of the OAM emitter.

Download full size

View all figures

Figure 19.Nanophotonic devices designed by SAA. (a) Schematic of the photonic spin element. Incident light is coupled into different waveguides according to the spin states. (b) The core component of an optical element. The design area is divided into 288 pixels. The green blocks stand for optimized structures filled with silicon and the white blocks stand for air. (c) The measured output power at different ports when the polarization of incident light varies^[103].

Download full size

View all figures

5.2 Hill-climbing algorithm

The hill-climbing algorithm is a local search algorithm. Its advantage is that it does not need a traversal process to reach the highest point of the solution space; instead, it selects nodes with a higher value through heuristics where the efficiency is highly improved [104] . The process does not require memories of the previous steps, which makes it save storage space when searching for the optimal solution in a large parameter space.

Figure 20.Nanophotonic devices designed by the hill-climbing algorithm. (a) An example of the target function in which the difficulties of hill climbing are shown. (b) The schematic of the photonic crystal split-beam nanocavity. R₁, R₂, and R₃ are optimized by the algorithm. Experimental transmission spectrum of the split-beam cavity under 0.6 mW input power respectively in the whole measurement range, (c) the 2nd TE mode individually and (d) the 4th TE mode individually^[105].

Download full size

View all figures

Figure 21.(a) Flowchart of the hill climbing algorithm. (b) An example of the target function in which the difficulties of hill climbing are shown.

Download full size

View all figures

1.99 \times 10^{4}

The hill-climbing algorithm is a relatively basic algorithm that is easy to start with. However, with the development of intelligent algorithms, more complex algorithms have obvious advantages in the design of nanophotonic devices and are more widely adopted.

5.3 Tabu search

The TS algorithm [107] is a meta-heuristic random search algorithm first proposed by Glover, which is based on the improvement of the hill-climbing algorithm, and usually used to solve combinatorial optimization problems. It starts from an initial feasible solution, and then identifies the one from a series of specific search directions where the value of the objective function has the greatest improvement.

In order to avoid repeated searching, a flexible “memory” technique, the establishment of the tabu list, is used in the TS search to record and select the optimization process that has been performed to guide the next search direction. The tabu list has an associated size, which can be a fixed size or change during the iterative process and can be visualized as a window on accepted moves. The moves that tend to undo moves within this window are forbidden [108] . During the iterative process of the TS algorithm, it is possible that all moves in the candidate set are in the tabu list, or the current move is in the tabu list, but the target value will improve significantly if the tabu is lifted. In this case, in order to break through the limitation, some tabu objects will be made reselectable. This method is called aspiration, and the corresponding rule is called aspiration criterion. In the end, the TS still will probably fall into a loop; in that case, a stopping criterion is needed. Normally, the program is assigned to stop when a fixed number of iterations are reached. The flowchart of the process of TS is presented in Fig. 22 .

Figure 22.The flowchart of TS.

Download full size

View all figures

The advantage of TS is that it provides a very effective solution to jump out of the local optimal solution, and it has fast convergence speed, finding the optimal solution with less iterations. Since TS is not guaranteed to traverse the full parameter space, it is still possible to find a local optimal solution. The search path is determined by the direction of the current solution to the neighborhood, so the structure of the neighbors, that is, the mapping relationship between the initial solution and its neighbors, is particularly important.

Gagnon et al . used the TS algorithm to solve inverse design problems in integrated photonics [109] . The proposed method is called parallel tabu search (PTS), which starts with a diverse population of solution individuals, and each individual goes through a TS process. This work provides a solution to the coherent beam shaping problem, which is a multi-objective optimization problem, considering both the amplitude and the phase profile of the beam. The device is based on a 2D photonic crystal [Fig. 23(a)], and intelligent algorithms are implemented to determine the location of lattice defects. In order to compare GA and PTS, they use both methods for optimization. The best possible tradeoff between the amplitude and phase of the beam is shown in Fig. 23(b) . In comparison with GA, the PTS can produce comparable or better solutions while requiring less computation time and fewer adjustable parameters. This approach is also applied to the design of integrated polarization filters [110], which is shown in Figs. 23(c) and 23(d) . After the multi-objective optimization process, the devices allow for simultaneous polarization filtering and amplitude beam shaping. The average degree of polarization of the output beam is improved to 98% with a transmission eﬃciency over 75% for the TM polarizer and 80% for the TE polarizer.

Figure 23.Nanophotonic devices designed by TS. (a) Basic photonic lattice configuration for the beam shaping problem. (b) Best possible trade-off between the amplitude and the phase profile of the beam in the beam shaping problem^[109]. (c) The |E_z| field profile (arbitrary units) and comparison of orthogonal polarization components along target plane of optimized TM polarized Gaussian beam. (d) The |H_z| field profile (arbitrary units) and comparison of orthogonal polarization components along target plane of optimized TE polarized Gaussian beam^[110].

Download full size

View all figures

TS has its opportunities in optimizing nanophotonic devices, especially when the parameter space is finite with discrete numeric values. However, due to relatively few reports, the prospects of this field need to be further explored.

6. Nanophotonic Devices Based on Other Algorithms

6.1 Direct binary search

As mentioned above, intelligent algorithms are beneficial to the design of compact devices and calculate the full parameter space, compared with conventional approaches. As one of the crucial algorithms, the DBS algorithm has drawn more and more attention recently. DBS is an iterative search algorithm that was first used for the synthesis of digital holograms [111] . The basic problem in the synthesis of binary digital holograms is to find a binary-valued transmittance function for the hologram. The DBS algorithm is used to manipulate the hologram transmittance directly to produce the best reconstruction and find a binary transmittance function that minimizes the mean squared error between the reconstructed image and original object, which is illustrated in the flow chart of Fig. 24 [112] .

Figure 24.The flow chart of DBS algorithm^[112].

Download full size

View all figures

With the development of intelligent optimization algorithms, the DBS algorithm has found more application domains, and there are some improved versions of the DBS algorithm. The modified version of the DBS algorithm operates in an iterative fashion. In the application of this method, the device should be discretized into “pixels” first. The possible pixel states are two different materials, and the two states are denoted by 1 and 0. During each iteration of the DBS algorithm, the pixel is toggled between these two states and the pixel to be perturbed is chosen at random. Then, a figure-of-merit (FOM) or objective function is calculated for the resulting device. If the FOM is improved, the perturbation is kept and the next parameter is perturbed, and the FOM is evaluated. If the FOM is not improved, the perturbation is discarded. At this time, an alternate perturbation (of the opposite sign) may be applied and the FOM is re-evaluated. This perturbation cycle continues until all the parameters have been addressed. This completes one iteration of the DBS algorithm. Such iterations are continued until the FOM converges to a stable value. An upper bound on the total number of iterations and a minimum change in FOM are defined to enforce numerical convergence [113 - 117] .

The algorithm provides an effective approach to designing on-chip nanophotonic devices, such as the design of diffractive optics[117–119], nanophotonics for light trapping[114,116], couplers[120], computational microscopy[121], free-space polarizers[115], polarization beam splitter[113,122], optical modulator[123], the integrated cloak[124], mode router[125–127], and power splitters[128,129]. Several typical devices designed by the modified version of the DBS algorithm are introduced below. Shen et al. applied the nonlinear optimization to design a free-space-to-waveguide coupler, polarization beam splitter, and cloak[113,120,124], as shown in Fig. 25. Figure 25(a) is a free-space to multi-mode waveguide coupler and polarization splitter, and panels a, b, and c in Fig. 25(a) are the structure diagram, simulated time-averaged intensity distribution for light polarized along X and that polarized along Y, respectively[120].

Figure 25.Nanophotonic devices designed by DBS. (a) Panel a, structure diagram of a free-space to multi-mode waveguide coupler and polarization splitter; panels b and c are simulated time-averaged intensity distribution for light polarized along X and that polarized along Y, respectively^[120]. (b) The structure diagram of a polarization splitter. (c) and (d) The simulated steady-state intensity distributions for TE and TM polarized light at the design wavelength of 1550 nm, respectively^[113]. (e) and (f) Reference coupled system and the cloak for micro-ring resonator^[124].

Download full size

View all figures

2.4 μ m \times 2.4 μ m

With the development of the photonic integrated circuit, a higher density integration is required. One of the options to increase integration density is to decrease the spacing between the individual devices. An optical waveguide in the plane of the photonic integrated circuit is one of the most fundamental structures. However, the integration density of the waveguide is limited by the leakage of light from one waveguide to its neighbor, if the spacing between them is too small. The DBS algorithm is employed to design the integrated cloak with a footprint of just a few micrometers to decrease this spacing without considerably increasing cross talk [124] . Take the nanophotonic cloak that can render a waveguide invisible to a neighboring micro-ring resonator, for example. In most applications, light is coupled into the resonator via a waveguide that is placed in close vicinity to the ring. However, if another waveguide is placed close to the micro-ring, the two optical components would work as a coupled system with functionality that is different from that of either one working independently, as is shown in Fig. 25(e) . Shen et al . designed a nanophotonic cloak that allows a waveguide to be placed at a gap of only 300 nm from the micro-ring and essentially renders the waveguide invisible to the micro-ring. The structure diagram of the device and the steady-state intensity distribution are shown in Fig. 25(f) . Another option to increase integration density is to combine the function of multiple devices into a single compact device. Liu et al . designed a mode-division multiplexing circuit consisting of a multiplexer, a crossing, and a demultiplexer [126] .

1 \times 2

Figure 26.Nanophotonic devices designed by DBS. (a) The top-view microscope image of the mode-division multiplexing circuit (top), and the lower left corner is the microscope image of the four-cascaded crossing^[126]. (b) The scanning electron microscope image. (c) The measured transmission spectra for the mode-division multiplexing circuit. (d) The top view of the 1 × 4 power splitter (top), and the bottom is optical field distribution^[129]. (e) Excess loss of each output port.

Download full size

View all figures

The DBS algorithm is a simpler iterative algorithm for the design of nanophotonic devices. The discrete structure generated by DBS algorithms is more favorable to the fabrication using traditional manufacturing techniques like focused ion beam milling or electron beam lithography. However, the DBS algorithm has some limitations. First, the algorithm is guaranteed to converge, but not necessarily to a global minimum. It inherently produces a suboptimal result, as the DBS algorithm converges to the first local minimum during the search process. Second, it is computationally expensive and suitable for discrete solution space and small parameter space. The cost of the calculation and the probability of the DBS algorithm falling into the local optimal value will increase as the search space increases. Third, the algorithm is sensitive to the starting point. In view of the above analysis, there is an urgent need to develop an algorithm to design the optimal and multi-function integrated device.

6.2 Topology optimization

Figure 27.(a) The structure of the topology optimization algorithm used in the work. (b) The 3D model of gold nanoparticle dimer with predefined key parameters in geometry and material.

Download full size

View all figures

The level set method is a numerical technique for interface tracking and shape modeling. One of the advantages of the level set method is that the curves and surfaces can be numerically calculated on a Cartesian grid without parameterizing curves and surfaces (this is the so-called Eulerian approach). Another advantage of the level set method is that it is easy to track the topology change of the object. For example, an object may be divided into two parts or combined into one, or a new cavity or new entity may be created. All of these make the level set method a powerful tool for time-changing objects modeling, such as expansion of airbag and oil droplets falling into the water. However, the level set equation needs to be updated with the PDE equation. During the process, the level set equation needs to be reset to ensure the continuous update of the PDE, which will greatly reduce the optimal convergence speed, or even fail to converge.

The optimal design of photonic bandgaps for 2D square lattices is considered [133] . The level set method can represent the interface between two materials with two company’s dielectric constants [134] .

ϵ = {\begin{matrix} ϵ_{1} & f o r {x : φ (x) < 0} \\ ϵ_{2} & f o r {x : φ (x) > 0} \end{matrix}

ϵ

ω_{T M}^{1}

Figure 28.Nanophotonic devices designed by the level set method. (a) The evolution of the dielectric distribution^[133]. (b) The bandgap versus the iteration. (c) The final band structure with the largest bandgap between $ω_{T M}^{1}$ and $ω_{T M}^{2}$ .

Download full size

View all figures

The level set method can calculate the curves and surfaces in the evolution process numerically on the Cartesian grid without parametric curves and surfaces. It has a larger application space, and it is believed that the level set algorithm can solve more problems.

6.3 Monte Carlo method

The Monte Carlo method, also known as a statistical simulation method, is a very important numerical calculation method guided by the theory of probability and statistics, which was proposed in the mid-1940s due to the development of science and technology and the invention of electronic computers. The Monte Carlo method is a method that uses random numbers to solve many computing problems. The Monte Carlo method is widely used in financial engineering, macroeconomics, computational physics, and other fields [136, 137] .

The Monte Carlo method usually solves mathematical problems by constructing random numbers that conform to certain rules. The Monte Carlo method is an effective method to find numerical solutions for those problems that are too complex to obtain analytical solutions or have no analytical solutions at all. The most common application of the Monte Carlo method in mathematics is the Monte Carlo integral [137] .

Applying the Monte Carlo method to practical problems has two main parts. 1. When the Monte Carlo method is used to simulate a process, it is necessary to generate a random variable of a probability distribution. 2. The numerical characteristics of the model are estimated by the statistical method, and the numerical solutions of practical problems are obtained [138] .

With the help of computer technology, the Monte Carlo method has many advantages; it is simple and fast, eliminating the need for complicated mathematical derivation and calculation. Moreover, the Monte Carlo method has a strong adaptability, and the complexity of the problem geometry has little influence on it. It is believed that the Monte Carlo method will have more applications in the field of photonic nanometers.

7. Summary and Outlook

In this review article, we extensively discuss a variety of intelligent algorithms including deep learning methods, the gradient-based inverse design method, swarm intelligence algorithms, individual inspired algorithms, and other intelligent algorithms, as well as nanophotonic devices designed using these algorithms. Some representative examples are used to analyze various intelligent algorithms for different situations. In many practical applications, intelligent algorithms are practical methods to deal with various challenging problems. The advantages, disadvantages, characteristics, and suitable devices of the algorithms discussed in this paper are presented in Table 1 .

Table 1. Comparison of Various Intelligent Design Algorithms

View table

View all Tables

Table 1. Comparison of Various Intelligent Design Algorithms

Intelligent Design Algorithms	Advantages	Disadvantages	Unique Features	Suitable Design for Photonic Structures and Devices
1. Deep learning methods	Spend less time than traditional algorithms (after training). More likely to find better local optimal solutions. Many typical structures and strong flexibility. Realize inverse design more easily.	Take a lot of computational and time cost for preparation and training. Difficult to exploit the trained ANN for further analysis. Poor performance when dealing with problems with few samples.	Great for dealing with problems that training set is easy to generate. Ability of transfer learning (albeit immature). Some hyperparameters need to be tweaked.	Nanoparticle^[51], power splitter^[16,23], optical spectrum^{[34,37,50,52]}, metamaterial^[17], metasuface^[21,49]
2. Gradient-based inverse design algorithm	Large parameter space, high computational efficiency.	Exhibit a continuous topography, produce a local optimal solution.	Gradient-based, large parameter space.	Multi-channel devices^[62], router^{[60–62,64,65,67]}, coupler^[63], mode converter^[62], accelerators^[66], switch^[68]
3. Genetic algorithm	Suitable for solving complex optimization problems, concurrency, extensibility.	Low search efficiency in late evolution, premature convergence.	Large coverage, self-organization, self-adaptation, self-learning.	Coupler^[78], metamaterial^[6], nanoparticle^[80], router^[79]
4. Particle swarm optimization	Fast convergence speed, easily understood, parallel computing.	High requirement for parameter setting.	Real-time change of perception.	Waveguide^[5], nanoparticle^[87]
5. Ant colony algorithm	Suitable for combinatorial and continuous function optimization, intuitive.	High time cost.	Usually combined with other algorithms.	Photovoltaic collector^[95]
6. Simulated annealing algorithm	Robustness of a random initial guess, simple structure, parallel computing.	Sensitive to parameters, low efficiency.	Converge with the drop of temperature.	Coupler^[103], switch^[99], metamaterial^[139]
7. Hill-climbing algorithm	Easily understood, avoid traversal in solution space.	Unable to break out the local optimum.	One of the most basic heuristic algorithms.	Nano-cavity^[104]
8. Tabu search algorithm	Suitable for combinatorial optimization, fast convergence speed.	Premature convergence, high requirement for parameter setting.	Parallel tabu search can improve efficiency.	Polarization filter^[110], beam shaping device^[109]
9. Direct binary search	Discrete structure generated by DBS algorithms is more favorable to the fabrication.	Suitable for small parameter space, computationally expensive, sensitive to the starting point.	A simpler iterative algorithm.	Coupler^[120], computational microscopy^[121], polarizer^[115], router^{[113,122,128,129]}, optical modulator^[123], integrated cloak^[124]
10. Topology optimization	Large degree of freedom in design, high sustainability.	Complex shapes are difficult to manufacture.	Optimize the material distribution.	Band structures^[133]
11. Monte Carlo method	Strong adaptability.	Assumptions need to be fair.	Solve problems without analytical solutions.	Optical imaging^[140]

Compared with the traditional design method, the intelligent algorithm is universal and efficient. For example, the advantages of deep learning are that once trained it takes less time (i.e., less computational cost) than traditional algorithms, and is more likely to find better optimization solutions. In addition, compared with traditional algorithms, the deep learning method can realize inverse design more easily. ANN has many typical structures and strong flexibility. According to the design requirements of the equipment and many problems in the training process, we can choose the appropriate neural network for optimal design. First, the design of nanophotonic devices is non-convex, and there is no guarantee that the designed devices are optimal. Second, preparing training sets and training neural networks require a lot of computing and time costs, especially when dealing with complex learning tasks. Third, further analysis using trained neural networks is difficult because ANN’s learning mechanisms (note that they are sometimes useful) operate as black boxes. However, the useful information about the features of photonic structures can be extracted by introducing proper techniques such as latent space [22] . Fourth, a stronger capacity for migration learning is needed to cope with changing situations, although this ability is being developed to demonstrate its power in the design of nanophotonic devices. Finally, in the case of fewer training samples, traditional methods may perform better than deep learning methods.

The gradient-based inverse design method can automatically design nanophotonic devices and only requires the user to input high level parameters. This method can provide large parameter space and design devices using full space parameters of manufacturable devices, which often requires less simulation than GA or PSO because they do not rely on parametric scanning or random perturbations to find their minima. This method can be used to design any passive, linear photonic device.

However, the implemented design usually presents a continuous terrain, and some very small structural components may be formed during the inverse design process, which presents a challenge to sample making. In addition, the gradient-based inverse design method usually produces only local optimal solutions and cannot realize the true global optimal solution.

Individual inspired algorithms can give a better solution in a certain acceptable time, but cannot guarantee it is optimal. The calculation process of SAA is simple, and it has strong universality and robustness. However, it is very sensitive to customized parameters, especially the initial temperature. When faced with a large number of parameters that need to be optimized, SAA randomly selects new solutions from the solution space, thus making the performance weak. In the case of many unknown parameters, the search efficiency and the possibility of finding the optimal solution will decrease. The climbing algorithm is more intuitive, because the memory requirement is small. But it cannot solve the problem of large-scale multi-constraint. The TS algorithm has fast convergence speed and few iteration times, but the results depend on the initial solution and the adjacent.

The DBS algorithm is a simple iterative algorithm for designing nanophotonic devices. The discrete structure generated by the DBS algorithm is more conducive to the traditional manufacturing techniques such as focused ion beam milling or electron beam lithography. However, the DBS algorithm has some limitations. First, the algorithm guarantees convergence, but not necessarily to the minimum. When it converges to the first local minimum during the search, it inherently produces a suboptimal result. Second, it is suitable for discrete solution space and small parameter space due to its large computation. The calculation cost and the probability of falling into the local optimal value will increase with the increase of search space. Third, the algorithm is sensitive to the starting point. Topological optimization has more design freedom and design space, among which the level set method used in designing of nanophotonic devices can be used for numerical calculation of curves and surfaces in the evolution process on the Cartesian grid of parametric curves and surfaces, but the process is more complex and requires a certain mathematical foundation. The level set equation needs to be updated with a partial differential equation and, in the middle, the level set equation needs to be reset to ensure the continuous update of the partial differential equation, which greatly reduces the optimal convergence rate or even fails to converge. The Monte Carlo method has strong adaptability and can solve probability and statistics problems easily and quickly. However, the number of samples must be large enough, and the calculation process is long.

As the need for nanophotonic devices to achieve more functions is further strengthened, the intelligent algorithms, especially the more popular method - the deep learning method - with higher efficiency and better effect [54 - 56], will continue playing a significant role in the designing of nanophotonic devices to implement complex functions and improve the performance of nanophotonic devices. This will provide an avenue for the realization of photonic chips in the future. As for the utilization of intelligent algorithms, we think during the designing process of nanophotonic devices, multiple algorithms can be adopted simultaneously to provide efficient and optimal solutions, rather than just one algorithm. In addition, when too many algorithms are difficult to choose from, the more reports some algorithms appear in, the more frequently they have been used, which may be a reference for similar problems.

Category: Integrated Optics

Received: Jun. 10, 2020

Accepted: Sep. 4, 2020

Published Online: Dec. 28, 2020

The Author Email: Cuicui Lu (cuicuilu@bit.edu.cn)

DOI:10.3788/COL202119.011301

Table 1. Comparison of Various Intelligent Design Algorithms

Table 1. Comparison of Various Intelligent Design Algorithms

微信扫一扫：分享