Advanced Photonics Nexus, Volume. 3, Issue 6, 066014(2024)

AI-enabled universal image-spectrum fusion spectroscopy based on self-supervised plasma modeling

Feiyu Guan1、†, Yuanchao Liu2, Xuechen Niu1, Weihua Huang1, Wei Li3, Peichao Zheng3, Deng Zhang4, Gang Xu5、*, and Lianbo Guo1、*
Author Affiliations
  • 1Huazhong University of Science and Technology, Wuhan National Laboratory for Optoelectronics, Wuhan, China
  • 2City University of Hong Kong, Department of Physics, Hong Kong, China
  • 3Chongqing University of Posts and Telecommunications, School of Optoelectronic Engineering, Chongqing, China
  • 4Nanjing Normal University, School of Computer and Electronic Information, Nanjing, China
  • 5Huazhong University of Science and Technology, School of Optical and Electronic Information, Wuhan, China
  • show less

    Spectroscopy, especially for plasma spectroscopy, provides a powerful platform for biological and material analysis with its elemental and molecular fingerprinting capability. Artificial intelligence (AI) has the tremendous potential to build a universal quantitative framework covering all branches of plasma spectroscopy based on its unmatched representation and generalization ability. Herein, we introduce an AI-based unified method called self-supervised image-spectrum twin information fusion detection (SISTIFD) to collect twin co-occurrence signals of the plasma and to intelligently predict the physical parameters for improving the performances of all plasma spectroscopic techniques. It can fuse the spectra and plasma images in synchronization, derive the plasma parameters (total number density, plasma temperature, electron density, and other implicit factors), and provide accurate results. The experimental data demonstrate their excellent utility and capacity, with a reduction of 98% in evaluation indices (root mean square error, relative standard deviation, etc.) and an analysis frequency of 143 Hz (much faster than the mainstream detection frame rate of 1 Hz). In addition, as a completely end-to-end and self-supervised framework, the SISTIFD enables automatic detection without manual preprocessing or intervention. With these advantages, it has remarkably enhanced various plasma spectroscopic techniques with state-of-the-art performance and unsealed their possibility in industry, especially in the regions that require both capability and efficiency. This scheme brings new inspiration to the whole field of plasma spectroscopy and enables in situ analysis with a real-world scenario of high throughput, cross-interference, various analyte complexity, and diverse applications.

    Keywords

    1 Introduction

    With the technological advances in analytical chemistry over the past decades, the study of spectroscopy has become popular, providing rich and comprehensive information about chemical and mechanical properties, such as Raman spectroscopy,13 infrared spectroscopy,46 Brillouin spectroscopy,79 etc.10,11 Plasma spectroscopy, a main branch of spectroscopy, utilizes plasma as the source of emission spectra, offering a readily available and reliable technique commonly employed to observe elemental content and distribution. Nowadays, many plasma-based spectroscopic techniques have been developed, such as direct reading spark emission spectroscopy, laser-induced breakdown spectroscopy (LIBS), glow discharge optical emission spectroscopy (GD-OES), inductively coupled plasma emission spectroscopy (ICP-OES), and so on.1215 However, in many cases, plasmas typically have high temperatures, small sizes, and short lifetimes, making their behavior highly unstable and unpredictable. They are susceptible to various interferences, such as spectral fluctuation, matrix effect, etc., which can significantly deteriorate the final analytical results.

    Finding an effective and universal method to realize accurate analysis is one of the predominant challenges for complex analyte analysis, high throughput detection, and industrial applications. To this end, researchers have proposed various improved methods.1619 Although they have made significant progress, a high degree of specificity confines them to aspecific spectroscopic techniques and prevents developments in the whole field of plasma spectroscopy. In addition, some algorithms have also been developed.2022 However, they are constrained by weak physical interpretation and the insufficiency of the spectral information and have a limited processing effect in improving accuracy. In summary, existing methods are only suitable for certain conditions, and a solution for practical applications in real-world scenarios has not yet been reported.

    As previously mentioned, in most cases, the source of issues is the plasma, so the key lies in getting more information from the plasma, thus assessing the severity of interference and correcting the results. Many studies have shown the influence of plasma parameters on spectral results, including plasma temperature, total number density, electron density, etc.2325 Unfortunately, it is difficult to obtain them directly due to the small size and short lifetime of the plasma. A promising concept is the plasma image, which can potentially be designed to provide more information than the conventional spectrum, another kind of presentation of plasma.2628 Although the combination of plasma image and spectrum showed higher performance than conventional methods, they were not widely adopted. Limited by the unsatisfactory data-processing algorithms, only simple features can be extracted manually, while further information is still beyond reach. More recently, artificial intelligence (AI)-enabled spectral processing schemes have been noted, which offer an appealing approach for optimized measurement and intelligent rapid analysis. However, the lack of data sets is a barrier to their universal adoption. The performance of conventional supervised learning depends on the quality and quantity of manual labels, but ideal spectra and labels cannot be experimentally obtained in real-world scenarios.

    To address the aforementioned issues, in this work, we have proposed a self-learning scheme avoiding the need for data and have explored the extension of conventional single-spectrum spectroscopy techniques to plasma image-spectrum fusion detection. A novel quantitative method termed by self-supervised image-spectrum twin information fusion detection (SISTIFD) is proposed for auto-extract deep features, faithful reconstruction of the plasma state, and optimization of quantitative accuracy in a physics-driven manner. In essence, the image and spectrum are a pair of twin signals originating from the plasma, providing us with a different kind of information about the plasma. With this approach, image-spectrum twin information fusion acquisition was achieved, which can provide the multidimensional information about plasma and lay a solid foundation for plasma modeling. After that, a self-supervised framework called a plasma image-spectrum attention network (PISA-Net) was established. It combines physical modeling with deep learning and can provide reliable results for accurate quantification. As a result, we successfully reconstruct the plasma state and get excellent quantitative results. PISA-Net has two pipelines to process plasma images and spectra, respectively, and a submodule specially designed based on the characteristics of the task, takes into consideration the challenges posed by the sparsity and less texture of the data. This scheme is completely self-supervised and enables parameters self-optimization without human intervention or preprocessing, thus achieving automatic analysis. To the best of our knowledge, this is the first method that is applicable to all fields of plasma spectroscopy based on AI-driven plasma modeling.

    We used numerous experiments to evaluate SISTIFD. The first data set is constructed to overcome the lack of available data in this field. With this aim, we performed the experiment based on the LIBS platform, which is an ideal and representative testbed for plasma spectroscopy because of its poor quantitative performance. In addition, GD-OES is another common technique that is suitable for evaluation. Using LIBS and GD-OES as the application examples, the results show that SISTIFD outperforms other existing methods and achieves state-of-the-art performance in terms of accuracy, precision, complexity, and efficiency. Except for the experimental data trial, we validated its capability on each specific interference using different conditions. All the evidence points to the fact that SISTIFD significantly eliminates the interferences and achieves fantastic quantitative accuracy, which shows that it has the potential to become the universal issue of all plasma spectroscopy and bring revolutionary guidance to this field.

    2 Results

    2.1 Self-Supervised Image-Spectrum Twin Information Fusion Detection

    According to classical plasma emission theory, for a plasma under local thermal equilibrium (LTE) condition, the emission intensity of a spectral line Iij is expressed as29Iij=Fnsrgiexp[Ei/(kT)]U(T),where most of the variables are constant, with only the plasma temperature T, the total number density ns, and the electron density ne yet to be determined. In other words, as long as these three parameters can be accurately measured, the performance can be improved. However, the common methods cannot meet the requirements.30 Because the generation of plasma is complex and nonlinear, there are still more implicit interference factors that cannot be directly quantified but significantly affect the results. Consequently, we propose the SISTIFD to overcome existing shortcomings and achieve accurate quantitation (Fig. 1) (see the Supplementary Material for more derivations and details).

    Overview of SISTIFD. (a) The schematic diagram of the system. When the plasma is produced, an intensified charge-coupled device (ICCD) camera and a spectrometer collect signals synchronously according to predetermined procedures. (b) The conventional single-spectrum technique in plasma spectroscopy only uses a spectrum, which exhibits some shortcomings, and the results are not satisfactory. (c) The SISTIFD we proposed can synchronously capture images and emission spectra of plasma and extract many deep features, such as area and brightness to predict physical parameters based on an AI model, thus achieving accurate and precise detection. (The size of symbols represents the standard deviation.)

    Figure 1.Overview of SISTIFD. (a) The schematic diagram of the system. When the plasma is produced, an intensified charge-coupled device (ICCD) camera and a spectrometer collect signals synchronously according to predetermined procedures. (b) The conventional single-spectrum technique in plasma spectroscopy only uses a spectrum, which exhibits some shortcomings, and the results are not satisfactory. (c) The SISTIFD we proposed can synchronously capture images and emission spectra of plasma and extract many deep features, such as area and brightness to predict physical parameters based on an AI model, thus achieving accurate and precise detection. (The size of symbols represents the standard deviation.)

    SISTIFD has two key parts. First, for the detection, the strategy based on the image-spectrum fusion acquisition is considered. In essence, plasma’s images and spectra are a pair of twin signals from plasma, providing a different kind of information. With this method, spectra can provide information about the types and quantities of particles within the plasma, while plasma images can offer insight into the spatial distributions of these particles. Unlike conventional single-spectrum acquisition, we regard spectra and images as distinct but equal representations of the same information source. In other words, their weights for the results are balanced. By asynchronously acquiring plasma images and emission spectra at the same frequency and performing pixel-level fusion, a solid foundation for plasma reconstruction and parameter retrieval is established. Generally speaking, the process of plasma evolution is highly complex and nonlinear, making it difficult to understand directly. Fortunately, AI provides a novel routine. In a recent study, deep learning was used to discover potential patterns and relationships between physical objects.3133 Inspired by these, we propose a universal and efficient framework named PISA-Net. PISA-Net employs an attention mechanism to extract both explicit features and implicit information from fusion data, infer plasma parameters, and correct spectral biases. Based on such a process, the parameters of the plasma are accurately calculated, and accurate and precise quantitative results can be obtained. It is completely self-supervised and end-to-end, thus needing no manual intervention and achieving automatic detection.

    The workflow of PISA-Net is shown in Fig. 2. The core purpose of training is to make the network learn the fitting relationship of plasma parameters and integrate it into the signal correction process, thus outputting the final result directly. The advantage is obvious: we do not need to know the exact plasma parameters, and we can complete the extraction of plasma parameters while achieving the correction of the final result. The preprocessing is performed automatically by the system, as illustrated in Fig. 2(a). The images will be cropped first, and then be normalized and subsampled to decrease the feature space for the model. Similarly, PISA-Net will find the spectral peak of the spectra, normalize the total spectral line, and subsample it to a lightweight structure. Figure 2(b) is the main body of PISA-Net, which is trained on pairs of plasma images and plasma emission spectra. There are two pipelines to process. The image pipeline’s target is to extract features from plasma images based on the novel layers termed by residual attention cells (RACell) we specially designed. For the spectrum pipeline, the one-dimensional data are processed through some preprocess layers, convolution layers, and a leaky ReLU layer,34 resulting in spectrum features. After that, all the features, including plasma parameters and encoded vectors, will be merged into the plasma feature stack and fused to calculate the correct results. The RACell illustrated in Fig. 2(c) is an innovative structure for extracting features from the plasma image. Inspired by Transformer,35 we have a small net that can learn a set of weight coefficients for each channel to represent its importance. The channel-dependent self-gate attention mechanism learns the activation of specific samples for each channel, thus using global information to selectively emphasize specific features and suppress less useful features. In addition, we added soft thresholding as a simple but effective trick for denoising. (See the Supplementary Material for more details.)

    Workflow of PISA-Net. (a) Auto-preprocessing. After the image-spectrum fusion acquisition, the images and spectra need to be preprocessed automatically as the input of PISA-Net. The table contains examples of data that are influenced by different interferences. The images need to be cropped to keep the plasma centered and then be normalized and subsampled for better training. As for the spectra, the first step is finding the spectral peaks of interest. Similarly, the spectra also need to be normalized and subsampled before training. (b) The architecture of PISA-Net consists of an image pipeline and a spectrum pipeline to process different signals and merges them into a plasma feature stack. The feature stack contains both plasma physical parameters and high dimensional eigenvectors. Finally, PISA-Net outputs the final results based on these features. (c) The residual attention cell (RACell), including the channel-attention network and soft thresholding mechanism, is well-designed for sparsity and less texture of the plasma image.

    Figure 2.Workflow of PISA-Net. (a) Auto-preprocessing. After the image-spectrum fusion acquisition, the images and spectra need to be preprocessed automatically as the input of PISA-Net. The table contains examples of data that are influenced by different interferences. The images need to be cropped to keep the plasma centered and then be normalized and subsampled for better training. As for the spectra, the first step is finding the spectral peaks of interest. Similarly, the spectra also need to be normalized and subsampled before training. (b) The architecture of PISA-Net consists of an image pipeline and a spectrum pipeline to process different signals and merges them into a plasma feature stack. The feature stack contains both plasma physical parameters and high dimensional eigenvectors. Finally, PISA-Net outputs the final results based on these features. (c) The residual attention cell (RACell), including the channel-attention network and soft thresholding mechanism, is well-designed for sparsity and less texture of the plasma image.

    2.2 Parameters of Plasma

    Benefiting from these advanced structures, our method has achieved excellent performance in monitoring and predicting plasma parameters. As shown in Fig. 3, we obtained plasma images and spectra with different temperatures by changing the laser energy [Figs. 3(a) and 3(b)], and we compared the results obtained by our method with the plasma temperature calculated using the Boltzmann plot method [Fig. 3(c)]. For the intercept m of the Boltzmann plot, the plasma temperature T can be calculated as T=1/mkb, where kb is the Boltzmann constant.

    Results of plasma parameter calculation. (a) Collected plasma images and (b) spectra at different temperatures. (c) The Boltzmann plot of the Fe element for measurement of plasma temperature. A total of 144 spectral lines were selected for fitting. The plasma temperature can be calculated according to the intercept. (d) The comparison results of the plasma temperature T under different energy conditions. The results of t-test indicate that there is no statistical difference. (e) The relative deviation of the total 50 results obtained by the Boltzmann plot method and SISTIFD under the same condition. (f) Spectra of the Fe I 404.581 nm spectral line after Lorentz fitting under different energy conditions. The electron density can be approximate, estimated based on the FWHM of the spectral lines. (g) The comparison results of electron density ne under different energy conditions. (h) The relative deviation of the total 50 results obtained by the Boltzmann plot method and PISA-Net under the same energy. The results of a t-test indicate that there is no statistical difference. (i) Spectra of soil samples for comparison. (j) The quantitative analysis using original data. (k) The quantitative analysis corrected by T and ne. (l) The quantitative analysis corrected by all parameters (T, ne, ns, and δ).

    Figure 3.Results of plasma parameter calculation. (a) Collected plasma images and (b) spectra at different temperatures. (c) The Boltzmann plot of the Fe element for measurement of plasma temperature. A total of 144 spectral lines were selected for fitting. The plasma temperature can be calculated according to the intercept. (d) The comparison results of the plasma temperature T under different energy conditions. The results of t-test indicate that there is no statistical difference. (e) The relative deviation of the total 50 results obtained by the Boltzmann plot method and SISTIFD under the same condition. (f) Spectra of the Fe I 404.581 nm spectral line after Lorentz fitting under different energy conditions. The electron density can be approximate, estimated based on the FWHM of the spectral lines. (g) The comparison results of electron density ne under different energy conditions. (h) The relative deviation of the total 50 results obtained by the Boltzmann plot method and PISA-Net under the same energy. The results of a t-test indicate that there is no statistical difference. (i) Spectra of soil samples for comparison. (j) The quantitative analysis using original data. (k) The quantitative analysis corrected by T and ne. (l) The quantitative analysis corrected by all parameters (T, ne, ns, and δ).

    To better illustrate the capability for plasma temperature calculation, we measured 50 times under each laser energy condition and calculated their plasma temperature using both the Boltzmann plot method and PISA-Net. Figure 3(d) provides a comparison of the results for energies ranging from 20 to 80 mJ. Statistical testing is performed with a standard two-tailed t-test, and the results indicate that there is no significant difference between them (p>0.05), showing the accuracy of temperature calculations under various conditions. To be more specific, we selected all the 50 measurement results at the laser energy of 50 mJ and calculated the relative deviations between the two methods. As shown in Fig. 3(e), the relative deviation between the two methods is <1%, and in the majority of cases, it is <0.4%. This result primarily indicates the stability and robustness of the plasma temperature calculation of SISTIFD.

    Similarly, for electron density ne, we obtained the full width at half-maximum (FWHM) through Lorentzian fitting of the spectral line of Fe at 404.581 nm and calculated the plasma electron density at different energies based on the Stark broadening method, as shown in Fig. 3(f). For typical plasma, the contribution from ion broadening is generally negligible. Thus, the electron density ne of a plasma can be expressed as ne=FWHM×1016/2ω, where the electron impact parameter ω can be found in the literature.

    We also calculated the electron density ne of 50 plasmas under each energy condition. Figures 3(g) and 3(h) show the final results; the results of two-tailed t-test indicated the deviation is not statistically significant. It is worth noting that although the Stark broadening method is the currently better-performing method, the calculated results are not perfectly accurate. Therefore, larger standard deviations and relative deviations between the results of the two methods are understandable. All the results are relatively advanced compared to existing methods, demonstrating that our method can calculate the plasma parameters very well.

    A comparison of final quantitative results can better evaluate the reliability of the plasma parameter calculation. We used seven soil samples with varying Mg concentrations and induced plasma by laser with random fluctuations in the energy range of 47.2 to 53.9 mJ. Figure 3(i) shows the spectra we collected. We conducted linear fitting on the original spectral intensity [Fig. 3(j)], the intensity corrected by plasma temperature T and electron density ne that we calculated before [Fig. 3(k)], and the intensity corrected by all of the parameters (T, ne, ns, and δ) of PIAS-Net [Fig. 3(l)], to obtain calibration curves for Mg II 279.553 nm spectral line. It can be observed that the unprocessed spectra performed the worst, with the determination coefficient R2 of the calibration curve being only 0.8717. The curve’s linearity significantly improved after the correction by T and ne, reaching an R2 of 0.9314, which further validates the previous conclusions. Finally, comparing it with the calibration curve corrected finally, the R2 of the curve further increased to 0.9989, indicating the accuracy of our calculation for all parameters.

    2.3 Quantitative Results of LIBS

    Previously, we analyzed the theoretical feasibility of calculating the plasma parameters. Next, using LIBS [Fig. 4(a)] as the example, we will further verify the quantitative effectiveness of the method. Under real-world conditions, the spectral intensity is influenced by various interferences, including spectral fluctuation, matrix effect, etc. (Detailed explanations are in Sec. 2.5.) It is challenging to establish reliable regression curves for quantitative analysis. We set a series of factors to simulate this complex process. To verify the performance of our method, we set the laser energy fluctuating between 45.4 and 54.7 mJ, and two different kinds of samples, including K2CO3-soil and potash feldspar samples, were used. In addition, because of the high concentration of K element, the occurrence of self-absorption is foreseeable. Figure 4(b) shows examples of plasma spectra and images under different conditions.

    The quantitative results in LIBS. (a) The schematic diagram of LIBS. A laser is focused on the target, and the generated plasma will emit a spectrum with its elemental fingerprinting capability. (b) Examples of plasma spectra and images under different interference conditions, including spectral fluctuation, unstable excitation, matrix effect, and self-absorption. The spectra and images show significant inconsistencies. (c) Part of the original images and (d) spectra in the experiment under cross-interference, composite, and high throughput conditions. (e) Calibration curves of K I 766.490 nm of conventional LIBS. Different kinds of samples (potash feldspar and soil) were used; the energy of the excitation laser was unstable. Herein, the calibration curves of conventional LIBS were influenced by all sorts of interferences mentioned above, indicating unacceptable spectral analytical results. (f) The results of SISTIFD. These interferences can be overcome by SISTIFD to achieve accurate quantification. Error bars in (e) and (f) represent standard deviation (s.d.) for each data point (n=100), and points are average values.

    Figure 4.The quantitative results in LIBS. (a) The schematic diagram of LIBS. A laser is focused on the target, and the generated plasma will emit a spectrum with its elemental fingerprinting capability. (b) Examples of plasma spectra and images under different interference conditions, including spectral fluctuation, unstable excitation, matrix effect, and self-absorption. The spectra and images show significant inconsistencies. (c) Part of the original images and (d) spectra in the experiment under cross-interference, composite, and high throughput conditions. (e) Calibration curves of K I 766.490 nm of conventional LIBS. Different kinds of samples (potash feldspar and soil) were used; the energy of the excitation laser was unstable. Herein, the calibration curves of conventional LIBS were influenced by all sorts of interferences mentioned above, indicating unacceptable spectral analytical results. (f) The results of SISTIFD. These interferences can be overcome by SISTIFD to achieve accurate quantification. Error bars in (e) and (f) represent standard deviation (s.d.) for each data point (n=100), and points are average values.

    Part of the collected images and spectra in the quantitative experiments are shown in Figs. 4(c) and 4(d), and we can see the difference clearly because of the interferences. The visual calibration curves are as shown in Figs. 4(e) and 4(f). Figure 4(e) shows the original results, and the data points cannot be fit with a straight line, demonstrating the significant influence on the quantitative results by interferences. The results corrected by SISTIFD are shown in Fig. 4(f). We selected three samples as the test set that was completely excluded from the training process. The evaluation parameters are in Table 1. The R2 increases from 0.0359 to 0.9996, and the root mean square error (RMSE) and mean relative error (MRE), the indicators of accuracy, decline from 0.6032 and 0.3471 to 0.0109 and 0.0062, respectively. In addition, the relative standard deviation (RSD) improves from 0.1428 to 0.0037, which means the precision of results has also been improved. The results show that, compared with the poor accuracy and precision of the original spectra, the samples in the test set could be calibrated with almost no errors in our method, especially under extremely harsh conditions, which fully demonstrates the capability of our method.

    • Table 1. Evaluation parameters of the calibration curves established with different methods in LIBS.

      Table 1. Evaluation parameters of the calibration curves established with different methods in LIBS.

      MethodR2RMSEMRERSD
      Conventional LIBS0.03590.60320.34710.1428
      PISA-Net0.99960.01090.00620.0037

    2.4 Results of GD-OES

    To illustrate the universality of SISTIFD for plasma spectroscopy, we also verified its effectiveness using GD-OES [Fig. 5(a)]. We used samples with different concentrations of Cs and Li elements. By applying the voltage at both ends to excite the plasma, we collected their emission spectra using the spectrometer. To obtain reliable results, an 852.1 nm spectral line of Cs element and a 670.8 nm spectral line of Li element were selected for analysis. Figures 5(b) and 5(c) present the plasma images of the Li and Cs elements, respectively.

    The quantitative results in GD-OES. (a) The schematic diagram of GD-OES. High voltage is applied to both ends to excite the plasma, and the sample is passed through a capillary tube. (b) The plasma image of the Li element. (c) The plasma image of the Cs element. (d) The spectra of Li 670.8 nm in different concentrations. (e) Calibration curve of Li 670.8 nm based on conventional GD-OES. (f) Calibration curve of Li 670.8 nm based on SISTIFD. (g) The spectra of Cs 852.1 nm in different concentrations. (h) Calibration curve of Cs 852.1 nm based on conventional GD-OES. (i) Calibration curve of Cs 852.1 nm based on SISTIFD.

    Figure 5.The quantitative results in GD-OES. (a) The schematic diagram of GD-OES. High voltage is applied to both ends to excite the plasma, and the sample is passed through a capillary tube. (b) The plasma image of the Li element. (c) The plasma image of the Cs element. (d) The spectra of Li 670.8 nm in different concentrations. (e) Calibration curve of Li 670.8 nm based on conventional GD-OES. (f) Calibration curve of Li 670.8 nm based on SISTIFD. (g) The spectra of Cs 852.1 nm in different concentrations. (h) Calibration curve of Cs 852.1 nm based on conventional GD-OES. (i) Calibration curve of Cs 852.1 nm based on SISTIFD.

    The results are shown in Figs. 5(d)5(i) and Table 2. Figure 5(d) shows the corresponding spectra of Li; the calibration curves are shown in Figs. 5(e) and 5(f), where Fig. 5(e) is the result using conventional GD-OES; the result of SISTIFD is given in Fig. 5(f). The R2 of the calibration curve was increased from 0.9355 to 0.9997, and the MRE of the test set declined from 0.1033 to 0.0033, which supports our idea that SISTIFD is also effective for other plasma spectroscopy techniques and can help to improve their performance. To provide more evidence, Fig. 5(g) shows the spectra of the Cs 852.1 nm element; the calibration curves are in Figs. 5(h) and 5(i). The R2 of the calibration curve increased from 0.9784 to 0.9993, and the MRE of the test set improved from 0.0293 to 0.0110, demonstrating a very significant improvement in quantitative results. Based on the results of the GD-OES, SISTIFD has also demonstrated significant effectiveness and universality. We have reason to believe that it has the potential to be applied to all detection techniques based on plasma emission spectroscopy as a “universal method.”

    • Table 2. Evaluation parameters of the calibration curves established with different methods in GD-OES.

      Table 2. Evaluation parameters of the calibration curves established with different methods in GD-OES.

      Case 1 (Li)Reference value (ppm)Analytical result (ppm)R2MRE
      Conventional GD-OES0.03000.03310.93550.1033
      SISTIFD0.03010.99970.0033
      Case 2 (Cs)Reference value (ppm)Analytical result (ppm)R2MRE
      Conventional GD-OES0.60000.61760.97840.0293
      SISTIFD0.59340.99930.0110

    2.5 Results of Single Interference

    To better illustrate the mechanism of SISTIFD to provide accurate results, it is necessary to analyze the individual effect of the different interferences specifically. It is proper to use LIBS as the example because the influence is more significant. In this research, both atomic and ionic spectral lines of different elements are selected for quantitative analysis to support our scheme. We studied the contents of Si, Fe, Mg, Mg, and K elements in powder-pressed samples (potash feldspar and soil) and metal samples (aluminum alloy, micro-alloyed steel). The method was compared with the original LIBS, an advanced method called IA-LIBS.36 All of these worked well and were representative. We used both metal and pressed samples to conduct the experimental study and followed the same setting to execute algorithms.

    As shown in Fig. 6, we researched four different interferences, including spectral fluctuation, unstable excitation, matrix effect, and self-absorption. They are the dominant interferences that may be faced in real-world applications. Among them, experimental case 1 is spectral fluctuation, which shows the influence of slight changes in environmental factors. It results in different intensities for the same experimental parameters. Experimental case 2 is unstable excitation, which represents the consequences of the instability of the excitation source. Both of them are precision-influencing conditions. After that, the interferences of the matrix effect and self-absorption are set as the experimental conditions 3 and 4 to illustrate their accuracy. The reason for the matrix effect is the difference in the samples; self-absorption refers to the fact that particles in the outer layer of the plasma absorb some photons, resulting in an abnormally low spectral intensity.

    Comparison of results obtained by conventional LIBS (left) and SISTIFD (right) under different interference conditions. (a) Calibration curve of Si I 250.690 nm spectral line using conventional LIBS within experimental condition 1 that displayed spectral fluctuation. (b) The corresponding results using SISTIFD. (c) Calibration curves of Si I 250.690 nm spectral line using conventional LIBS within experimental condition 2 that existed in unstable excitation. (d) The corresponding results using SISTIFD. (e) Calibration curves of Mn II 293.931 nm spectral line using conventional LIBS in experimental condition 3 that existed as a matrix effect. (f) The corresponding results using SISTIFD. (g) Calibration curves of K I 766.490 nm spectral line using conventional LIBS in experimental condition 4 that existed as self-absorption. (h) The corresponding results using SISTIFD. All the error bars represent s.d. for each data point (n=100), and points are average values.

    Figure 6.Comparison of results obtained by conventional LIBS (left) and SISTIFD (right) under different interference conditions. (a) Calibration curve of Si I 250.690 nm spectral line using conventional LIBS within experimental condition 1 that displayed spectral fluctuation. (b) The corresponding results using SISTIFD. (c) Calibration curves of Si I 250.690 nm spectral line using conventional LIBS within experimental condition 2 that existed in unstable excitation. (d) The corresponding results using SISTIFD. (e) Calibration curves of Mn II 293.931 nm spectral line using conventional LIBS in experimental condition 3 that existed as a matrix effect. (f) The corresponding results using SISTIFD. (g) Calibration curves of K I 766.490 nm spectral line using conventional LIBS in experimental condition 4 that existed as self-absorption. (h) The corresponding results using SISTIFD. All the error bars represent s.d. for each data point (n=100), and points are average values.

    2.5.1 Experimental condition 1 spectral fluctuation

    Due to the sensitivity of the plasma, the slight change of experimental conditions, such as temperature, sample state, and instrument noise, can lead to significant fluctuation in spectral intensities, greatly compromising the precision of detection. Therefore, we explored the effectiveness of our method for spectral fluctuation. We used a Si I 250.690 nm line in potash feldspar samples and Fe II 239.563 nm in aluminum alloy and collected the spectral intensities of each sample under 50 laser pulses.

    The calibration curves are shown in Figs. 6(a) and 6(b); evaluation parameters are given in Table 3. The RSD of the calibration curves declined from 0.0743 and 0.0983 to 0.0014 and 0.0032, which suggested that the precision and stability improved significantly. Besides, the R2, RMSE, and MRE of Si I 250.690 nm were enhanced from 0.9801, 0.9813, and 0.0589 to 0.9998, 0.0091, and 0.0014. The R2, RMSE, and MRE of Fe II 293.931 nm were optimized from 0.9210, 0.0538, and 0.3490 to 0.9984, 0.0017, and 0.0049. The results indicate that our method can effectively eliminate the spectral fluctuation according to the information in plasma. Compared to existing methods, our approach can distribute appropriate weights to the twin information and extract adequate information to achieve better performance.

    • Table 3. Evaluation indices of the calibration curves established with different methods. Two different spectral lines were selected for each interference condition. The results show that our SISTIFD achieved SOTA performance in every indicator.

      Table 3. Evaluation indices of the calibration curves established with different methods. Two different spectral lines were selected for each interference condition. The results show that our SISTIFD achieved SOTA performance in every indicator.

      ConditionElementEvaluation indexMethod
      ConventionalIA-LIBS36SISTIFD (ours)
      Condition 1 spectral fluctuationSi (250.690 nm)R20.98010.98870.9998
      RMSE0.98130.69010.0091
      MRE0.05890.03520.0014
      RSD0.07430.06450.0014
      Fe (239.563 nm)R20.92100.95870.9984
      RMSE0.05380.04210.0017
      MRE0.34900.06540.0049
      RSD0.09830.06990.0032
      Condition 2 unstable excitationSi (250.690 nm)R20.22100.76010.9959
      RMSE0.51230.25110.0260
      MRE0.80810.36210.0647
      RSD2.24650.63320.0501
      Mg (279.553 nm)R20.13200.84660.9992
      RMSE0.39310.27390.0155
      MRE1.63590.83560.0505
      RSD1.71540.81430.0448
      Condition 3 matrix effectMn (293.931 nm)R20.41510.96130.9952
      RMSE0.43260.20610.0044
      MRE0.44570.08170.0099
      Mg (285.213 nm)R20.02440.94200.9998
      RMSE0.13820.01730.0013
      MRE0.63300.07330.0064
      Condition 4 self-absorptionK (766.490 nm)R2Exp fitting0.96210.9995
      RMSE0.4266 (Exp)0.11710.0041
      MRE0.2138 (Exp)0.09240.0019
      K (769.896 nm)R2Exp fitting0.94670.9989
      RMSE0.3669 (Exp)0.24310.0053
      MRE0.3852 (Exp)0.10130.0046

    2.5.2 Experimental condition 2 unstable excitation

    As the excitation source for the plasma, the laser is the most important external factor affecting the spectrum. In practical detection, many laser parameters will fluctuate, and the change in energy can have the most significant impact on the results. In this study, we evaluate the performance of our proposal and other methods under different energy conditions (40, 45, 50, 55, and 60 mJ). We used Si in potash feldspar samples with a 250.690 nm spectral line and Mg in soil with a 279.553 nm spectral line for experiments.

    The calibration curves of Si are shown in Figs. 6(c) and 6(d); the evaluation parameters are shown in Table 3, which suggests the R2, RMSE, MRE, and RSD of the curves improved greatly. The indicators of precision, RSD for Si I 250.690 nm and Mg II 279.553 nm, diminished from 2.2465 and 1.7154 to 0.0501 and 0.0448. The values of R2 improved from 0.2210 and 0.1320 to 0.9959 and 0.9992, respectively. Likewise, for Si, the RMSE and MRE decreased from 0.5123 and 0.8081 to 0.0260 and 0.0647. As for Mg, the RMSE and MRE lessened from 0.3931 and 1.6359 to 0.0155 and 0.0505. Figures S2 and S3 in the Supplementary Material provide more results about the Si and Mg elements.

    2.5.3 Experimental condition 3 matrix effect

    In general, the matrix effect refers to the inconsistent spectral response of the same concentration of elements in different kinds of samples. The matrix effect is primarily manifested in the inconsistency of the calibration curves, which greatly limits the accuracy and precision of quantification, which hinders its further popularization in the industry. Our SISTIFD has the potential to be applied to the correction of the matrix effect, and the experimental results support our claim. The spectral line at 293.931 nm of Mn element in metal samples was selected for visual analysis in Figs. 6(e) and 6(f); Mg I 285.213 nm, Si I 250.690 nm, and Mn II 257.610 nm were also used for the test.

    The intensities of Mn II 293.931 nm and Mg I 285.213 nm in different kinds of samples were studied to indicate the validity of our network; the results are shown in Table 3. There are also some other quantitative results in Fig. S4 and Table S2 in the Supplementary Material. We can see that the spectral intensities could be fitted very well by the calibration curves, although they come from totally different kinds of samples, which illustrates the effectiveness of our proposal. The R2 of calibration curves of Mn and Mg increased from 0.4151 and 0.0244 to 0.9952 and 0.9998. The RMSE and MRE of Mn II 293.931 nm in aluminum alloy and micro-alloyed steel were improved from 0.4326 and 0.4457 to 0.0042 and 0.0098 with our model. In the same way, the RMSE and MRE of Mg I 285.213 nm in pressed samples declined from 0.1382 and 0.6330 to 0.0013 and 0.0064, respectively, proving that SISTIFD achieves the SOTA performance.

    2.5.4 Experimental condition 4 self-absorption

    Self-absorption is due to the absorption of photons emitted from the inner layer by the outer plasma, and it is another reason for inaccuracy in LIBS quantitation because of its nonlinear performance. To overcome the potential self-absorption, SISTIFD is designed to ameliorate this effect to improve the accuracy and precision of LIBS quantification. Considering that potassium element (K) is prone to induce self-absorption and is easy to obtain, we use K I 766.490 nm in K2CO3-soil mixed samples and K I 769.896 nm in potash feldspar samples for the experiment.

    The calibration curves of K I 766.490 nm are shown in Figs. 6(g) and 6(h) as an example; the results are given in Table 3. For the original data, with the increasing of K content, the spectral intensity value grows more and more slowly, and the calibration curve is obviously concave and can no longer be fitted by a straight line. Using exponential fitting, the RMSE and MRE were 0.4266, 0.3669 and 0.2138, 0.3852, respectively. Corrected by our model, the value of R2 was improved to 0.9995 and 0.9989 even using the linear fitting, and RMSE and MRE decreased to 0.0041, 0.0053 and 0.0019, 0.0046, which proves that our proposal can effectively eliminate the self-absorption effect and improve the quantitative accuracy.

    2.6 Ablation Study and Efficiency

    In this experiment, we analyzed the necessity and efficiency of PISA-Net, as shown in Table 4. First, we examined the model size by changing the number of channels. In this case, the model degraded regardless of whether the parameter count increased or decreased. Next, we removed several key structures, including attention net, soft thresholding, and skip connection. However, the absence of them brought a larger loss. This is because certain features were not accurately extracted. Finally, we compared the performance of different loss functions. The conclusion is that the weighted loss we used is most beneficial for the model.

    • Table 4. Evaluation of models with varying architectures. Each variation (columns) combined by different components (rows) is indicated by ticks (√), e.g., variation “large” is a model with 98.2 M parameters using attention, skip connection, soft thresholding, and setting loss coefficient α=0.6 in the pre-training stage while α=0.2 in the fine-tune stage.

      Table 4. Evaluation of models with varying architectures. Each variation (columns) combined by different components (rows) is indicated by ticks (√), e.g., variation “large” is a model with 98.2 M parameters using attention, skip connection, soft thresholding, and setting loss coefficient α=0.6 in the pre-training stage while α=0.2 in the fine-tune stage.

      PropertySmallLargeNo attentionNo soft thresholdingNo skip connectionLinearity loss onlyPISA-Net (pre-trained)PISA-Net
      8.8 M parameters
      26.4 M parameters
      98.2 M parameters
      Attention
      Soft thresholding
      Skip connection
      Loss coefficient α0.6 → 0.20.6 → 0.20.6 → 0.20.6 → 0.20.6 → 0.20.6 → 0.20.60.6 → 0.2
      Loss (relative)15.87511.62339.8558.3319.74420.815102.3241.000
      R20.9230.9740.9820.9910.9890.9980.9910.999

    As for the runtime, our approach takes an average of 7 ms (143 Hz) for calculation on a computer with Intel Xeon 6230R CPU and Nvidia GeForce RTX 2080Ti GPU, which can meet the requirement of real-time detection. In general, the total time cost for the mainstream detection is about only 1 s (1 Hz), which means our scheme does not lead to a significant increase in detection time compared to conventional techniques.

    3 Discussion

    Here, we report what we believe is a novel, accurate, effective, and universal scheme SISTIFD for improving the performance of plasma spectroscopy, especially in cross-interference scenarios. The proposed SISTIFD relies on the innovative construction of a self-supervised learning approach based on the relationship between spectral twin signals and physical parameters. The first stage of the workflow is the experimental strategy based on image-spectrum twin information fusion acquisition, which contributes to much more information than conventional techniques about plasma. The second stage is an AI-driven framework called PISA-Net, which injects plasma images and spectra into different pipelines and applies residual attention cells to conduct self-supervised training, enabling one to obtain the plasma parameters and compute appropriate correction coefficients for accurate analysis. Using LIBS and GD-OES as the testbed, we achieved state-of-the-art performance and all the evaluation indices, including RMSE, MRE, and RSD, declined 2 orders of magnitude, demonstrating the effectiveness and universality of SISTIFD.

    The improvements based on conventional single-spectrum techniques have shown remarkable capability in the face of single interference or simple applications, but insufficient information in the spectrum under cross-interference conditions is the barrier to widespread application. Detecting multidimensional twin signals of plasma is a novel idea that can provide a wealth of information to better understand the behavior of plasma. Although deep learning-based methods showed higher efficiency than rule-based methods, they were not widely adopted. The performance of supervision-based neural networks relies on the quality and quantity of training data. However, ideal spectra without interferences and noises cannot be experimentally obtained. To solve this problem, we propose a self-supervised network to learn the pattern from unordered data to physical parameters as part of our quantitative scheme. SISTIFD provides an efficient, accurate, precise, and robust workflow for quantitative analysis in high throughput, cross-interference, complex, and diverse applications, by retaining the rapid, easy to operate, and in situ advantages of conventional spectroscopic techniques. In addition, the scheme introduces excessive time overhead; of course, some cost increase is necessary (an extra ICCD camera is needed).

    With the development of advanced analytical techniques over the past years, the universality of a new proposal interdisciplinary field is becoming increasingly important. To the best of our knowledge, existing spectroscopic techniques only focused on the problems in their own field, and there is no one universal method that works for the entire field of plasma spectroscopy. SISTIFD is a new method that establishes a unified framework for materials analysis based on plasma spectroscopy. The achievement has been verified in LIBS and GD-OES, and we believe that our concept can also be applied to other techniques, such as ICP-OES, spark-emission spectroscopy, and others. This may boost multispectral technology because all the above-mentioned concepts still rely on the need for more spectral information acquisition and more accurate algorithms. One can expect other novel universal methods in the future, and we believe our proposal is the first facilitator for a universal analysis paradigm, making spectroscopy more accessible to academia as well as industry.

    Lianbo Guo is a full professor at the Wuhan National Laboratory for Optoelectronics (WNLO), Huazhong University of Science and Technology (HUST). He is a spectroscopy scientist and his research interests include laser-induced breakdown spectroscopy (LIBS), Raman spectroscopy, photoacoustic spectroscopy, etc., and their applications in fields such as biology, geology, and metallurgy. He has authored over 100 papers and served as a reviewer for over 40 international journals.

    Biographies of the other authors are not available.

    [34] A. L. Maas, A. Y. Hannun, A. Y. Ng. Rectifier nonlinearities improve neural network acoustic models, 3(2013).

    Tools

    Get Citation

    Copy Citation Text

    Feiyu Guan, Yuanchao Liu, Xuechen Niu, Weihua Huang, Wei Li, Peichao Zheng, Deng Zhang, Gang Xu, Lianbo Guo, "AI-enabled universal image-spectrum fusion spectroscopy based on self-supervised plasma modeling," Adv. Photon. Nexus 3, 066014 (2024)

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Research Articles

    Received: Jun. 26, 2024

    Accepted: Oct. 15, 2024

    Published Online: Dec. 9, 2024

    The Author Email: Xu Gang (gang_xu@hust.edu.cn), Guo Lianbo (lbguo@hust.edu.cn)

    DOI:10.1117/1.APN.3.6.066014

    Topics