Spectroscopy and Spectral Analysis, Volume. 45, Issue 6, 1566(2025)

Study on Improving Stability of Near-Infrared Spectra by Normal Distribution Screening Method

LI Xiao-xing1,2, XIAO Jin-feng1、*, ZHANG Hong-ming2, LÜ2,3, YIN Xiang-hui1, ZHAO Ming4, MA Fei5, FU Jia2, HU Yan1,2, LI Zhi-hao1,2, WANG Fu-di2, SHEN Yong-cai6, and DAI Shu-yu7
Author Affiliations
  • 1College of Electrical Engineering, University of South China, Hengyang 421001,China
  • 2Bo
  • 2Institute of Plasma Physics, Hefei Institutes of Physical Sciences, Chinese Academy of Sciences, Hefei 230031,China
  • 3Institute of Plasma Physics, Hefei Institutes of Physical Sciences, Chinese Academy of Sciences, Hefei 230031,China
  • 4College of Biological and Food Engineering, Anhui Polytechnic University, Hefei 241000,China
  • 5College of Food and Biological Engineering, Hefei University of Technology, Hefei 230009,China
  • 6College of Physics and Materials Engineering, Hefei Normal University, Hefei 230601,China
  • 7College of Physics, Dalian University of Technology, Dalian 116024,China
  • show less

    In the near-infrared online detection of the fermentation process, bubbles are often generated in the fermentation broth due to the need to continuously pass oxygen into the fermentation broth to promote microbial growth and metabolic activities. When the bubbles in the fermentation broth pass in front of the probe, they will interfere with the intensity of the near-infrared (NIR) spectrum. To eliminate the abnormal spectra caused by bubbles collected during the near-infrared online detection of fermentation broth and reduce spectral fluctuations, a normal distribution screening method is proposed in this study. In this study, 600 g of glucose solution with a mass fraction of 10% was prepared, adding 2 g of glucose solution to a reactor containing 600 mL of distilled water every 30 s, stirring well, then calculating and recording the mass fraction of glucose solution in the reactor, and generating bubbles by passing oxygen to the bottom of the reactor, and collecting the NIR spectra of the glucose solution in the reactor by using NIR spectrometer, respectively. After the anomalous spectra affected by the air bubbles were excluded by principal component analysis (PCA) combined with Mahalanobis distance method, Euclidean distance method, isolated forest, and normal distribution screening method, the sample set of spectra was randomly divided into the correction set and the prediction set according to the ratio of 4∶1, and then, after the spectral pre-processing, the glucose concentration prediction model was established for the correction set using the partial least squares method (PLSR) and the prediction set was analyzed by the established PLSR model. The correlation coefficient of the correction set, the root mean square error of the correction set, and the correlation coefficient and root mean square error of the prediction set were compared and analyzed. The results of the constructed model after removing the anomalous spectra affected by bubbles using the four methods are as follows: the correlation coefficient Rc2 of the correction set obtained after removing the anomalous spectra by PCA combined with the Mahalanobis Distance Method is 0.998 208, and the root-mean-square error RMSECV is 0.000 764, and the correlation coefficient Rp2 of the prediction set is 0.997 994, and the root mean square error RMSEP is 0.000 764; The correction set Rc2 obtained after removing the anomalous spectra by the Euclidean distance method is 0.998 628, the root mean square error RMSECV is 0.000 652, the prediction set correlation coefficient Rp2 is 0.998 628, and the root mean square error RMSEP is 0.000 655; the correction set Rc2 obtained after removing the anomalous spectra by the isolated forest method is 0.998 255, the RMSECV is 0.000 739, the prediction set Rp2 is 0.998 132, and the RMSEP is 0.000 740; the correction set Rc2 obtained after the removal of anomalous spectra by the normal distribution screening method is 0.998 641, with a root mean square error RMSECV of 0.000 645, and the prediction set Rp2 is 0.998 628, with a RMSEP of 0.000 636. Comparing the four methods, the normal distribution screening method can effectively reduce the fluctuation of spectral intensity and eliminate abnormal spectra more effectively than other methods.

    Tools

    Get Citation

    Copy Citation Text

    LI Xiao-xing, XIAO Jin-feng, ZHANG Hong-ming, LÜ, YIN Xiang-hui, ZHAO Ming, MA Fei, FU Jia, HU Yan, LI Zhi-hao, WANG Fu-di, SHEN Yong-cai, DAI Shu-yu. Study on Improving Stability of Near-Infrared Spectra by Normal Distribution Screening Method[J]. Spectroscopy and Spectral Analysis, 2025, 45(6): 1566

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Received: Jul. 25, 2024

    Accepted: Jun. 27, 2025

    Published Online: Jun. 27, 2025

    The Author Email: XIAO Jin-feng (806609919@qq.com)

    DOI:10.3964/j.issn.1000-0593(2025)06-1566-12

    Topics