Spectroscopy and Spectral Analysis, Volume. 45, Issue 2, 386(2025)

Simulated Estimation of BOD Content in Water Bodies Based on PCA Transmission Spectrum Reconstruction With Noise Reduction

WANG Yi-ming1, WANG Cai-ling1, and WANG Hong-wei2、*
Author Affiliations
  • 1College of Computer Science, Xi'an Shiyou University, Xi'an 710065, China
  • 2School of Artificial Intelligence, Optics and Electronics, Northwestern Polytechnical University, Xi'an 710072, China
  • show less

    Biochemical oxygen demand (BOD) is an important indicator that can directly reflect water bodies' degree of organic pollution. Real-time monitoring of water BOD is significant in water resource protection and water environment improvement. The traditional BOD measurement method will consume a lot of human and material resources, and the measurement cycle is long, which can not quickly reflect the changing conditions of the water body, and can not realize the timely and effective early warning of sudden water pollution events. With the wide application of machine learning in the field of water monitoring, to solve the problem of difficulty in obtaining the input variables of the machine learning model and the existence of missing values, we further combine the hyperspectral technology to realize the accurate and rapid estimation of the BOD content of the water body. The raw spectral data of ten BOD standard liquids with different concentrations were collected, and 100 sets of transmission spectral data were obtained by whiteboard correction. A noise reduction technique based on PCA transmission spectra reconstruction is proposed, which utilizes the PCA algorithm to extract the principal component eigenvectors of the original transmission spectra and then reconstructs the whole dataset by using the first part of the principal component eigenvectors whose cumulative variance contribution rate reaches a certain percentage. The first 2, 10, and 15 principal component feature vectors were used in the experiment to reconstruct the transmission spectral data and compared with the traditional noise reduction methods for spectral data. We combined the SVM model and BP neural network model to establish a model for estimating the BOD content of water bodies. The results showed that the BPNN model was superior to the SVM model regarding regression accuracy and degree of fit, and the noise reduction effect was more significant. The model using the first 2 feature vectors reconstructed for noise reduction did not fit as expected, probably due to the loss of information. The BPNN model with the first 10 feature vectors reconstructed for noise reduction performed the best with an RMSE of 0.040 6 and an R2 of 0.980 3. The reconstruction of the first 15 feature vectors did not improve the noise reduction effect, probably because more than 10 feature vectors added redundant information. The experiments verified the feasibility of noise reduction using PCA reconstruction of transmission spectra and provided a new idea for estimating the BOD content of water bodies.

    Tools

    Get Citation

    Copy Citation Text

    WANG Yi-ming, WANG Cai-ling, WANG Hong-wei. Simulated Estimation of BOD Content in Water Bodies Based on PCA Transmission Spectrum Reconstruction With Noise Reduction[J]. Spectroscopy and Spectral Analysis, 2025, 45(2): 386

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Received: Sep. 5, 2023

    Accepted: May. 21, 2025

    Published Online: May. 21, 2025

    The Author Email: WANG Hong-wei (wanghongwei@nwpu.edu.cn)

    DOI:10.3964/j.issn.1000-0593(2025)02-0386-08

    Topics