Laser & Optoelectronics Progress, Volume. 62, Issue 1, 0100011(2025)

Application of Infrared Spectroscopy Combined with Chemometrics in Material Evidence Analysis

Ruiting Feng*, Hongda Li, Zhichao Yang, and Yumu Liu
Author Affiliations
  • School of Criminal Science and Technology, Criminal Investigation Police University of China, Shenyang 110035, Liaoning , China
  • show less
    Figures & Tables(11)
    Chemometrics application flow chart
    Discriminant function diagrams of PCA+FDA model. (a) Raw spectral data; (b) first derivative spectral data; (c) fusion spectral data
    Principal component analysis of camellia oil and adulterated oil. (a) MIR spectra; (b) NIR spectra
    PLS regression of predicted vs actual soybean oil content in pure camellia oil samples. (a) ATR-MIR spectra; (b) FODR-NIR spectra
    Comparative ATR-FTIR spectra of asian elephant, indian leopard, and royal Bengal tiger
    PLS-DA plot to discriminate the blood spectra of asian elephant, indian leopard, and royal Bengal tiger
    Infrared spectra of toner before and after baseline correction
    System clustering diagram of different laser printer toner
    • Table 1. Several spectral preprocessing methods commonly used in infrared spectroscopy

      View table

      Table 1. Several spectral preprocessing methods commonly used in infrared spectroscopy

      AimMethodExplanation
      Data enhancement transformMean centeringBefore establishing a model, it is often necessary to adopt a data enhancement algorithm to eliminate redundant information in the collected data, so as to enhance the differences between data samples, so as to effectively improve the representativeness and prediction ability of the built model
      Autoscaling
      Normalization
      SmoothingSavitzky-Golay convolution smoothingSmoothing is to eliminate the random noise in the spectral signal and improve the signal to noise ratio of the sample signal, which can improve the prediction effect of the model to a certain extent
      Moving window smoothing
      Baseline correction1st derivativeIn spectrum preprocessing, derivative algorithm is mainly used to analyze the rate of change of spectral data, which is very effective for eliminating baseline drift, smoothing background interference and resolving overlapping peaks
      2nd derivative
      Scattering correctionStandard normal variate (SNV)They are mainly used to eliminate the influence of scattering on the spectrum caused by uneven particle distribution and particle size difference
      Multiplicative scatter correction(MSC)
    • Table 2. Comparison of chemical pattern recognition methods commonly used in infrared spectroscopy

      View table

      Table 2. Comparison of chemical pattern recognition methods commonly used in infrared spectroscopy

      ClassificationPattern recognition methodBasic ideaCharacteristicRange of applicationLimitationReference
      SupervisedPLS-DABy way of projection, the predictor and observed variables are projected into a new space, and then a linear regression model is found in this new space to achieve the classification or discrimination taskNoise filtering, dimensionality reduction, discrimination and classification, sample prediction, good interpretation abilityThe situation where the number of explanatory variables is large and there is multicollinearity, the number of sample observations is small and the interference noise is largeWhen there is a high correlation between variables, PLS-DA may overfit the data11-12
      LDAMaximize the distance between classes and minimize the distance within classesDimensionality reduction, discrimination and classification, high computational efficiencySample data with clear class label, high dimensional feature space and approximate Gaussian distributionIt is not suitable for samples with non-Gaussian distribution and cannot handle nonlinear problems13-15
      SupervisedKNNBy majority rule, a sample belongs to a class if most of its neighbors belong to that classNoise filtering, discrimination and classification, sample prediction, no training requiredMultiple classification problems, nonlinear problems and rare event classificationLarge computational complexity and sensitivity to distance measurement16
      SVMAn optimal hyperplane is found to divide the samples so that the distance between the two types of samples closest to the plane is maximizedDiscrimination and classification, sample predictionBinary classification problem, small sample high-dimensional data set, separable data setSensitive to parameter adjustment and kernel function selection17-18
      SIMCAPCA analysis is used to build a model for each class, calculate the distance between the unknown samples and these models, and determine the category of the unknown samples according to the distance discriminationDimensionality reduction, discrimination and classification, sample predictionSamples with clear category labelsLarge computational complexity19-20
      Non-supervisedPCAUsing a small number of basic principal components (PCS) to explain the correlation between a large number of variables, the dimensionality of the original data set is reduced and the original information content is preserved as much as possibleDimensionality reduction, sample classification, maximum retention of original variable informationData sets with a large number of features and sufficient sample size, especially when dimensionality reduction is needed to simplify the problemWhen the sample size is insufficient or there are too many variables, PCA may not be effective in extracting representative principal components21-22
      HCAThe variables or samples that are less similar are aggregated into one class, and the variables or samples that are more similar are aggregated into one classDiscrimination and classification, intuitiveness and visualizationSamples that are not clearly labeled or classifiedIt is sensitive to distance calculation methods and feature weights and difficult to deal with large data sets23
    • Table 3. Summary of chemometric methods used in physical evidence analysis

      View table

      Table 3. Summary of chemometric methods used in physical evidence analysis

      Research objectSampleAimAnalytical techniqueApplied statistical methodReference
      Trace evidenceHousehold paintsClassificationFTIRPCA27
      Textile fibresIdentificationATR-FTIRPCA28
      Automobile lampshadesClassificationFTIRPCA-FDA29
      ExplosivesIdentificationNIRPLS-DA30
      ExplosivesIdentificationNIRPLSR31
      Physical evidence of food, medicine, and environmental factorsIllicit liquorsOrigin identificationATR-FTIRLDA32
      Crocus sativusOrigin identificationNIR、MIRPLS-DA33
      Sika deer antler cap powderAdulteration detectionFTIRPCA-SVM34
      Camellia oilsAdulteration detectionMIR-ATR,FODR-NIRSIMCA,PLS35
      SoilPollutant concentrationNIRPLS36
      Biological evidenceBloodSpecies differentiationATR-FTIRPLS-DA38
      BloodGender and racial discriminationATR-FTIRPLS-DA39
      BloodPrediction of the formation time of bloodstainsATR-FTIRMLR,PLSR40
      HairIdentificationATR-FTIRPLS-DA41
      Seminal fluidClassification and identificationATR-FTIRPCA-LDA,PLSR42
      Document evidenceErasable pensClassificationFTIRPCA44
      Black tonerClassificationμ-FTIRPCA-HCA45
      BanknotesIdentificationNIRSIMCA,SPA-LDA46
      PapersIdentificationATR-FTIRPCA47
      Paper express document bagsClassificationFTIRPCA,FDA48
    Tools

    Get Citation

    Copy Citation Text

    Ruiting Feng, Hongda Li, Zhichao Yang, Yumu Liu. Application of Infrared Spectroscopy Combined with Chemometrics in Material Evidence Analysis[J]. Laser & Optoelectronics Progress, 2025, 62(1): 0100011

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category: Reviews

    Received: Apr. 15, 2024

    Accepted: May. 22, 2024

    Published Online: Jan. 3, 2025

    The Author Email:

    DOI:10.3788/LOP241107

    CSTR:32186.14.LOP241107

    Topics