Electronics Optics & Control, Volume. 29, Issue 4, 117(2022)

A Sampled Data Cleaning Technology Based on Regression Model

LI Honglie, XIA Dong, and WANG Qian
Author Affiliations
  • [in Chinese]
  • show less

    Data cleaning is an important content in data preprocessingbut problems such as outlier missing and outlier influence exist in current data cleaning technology.A dynamic and fine identification algorithm for outliers based on regression model is proposedin which the regressive values of two data segments ahead of and after the current position are set as referenced values after the elimination of potential outlierswhich is used together with the limits of parameters change rate to give the judgement of outliers.Data cleaning procedure based on regression model is also givenin which steps of coarse identificationfine identification and regressive estimation are adopted to improve the efficiency and effects of data cleaning.A set of real aeronautical data sampled is used to certify the proposed methodand the processing results show that the data cleaning technology based on regression model is able to identify and estimate outliers accurately.

    Tools

    Get Citation

    Copy Citation Text

    LI Honglie, XIA Dong, WANG Qian. A Sampled Data Cleaning Technology Based on Regression Model[J]. Electronics Optics & Control, 2022, 29(4): 117

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Mar. 2, 2021

    Accepted: --

    Published Online: Apr. 22, 2022

    The Author Email:

    DOI:10.3969/j.issn.1671-637x.2022.04.022

    Topics