Journal of Terahertz Science and Electronic Information Technology , Volume. 23, Issue 5, 482(2025)

Sensitive data leakage risk prediction driven by data explicit and implicit relationships

LIANG Hua1, JIN Min2, YAN Hua1, HAN Shihai1, and LI Wei2
Author Affiliations
  • 1Electric Power Research Institute, Chongqing 401123, China, State Grid Chongqing Electric Power Company
  • 2Digitization Department, Chongqing 400014, China, State Grid Chongqing Electric Power Company
  • show less

    With the rapid development of Internet of Things(IoT), big data, and Artificial Intelligence(AI) technologies, massive amounts of data are being generated and utilized on an unprecedented scale. These data contain a large amount of sensitive information, and how to securely store sensitive data has become a realistic problem that needs to be solved. The existing data storage schemes usually focus on the direct protection of sensitive data, while ignoring the leakage risks associated with explicit and implicit associations between sensitive and non-sensitive data. The explicit and implicit relationships among data are deeply analyzed from the perspective of information entropy, and a method is proposed to quickly assess the explicit and implicit relationships and predict the leakage risk of sensitive data. By introducing the information Lift Ratio(LR) and the Probability of Information Control(PIC), the method can effectively identify the influence of non-sensitive data on the risk of sensitive data leakage. In the simulation experiments, the maximum single-attribute LR in the Statistical Property Dataset(SPD) is 0.308, and the joint-attribute LR can be up to 0.891, and the predicted value of the sensitive data leakage risk is significantly improved, up to 23.2%. The simulation results show that the method can effectively identify and cope with the security risks caused by explicit and implicit relationships, thus significantly improving the overall security level of sensitive data storage.

    Tools

    Get Citation

    Copy Citation Text

    LIANG Hua, JIN Min, YAN Hua, HAN Shihai, LI Wei. Sensitive data leakage risk prediction driven by data explicit and implicit relationships[J]. Journal of Terahertz Science and Electronic Information Technology , 2025, 23(5): 482

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Aug. 16, 2024

    Accepted: Jun. 5, 2025

    Published Online: Jun. 5, 2025

    The Author Email:

    DOI:10.11805/tkyda2024383

    Topics