Risk Assessment Framework for Data Leakage Prevention Using Machine Learning Techniques

Authors

  • Xiaolan Wu Northeastern University Computer Science Author
  • Juan Li Shanghai Jiao Tong University Master of Science in Communication and Information Systems Author
  • Wenkun Ren Information Technology and Management, Illinois Institute of Technology, Chicago, IL Author

DOI:

https://doi.org/10.69987/AIMLR.2024.50305

Keywords:

Data leakage prevention, Machine learning, Anomaly detection, Risk assessment

Abstract

Data leakage remains a critical concern for organizations handling sensitive information, requiring effective risk assessment methods to identify potential vulnerabilities. This paper proposes a machine learning-based framework for assessing data leakage risks in corporate environments. Our approach focuses on analyzing user access patterns and data flow characteristics to identify anomalous behaviors that may indicate potential leakage risks. We employ anomaly detection algorithms, particularly Isolation Forest and Local Outlier Factor, to detect unusual data access activities. The framework includes a risk scoring mechanism that evaluates access requests based on user roles, data sensitivity levels, and contextual factors. Additionally, we explore the use of natural language processing for identifying sensitive content in unstructured documents, enabling more comprehensive risk assessment. The proposed method provides organizations with a practical tool for prioritizing security efforts and allocating resources to high-risk areas. This work supports data protection compliance requirements and helps organizations strengthen their data governance practices.

Author Biography

  • Wenkun Ren, Information Technology and Management, Illinois Institute of Technology, Chicago, IL

     

     

Downloads

Published

2024-07-16

How to Cite

Xiaolan Wu, Juan Li, & Wenkun Ren. (2024). Risk Assessment Framework for Data Leakage Prevention Using Machine Learning Techniques. Artificial Intelligence and Machine Learning Review , 5(3), 55-66. https://doi.org/10.69987/AIMLR.2024.50305

Share