Abstract
Data normalization is a crucial step in database management systems (DBMS), ensuring consistency,
minimizing redundancy, and enhancing query performance. Traditional methods of normalization in supermarket sales
databases often demand significant manual effort and domain expertise, making the process time-consuming and prone
to errors. This paper introduces an innovative machine learning (ML)-based framework to automate data normalization
in supermarket sales databases. The proposed approach utilizes both supervised and unsupervised ML techniques to
identify functional dependencies, detect anomalies, and suggest optimal schema transformations. Experiments on
supermarket sales datasets show substantial improvements in accuracy, scalability, and processing time compared to
traditional approaches. The results emphasize the potential of incorporating ML into database management practices to
boost operational efficiency and support better decision-making.