Abstract
Abstract: This research employs a Random Forest classification model to predict and assess obesity and cardiovascular disease (CVD) risk based on a comprehensive dataset collected from individuals in Mexico, Peru, and Colombia. The dataset comprises 17 attributes, including information on eating habits, physical condition, gender, age, height, and weight. The study focuses on classifying individuals into different health risk categories using machine learning algorithms. Our Random Forest model achieved remarkable performance with an accuracy, F1-score, recall, and precision all reaching 97.23%. The model's success is attributed to its ability to effectively leverage attributes related to eating habits, physical condition, and demographic factors. Feature importance analysis reveals key factors contributing to accurate predictions. This paper contributes to the field of health analytics by demonstrating the effectiveness of Random Forest in predicting and categorizing health risks. The insights gained from this research have the potential to inform personalized health interventions and contribute to the advancement of precision health strategies. Furthermore, the study highlights the importance of leveraging machine learning techniques for health risk assessment in diverse populations.