Protecting the Hadoop Cluster on the Basis of Big Data Security

Journal of Artificial Intelligence, Machine Learning and Data Science 1 (3):831-837 (2023)
  Copy   BIBTEX

Abstract

Gathering and analyzing enormous volumes of data is known as "big data," and it includes information from users, sensors, healthcare providers, and companies. Using the Hadoop framework, large amounts of data are stored, managed, and dispersed across multiple server nodes. Big Data issues, including security holes in the Hadoop Distributed File System (HDFS), the architecture's core layer, are highlighted in this article. The methodology includes setting up a Hadoop environment, integrating Kerberos for authentication, enabling HDFS encryption zones, implementing SSL/TLS for data in transit, and utilizing Apache Ranger and Apache Knox for access control and perimeter security, respectively. The results demonstrate the successful implementation of all planned security measures, achieving a robust security framework for the Hadoop cluster. Performance testing indicates a 10% reduction in processing speed due to the security features, a trade-off deemed acceptable given the significant enhancement in data protection. Compliance testing confirms adherence to GDPR and CCPA regulations, ensuring legal and secure data management. Overall, the study underscores the feasibility of integrating comprehensive security measures within a Hadoop environment, balancing the need for robust data protection with minimal performance impact. Future work includes optimizing security configurations to further mitigate performance degradation and exploring advanced security measures for enhanced threat detection and response. This methodology provides a scalable and secure solution for managing large datasets in compliance with global data protection standards.

Other Versions

No versions found

Links

PhilArchive

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

OPTIMIZED CLOUD SECURE STORAGE: A FRAMEWORK FOR DATA ENCRYPTION, DECRYPTION, AND DISPERSION.S. Yoheswari - 2024 - Journal of Science Technology and Research (JSTAR) 5 (1):415-426.
Next-Generation Cloud Security Frameworks:Balancing Privacy, Compliance, and Data Protection in a Digital-First Era.Varad Upadhye Atharva Hasabnis - 2025 - International Journal of Advanced Research in Education and Technology (Ijarety) 12 (2):453-457.
Guardians of the Cloud: Securing the Digital Sky.Brijesh Yadav Kashyap Bhalodiya - 2022 - International Journal of Multidisciplinary Research in Science, Engineering, Technology and Management 9 (12):3185-3188.
Virtual Machine for Big _Data in Cloud Computing (13th edition).Banupriya I. Manivannan B., - 2024 - International Journal of Innovative Research in Science, Engineering and Technology 13 (11):18380-18386. Translated by Manivannan B.
Guardians of the Cloud: Securing the Digital Sky.Sree M. Harini - 2022 - International Journal of Multidisciplinary and Scientific Emerging Research 10 (4):1611-1614.
Enhanced Secure Cloud Storage: An Integrated Framework for Data Encryption and Distribution.M. Arulselvan - 2024 - Journal of Science Technology and Research (JSTAR) 5 (1):416-427.
Cloud-Based Secure Storage: A Framework for Efficient Encryption, Decryption, and Data Dispersion.M. Arul Selvan - 2024 - Journal of Science Technology and Research (JSTAR) 5 (1):427-434.
Hybrid Blockchain and Big Data Framework for PrivacyPreserving Medical Data Sharing.P. Selvaprasanth - 2024 - Journal of Theoretical and Computationsl Advances in Scientific Research (Jtcasr) 8 (1):1-7.
Multi-Cloud Environments: Reducing Security Risks in Distributed Architectures.Sharma Sidharth - 2021 - Journal of Artificial Intelligence and Cyber Security (Jaics) 5 (1):1-6.
Building Scalable Data Warehouses for Financial Analytics in Large Enterprises.Vijayan Naveen Edapurath - 2024 - International Journal of Innovative Research and Creative Technology 10 (3):1-10.

Analytics

Added to PP
2025-03-09

Downloads
19 (#1,162,759)

6 months
19 (#156,502)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

Autonomous Cloud Operations: Self-Optimizing Cloud Systems Powered By AI and Machine Learning.G. Geethanjali - 2025 - International Journal of Innovative Research in Computer and Communication Engineering 13 (3):2138-2143.
The Future of Serverless Computing: Pushing the Boundaries of Cost Efficiency and Scalability in the Cloud.Satish Patkar Shraddha Sayali - 2025 - International Journal of Advanced Research in Arts, Science, Engineering and Management 12 (1):359-363.
Azure AI-Driven Automation for Supply Chain and Logistics Management In.Kshirsagar Pranav - 2025 - International Journal of Multidisciplinary Research in Science, Engineering, Technology and Management (Ijmrsetm) 12 (3):748-753.
Azure Integration with the Metaverse: Opportunities and Challenges for Future Enterprise Ecosystems.Magar Sanket - 2025 - International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering (Ijareeie) 14 (2):458-464.
Cloudshield: The Future of Cloud Security.Asma Tabassum Ateeb Baig H. - 2025 - International Journal of Advanced Research in Education and Technology 12 (2):493-497.

View all 23 citations / Add more citations

References found in this work

No references found.

Add more references