Protecting the Hadoop Cluster on the Basis of Big Data Security

Pamarthi Kartheek

PhilArchive

More download options

Protecting the Hadoop Cluster on the Basis of Big Data Security

Pamarthi Kartheek

Journal of Artificial Intelligence, Machine Learning and Data Science 1 (3):831-837 (2023) Copy BIBT_EX

Abstract

Gathering and analyzing enormous volumes of data is known as "big data," and it includes information from users, sensors, healthcare providers, and companies. Using the Hadoop framework, large amounts of data are stored, managed, and dispersed across multiple server nodes. Big Data issues, including security holes in the Hadoop Distributed File System (HDFS), the architecture's core layer, are highlighted in this article. The methodology includes setting up a Hadoop environment, integrating Kerberos for authentication, enabling HDFS encryption zones, implementing SSL/TLS for data in transit, and utilizing Apache Ranger and Apache Knox for access control and perimeter security, respectively. The results demonstrate the successful implementation of all planned security measures, achieving a robust security framework for the Hadoop cluster. Performance testing indicates a 10% reduction in processing speed due to the security features, a trade-off deemed acceptable given the significant enhancement in data protection. Compliance testing confirms adherence to GDPR and CCPA regulations, ensuring legal and secure data management. Overall, the study underscores the feasibility of integrating comprehensive security measures within a Hadoop environment, balancing the need for robust data protection with minimal performance impact. Future work includes optimizing security configurations to further mitigate performance degradation and exploring advanced security measures for enhanced threat detection and response. This methodology provides a scalable and secure solution for managing large datasets in compliance with global data protection standards.

Keywords

HADOOP, Cluster, Big data, Security

Reprint years

Other Versions

No versions found

My notes

Analytics

Added to PP
2025-03-09

Downloads
19 (#1,162,759)

6 months
19 (#156,502)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

Autonomous Cloud Operations: Self-Optimizing Cloud Systems Powered By AI and Machine Learning.G. Geethanjali - 2025 - International Journal of Innovative Research in Computer and Communication Engineering 13 (3):2138-2143.

The Future of Serverless Computing: Pushing the Boundaries of Cost Efficiency and Scalability in the Cloud.Satish Patkar Shraddha Sayali - 2025 - International Journal of Advanced Research in Arts, Science, Engineering and Management 12 (1):359-363.

Azure AI-Driven Automation for Supply Chain and Logistics Management In.Kshirsagar Pranav - 2025 - International Journal of Multidisciplinary Research in Science, Engineering, Technology and Management (Ijmrsetm) 12 (3):748-753.

Azure Integration with the Metaverse: Opportunities and Challenges for Future Enterprise Ecosystems.Magar Sanket - 2025 - International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering (Ijareeie) 14 (2):458-464.

Cloudshield: The Future of Cloud Security.Asma Tabassum Ateeb Baig H. - 2025 - International Journal of Advanced Research in Education and Technology 12 (2):493-497.

View all 23 citations / Add more citations

References found in this work

No references found.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Protecting the Hadoop Cluster on the Basis of Big Data Security

Abstract

Categories

Keywords

Reprint years

Other Versions

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work