EXPLORE SCALABLE AND COST-EFFECTIVE AI DEPLOYMENTS, INCLUDING DISTRIBUTED TRAINING, MODEL SERVING, AND REAL-TIME INFERENCE ON HUMAN TASKS

International Journal of Advances in Engineering Research 24 (1):7-27 (2022)
  Copy   BIBTEX

Abstract

The rapid growth of Artificial Intelligence (AI) has sparked the demand for scalable, efficient, and cost-effective deployment solutions. In particular, these methods are crucial for handling the increasing computing demand and complexity of AI models in human-centric tasks like real-time picture classification, speech recognition, and natural language processing. The three main topics of this paper's exploration of scalable AI deployment methodologies are real-time inference, model serving, and distributed training. Optimized deployment pipelines, parallel processing, and cloud infrastructure are essential for striking a balance between performance and cost. This study offers a thorough analysis of various technologies, looking at their cost-effectiveness, suitability for use in real-world settings, and capacity to handle huge datasets. Along with evaluations, the article provides a comparative study of various approaches based on cost, efficiency, and scalability parameters. Tables are used to highlight the differences between the approaches. A survey of pertinent literature covering the years 2003 to 2022 gives context for the advancement of AI deployment technology

Other Versions

No versions found

Links

PhilArchive

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Building Scalable MLOps: Optimizing Machine Learning Deployment and Operations.Vijayan Naveen Edapurath - 2024 - International Journal of Scientific Research in Engineering and Management 8 (10):1-5.
Autonomous Cloud Operations: Self-Optimizing Cloud Systems Powered By AI and Machine Learning.G. Geethanjali - 2025 - International Journal of Innovative Research in Computer and Communication Engineering 13 (3):2138-2143.
Artificial Intelligence and Automation in Cloud Cost Management: Predicting and Optimizing Cloud Spend.Rewatkar Janhavi - 2025 - International Journal of Multidisciplinary and Scientific Emerging Research (Ijmserh) 13 (1):123-128.
Strengthening Cloud Security with AI-Based Intrusion Detection Systems.Sharma Sidharth - 2024 - Journal of Science Technology and Research (JSTAR) 5 (1):658-663.
Enhancing Cloud Security with AI-Based Intrusion Detection Systems.Sharma Sidharth - 2024 - Journal of Science Technology and Research (JSTAR) 5 (1):658-664.
AI-Powered Cloud Security: Revolutionizing Cyber Defense in the Digital Age.V. Talati Dhruvitkumar - 2024 - International Journal of Multidisciplinary Research in Science, Engineering and Technology 7 (3):4762-4768.
Optimizing AI Models for Biomedical Signal Processing Using Reinforcement Learning in Edge Computing.A. Manoj Prabaharan - 2024 - Journal of Artificial Intelligence and Cyber Security (Jaics) 8 (1):1-7.
Power Consumption and Heat Dissipation in AI Data Centers: A Comparative Analysis.Krishnaiah Narukulla Krishna Chaitanya Sunkara - 2025 - International Journal of Innovative Research in Science, Engineering and Technology 14 (3):1894-1899.

Analytics

Added to PP
2025-03-09

Downloads
56 (#422,263)

6 months
56 (#101,400)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references