The Indonesian Journal of Computer Science
Vol. 13 No. 2 (2024): The Indonesian Journal of Computer Science (IJCS)

Distributed Systems for Machine Learning in Cloud Computing: A Review of Scalable and Efficient Training and Inference

Sadiq, Shereen (Unknown)
R. M. Zeebaree, Subhi (Unknown)



Article Info

Publish Date
01 Apr 2024

Abstract

Traditional computer systems have been pushed to their limits as a result of the exponential rise of data and the rising complexity of machine learning (ML) models. As a result of its on-demand scalability and resource agility, cloud computing has emerged as the platform of choice for training and deploying large-scale machine learning models. However, in order to make good use of cloud resources for machine learning, it is necessary to make use of distributed systems. These systems are responsible for coordinating computations over several nodes in order to manage the demanding workloads. The purpose of this paper is to investigate the realm of distributed systems for machine learning in cloud computing, with a particular emphasis on training and inference that is both scalable and efficient. During the discussion on the need of distributed systems in machine learning, it was made clear why conventional single-machine techniques are not enough for the requirements of current machine learning and how distributed systems might help solve these difficulties. Scalability and Efficiency Considerations were reviewed in relation to the primary elements that contribute to the effectiveness of a distributed system for machine learning. These elements include task partitioning, communication overhead, fault tolerance, and resource optimization that were discussed. In the context of cloud computing, the purpose of this review research is to provide a complete overview of the fascinating topic of distributed systems for machine learning. In order to successfully traverse the intricate and ever-changing world of cloud-based machine learning, it provides vital insights and information.

Copyrights © 2024






Journal Info

Abbrev

ijcs

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering Engineering

Description

The Indonesian Journal of Computer Science (IJCS) is a bimonthly peer-reviewed journal published by AI Society and STMIK Indonesia. IJCS editions will be published at the end of February, April, June, August, October and December. The scope of IJCS includes general computer science, information ...