I. Introduction
Although Machines Learning, Deep learning, AI model building has become easily accessible and commonplace due to the availability of cheap compute [1] [2], ML skills and talent, ML tools /libraries/frameworks, and it is being increasingly employed to solve problems in various industries, the most technical debt in creating usable ML can be attributed to the process of actually productionalizing an ML model. This is because it requires considerable expertise across different skills like software development, data engineering, system design, DevOps and cloud apart from just data science and ML skills [3]–[5]. This becomes a roadblock to many individuals and small organizations or even big organizations where different teams that in fact even have these skills find it difficult to plan how the model will be put into production environment, how it will be used by the business applications, how it will be trained on new data, how it will be retired without affecting the service and how to make this process automatic or reduce manual interactions. Failure to execute this leads to wasted time and effort in only experimentation and other local optimizations versus an optimized, systematic, and automatic ML model service and pipeline roll-out. In this paper, taking the example of a Cloud Forensics ML model, we will see how to create and operationalize au-tomatic’ zero-touch and reusable ML training and serving pipelines using Acumos. [6],–[8] show other ways to create and operationalize pipelines using Acumos. These days, there are many other MLOps tools like Kubeflow [9] from Google, which is an open-source project and an end-to-end MLOps platform that helps do workflow orchestration, experiment tracking, model management, model deployment and has notebook workspace. Airflow [10] and Argo [11] are open-source projects for pipeline orchestration. MLflow [12] is another open-source project supported by Databricks and helps do experiment tracking and model management. Metaflow [13] has features for orchestrating ML pipelines. Pachyderm [14] has data versioning and pipelining features. Most of these MLOps solutions and Tensorflow Serving [15] and Seldon [16] also offer additional advanced capabilities like traffic routing etc. But Acumos offers some other distinct benefits like having inbuilt federation that allows using shared models, and having a design studio to help create AI pipelines consisting of many heterogeneous models.