DESIGN AND IMPLEMENTATION OF AI-DRIVEN MICROSERVICES ARCHITECTURE FOR SCALABLE MACHINE LEARNING WORKLOADS
Keywords:
Microservices, Machine Learning, AI-driven Architecture, Scalability, Cloud-native Systems, Containerization, OrchestrationSynopsis
This paper explores the design and implementation of a microservices-based architecture tailored for scalable machine learning (ML) workloads enhanced by artificial intelligence (AI). It discusses how microservices facilitate flexible deployment, scalability, and real-time model updates while allowing AI components to dynamically manage workload distribution. Two diagrams are included to illustrate the architecture and AI-driven orchestration, alongside two tables that compare legacy vs. microservices-based ML systems and performance metrics. This architecture is especially suited for cloud-native, continuous-integration environments, where AI drives decisions about resource scaling and service orchestration.
References
[1] Ribeiro, J. L., Figueredo, M., & Araujo, A. (2019). A microservice based architecture topology for machine learning deployment. IEEE.
[2] Gummadi, V. P. K. (2019). Microservices architecture with APIs: Design, implementation, and MuleSoft integration. Journal of Electrical Systems, 15(4), 130–134. https://doi.org/10.52783/jes.9328
[3] Felstaine, E., & Hermoni, O. (2018). Machine Learning, Containers, Cloud Natives, and Microservices. Taylor & Francis.
[4] Devarakonda, R. R. (2017). A Microservices-Based Approach for Scalable Deployment of Machine Learning Models. SSRN.
[5] Gummadi, V. P. K. (2020). API design and implementation: RAML and OpenAPI specification. Journal of Electrical Systems, 16(4). https://doi.org/10.52783/jes.9329
[6] Mohapatra, A., & Sehgal, N. (2018). Scalable Deep Learning on Cloud Platforms. IJTMH.
[7] Chishti, N., & Dine, F. (2018). Building Scalable and Resilient Enterprise Architectures. ResearchGate.
[8] Rodríguez-Gracia, D., et al. (2019). Microservices and Machine Learning Algorithms for Adaptive Green Buildings. Sustainability, 11(16), 4320.
[9] Lee, G. M., Um, T. W., & Choi, J. K. (2018). AI as a Microservice over 5G Networks. IEEE.
[10] Alipour, H., & Liu, Y. (2017). Online Machine Learning for Cloud Resource Provisioning. IEEE.
[11] Chakraborty, A., Ganguly, S., & Das, S. (2019). Architectural Pattern for XAI as a Service. Taylor & Francis.
[12] Magableh, B., & Almiani, M. (2019). Deep Q Learning for Self-Adaptive Microservices. IEEE Access.
Published
Series
Categories
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.