Comparative Study of Python and C++ in High-Performance AI Model Deployment
Keywords:
AI deployment, Python, C++, performance benchmarking, inference optimization, real-time systems, model servingSynopsis
References
(1) Jia, Yangqing, et al. “Caffe: Convolutional Architecture for Fast Feature Embedding.” Proceedings of the ACM International Conference on Multimedia, ACM, 2014, pp. 675–678.
(2) Abadi, Martín, et al. “TensorFlow: A System for Large-Scale Machine Learning.” Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, USENIX Association, 2016, pp. 265–283.
(3) Paszke, Adam, et al. “PyTorch: An Imperative Style, High-Performance Deep Learning Library.” Advances in Neural Information Processing Systems, vol. 32, Curran Associates Inc., 2019, pp. 8024–8035.
(4) Pedregosa, Fabian, et al. “Scikit-Learn: Machine Learning in Python.” Journal of Machine Learning Research, vol. 12, 2011, pp. 2825–2830.
(5) He, Kaiming, et al. “Deep Residual Learning for Image Recognition.” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, IEEE, 2016, pp. 770–778.
(6) Sirimalla A. Autonomous Performance Tuning Framework for Databases Using Python and Machine Learning. J Artif Intell Mach Learn & Data Sci 2023 1(4), 3139-3147. DOI: doi.org/10.51219/JAIMLD/adithya-sirimalla/642
(7) Chen, Tianqi, et al. “Learning to Optimize Tensor Programs.” International Conference on Learning Representations, OpenReview, 2018.
(8) Kraska, Tim, et al. “The Case for Learned Index Structures.” Proceedings of the 2018 International Conference on Management of Data, ACM, 2018, pp. 489–504.
(9) Li, Mu, et al. “Scaling Distributed Machine Learning with the Parameter Server.” Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation, USENIX Association, 2014, pp. 583–598.
(10) Howard, Andrew G., et al. “MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications.” arXiv preprint arXiv:1704.04861, 2017.
(11) Reddi, Vijay Janapa, et al. “MLPerf Inference Benchmark.” Proceedings of the ACM/IEEE International Symposium on Computer Architecture, IEEE, 2020, pp. 446–459.
(12) Micikevicius, Paulius, et al. “Mixed Precision Training.” International Conference on Learning Representations, OpenReview, 2018.
(13) Sirimalla, A. (2022). End-to-end automation for cross-database DevOps deployments: CI/CD pipelines, schema drift detection, and performance regression testing in the cloud. World Journal of Advanced Research and Reviews, 14(3), 871–889. https://doi.org/10.30574/wjarr.2022.14.3.0555
(14) Wang, Jue, et al. “TVM: An Automated End-to-End Optimizing Compiler for Deep Learning.” Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, USENIX Association, 2018, pp. 578–594.
Published
Series
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.