Innovative Devops for Artificial Intelligence

Open access


Developing Artificial Intelligence is a labor intensive task. It implies both storage and computational resources. In this paper, we present a state-of-the-art service based infrastructure for deploying, managing and serving computational models alongside their respective data-sets and virtual environments. Our architecture uses key-based values to store specific graphs and datasets into memory for fast deployment and model training, furthermore leveraging the need for manual data reduction in the drafting and retraining stages. To develop the platform, we used clustering and orchestration to set up services and containers that allow deployment within seconds. In this article, we cover high performance computing concepts such as swarming, GPU resource management for model implementation in production environments with emphasis on standardized development to reduce integration tasks and performance optimization.

[1] Mo, Y. J., Kim, J., Kim, J.-K., Mohaisen, A. and Lee, W., Performance of deep learning computation with TensorFlow software library in GPU-capable multi-core computing platforms, In: 2017 Ninth International Conference on Ubiquitous and Future Networks (ICUFN). doi: 10.1109/icufn.2017.7993784.

[2] Agile Admin “What is DevOps: As of 19 May 2018.

[3] Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich, Computer Vision and Pattern Recognition Going Deeper with Convolutions, In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] K. Wongsuphasawat et al., Visualizing Dataflow Graphs of Deep Learning Models in TensorFlow, IEEE Transactions on Visualization and Computer Graphics, vol. 24, no. 1, pp. 1-12, Jan. 2018.

[5] Rutkowski, Leszek, Image classification with recurrent attention models, In: Artificial Intelligence and Soft Computing, 11th International Conference, ICAISC 2012, Zakopane, Poland, April 29 -May 3, 2012: Proceedings. Springer, 2012.

[6] Ian Miell, Aidan Hobson Sayers, Docker in Practice, 1st Manning Publications Co. 2016.

[7] Nishant Shukla, Machine Learning with TensorFlow Manning Publications Co. 2018.

[8] Srdjan Grubor, A practical guide to rapidly and efficiently mastering Docker containers, along with tips and tricks learned in the field Packt Publishing Ltd, Nov 22, 2017.

[9] Jeremy Nelson, Mastering Redis, Packt Publishing Ltd, May 31, 2016.

[10] Maxwell Dayvson Da Silva, Hugo Lopes Tavares, Redis Essentials, Packt Publishing Ltd, Sep 8, 2015.

[11] P. Mehra and S. Fineberg, Fast and flexible persistence: the magic potion for fault-tolerance, scalability and performance in online data stores, In: 18th International Parallel and Distributed Processing Symposium, 2004.

[12] Fabrizio Soppelsa, Chanwit Kaewkasi Native Docker Clustering with Swarm, Packt Publishing Ltd, Dec 20, 2016.

[13] Clouds Andrew J. Younge, Kevin Pedretti, Ryan E. Grant, Ron Brightwell, A Tale of Two Systems: Using Containers to Deploy, In: HPC Applications on Supercomputers and 2017 IEEE 9th International Conference on Cloud Computing Technology and Science.

[14] Adochiei Felix Constantin, “Contributions to Biological Signal Processing using Embedded Systems”, PhD Thesis, POSDRU CUANTUMDOC -RESEARCH GRANT.

Journal Information


All Time Past Year Past 30 Days
Abstract Views 0 0 0
Full Text Views 94 94 17
PDF Downloads 86 86 17