The management of distributed systems infrastructure requires dedicated set of tools. The one tool that helps visualize current operational state of all systems and notify when failure occurs is available within monitoring solution. This paper provides an overview of monitoring approaches for gathering data from distributed systems and what are the major factors to consider when choosing a monitoring solution. Finally we discuss the tools currently available on the market.
If the inline PDF is not rendering correctly, you can download the PDF file here.
 Aceto G. Botta A. De Donato W. Pescape A. Cloud monitoring: A survey Computer Networks vol. 57 pp. 2093-2115 2013.
 Boccia V. et al. Infrastructure Monitoring for distributed Tier1: The ReCaS project use-case International Conference on Intelligent Networking and Collaborative Systems Salerno Italy 2014.
 Fatema K. Emeakaroha V. C. Healy P. D. Morrison J. P. Lynn T. A survey of Cloud monitoring tools: Taxonomy capabilities and objectives Journal of Parallel and Distributed Computing vol. 74 no. 10 pp. 2918-2933 2014.
 Hakulinen T. Ninin P. Nunes R. Riesco-Hernandez T. Revisiting CERN Safety System Monitoring (SSM) Proceedings of International Conference on Accelerator & Large Experimental Physics Control Systems San Francisco California USA 2013.
 Hernantes J. Gallardo G. Serrano N. IT Infrastructure-Monitoring Tools IEEE Software vol. 32 no. 4 pp. 88-93 2015.
 Horalek J. Sobeslav V. Proactive ICT Application Monitoring Latest Trends in Information Technology Wseas Press pp. 49-54 2012.
 Kent K. Souppaya M. Guide to Computer Security Log Management US Nat'l Inst. Standards and Technology Sept. 2006; http://csrc.nist.gov/publications/nistpubs/800-92SP800-92.pdf.
 Kufel L. Security Event Monitoring in a Distributed Systems Environment IEEE Security & Privacy vol. 11 no. 1 pp. 36-43 2013.
 Massie M. Li B. Nicholes B. Vuksan V. Monitoring with Ganglia Book published by O’Reilly Media 2013.
 Smit M. Simmons B. Litoiu M. Distributed application-level monitoring for heterogeneous clouds using stream processing Future Generation Computer Systems vol. 29 pp. 2103-2114 2013.
 Spellmann A. Gimarc R. Capacity Planning: A Revolutionary Approach for Tomorrow’s Digital Infrastructure Computer Measurement Group Conference La Jolla California USA 2013.
 Terenziani P. Coping with Events in Temporal Relational Databases IEEE Trans. Knowledge and Data Eng. vol. 25 no. 5 pp. 1181-1185 2013.
 Tierney B. Crowley B. Gunter D. Holding M. Lee J. Thompson M. A Monitoring Sensor Management System for Grid Environments Proceedings of The Ninth International Symposium On High-performance Distributed Computing IEEE CS pp. 97-104 2000.