A flexible monitoring and notification system for distributed resources
Smith, G. and Baker, M. (2008) A flexible monitoring and notification system for distributed resources. In: Tudruj, M. (ed.) Proceedings of the International Symposium on Parallel and Distributed Computing. IEEE Computer Soc, pp. 31-38. ISBN 9780769534725
Full text not archived in this repository.
Resource monitoring in distributed systems is required to understand the 'health' of the overall system and to help identify particular problems, such as dysfunctional hardware, a faulty, system or application software. Desirable characteristics for monitoring systems are the ability to connect to any number of different types of monitoring agents and to provide different views of the system, based on a client's particular preferences. This paper outlines and discusses the ongoing activities within the GridRM wide-area resource-monitoring project.