Monitoring and Management-Support of Distributed Systems

TitleMonitoring and Management-Support of Distributed Systems
Publication TypeTechnical Report
Year of Publication1988
AuthorsHaban, D., Wybranietz D., & Barak A.
Other Numbers493

This paper describes a tool for on-line monitoring of distributed systems. The tool consists of a hardware component and software level, i.e., a hybrid monitor, which is capable of presenting the interactive user and the local operating system with a high-level information and performance evaluation of the activities in the host system with minimal interferences. A special hardware support, which consists of a test and measurement processor (TMP), was designed and has been implemented in the nodes of an experimental multicomputer system. The main function of the TMP is to execute low level operating system functions, to manage local resources and to trigger time driven events in order to reduce the overhead of the host operating system. The operations of the TMP are completely transparent to the users with a minimal, less that 0.1%, overhead to the hardware system. In the experimental system, all the TMPs were connected with a central monitoring station, using an independent communication network in order to provide a global view of the monitored system. The central monitoring station displays the resulting information in easy-to-read charts and graphs. Our experience with the TMP shows that it promotes an improved understanding of run-time behavior and performance measurements, to derive qualitative and quantitative assessments of distributed systems.

Bibliographic Notes

ICSI Technical Report TR-88-007

Abbreviated Authors

D. Haban, D. Wybranietz, and A. Barak

ICSI Publication Type

Technical Report