Search from over 60,000 research works

Advanced Search

Handling single node failures using agents in computer clusters

Full text not archived in this repository.
Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Varghese, B., McKee, G. T. and Alexandrov, V. N. (2010) Handling single node failures using agents in computer clusters. In: Proceedings of the 2010 International Symposium on Performance Evaluation of Computer and Telecommunication Systems. Society for Modeling and Simulation International, San Diego, USA, pp. 96-101. ISBN 9781565553408

Abstract/Summary

The work reported in this paper is motivated towards handling single node failures for parallel summation algorithms in computer clusters. An agent based approach is proposed in which a task to be executed is decomposed to sub-tasks and mapped onto agents that traverse computing nodes. The agents intercommunicate across computing nodes to share information during the event of a predicted node failure. Two single node failure scenarios are considered. The Message Passing Interface is employed for implementing the proposed approach. Quantitative results obtained from experiments reveal that the agent based approach can handle failures more efficiently than traditional failure handling approaches.

Additional Information The symposium was held in Ottawa, Canada, 11-14 July 2010.
Item Type Book or Report Section
URI https://reading-clone.eprints-hosting.org/id/eprint/17481
Item Type Book or Report Section
Refereed Yes
Divisions Science
Additional Information The symposium was held in Ottawa, Canada, 11-14 July 2010.
Publisher Society for Modeling and Simulation International
Download/View statistics View download statistics for this item

University Staff: Request a correction | Centaur Editors: Update this record

Search Google Scholar