Journal Home Online First Current Issue Archive For Authors Journal Information 中文版

Strategic Study of CAE >> 2007, Volume 9, Issue 2

Research on Multi-level Software Rejuvenationof Computing System

1. Department of Computer Science & Engineering, Jiangsu Polytechnic University, Changzhou, Jiangsu 213164, China;

2. Department of Computer, Nanjing University of Science and Technology, Nanjing 210094, China

Funding project:国家自然科学基金资助项目(60273035);国防科工委基础应用项目(K1704060511) Received: 2005-09-27 Revised: 2005-11-10

Next Previous

Abstract

Recently,the phenomenon of software aging,one in which error conditions actually accrue with time and/or load,has been observed. To counteract software aging, which can cause outages resulting in high costs,a proactive restart technique called software rejuvenation is proposed and the rejuvenation cost is analyzed.In order to reduce the rejuvenation cost and improve software availability and reliability further, rejuvenation granularity should be finer than before. Therefore a fine-grained proactive technique——multilevel software rejuvenation is put forward. Firstly, the degradation law of system performance can be determined by analyzing the occupation and wastage of system resources. Based on the law and the software architecture,the two software rejuvenation policies, i.e. time-based multilevel software rejuvenation policy and detection-based multilevel software rejuvenation policy, can be drafted, and the rejuvenation granularity can be determined. Their formal description of policies is given by finite-state automaton. Finally, the entire process is illustrated with a web service case. This paper provides a case to illustrate the process, and the simulation results of the case show that the multilevel software rejuvenation policy can reduce the MTTR and rejuvenation cost further, comparing with the only system-level software rejuvenation. As a consequence,the system availability and reliability are enhanced.

Figures

图 1

图 2

图 3

图 4

图 5

图 6

图 7

图 8

References

[ 1 ] Huang Y , Kintala C , Kolettis N . Software rejuvenation : analysis ,module and applications [A] .In :Proc of FTCS- 25 [C] .Pasadena ,CA ,1995 link1

[ 2 ] Garg S , Moorsel A V , Vaidyanathan K . A Methodology for detection and estimation of software aging [A] . In : Proceedings of the 9th International Symposium on Software Reliability Engineering [C] .Paderborn ,Germany ,1998 link1

[ 3 ] Castelli V , Harper R E , Heidelberger P . Proactive management of software Aging [J] . IBM JRD , 2001 , 45 (2) :311 ~ 332 link1

[ 4 ] Vaidyanathan K . Proactive Management of Software Systems :Analysis and Implementation [D] .Department of Electrical and Computer Engineering Duke University ,NC , USA ,2002

[ 5 ] Xiea W , Hongb Y , Trivedi K . Analysis of a two-level software rejuvenation policy [J] . Reliability Engineering and System Safety ,2005 ,87(1) :13 ~ 22 link1

[ 6 ] Patterson D , Brown A , Broadwell P . Recovery oriented computing (ROC) : motivation , definition , techniques , and case studies [R] . UC Berkeley Computer Science Technical Report UCB/CSD - 02 - 1175 ,2002

[ 7 ] Candea G , Fox A . Recursive restartability : turning the reboot sledgehammer into a scalpel [ A] . In : 8th Workshop on Hot Topics in Operating Systems [ C] . Schloss Elmau ,Germany ,2001 link1

[ 8 ] Candea G , Kawamoto S , Fujiki Y . Microreboot-A technique for cheap recovery [A] . In :6th Symposium on Operating Systems Design and Implementation [C] . San Francisco ,CA ,2004 link1

[ 9 ] Candea G , Cutler J , Fox A . Improving availability with recursive microreboots :a soft-state system case study [J] . Performance Evaluation Journal ,2004 ,56(1 - 3) :213 ~ 248 link1

[10] Candea G , Fox A . Crash-only software [A] . In : 9th Workshop on Hot Topics in Operating Systems (HotOS-IX) [C] .Lihue ,Hawaii ,2003 link1

Related Research