Show simple item record

dc.contributor.advisorBhattacharyya, Chiranjib
dc.contributor.authorAgrawal, Vipul
dc.date.accessioned2011-07-12T07:12:00Z
dc.date.accessioned2018-07-31T04:40:24Z
dc.date.available2011-07-12T07:12:00Z
dc.date.available2018-07-31T04:40:24Z
dc.date.issued2011-07-12
dc.date.submitted2010
dc.identifier.urihttps://etd.iisc.ac.in/handle/2005/1285
dc.identifier.abstracthttp://etd.iisc.ac.in/static/etd/abstracts/1667/G23699-Abs.pdfen_US
dc.description.abstractThe ability to accurately predict an impending hard disk failure is important for reliable storage system design. The facility provided by most hard drive manufacturers, called S.M.A.R.T. (self-monitoring, analysis and reporting technology), has been shown by current research to have poor predictive value. The problem of finding alternatives to S.M.A.R.T. for predicting disk failure is an area of active research. In this work, we present a rule discovery methodology, and show that it is possible to construct decision support systems that can detect such failures using information recorded from live disks. It is desired that any such prediction methodology should have high accuracy and must have ease of interpretability. Black box models can deliver highly accurate solutions but do not provide an understanding of events which explains the decision given by it. To this end we explore rule based classifiers for predicting hard disk failures from various disk events. We show that it is possible to learn easy to understand rules from disk events. Our evaluation shows that our system can be tuned either to have a high failure detection rate (i.e., classify a bad disk as bad) or to have a low false alarm rate (i.e., not classify a good disk as bad). We also propose a modification of MLRules algorithm for classification of data with imbalanced class distributions. The existing algorithm, assuming relatively balanced class distributions and equal misclassfication costs, performs poorly in classification of such datasets. The performance can be considerably improved by introducing cost- sensitive learning to the existing framework.en_US
dc.language.isoen_USen_US
dc.relation.ispartofseriesG23699en_US
dc.subjectHard Drive Failure Predictionen_US
dc.subjectHard Drive Verificationen_US
dc.subjectRule-based Classifiersen_US
dc.subjectHard Disks (Computer Science)en_US
dc.subjectRule Discovery Methodologyen_US
dc.subjectDisk Events (Computer Science)en_US
dc.subjectRule-based Learningen_US
dc.subjectHard Drive Failuresen_US
dc.subjectSelf Monitoring Analysis and Reporting Technology (SMART)en_US
dc.subject.classificationComputer Scienceen_US
dc.titleHard Drive Failure Prediction : A Rule Based Approachen_US
dc.typeThesisen_US
dc.degree.nameMSc Enggen_US
dc.degree.levelMastersen_US
dc.degree.disciplineFaculty of Engineeringen_US


Files in this item

This item appears in the following Collection(s)

Show simple item record