教育管理计算机专业英文讲义汇总

下载本文档

ID 704246
格式 doc
大小 210.5 KB
约32页
收藏
点赞(0)
海报
举报

/ 32

下载本文档

文本预览下载提示常见问题

激angY,LiM,ZhouZH.SoftwaredefectdetectionwithRocus.JOURNALOFCOMPUTERSCIENCEANDTECH-NOLOGY26(2):328{342Mar.2011.DOI10.1007/s11390-011-1135-6SoftwareDefectDetectionwithROCUSYuan激ang(姜远),Member,CCF,MingLi(黎铭)¤,Member,CCF,ACM,IEEEandZhi-HuaZhou(周志华),SeniorMember,CCF,IEEE,Member,ACMNationalKeyLaboratoryforNovelSoftwareTechnology,Nan激ngUniversity,Nan激ng210093,ChinaE-mail:f激angyuan,lim,zhouzhg@nju.eduReceivedMay15,2009;revisedOctober26,2010.AbstractSoftwaredefectdetectionaimstoautomaticallyidentifydefectivesoftwaremodulesfore±cientsoftwaretestinordertoimprovethequalityofasoftwaresystem.Althoughmanymachinelearningmethodshavebeensuccessfullyappliedtothetask,mostofthemfailtoconsidertwopracticalyetimportantissuesinsoftwaredefectdetection.First,itisratherdi±culttocollectalargeamountoflabeledtrainingdataforlearningawell-performingmodel;second,inasoftwaresystemthereareusuallymuchfewerdefectivemodulesthandefect-freemodules,solearningwouldhavetobeconductedoveranimbalanceddataset.Inthispaper,weaddressthesetwopracticalissuessimultaneouslybyproposinganovelsemi-supervisedlearningapproachnamedRocus.Thismethodexploitstheabundantunlabeledexamplestoimprovethedetectionaccuracy,aswellasemploysunder-samplingtotackletheclass-imbalanceprobleminthelearningprocess.Experimentalresultsofreal-worldsoftwaredefectdetectiontasksshowthatRocusise®ectiveforsoftwaredefectdetection.Itsperformanceisbetterthanasemi-supervisedlearningmethodthatignorestheclass-imbalancenatureofthetaskandaclass-imbalancelearningmethodthatdoesnotmakee®ectiveuseofunlabeleddata.Keywordsmachinelearning,datamining,semi-supervisedlearning,class-imbalance,softwaredefectdetection1IntroductionEnabledbytechnologicaladvancesincomputerhardware,softwaresystemshavebecomeincreasinglypowerfulandversatile.However,theattendantincreaseinsoftwarecomplexityhasmadethetimelydevelop-mentofreliablesoftwaresystemsextremelychallenging.Tomakesoftwaresystemsreliable,itisveryimportanttoidentifyasmanydefectsaspossiblebeforereleas-ingthesoftware.However,duetothecomplexityofthesoftwaresystemsandthetightprojectschedule,itisalmostimpossibletoextensivelytesteverypathofthesoftwareunderallpossibleruntimeenvironment.Thus,accuratelypredictingwhetherasoftwaremodulecontainsdefectscanhelptoallocatethelimitedtestresourcese®ectively,andhence,improvethequalityofsoftwaresystems.Suchaprocessisusuallyreferredtoassoftwaredefectdetection,whichhasalreadydrawnmuchattentioninsoftwareengineeringcommunity.Machinelearningtechniqueshavebeensuccessfullyappliedtobuildingpredictivemodelsforsoftwaredefectdetection[1-7].Thestaticanddynamiccodeattributesorsoftwaremetricsareextractedfromeachsoftwaremoduletoformanexample,whichisthenlabeledas\defective"or\defect-free".Predictivemodelswhichlearnfromalargenumberofexamplesareexpectedtoaccuratelypredictwhetheragivenmoduleisdefective.However,mostofthesestudieshavenotconsideredtwopracticalyetimportantissuesinsoftwaredefectdetection.First,althoughitisrelativelyeasytoauto-maticallygenerateexamplesfromsoftwaremodulesus-ingsomestandardtools,determiningwhetheramodulecontainsdefectthroughextensivetestusuallyconsumestoomuchtimeandresource,sincethenumberofpro-gramstatusgrowsexponentiallyasthecomplexityofsoftwareincreases.Withlimitedtimeandtestresource,onecanonlyobtainthelabelsforasmallportionofmodules.However,thepredictivemodelsthatlearnfromsuch...

1、当您付费下载文档后，您只拥有了使用权限，并不意味着购买了版权，文档只能用于自身使用，不得用于其他商业用途（如 [转卖]进行直接盈利或[编辑后售卖]进行间接盈利）。
2、本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺！文档内容仅供参考，付费前请自行鉴别。
3、如文档内容存在侵犯商业秘密、侵犯著作权等，请点击“举报”。