资源描述:
《OURMINE An Open Source Data Mining Toolkit》由会员上传分享,免费在线阅读,更多相关内容在学术论文-天天文库。
1、OURMINE:AnOpenSourceDataMiningToolkitAdamR.NelsonThesissubmittedtotheCollegeofEngineeringandMineralResourcesatWestVirginiaUniversityinpartialfulfillmentoftherequirementsforthedegreeofMasterofScienceinComputerScienceTimMenzies,Ph.D.,ChairFrancesVanScoy,Ph.
2、D.TimMcGraw,Ph.D.LaneDepartmentofComputerScienceandElectricalEngineeringMorgantown,WestVirginia2010Keywords:DataMining,Toolkit,SoftwareDefectPrediction,Bash,Awk,Scriptingc2010AdamR.NelsonAbstractOURMINE:AnOpenSourceDataMiningToolkitAdamR.NelsonWhenresear
3、cherswanttorepeat,improveorrefutepriorconclusions,itisusefultohaveacompleteandoperationaldescriptionofpriorexperiments.Ifthosedescriptionsareoverlylongorcomplex,thensharingtheirdetailsmaynotbeinformative.OURMINEisascriptingenvironmentforthedevelopmentand
4、deploymentofdataminingex-periments.UsingOURMINE,dataminingnovicescanspecifyandexecuteintricateexperiments,whileresearcherscanpublishtheircompleteexperimentalrigalongsidetheirconclusions.ThisisachievablebecauseofOURMINEssuccinctness.Forexample,thisthesisp
5、resentsthreecasestudiesdocumentedintheOURMINEsyntax.Thus,thebrevityandsimplicityofOURMINErecommendsitasabettertoolfordocumenting,executing,andsharingdataminingexperiments.AcknowledgmentsAmongthosetothank,Iwouldfirstliketoacknowledgemyparentsandmysisterfor
6、theirutmostsupportinbothgoodtimes,andbad.Theirdedicationtomyhappinesswillneverbeforgotten.Iwouldalsoliketothankmyadvisor,Dr.Menzies,whobelievedinmeandalwayslookedoutformyprofessionalwell-being.Finally,IwouldliketoacknowledgeGregGay,TomiPrifti,AndrewMathe
7、ny,andeveryoneelsewhosecontribution,directorindirect,wasmonumentalinmyresearch.iContents1Introduction11.1StatementofThesis..................................21.2ContributionsofthisThesis..............................31.3PapersfromthisWork.................
8、...............31.4StructureofthisThesis................................32RelatedWork52.1ExistingOpenSourceDataMiningTools......................52.1.1ADaM....................................52.1.2DatabionicESOMTools.......