欢迎来到天天文库
浏览记录
ID:7051545
大小:1.32 MB
页数:169页
时间:2018-02-02
《数据挖掘导论习题答案》由会员上传分享,免费在线阅读,更多相关内容在教育资源-天天文库。
1、IntroductiontoDataMiningInstructor’sSolutionManualPang-NingTanMichaelSteinbachVipinKumarCopyrightc2006PearsonAddison-Wesley.Allrightsreserved.Contents1Introduction12Data53ExploringData194Classification:BasicConcepts,DecisionTrees,andModelEvaluation255Classification:Alter
2、nativeTechniques456AssociationAnalysis:BasicConceptsandAlgorithms717AssociationAnalysis:AdvancedConcepts958ClusterAnalysis:BasicConceptsandAlgorithms1259ClusterAnalysis:AdditionalIssuesandAlgorithms14710AnomalyDetection157iii1Introduction1.Discusswhetherornoteachofthefo
3、llowingactivitiesisadataminingtask.(a)Dividingthecustomersofacompanyaccordingtotheirgender.No.Thisisasimpledatabasequery.(b)Dividingthecustomersofacompanyaccordingtotheirprof-itability.No.Thisisanaccountingcalculation,followedbytheapplica-tionofathreshold.However,predic
4、tingtheprofitabilityofanewcustomerwouldbedatamining.(c)Computingthetotalsalesofacompany.No.Again,thisissimpleaccounting.(d)Sortingastudentdatabasebasedonstudentidentificationnum-bers.No.Again,thisisasimpledatabasequery.(e)Predictingtheoutcomesoftossinga(fair)pairofdice.No
5、.Sincethedieisfair,thisisaprobabilitycalculation.Ifthediewerenotfair,andweneededtoestimatetheprobabilitiesofeachoutcomefromthedata,thenthisismoreliketheproblemsconsideredbydatamining.However,inthisspecificcase,solu-tionstothisproblemweredevelopedbymathematiciansalongtime
6、ago,andthus,wewouldn’tconsiderittobedatamining.(f)Predictingthefuturestockpriceofacompanyusinghistoricalrecords.Yes.Wewouldattempttocreateamodelthatcanpredictthecontinuousvalueofthestockprice.Thisisanexampleofthe2Chapter1Introductionareaofdataminingknownaspredictivemode
7、lling.Wecoulduseregressionforthismodelling,althoughresearchersinmanyfieldshavedevelopedawidevarietyoftechniquesforpredictingtimeseries.(g)Monitoringtheheartrateofapatientforabnormalities.Yes.Wewouldbuildamodelofthenormalbehaviorofheartrateandraiseanalarmwhenanunusualhear
8、tbehavioroccurred.Thiswouldinvolvetheareaofdataminingknownasanomalyde-tection.Thiscouldalsobeconsideredasaclas
此文档下载收益归作者所有