资源描述:
《数据挖掘英文题目》由会员上传分享,免费在线阅读,更多相关内容在教育资源-天天文库。
1、2.4.Supposethatthedataforanalysisincludestheattributeage.Theagevaluesforthedatatuplesare(inincreasingorder)13,15,16,16,19,20,20,21,22,22,25,25,25,25,30,33,33,35,35,35,35,36,40,45,46,52,70.(a)Whatisthemeanofthedata?Whatisthemedian?(b)Whatisthemodeofthedata?Commentonthedata'sm
2、odality(i.e.,bimodal,trimodal,etc.).(c)Whatisthemidrangeofthedata?(d)Canyou¯nd(roughly)the¯rstquartile(Q1)andthethirdquartile(Q3)ofthedata?(e)Givethe¯ve-numbersummaryofthedata.(f)Showaboxplotofthedata.(g)Howisaquantile-quantileplotdi®erentfromaquantileplot?2.9.Supposeahospit
3、altestedtheageandbodyfatdatafor18randomlyselectedadultswiththefollowingresultage232327273941474950%fat9.526.57.817.831.425.927.427.231.2age525454565758586061%fat34.642.528.833.430.234.132.941.235.7(a)Calculatethemean,medianandstandarddeviationofageand%fat.(b)Drawtheboxplotsf
4、orageand%fat.(c)Drawascatterplotandaq-qplotbasedonthesetwovariables.(d)Normalizethetwovariablesbasedonz-scorenormalization.(e)Calculatethecorrelationcoe±cient(Person'sproductmomentcoe±cient).Arethesetwovariablespositivelyornegativelycorrelated?2.11.Usethetwomethodsbelowtonor
5、malizethefollowinggroupofdata:200;300;400;600;1000(a)min-maxnormalizationbysettingmin=0andmax=1(b)z-scorenormalization4.4.SupposethatabasecuboidhasthreedimensionsA;B;C,withthefollowingnumberofcells:jAj=1;000;000,jBj=100,andjCj=1000.Supposethateachdimensionisevenlypartitioned
6、into10portionsforchunking.(a)Assumingeachdimensionhasonlyonelevel,drawthecompletelatticeofthecube.(b)Ifeachcubecellstoresonemeasurewith4bytes,whatisthetotalsizeofthecomputedcubeifthecubeisdense?(c)Statetheorderforcomputingthechunksinthecubethatrequirestheleastamountofspace,a
7、ndcomputethetotalamountofmainmemoryspacerequiredforcomputingthe2-Dplanes.5.3.Adatabasehas¯vetransactions.Letminsup=60%andminconf=80%.(a)FindallfrequentitemsetsusingAprioriandFP-growth,respectively.Comparethee±ciencyofthetwominingprocesses.(b)Listallofthestrongassociationrule
8、s(withsupportsandcon¯dencec)matchingthefollowingmetarule,whereXisavariabler