资源描述:
《hadoop in action》由会员上传分享,免费在线阅读,更多相关内容在学术论文-天天文库。
1、MEAPEditionManningEarlyAccessProgramCopyright2010ManningPublicationsFormoreinformationonthisandotherManningtitlesgotowww.manning.comTABLEOFCONTENTSPART1Hadoop-ADistributedProgrammingFrameworkCHAPTER1IntroducingHadoopCHAPTER2StartingHadoopCHAPTER3ComponentsofHadoo
2、pPART2-HadoopinActionCHAPTER4WritingbasicMapReduceprogramsCHAPTER5AdvancedMapReduceCHAPTER6ProgrammingpracticesCHAPTER7CookbookCHAPTER8ManagingHadoopPART3-HadoopGoneWildCHAPTER9RunningHadoopinthecloudCHAPTER10ProgrammingwithPigCHAPTER11HiveandtheHadoopherdCHAPTER
3、12CasestudiesAPPENDIXHDFSfilecommandsPart1HadoopADistributedProgrammingFrameworkPart1ofthisbookintroducesthebasicsforunderstandingandusingHadoop.WedescribethehardwarecomponentsthatmakeupaHadoopcluster,aswellastheinstallationandconfigurationtocreateaworkingsystem.W
4、ecovertheMapReduceframeworkatahighlevelandgetyourfirstMapReduceprogramupandrunning.IntroducingHadoop1Thischaptercovers■Thebasicsofwritingascalable,distributeddata-intensiveprogram■UnderstandingHadoopandMapReduce■WritingandrunningabasicMapReduceprogramToday,we’resu
5、rroundedbydata.Peopleuploadvideos,takepicturesontheircellphones,textfriends,updatetheirFacebookstatus,leavecommentsaroundtheweb,clickonads,andsoforth.Machines,too,aregeneratingandkeepingmoreandmoredata.Youmayevenbereadingthisbookasdigitaldataonyourcomputerscreen,
6、andcertainlyyourpurchaseofthisbookisrecordedasdatawithsomeretailer.1Theexponentialgrowthofdatafirstpresentedchallengestocutting-edgebusinessessuchasGoogle,Yahoo,Amazon,andMicrosoft.Theyneededtogothroughterabytesandpetabytesofdatatofigureoutwhichwebsiteswerepopular,
7、whatbookswereindemand,andwhatkindsofadsappealedtopeople.Existingtoolswerebecominginadequatetoprocesssuchlargedatasets.GooglewasthefirsttopublicizeMapReduceasystemtheyhadusedtoscaletheirdataprocessingneeds.1Ofcourse,you’rereadingalegitimatecopyofthis,right?34CHAPTE
8、R1IntroducingHadoopThissystemarousedalotofinterestbecausemanyotherbusinesseswerefacingsimilarscalingchallenges,anditwasn’tfeasibleforeveryonetoreinventtheirown