资源描述:
《Advanced Analytics with Spark》由会员上传分享,免费在线阅读,更多相关内容在学术论文-天天文库。
1、AdvancedAnalyticswithSparkAdvancedAnalyticswithSparkInthispracticalbook,fourClouderadatascientistspresentasetofself-containedpatternsforperforminglarge-scaledataanalysiswithSpark.TheauthorsbringSpark,statisticalmethods,andreal-worlddatasetstogethertoteachyouhowtoapproachanaly
2、ticsproblemsbyexample.You’llstartwithanintroductiontoSparkanditsecosystem,andthendiveintopatternsthatapplycommontechniques—classification,collaborativefiltering,andanomalydetection,amongothers—tofieldssuchasgenomics,security,andfinance.Ifyouhaveanentry-levelunderstandingofmac
3、hinelearningandstatistics,andyouprograminJava,Python,orScala,you’llfindthesepatternsusefulforworkingonyourowndataapplications.Patternsinclude:■RecommendingmusicandtheAudioscrobblerdataset■Predictingforestcoverwithdecisiontrees■AnomalydetectioninnetworktrafficwithK-meanscluste
4、ring■UnderstandingWikipediawithLatentSemanticAnalysis■Analyzingco-occurrencenetworkswithGraphXAdvanced■GeospatialandtemporaldataanalysisontheNewYorkCityTaxiTripsdata■EstimatingfinancialriskthroughMonteCarlosimulationAnalyticswith■AnalyzinggenomicsdataandtheBDGproject■Analyzin
5、gneuroimagingdatawithPySparkandThunder SandyRyzaisaSeniorDataScientistatClouderaandactivecontributortotheApacheSparkproject.Ryza,Laserson,UriLasersonisaSeniorDataScientistatCloudera,wherehefocusesonPythonOwen&WillsintheHadoopecosystem.SparkSeanOwenisDirectorofDataScienceforEM
6、EAatCloudera,andacommitterforApacheSpark.JoshWillsisSeniorDirectorofDataScienceatClouderaandfounderoftheApacheCrunchproject.PATTERNSFORLEARNINGFROMDATAATSCALEDATA/SPARKTwitter:@oreillymediafacebook.com/oreillyUS$49.99CAN$57.99ISBN:978-1-491-91276-8SandyRyza,UriLaserson,SeanOw
7、en&JoshWillswww.it-ebooks.infoAdvancedAnalyticswithSparkAdvancedAnalyticswithSparkInthispracticalbook,fourClouderadatascientistspresentasetofself-containedpatternsforperforminglarge-scaledataanalysiswithSpark.TheauthorsbringSpark,statisticalmethods,andreal-worlddatasetstogeth
8、ertoteachyouhowtoapproachanalyticsproblemsbyexample.You’llstartwitha