1、201306ClouderaDataAnalystTraining:UsingPig,Hive,andImpalawithHadoopHands-OnExercisesGeneral Notes..........................................................................................................................................2Hands‐On Exercise #1: Data Ingest With Hadoop Tools............
2、...............................................4Hands‐On Exercise #2: Using Pig for ETL Processing.............................................................12Hands‐On Exercise #3: Analyzing Ad Campaign Data with Pig............................................19Hands‐On Exercise #4: Analyzing Dis
3、parate Data Sets with Pig.........................................25Hands‐On Exercise #5: Extending Pig with Streaming and UDFs.......................................30Hands‐On Exercise #6: Running Hive Queries from the Shell, Scripts, and Hue............35Hands‐On Exercise #7: Data Management with
4、 Hive..............................................................40Hands‐On Exercise #8: Gaining Insight with Sentiment Analysis (Optional).................47Hands‐On Exercise #9: Data Transformation with Hive........................................................51Hands‐On Exercise #10: Interac
5、tive Analysis with Impala...................................................59Data Model Reference........................................................................................................................65Regular Expression Reference...................................................
7、mulatea real Hadoop clusteron a single machine. In addition to Hadoop itself, Pig, Hive, Impala, Sqoop, andall the other CDH components youwill use in class are already installed and configured for you.Poin