Processing WebLogs with HDInsight
In this lab, explore the weblogs in its unstructured form and curate them to bring out information that can be actionable. We use Hadoop concepts like MapReduce and bring data to a usable format with Hive. We then use Sqoop to bring the data into our Data Warehouse and then create a report upon it in Excel. This covers a complete cycle of bringing in unstructured data and making sense of it with the high volumes and velocity associated with Big Data.
Click here to run this lab in a Virtual Machine.
Click here to view the lab manual