Optimizing Apache Hive Performance in HDInsight

Play Optimizing Apache Hive Performance in HDInsight
Sign in to queue

Description

HDInsight allows you to run Big Data technologies (including Hadoop) on Microsoft Azure. If you have a Hadoop cluster, more than likely you use Hive in some capacity. Hive is the SQL engine on Hadoop and is mature, scalable, and heavily used in production scenarios. Hive can run different types of workloads including ETL, reporting, data mining and others. Each of these workloads needs to be tuned to get the best performance. At this session you will learn how to optimize your system better. We will discuss performance optimization at both an architecture layer and at the execution engine layer. Come prepared for a hands-on view of HDInsight including demos.

Embed

Download

Right click to download this episode

Download captions

The Discussion

Add Your 2 Cents