Big, Fast, and Data-Furious…with Spark

Play Big, Fast, and Data-Furious…with Spark
Sign in to queue

Description

Making teams of data scientists productive is a challenging task. The size of the data in Big Data problems is the first great hindrance to productivity. Apache Spark provides a foundation for the solution to this problem by offering interactive compute engine, but it is not sufficient in itself. In this session we review how a set of open source tools including Jupyter and Livy can be combined with advanced resource management and elasticity of Azure cloud to provide a comprehensive interactive platform for Big Data.

Embed

Download

Download this episode

Download captions

The Discussion

Add Your 2 Cents