Episode

Large-scale data processing in Azure Data Lake

with Saveen Reddy

Data scientists and data wranglers often have existing code that they want to use at scale over large data sets. In this presentation, we show how you can take your existing Python, R, and Java code and libraries—and formats like Parquet—and apply them at scale to schematize unstructured data and process large amounts of data in Azure Data Lake with U-SQL.

Product info: azure.microsoft.com/en-us/solutions/data-lake/
Learn more: docs.microsoft.com/en-us/azure/data-lake-analytics/
Documentation: msdn.microsoft.com/library/azure/mt591959

Create a Free Account (Azure): https://aka.ms/c9-azurefree

Java

U-SQL