Episode
Large-scale data processing in Azure Data Lake
with Saveen Reddy
Data scientists and data wranglers often have existing code that they want to use at scale over large data sets. In this presentation, we show how you can take your existing Python, R, and Java code and libraries—and formats like Parquet—and apply them at scale to schematize unstructured data and process large amounts of data in Azure Data Lake with U-SQL.
Product info: azure.microsoft.com/en-us/solutions/data-lake/
Learn more: docs.microsoft.com/en-us/azure/data-lake-analytics/
Documentation: msdn.microsoft.com/library/azure/mt591959
Create a Free Account (Azure): https://aka.ms/c9-azurefree
Data scientists and data wranglers often have existing code that they want to use at scale over large data sets. In this presentation, we show how you can take your existing Python, R, and Java code and libraries—and formats like Parquet—and apply them at scale to schematize unstructured data and process large amounts of data in Azure Data Lake with U-SQL.
Product info: azure.microsoft.com/en-us/solutions/data-lake/
Learn more: docs.microsoft.com/en-us/azure/data-lake-analytics/
Documentation: msdn.microsoft.com/library/azure/mt591959
Create a Free Account (Azure): https://aka.ms/c9-azurefree
Have feedback? Submit an issue here.