Building a Scalable Data Science Platform with R on HDInsight

Sign in to queue

Description

Hadoop is famously scalable. Cloud computing is famously scalable. R, the thriving and extensible open source Data Science software, not so much. But what if we seamlessly combined Hadoop, cloud computing, and R to create a scalable Data Science platform? Imagine exploring, transforming, modeling, and scoring data at any scale from the comfort of your favorite R environment. Now, imagine calling a simple R function to operationalize your predictive model as a scalable, cloud-based Web Services API. Come learn how to use the magic of the cloud to run your R code, thousands of open source R extension packages, and distributed implementations of the most popular machine learning algorithms at scale.

Embed

Download

Download this episode

Download captions

The Discussion

  • User profile image
    alejanmi

    Loved this session! Debraj provided a very comprehensive, easy to follow and pragmatic walk through of the R capabilities in HDInsight. Nice addition was the coverage of wide set of components or landscape of the big data stack in Azure. Looking forward to your next sessions and channel9 videos!

Add Your 2 Cents