Building a Scalable Data Science Platform with R on HDInsight

Download this episode

Download Video

Download captions


Hadoop is famously scalable. Cloud computing is famously scalable. R, the thriving and extensible open source Data Science software, not so much. But what if we seamlessly combined Hadoop, cloud computing, and R to create a scalable Data Science platform? Imagine exploring, transforming, modeling, and scoring data at any scale from the comfort of your favorite R environment. Now, imagine calling a simple R function to operationalize your predictive model as a scalable, cloud-based Web Services API. Come learn how to use the magic of the cloud to run your R code, thousands of open source R extension packages, and distributed implementations of the most popular machine learning algorithms at scale.



Available formats for this video:

Actual format may change based on video formats available and browser capability.

    The Discussion

    • User profile image

      Loved this session! Debraj provided a very comprehensive, easy to follow and pragmatic walk through of the R capabilities in HDInsight. Nice addition was the coverage of wide set of components or landscape of the big data stack in Azure. Looking forward to your next sessions and channel9 videos!

    Comments closed

    Comments have been closed since this content was published more than 30 days ago, but if you'd like to send us feedback you can Contact Us.