Building a Scalable Data Science Platform with R on HDInsight
Hadoop is famously scalable. Cloud computing is famously scalable. R, the thriving and extensible open source Data Science software, not so much. But what if we seamlessly combined Hadoop, cloud computing, and R to create a scalable Data Science platform? Imagine exploring, transforming, modeling, and scoring data at any scale from the comfort of your favorite R environment. Now, imagine calling a simple R function to operationalize your predictive model as a scalable, cloud-based Web Services API. Come learn how to use the magic of the cloud to run your R code, thousands of open source R extension packages, and distributed implementations of the most popular machine learning algorithms at scale.