WEBVTT

00:00:00.000 --> 00:00:02.400
>> I was pretty excited when
I found out that Python

00:00:02.400 --> 00:00:05.100
is one of the most popular
languages for data science.

00:00:05.100 --> 00:00:07.380
Admittedly though, I was
a little bit nervous

00:00:07.380 --> 00:00:09.720
when Francesca mentioned
Jupyter Notebooks.

00:00:09.720 --> 00:00:11.865
I didn't have much
experience with those.

00:00:11.865 --> 00:00:16.245
I was thrilled when I found out
that VS Code, my favorite editor,

00:00:16.245 --> 00:00:18.750
has packages and
extensions for Python,

00:00:18.750 --> 00:00:21.450
Jupyter Notebooks, and
Azure Machine Learning.

00:00:21.450 --> 00:00:23.415
I think I'll feel right at home

00:00:23.415 --> 00:00:26.235
because I'll be in my Visual
Studio Code environment.

00:00:26.235 --> 00:00:29.410
So let's get our local
environment setup and hopefully,

00:00:29.410 --> 00:00:31.410
you'll feel comfortable
in VS Code too.

00:00:31.410 --> 00:00:34.470
As always, there's additional
documentation both on

00:00:34.470 --> 00:00:35.910
the screen and link
in the description

00:00:35.910 --> 00:00:38.245
down below. Let's get started.

00:00:38.245 --> 00:00:41.360
We need to download and
install three things:

00:00:41.360 --> 00:00:45.125
Visual Studio Code,
Python, and Anaconda.

00:00:45.125 --> 00:00:47.330
I'm going to verify
that I have Python

00:00:47.330 --> 00:00:49.660
installed in my Anaconda prompt.

00:00:49.660 --> 00:00:52.040
That's looking good. Next, I'm

00:00:52.040 --> 00:00:54.260
going to create an
Anaconda environment.

00:00:54.260 --> 00:00:56.495
I'm going to take the advice from

00:00:56.495 --> 00:00:58.760
the Visual Studio Code Team and

00:00:58.760 --> 00:01:01.925
create an environment
ready for data science.

00:01:01.925 --> 00:01:05.320
I'm going to call this
environment devintrods.

00:01:05.320 --> 00:01:08.710
I'm going to specify
version 3.7 for Python.

00:01:08.710 --> 00:01:14.915
I'm going to also install
Pandas, Jupyter, Seaborn,

00:01:14.915 --> 00:01:21.960
Scikit-learn, Keras, and TensorFlow,

00:01:22.230 --> 00:01:26.305
some of the most popular
tools for data science.

00:01:26.305 --> 00:01:29.320
Once the Anaconda environment
has been created,

00:01:29.320 --> 00:01:30.400
we can activate it with

00:01:30.400 --> 00:01:34.250
conda activate and the
name of our environment.

00:01:34.970 --> 00:01:39.075
Next, we want to
install Azure ML SDK.

00:01:39.075 --> 00:01:41.440
I'm going to specify
two parameters here,

00:01:41.440 --> 00:01:44.095
the AutoML and explain features,

00:01:44.095 --> 00:01:46.240
which will really help with some of

00:01:46.240 --> 00:01:48.160
those advanced topics that

00:01:48.160 --> 00:01:50.990
Francesca will lead
us through later on.

00:01:51.630 --> 00:01:54.790
We also want to make
sure that we have access

00:01:54.790 --> 00:01:57.175
to the AutoML training tools.

00:01:57.175 --> 00:01:59.960
So I'll go ahead and
install that as well.

00:02:01.340 --> 00:02:03.780
I think our environment is set up.

00:02:03.780 --> 00:02:06.465
Now we should set up
Visual Studio Code.

00:02:06.465 --> 00:02:09.000
There are three extensions we need.

00:02:09.000 --> 00:02:11.150
Head over to the
extension marketplace

00:02:11.150 --> 00:02:14.070
and install the Python extension,

00:02:19.580 --> 00:02:22.285
the Azure Account extension,

00:02:22.285 --> 00:02:26.300
which will allow us to sign in to
Azure within Visual Studio Code,

00:02:26.300 --> 00:02:29.970
and the Azure Machine
Learning Extension.

00:02:29.970 --> 00:02:33.139
Once we have all three
of those installed,

00:02:33.139 --> 00:02:34.790
we can verify that everything is

00:02:34.790 --> 00:02:36.835
working inside of Visual Studio Code.

00:02:36.835 --> 00:02:39.415
First, I want to sign in to Azure.

00:02:39.415 --> 00:02:45.760
I'll open up the command palette
and choose "Azure Sign In".

00:02:46.520 --> 00:02:49.910
This will open up a
new browser tab and

00:02:49.910 --> 00:02:53.170
allow me to sign in to
Azure through the browser.

00:02:53.170 --> 00:02:55.310
Notice, since I've
already signed into

00:02:55.310 --> 00:02:57.125
the Azure portal in another tab,

00:02:57.125 --> 00:02:59.640
it's just a quick click of a button.

00:03:00.460 --> 00:03:02.885
Now you can see that I'm signed in to

00:03:02.885 --> 00:03:05.790
Azure down here at the
bottom of VS Code.

00:03:05.870 --> 00:03:08.720
Before getting started,
I already created

00:03:08.720 --> 00:03:10.865
a folder where all of this
code is going to be stored.

00:03:10.865 --> 00:03:13.510
It's called dev intro data science.

00:03:13.510 --> 00:03:15.705
Now I'm going to create a new file,

00:03:15.705 --> 00:03:18.550
and we're going to call
this bike-share.ipynb.

00:03:21.410 --> 00:03:25.740
This will create a new
Jupyter Notebook file.

00:03:26.150 --> 00:03:29.385
Because we've installed
the Python extension,

00:03:29.385 --> 00:03:32.335
and we've installed Jupyter
through our Anaconda environment,

00:03:32.335 --> 00:03:34.540
VS Code will automatically convert

00:03:34.540 --> 00:03:38.015
this file into a Jupyter
Notebook interface.

00:03:38.015 --> 00:03:41.635
There's a few things to notice to
make sure that they're correct.

00:03:41.635 --> 00:03:45.235
One is that VS Code is
referring to the right Python.

00:03:45.235 --> 00:03:49.390
Down here at the bottom left
of your VS Code window,

00:03:49.390 --> 00:03:52.030
you should be able to see the
correct version of Python,

00:03:52.030 --> 00:03:53.545
and if you hover over it,

00:03:53.545 --> 00:03:55.300
you should be able to
see that this is using

00:03:55.300 --> 00:03:57.865
the Anaconda environment
that we just created.

00:03:57.865 --> 00:04:01.870
Next, you should make sure that
your Jupyter kernel is also

00:04:01.870 --> 00:04:06.130
set to that same Python version in
that same Anaconda environment.

00:04:06.130 --> 00:04:10.160
Now your local environment
is all set up.

