Automated machine learning picks an algorithm and hyperparameters for you and generates a model ready for deployment. There are several options that you can use to configure automated machine learning experiments.
Configuration options available in automated machine learning:
- Select your experiment type: Classification, Regression or Time Series Forecasting
- Data source, formats, and fetch data
- Choose your compute target
- Automated machine learning experiment settings
- Run an automated machine learning experiment
- Explore model metrics
- Register and deploy model
You can create and run automated machine learning experiments in code using the Azure ML Python SDK or if you prefer a no code experience, you can also create your automated machine learning experiments in Azure Machine Learning Studio.
In this lab, we will use Automated Machine Learning to find the best performing binary classification model for predicting customer churn. We will do all of this from the Azure Machine Learning Studio without writing a single line of code.
-
In Azure portal, open the available machine learning workspace.
-
Select Launch now under the Try the new Azure Machine Learning studio message.
-
When you first launch the studio, you may need to set the directory and subscription. If so, you will see this screen:
For the directory, select Udacity and for the subscription, select Azure Sponsorship. For the machine learning workspace, you may see multiple options listed. Select any of these (it doesn't matter which) and then click Get started.
-
From the studio, select Datasets, + Create dataset, From web files. This will open the
Create dataset from web files
dialog on the right. -
In the Web URL field provide the following URL for the training data file:
https://introtomlsampledata.blob.core.windows.net/data/crm-churn/crm-churn.csv
-
Provide
CRM-Churn
as the Name, leave the remaining values at their defaults and select Next.
-
On the Settings and preview panel, set the column headers drop down to
All files have same headers
. -
Review the dataset and then select Next
-
From the studio home, select Create new, Automated ML run
-
This will open a
Create a new automated machine learning experiment
page
-
Provide an experiment name: Churn-Predictor
-
Select target column: Col1
-
Select compute target: select the available compute
-
Select Next
-
Select task type: Classification, and then select View additional configuration settings
-
This will open the
Additional configurations
dialog. -
Provide the following information and then select Save
- Primary metric: AUC weighted
- Exit criteria, Metric score threshold:
0.707
Note that we are setting a metric score threshold to limit the training time. In practice, for initial experiments, you will typically only set the training job time to allow AutoML to discover the best algorithm to use for your specific data.
-
The experiment will run for about 5 min
-
In the Details tab, observe the run status of the job.
-
Wait till the run status becomes Completed.
-
While you wait for the model training to complete, you can learn to view and understand the charts and metrics for your automated machine learning run by selecting Understand automated machine learning classification results.
-
From the
Details
tab review the best model'sAlgorithm name
and its correspondingAUC weighted
score. Next, select the best model's Algorithm name -
Select View all other metrics to review the various
Run Metrics
to evaluate the model performance. Next, select Metrics -
Select accuracy_table, Chart to review the various model performance curves, such as Precision-Recall, ROC, Calibration curve, and Gain & Lift curves.
Congratulations! You have trained and evaluated a binary classification model using automated machine learning. You can continue to experiment in the environment but are free to close the lab environment tab and return to the Udacity portal to continue with the lesson.