[DOCS] Add Data Visualizer to the ML Getting Started tutorial (elastic/x-pack-elasticsearch#3171)

* [DOCS] Refreshed ML screenshots * [DOCS] Added screenshots for ML Data Visualizer * [DOCS] Addressed feedback about data visualizer * [DOCS] Fixed typo in ML tutorial Original commit: elastic/x-pack-elasticsearch@2603536a93
2017-12-04 13:11:45 -08:00 · 2017-12-04 13:11:45 -08:00 · 0def4dfbf8
parent 6487557e61
commit 0def4dfbf8
10 changed files with 102 additions and 18 deletions
--- a/docs/en/ml/getting-started-single.asciidoc
+++ b/docs/en/ml/getting-started-single.asciidoc
@ -1,24 +1,8 @@
 [[ml-gs-jobs]]
 === Creating Single Metric Jobs

-Machine learning jobs contain the configuration information and metadata
-necessary to perform an analytical task. They also contain the results of the
-analytical task.
-
-[NOTE]
--
-This tutorial uses {kib} to create jobs and view results, but you can
-alternatively use APIs to accomplish most tasks.
-For API reference information, see {ref}/ml-apis.html[Machine Learning APIs].
-
-The {xpackml} features in {kib} use pop-ups. You must configure your
-web browser so that it does not block pop-up windows or create an
-exception for your {kib} URL.
--
-
-You can choose to create single metric, multi-metric, population, or advanced
-jobs in {kib}. At this point in the tutorial, the goal is to detect anomalies in
-the total requests received by your applications and services. The sample data
+At this point in the tutorial, the goal is to detect anomalies in the
+total requests received by your applications and services. The sample data
 contains a single key performance indicator(KPI) to track this, which is the total
 requests over time. It is therefore logical to start by creating a single metric
 job for this KPI.
--- a/docs/en/ml/getting-started-wizards.asciidoc
+++ b/docs/en/ml/getting-started-wizards.asciidoc
@ -0,0 +1,99 @@
+[[ml-gs-wizards]]
+=== Creating Jobs in {kib}
++++
+<titleabbrev>Creating Jobs</titleabbrev>
++++
+
+Machine learning jobs contain the configuration information and metadata
+necessary to perform an analytical task. They also contain the results of the
+analytical task.
+
+[NOTE]
+--
+This tutorial uses {kib} to create jobs and view results, but you can
+alternatively use APIs to accomplish most tasks.
+For API reference information, see {ref}/ml-apis.html[Machine Learning APIs].
+
+The {xpackml} features in {kib} use pop-ups. You must configure your
+web browser so that it does not block pop-up windows or create an
+exception for your {kib} URL.
+--
+
+{kib} provides wizards that help you create typical {ml} jobs. For example, you
+can use wizards to create single metric, multi-metric, population, and advanced
+jobs.
+
+To see the job creation wizards:
+
+. Open {kib} in your web browser and log in. If you are running {kib} locally,
+go to `http://localhost:5601/`.
+
+. Click **Machine Learning** in the side navigation.
+
+. Click **Create new job**.
+
+. Click the `server-metrics*` index pattern.
+
+You can then choose from a list of job wizards. For example:
+
+[role="screenshot"]
+image::images/ml-create-job.jpg["Job creation wizards in {kib}"]
+
+If you are not certain which wizard to use, there is also a **Data Visualizer**
+that can help you explore the fields in your data.
+
+To learn more about the sample data:
+
+. Click **Data Visualizer**. +
+
+--
+[role="screenshot"]
+image::images/ml-data-visualizer.jpg["Data Visualizer in {kib}"]
+--
+
+. Select a time period that you're interested in exploring by using the time
+picker in the {kib} toolbar. Alternatively, click
+**Use full server-metrics* data** to view data over the full time range. In this
+sample data, the documents relate to March and April 2017.
+
+. Optional: Change the number of documents per shard that are used in the
+visualizations. There is a relatively small number of documents in the sample
+data, so you can choose a value of `all`. For larger data sets, keep in mind
+that using a large sample size increases query run times and increases the load
+on the cluster.
+
+[role="screenshot"]
+image::images/ml-data-metrics.jpg["Data Visualizer output for metrics in {kib}"]
+
+The fields in the indices are listed in two sections.  The first section contains
+the numeric ("metric") fields. The second section contains non-metric fields
+(such as `keyword`, `text`, `date`, `boolean`, `ip`, and `geo_point` data types).
+
+For metric fields, the **Data Visualizer** indicates how many documents contain
+the field in the selected time period. It also provides information about the
+minimum, median, and maximum values, the number of distinct values, and their
+distribution. You can use the distribution chart to get a better idea of how
+the values in the data are clustered. Alternatively, you can view the top values
+for metric fields. For example:
+
+[role="screenshot"]
+image::images/ml-data-topmetrics.jpg["Data Visualizer output for top values in {kib}"]
+
+For date fields, the **Data Visualizer** provides the earliest and latest field
+values and the number and percentage of documents that contain the field
+during the selected time period. For example:
+
+[role="screenshot"]
+image::images/ml-data-dates.jpg["Data Visualizer output for date fields in {kib}"]
+
+For keyword fields, the **Data Visualizer** provides the number of distinct
+values, a list of the top values, and the number and percentage of documents
+that contain the field during the selected time period. For example:
+
+[role="screenshot"]
+image::images/ml-data-keywords.jpg["Data Visualizer output for date fields in {kib}"]
+
+In this tutorial, you will create single and multi-metric jobs that use the
+`total`, `response`, `service`, and `host` fields. Though there is an option to
+create an advanced job directly from the **Data Visualizer**, we will use the
+single and multi-metric job creation wizards instead.
--- a/docs/en/ml/getting-started.asciidoc
+++ b/docs/en/ml/getting-started.asciidoc
@ -75,6 +75,7 @@ significant changes to the system. You can alternatively assign the
 For more information, see <<built-in-roles>> and <<privileges-list-cluster>>.

 include::getting-started-data.asciidoc[]
+include::getting-started-wizards.asciidoc[]
 include::getting-started-single.asciidoc[]
 include::getting-started-multi.asciidoc[]
 include::getting-started-next.asciidoc[]
--- a/docs/en/ml/images/ml-create-job.jpg
+++ b/docs/en/ml/images/ml-create-job.jpg
--- a/docs/en/ml/images/ml-data-dates.jpg
+++ b/docs/en/ml/images/ml-data-dates.jpg
--- a/docs/en/ml/images/ml-data-keywords.jpg
+++ b/docs/en/ml/images/ml-data-keywords.jpg
--- a/docs/en/ml/images/ml-data-metrics.jpg
+++ b/docs/en/ml/images/ml-data-metrics.jpg
--- a/docs/en/ml/images/ml-data-topmetrics.jpg
+++ b/docs/en/ml/images/ml-data-topmetrics.jpg
--- a/docs/en/ml/images/ml-data-visualizer.jpg
+++ b/docs/en/ml/images/ml-data-visualizer.jpg
--- a/docs/en/ml/images/ml-kibana.jpg
+++ b/docs/en/ml/images/ml-kibana.jpg