diff --git a/docs/modeling-01.html b/docs/modeling-01.html
index e2e7ca9..cd51ee1 100644
--- a/docs/modeling-01.html
+++ b/docs/modeling-01.html
@@ -217,6 +217,7 @@ <h2 id="toc-title">On this page</h2>
   <li><a href="#predict-with-a-data-frame" id="toc-predict-with-a-data-frame" class="nav-link" data-scroll-target="#predict-with-a-data-frame"><span class="header-section-number">6.1</span> Predict with a data frame</a></li>
   <li><a href="#predict-with-rasters" id="toc-predict-with-rasters" class="nav-link" data-scroll-target="#predict-with-rasters"><span class="header-section-number">6.2</span> Predict with rasters</a></li>
   </ul></li>
+  <li><a href="#thinking-about-auc" id="toc-thinking-about-auc" class="nav-link" data-scroll-target="#thinking-about-auc"><span class="header-section-number">7</span> Thinking about AUC</a></li>
   </ul>
 </nav>
     </div>
@@ -244,7 +245,7 @@ <h1 class="title">Basic modeling</h1>
 <section id="load-data" class="level2" data-number="1">
 <h2 data-number="1" class="anchored" data-anchor-id="load-data"><span class="header-section-number">1</span> Load data</h2>
 <p>Here we load the observation and background data points. We add a column identifying the month of the year.</p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-1_1d86ed566b52b97721e5bf8c68528667">
+<div class="cell">
 <div class="sourceCode cell-code" id="cb1"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="fu">source</span>(<span class="st">"setup.R"</span>)</span>
 <span id="cb1-2"><a href="#cb1-2" aria-hidden="true" tabindex="-1"></a></span>
 <span id="cb1-3"><a href="#cb1-3" aria-hidden="true" tabindex="-1"></a>obs <span class="ot">=</span> sf<span class="sc">::</span><span class="fu">read_sf</span>(<span class="fu">file.path</span>(<span class="st">"data"</span>, <span class="st">"obs"</span>, <span class="st">"obs-covariates.gpkg"</span>)) <span class="sc">|&gt;</span></span>
@@ -268,7 +269,7 @@ <h2 data-number="2" class="anchored" data-anchor-id="prepare-the-input-data"><sp
 <section id="the-input-table" class="level3" data-number="2.1">
 <h3 data-number="2.1" class="anchored" data-anchor-id="the-input-table"><span class="header-section-number">2.1</span> The input table</h3>
 <p>Simply strip the spatial information off of <code>obs</code> and <code>bkg</code>, select just the environmental covariates, and then row bind them together</p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-2_488e08d87d8da1c40ab59b6e36b5d46f">
+<div class="cell">
 <div class="sourceCode cell-code" id="cb2"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb2-1"><a href="#cb2-1" aria-hidden="true" tabindex="-1"></a>input_obs <span class="ot">=</span> obs <span class="sc">|&gt;</span></span>
 <span id="cb2-2"><a href="#cb2-2" aria-hidden="true" tabindex="-1"></a>  sf<span class="sc">::</span><span class="fu">st_drop_geometry</span>() <span class="sc">|&gt;</span> </span>
 <span id="cb2-3"><a href="#cb2-3" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">select</span>(dplyr<span class="sc">::</span><span class="fu">all_of</span>(<span class="fu">c</span>(<span class="st">"sst"</span>, <span class="st">"u_wind"</span>, <span class="st">"v_wind"</span>))) <span class="sc">|&gt;</span></span>
@@ -285,7 +286,7 @@ <h3 data-number="2.1" class="anchored" data-anchor-id="the-input-table"><span cl
 <section id="the-input-vector" class="level3" data-number="2.2">
 <h3 data-number="2.2" class="anchored" data-anchor-id="the-input-vector"><span class="header-section-number">2.2</span> The input vector</h3>
 <p>The each element of the input vector must have a 1 for each observation row, and a 0 for each background row. Since we arranged to have all of the the observations come first, we can easily make the vector with two calls to <code>rep()</code>.</p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-3_cb9900a55b33bfa8665b73cd5b792553">
+<div class="cell">
 <div class="sourceCode cell-code" id="cb3"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb3-1"><a href="#cb3-1" aria-hidden="true" tabindex="-1"></a>input_vector <span class="ot">=</span> <span class="fu">c</span>( <span class="fu">rep</span>(<span class="dv">1</span>, <span class="fu">nrow</span>(input_obs)), <span class="fu">rep</span>(<span class="dv">0</span>, <span class="fu">nrow</span>(input_bkg)) )</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 </section>
@@ -293,7 +294,7 @@ <h3 data-number="2.2" class="anchored" data-anchor-id="the-input-vector"><span c
 <section id="build-the-model" class="level2" data-number="3">
 <h2 data-number="3" class="anchored" data-anchor-id="build-the-model"><span class="header-section-number">3</span> Build the model</h2>
 <p>Here we pass our inputs to the <code>maxnet()</code> function, leaving all of the optional arguments to the default values. Be sure to look over the docs for model construction - try <code>?maxnet</code></p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-4_e19ecb260d408ca51cb0111258a9be88">
+<div class="cell">
 <div class="sourceCode cell-code" id="cb4"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb4-1"><a href="#cb4-1" aria-hidden="true" tabindex="-1"></a>model <span class="ot">=</span> maxnet<span class="sc">::</span><span class="fu">maxnet</span>(input_vector, input_table)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 <p>That’s it. The returned object is of <code>maxnet</code> class; it’s essentially a list with all of the pertinent information required for subsequent use.</p>
@@ -302,7 +303,7 @@ <h2 data-number="3" class="anchored" data-anchor-id="build-the-model"><span clas
 <h2 data-number="4" class="anchored" data-anchor-id="assess-the-model"><span class="header-section-number">4</span> Assess the model</h2>
 <p>So what do we know about the model? Is it any good?</p>
 <p>One thing we can do is to plot what are called response curves. These show, for each parameter, how the model responds along the typical range of parameter values. We plot below the responses with type <code>cloglog</code> which transform the response value into the 0-1 range.</p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-5_d8d8a80a1a1bfaa99d75c5216c91eec3">
+<div class="cell">
 <div class="sourceCode cell-code" id="cb5"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(model, <span class="at">type =</span> <span class="st">"cloglog"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
 <p><img src="modeling-01_files/figure-html/unnamed-chunk-5-1.png" class="img-fluid" width="672"></p>
@@ -320,7 +321,7 @@ <h2 data-number="5" class="anchored" data-anchor-id="save-the-model"><span class
 <li><code>v3.0, ...</code> for for the split model(s)</li>
 </ul>
 <p>The <a href="https://github.com/BigelowLab/maxnetic">maxnetic</a> provides some convenience functions for working with maxnet models including file storage functions.</p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-6_39c8dad27270b8888cc1e080098b3752">
+<div class="cell">
 <div class="sourceCode cell-code" id="cb6"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a>v1_path <span class="ot">=</span> <span class="fu">file.path</span>(<span class="st">"data"</span>, <span class="st">"model"</span>, <span class="st">"v1"</span>, <span class="st">"v1.0"</span>)</span>
 <span id="cb6-2"><a href="#cb6-2" aria-hidden="true" tabindex="-1"></a>ok <span class="ot">=</span> <span class="fu">dir.create</span>(v1_path, <span class="at">recursive =</span> <span class="cn">TRUE</span>, <span class="at">showWarnings =</span> <span class="cn">FALSE</span>)</span>
 <span id="cb6-3"><a href="#cb6-3" aria-hidden="true" tabindex="-1"></a>maxnetic<span class="sc">::</span><span class="fu">write_maxnet</span>(model, <span class="fu">file.path</span>(v1_path, <span class="st">"model_v1.0.rds"</span>))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
@@ -332,7 +333,7 @@ <h2 data-number="6" class="anchored" data-anchor-id="make-a-prediction"><span cl
 <section id="predict-with-a-data-frame" class="level3" data-number="6.1">
 <h3 data-number="6.1" class="anchored" data-anchor-id="predict-with-a-data-frame"><span class="header-section-number">6.1</span> Predict with a data frame</h3>
 <p>Here we provide a data frame, in our case the original input data, to the <code>predict()</code> function with type <code>cloglog</code> which transform the response value into the 0-1 range.</p>
-<div class="cell" width="100%" data-hash="modeling-01_cache/html/unnamed-chunk-7_fc9307ef317c99d7100100573fcd0bb3">
+<div class="cell" width="100%">
 <div class="sourceCode cell-code" id="cb7"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a>prediction <span class="ot">=</span> <span class="fu">predict</span>(model, input_table, <span class="at">type =</span> <span class="st">'cloglog'</span>)</span>
 <span id="cb7-2"><a href="#cb7-2" aria-hidden="true" tabindex="-1"></a><span class="fu">hist</span>(prediction, <span class="at">xlab =</span> <span class="st">"prediction"</span>, <span class="at">main =</span> <span class="st">"Basic Model"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
@@ -341,10 +342,11 @@ <h3 data-number="6.1" class="anchored" data-anchor-id="predict-with-a-data-frame
 </div>
 <section id="how-did-it-do" class="level4" data-number="6.1.1">
 <h4 data-number="6.1.1" class="anchored" data-anchor-id="how-did-it-do"><span class="header-section-number">6.1.1</span> How did it do?</h4>
-<p>We can use some utilities in the <a href="https://github.com/BigelowLab/maxnetic">maxnetic</a> package to help us assess the model. First, we need to create a table with two columns: <code>label</code> and <code>pred</code>. Label is the simple a vector of 0/1 indicating that the predicted value is known to be either background or presence. We already have that in our <code>input_vector</code>. Pred is simple the 0-1 scale predicted value. Once we have that we can craft a <a href="https://en.wikipedia.org/wiki/Receiver_operating_characteristic">receiver operator characteristic curve</a> and compute it’s <a href="https://en.wikipedia.org/wiki/Receiver_operating_characteristic#Area_under_the_curve">AUC</a>.</p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-8_f6a097067e59693dc6da8d783fe32b2a">
-<div class="sourceCode cell-code" id="cb8"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb8-1"><a href="#cb8-1" aria-hidden="true" tabindex="-1"></a>x <span class="ot">=</span> dplyr<span class="sc">::</span><span class="fu">tibble</span>(<span class="at">label =</span> input_vector, <span class="at">pred =</span> <span class="fu">as.vector</span>(prediction))</span>
-<span id="cb8-2"><a href="#cb8-2" aria-hidden="true" tabindex="-1"></a><span class="fu">plot_ROC</span>(x, <span class="at">title =</span> <span class="st">"v1.0 Basic Model"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<p>We can use some utilities in the <a href="https://github.com/BigelowLab/maxnetic">maxnetic</a> package to help us assess the model. The <code>pAUC()</code> function will compute statistics, include a presence-only AUC value. We need to pass it two items - the universe of predictions and the predictions for just the presence points.</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb8"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb8-1"><a href="#cb8-1" aria-hidden="true" tabindex="-1"></a>ix <span class="ot">=</span> input_vector <span class="sc">&gt;</span> <span class="dv">0</span></span>
+<span id="cb8-2"><a href="#cb8-2" aria-hidden="true" tabindex="-1"></a>pauc <span class="ot">=</span> maxnetic<span class="sc">::</span><span class="fu">pAUC</span>(prediction, prediction[ix])</span>
+<span id="cb8-3"><a href="#cb8-3" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(pauc, <span class="at">title =</span> <span class="st">"v1.0 Basic Model"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
 <p><img src="modeling-01_files/figure-html/unnamed-chunk-8-1.png" class="img-fluid" width="672"></p>
 </div>
@@ -354,52 +356,33 @@ <h4 data-number="6.1.1" class="anchored" data-anchor-id="how-did-it-do"><span cl
 </section>
 <section id="predict-with-rasters" class="level3" data-number="6.2">
 <h3 data-number="6.2" class="anchored" data-anchor-id="predict-with-rasters"><span class="header-section-number">6.2</span> Predict with rasters</h3>
-<p>We can also predict using raster inputs using our basic model. Let’s read in rasters for each month of 2018, and then run a prediction for each month.</p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-9_f6b68d61fd862e983aa50df420a2da21">
+<p>We can also predict using raster inputs using our basic model. Let’s read in rasters for each month of 2019, and then run a prediction for each month.</p>
+<p>We provide a function <code>read_predictors()</code> that will read and bind the rasters together for you given the filtered databases and paths. So, first we define the paths and filter the databases to point to just the months in 2019.</p>
+<div class="cell">
 <div class="sourceCode cell-code" id="cb9"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a>dates <span class="ot">=</span> <span class="fu">as.Date</span>(<span class="fu">c</span>(<span class="st">"2019-01-01"</span>, <span class="st">"2019-12-31"</span>))</span>
 <span id="cb9-2"><a href="#cb9-2" aria-hidden="true" tabindex="-1"></a></span>
 <span id="cb9-3"><a href="#cb9-3" aria-hidden="true" tabindex="-1"></a>sst_path <span class="ot">=</span> <span class="st">"data/oisst"</span></span>
 <span id="cb9-4"><a href="#cb9-4" aria-hidden="true" tabindex="-1"></a>sst_db <span class="ot">=</span> oisster<span class="sc">::</span><span class="fu">read_database</span>(sst_path) <span class="sc">|&gt;</span></span>
 <span id="cb9-5"><a href="#cb9-5" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">arrange</span>(date) <span class="sc">|&gt;</span></span>
 <span id="cb9-6"><a href="#cb9-6" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>]))</span>
-<span id="cb9-7"><a href="#cb9-7" aria-hidden="true" tabindex="-1"></a>  </span>
-<span id="cb9-8"><a href="#cb9-8" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-9"><a href="#cb9-9" aria-hidden="true" tabindex="-1"></a>sst <span class="ot">=</span> sst_db <span class="sc">|&gt;</span></span>
-<span id="cb9-10"><a href="#cb9-10" aria-hidden="true" tabindex="-1"></a>  oisster<span class="sc">::</span><span class="fu">compose_filename</span>(<span class="at">path =</span> sst_path) <span class="sc">|&gt;</span></span>
-<span id="cb9-11"><a href="#cb9-11" aria-hidden="true" tabindex="-1"></a>  stars<span class="sc">::</span><span class="fu">read_stars</span>(<span class="at">along =</span> <span class="fu">list</span>(<span class="at">time =</span> sst_db<span class="sc">$</span>date)) <span class="sc">|&gt;</span></span>
-<span id="cb9-12"><a href="#cb9-12" aria-hidden="true" tabindex="-1"></a>  rlang<span class="sc">::</span><span class="fu">set_names</span>(<span class="st">"sst"</span>)<span class="sc">|&gt;</span></span>
-<span id="cb9-13"><a href="#cb9-13" aria-hidden="true" tabindex="-1"></a>  <span class="fu">st_to_180</span>()</span>
-<span id="cb9-14"><a href="#cb9-14" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-15"><a href="#cb9-15" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-16"><a href="#cb9-16" aria-hidden="true" tabindex="-1"></a>wind_path <span class="ot">=</span> <span class="st">"data/nbs"</span></span>
-<span id="cb9-17"><a href="#cb9-17" aria-hidden="true" tabindex="-1"></a>wind_db <span class="ot">=</span> nbs<span class="sc">::</span><span class="fu">read_database</span>(wind_path) <span class="sc">|&gt;</span></span>
-<span id="cb9-18"><a href="#cb9-18" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">arrange</span>(date)<span class="sc">|&gt;</span></span>
+<span id="cb9-7"><a href="#cb9-7" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb9-8"><a href="#cb9-8" aria-hidden="true" tabindex="-1"></a>wind_path <span class="ot">=</span> <span class="st">"data/nbs"</span></span>
+<span id="cb9-9"><a href="#cb9-9" aria-hidden="true" tabindex="-1"></a>wind_db <span class="ot">=</span> nbs<span class="sc">::</span><span class="fu">read_database</span>(wind_path) <span class="sc">|&gt;</span></span>
+<span id="cb9-10"><a href="#cb9-10" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">arrange</span>(date)<span class="sc">|&gt;</span></span>
+<span id="cb9-11"><a href="#cb9-11" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>]))</span>
+<span id="cb9-12"><a href="#cb9-12" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb9-13"><a href="#cb9-13" aria-hidden="true" tabindex="-1"></a>u_wind_db <span class="ot">=</span> wind_db <span class="sc">|&gt;</span></span>
+<span id="cb9-14"><a href="#cb9-14" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(param <span class="sc">==</span> <span class="st">"u_wind"</span>)<span class="sc">|&gt;</span></span>
+<span id="cb9-15"><a href="#cb9-15" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>]))</span>
+<span id="cb9-16"><a href="#cb9-16" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb9-17"><a href="#cb9-17" aria-hidden="true" tabindex="-1"></a>v_wind_db <span class="ot">=</span> wind_db <span class="sc">|&gt;</span></span>
+<span id="cb9-18"><a href="#cb9-18" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(param <span class="sc">==</span> <span class="st">"v_wind"</span>)<span class="sc">|&gt;</span></span>
 <span id="cb9-19"><a href="#cb9-19" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>]))</span>
 <span id="cb9-20"><a href="#cb9-20" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-21"><a href="#cb9-21" aria-hidden="true" tabindex="-1"></a>u_wind_db <span class="ot">=</span> wind_db <span class="sc">|&gt;</span></span>
-<span id="cb9-22"><a href="#cb9-22" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(param <span class="sc">==</span> <span class="st">"u_wind"</span>)<span class="sc">|&gt;</span></span>
-<span id="cb9-23"><a href="#cb9-23" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>]))</span>
-<span id="cb9-24"><a href="#cb9-24" aria-hidden="true" tabindex="-1"></a>u_wind <span class="ot">=</span> u_wind_db <span class="sc">|&gt;</span></span>
-<span id="cb9-25"><a href="#cb9-25" aria-hidden="true" tabindex="-1"></a>  nbs<span class="sc">::</span><span class="fu">compose_filename</span>(<span class="at">path =</span> wind_path) <span class="sc">|&gt;</span></span>
-<span id="cb9-26"><a href="#cb9-26" aria-hidden="true" tabindex="-1"></a>  stars<span class="sc">::</span><span class="fu">read_stars</span>(<span class="at">along =</span> <span class="fu">list</span>(<span class="at">time =</span> u_wind_db<span class="sc">$</span>date)) <span class="sc">|&gt;</span></span>
-<span id="cb9-27"><a href="#cb9-27" aria-hidden="true" tabindex="-1"></a>  rlang<span class="sc">::</span><span class="fu">set_names</span>(<span class="st">"u_wind"</span>) <span class="sc">|&gt;</span></span>
-<span id="cb9-28"><a href="#cb9-28" aria-hidden="true" tabindex="-1"></a>  <span class="fu">st_to_180</span>()</span>
-<span id="cb9-29"><a href="#cb9-29" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-30"><a href="#cb9-30" aria-hidden="true" tabindex="-1"></a>v_wind_db <span class="ot">=</span> wind_db <span class="sc">|&gt;</span></span>
-<span id="cb9-31"><a href="#cb9-31" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(param <span class="sc">==</span> <span class="st">"v_wind"</span>)<span class="sc">|&gt;</span></span>
-<span id="cb9-32"><a href="#cb9-32" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>]))</span>
-<span id="cb9-33"><a href="#cb9-33" aria-hidden="true" tabindex="-1"></a>v_wind <span class="ot">=</span> v_wind_db <span class="sc">|&gt;</span></span>
-<span id="cb9-34"><a href="#cb9-34" aria-hidden="true" tabindex="-1"></a>  nbs<span class="sc">::</span><span class="fu">compose_filename</span>(<span class="at">path =</span> wind_path) <span class="sc">|&gt;</span></span>
-<span id="cb9-35"><a href="#cb9-35" aria-hidden="true" tabindex="-1"></a>  stars<span class="sc">::</span><span class="fu">read_stars</span>(<span class="at">along =</span> <span class="fu">list</span>(<span class="at">time =</span> v_wind_db<span class="sc">$</span>date)) <span class="sc">|&gt;</span></span>
-<span id="cb9-36"><a href="#cb9-36" aria-hidden="true" tabindex="-1"></a>  rlang<span class="sc">::</span><span class="fu">set_names</span>(<span class="st">"v_wind"</span>) <span class="sc">|&gt;</span></span>
-<span id="cb9-37"><a href="#cb9-37" aria-hidden="true" tabindex="-1"></a>  <span class="fu">st_to_180</span>()</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
-</div>
-<p>Once we have them in hand we need to bind them together. But we need to attend to common but important issue. The <code>sst</code> rasters and <code>windspeed</code> rasters have different extents. We can’t bind them together until we warp one set to match the other. Let’s warp <code>sst</code> to match <code>u_wind</code>. And then we can bind them together.</p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-10_2094c79f856d59e1e5a3d1ca5dfa871c">
-<div class="sourceCode cell-code" id="cb10"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb10-1"><a href="#cb10-1" aria-hidden="true" tabindex="-1"></a>sst_warped <span class="ot">=</span> stars<span class="sc">::</span><span class="fu">st_warp</span>(sst, u_wind)</span>
-<span id="cb10-2"><a href="#cb10-2" aria-hidden="true" tabindex="-1"></a>x <span class="ot">=</span> <span class="fu">list</span>(sst_warped, u_wind, v_wind)</span>
-<span id="cb10-3"><a href="#cb10-3" aria-hidden="true" tabindex="-1"></a>predictors <span class="ot">=</span> <span class="fu">do.call</span>(c, <span class="fu">append</span>(x, <span class="fu">list</span>(<span class="at">along =</span> <span class="cn">NA_integer_</span>))) </span>
-<span id="cb10-4"><a href="#cb10-4" aria-hidden="true" tabindex="-1"></a>predictors</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<span id="cb9-21"><a href="#cb9-21" aria-hidden="true" tabindex="-1"></a>predictors <span class="ot">=</span> <span class="fu">read_predictors</span>(<span class="at">sst_db =</span> sst_db,</span>
+<span id="cb9-22"><a href="#cb9-22" aria-hidden="true" tabindex="-1"></a>                             <span class="at">u_wind_db =</span> u_wind_db,</span>
+<span id="cb9-23"><a href="#cb9-23" aria-hidden="true" tabindex="-1"></a>                             <span class="at">v_wind_db =</span> v_wind_db)</span>
+<span id="cb9-24"><a href="#cb9-24" aria-hidden="true" tabindex="-1"></a>predictors</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output cell-output-stdout">
 <pre><code>stars object with 3 dimensions and 3 attributes
 attribute(s):
@@ -414,10 +397,10 @@ <h3 data-number="6.2" class="anchored" data-anchor-id="predict-with-rasters"><sp
 time    1 12     NA    NA   Date    NA 2019-01-01,...,2019-12-01    </code></pre>
 </div>
 </div>
-<p>Now we can run the prediction.</p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-11_6a59a729cd3845088343a3df6ce41f45">
-<div class="sourceCode cell-code" id="cb12"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb12-1"><a href="#cb12-1" aria-hidden="true" tabindex="-1"></a>pred <span class="ot">=</span> <span class="fu">predict</span>(model, predictors, <span class="at">type =</span> <span class="st">'cloglog'</span>)</span>
-<span id="cb12-2"><a href="#cb12-2" aria-hidden="true" tabindex="-1"></a>pred</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<p>You can see that we have the rasters in one object of three attributes (<code>sst</code>, <code>u_wind</code> and <code>v_wind</code>) each with 12 layers (Jan 2019 - Dec 2019). Now we can run the prediction.</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb11"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a>pred <span class="ot">=</span> <span class="fu">predict</span>(model, predictors, <span class="at">type =</span> <span class="st">'cloglog'</span>)</span>
+<span id="cb11-2"><a href="#cb11-2" aria-hidden="true" tabindex="-1"></a>pred</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output cell-output-stdout">
 <pre><code>stars object with 3 dimensions and 1 attribute
 attribute(s):
@@ -431,25 +414,25 @@ <h3 data-number="6.2" class="anchored" data-anchor-id="predict-with-rasters"><sp
 </div>
 </div>
 <p>Since we get a spatially mapped prediction back, we can plot it.</p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-12_3658e3f201a72afb0648b5ea5fa85ad7">
-<div class="sourceCode cell-code" id="cb14"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb14-1"><a href="#cb14-1" aria-hidden="true" tabindex="-1"></a>coast <span class="ot">=</span> rnaturalearth<span class="sc">::</span><span class="fu">ne_coastline</span>(<span class="at">scale =</span> <span class="st">'large'</span>, <span class="at">returnclass =</span> <span class="st">'sf'</span>) <span class="sc">|&gt;</span></span>
-<span id="cb14-2"><a href="#cb14-2" aria-hidden="true" tabindex="-1"></a>  sf<span class="sc">::</span><span class="fu">st_crop</span>(pred)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb13"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a>coast <span class="ot">=</span> rnaturalearth<span class="sc">::</span><span class="fu">ne_coastline</span>(<span class="at">scale =</span> <span class="st">'large'</span>, <span class="at">returnclass =</span> <span class="st">'sf'</span>) <span class="sc">|&gt;</span></span>
+<span id="cb13-2"><a href="#cb13-2" aria-hidden="true" tabindex="-1"></a>  sf<span class="sc">::</span><span class="fu">st_crop</span>(pred)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output cell-output-stderr">
 <pre><code>Warning: attribute variables are assumed to be spatially constant throughout
 all geometries</code></pre>
 </div>
-<div class="sourceCode cell-code" id="cb16"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a>plot_coast <span class="ot">=</span> <span class="cf">function</span>() {</span>
-<span id="cb16-2"><a href="#cb16-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">plot</span>(sf<span class="sc">::</span><span class="fu">st_geometry</span>(coast), <span class="at">col =</span> <span class="st">'green'</span>, <span class="at">add =</span> <span class="cn">TRUE</span>)</span>
-<span id="cb16-3"><a href="#cb16-3" aria-hidden="true" tabindex="-1"></a>}</span>
-<span id="cb16-4"><a href="#cb16-4" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(pred, <span class="at">hook =</span> plot_coast)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="sourceCode cell-code" id="cb15"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb15-1"><a href="#cb15-1" aria-hidden="true" tabindex="-1"></a>plot_coast <span class="ot">=</span> <span class="cf">function</span>() {</span>
+<span id="cb15-2"><a href="#cb15-2" aria-hidden="true" tabindex="-1"></a>  <span class="fu">plot</span>(sf<span class="sc">::</span><span class="fu">st_geometry</span>(coast), <span class="at">col =</span> <span class="st">'green'</span>, <span class="at">add =</span> <span class="cn">TRUE</span>)</span>
+<span id="cb15-3"><a href="#cb15-3" aria-hidden="true" tabindex="-1"></a>}</span>
+<span id="cb15-4"><a href="#cb15-4" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(pred, <span class="at">hook =</span> plot_coast)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
-<p><img src="modeling-01_files/figure-html/unnamed-chunk-12-1.png" class="img-fluid" width="672"></p>
+<p><img src="modeling-01_files/figure-html/unnamed-chunk-11-1.png" class="img-fluid" width="672"></p>
 </div>
 </div>
 <p>Well, that certainly looks appealing with higher likelihood of near shore observations occurring during the warmer months.</p>
 <section id="how-did-it-do-1" class="level4" data-number="6.2.1">
 <h4 data-number="6.2.1" class="anchored" data-anchor-id="how-did-it-do-1"><span class="header-section-number">6.2.1</span> How did it do?</h4>
-<p>To compute an ROC and AUC for each month, we have a little bit of work to do. We need to extract the observations and background for each month from the prediction maps. These we can then pass to the <code>plot_ROC()</code> function.</p>
+<p>To compute an ROC and AUC for each month, we have a little bit of work to do. We need to extract the observations locations for each month from the prediction maps. These we can then plot.</p>
 <div class="callout callout-style-default callout-note callout-titled">
 <div class="callout-header d-flex align-content-center">
 <div class="callout-icon-container">
@@ -463,26 +446,19 @@ <h4 data-number="6.2.1" class="anchored" data-anchor-id="how-did-it-do-1"><span
 <p>We have to modify the date for each point to be the first date of each month. That’s because our predictors are monthlies.</p>
 </div>
 </div>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-13_02026be2cb9c768444eb87a3d719e735">
-<div class="sourceCode cell-code" id="cb17"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a>test_obs <span class="ot">=</span> obs <span class="sc">|&gt;</span></span>
-<span id="cb17-2"><a href="#cb17-2" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>])) <span class="sc">|&gt;</span></span>
-<span id="cb17-3"><a href="#cb17-3" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">select</span>(dplyr<span class="sc">::</span><span class="fu">all_of</span>(<span class="st">"date"</span>)) <span class="sc">|&gt;</span></span>
-<span id="cb17-4"><a href="#cb17-4" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">date =</span> oisster<span class="sc">::</span><span class="fu">current_month</span>(date))</span>
-<span id="cb17-5"><a href="#cb17-5" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb17-6"><a href="#cb17-6" aria-hidden="true" tabindex="-1"></a>test_bkg <span class="ot">=</span> bkg <span class="sc">|&gt;</span></span>
-<span id="cb17-7"><a href="#cb17-7" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>])) <span class="sc">|&gt;</span></span>
-<span id="cb17-8"><a href="#cb17-8" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">select</span>(dplyr<span class="sc">::</span><span class="fu">all_of</span>(<span class="st">"date"</span>)) <span class="sc">|&gt;</span></span>
-<span id="cb17-9"><a href="#cb17-9" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">date =</span> oisster<span class="sc">::</span><span class="fu">current_month</span>(date))</span>
-<span id="cb17-10"><a href="#cb17-10" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb17-11"><a href="#cb17-11" aria-hidden="true" tabindex="-1"></a>test_input <span class="ot">=</span> dplyr<span class="sc">::</span><span class="fu">bind_rows</span>(test_obs, test_bkg)</span>
-<span id="cb17-12"><a href="#cb17-12" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb17-13"><a href="#cb17-13" aria-hidden="true" tabindex="-1"></a>x <span class="ot">=</span> stars<span class="sc">::</span><span class="fu">st_extract</span>(pred, test_input, <span class="at">time_column =</span> <span class="st">'date'</span>) <span class="sc">|&gt;</span></span>
-<span id="cb17-14"><a href="#cb17-14" aria-hidden="true" tabindex="-1"></a>  <span class="fu">print</span>()</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb16"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb16-1"><a href="#cb16-1" aria-hidden="true" tabindex="-1"></a>test_obs <span class="ot">=</span> obs <span class="sc">|&gt;</span></span>
+<span id="cb16-2"><a href="#cb16-2" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>])) <span class="sc">|&gt;</span></span>
+<span id="cb16-3"><a href="#cb16-3" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">select</span>(dplyr<span class="sc">::</span><span class="fu">all_of</span>(<span class="st">"date"</span>)) <span class="sc">|&gt;</span></span>
+<span id="cb16-4"><a href="#cb16-4" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">date =</span> oisster<span class="sc">::</span><span class="fu">current_month</span>(date))</span>
+<span id="cb16-5"><a href="#cb16-5" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb16-6"><a href="#cb16-6" aria-hidden="true" tabindex="-1"></a>x <span class="ot">=</span> stars<span class="sc">::</span><span class="fu">st_extract</span>(pred, test_obs, <span class="at">time_column =</span> <span class="st">'date'</span>) <span class="sc">|&gt;</span></span>
+<span id="cb16-7"><a href="#cb16-7" aria-hidden="true" tabindex="-1"></a>  <span class="fu">print</span>()</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output cell-output-stdout">
-<pre><code>Simple feature collection with 1537 features and 3 fields
+<pre><code>Simple feature collection with 612 features and 3 fields
 Geometry type: POINT
 Dimension:     XY
-Bounding box:  xmin: -75.99915 ymin: 35.01635 xmax: -58.83057 ymax: 45.95233
+Bounding box:  xmin: -75.7589 ymin: 35.1211 xmax: -65.48274 ymax: 43.83954
 Geodetic CRS:  WGS 84
 First 10 features:
         pred       time       date                   geometry
@@ -499,66 +475,125 @@ <h4 data-number="6.2.1" class="anchored" data-anchor-id="how-did-it-do-1"><span
 </div>
 </div>
 <p>Finally we can build a table that merges the prediction with the labels. We are going to add the name of the month to group by that.</p>
-<div class="cell" data-hash="modeling-01_cache/html/unnamed-chunk-14_9f5df7dca1e15267f824071f528c234b">
-<div class="sourceCode cell-code" id="cb19"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb19-1"><a href="#cb19-1" aria-hidden="true" tabindex="-1"></a>y <span class="ot">=</span> x <span class="sc">|&gt;</span></span>
-<span id="cb19-2"><a href="#cb19-2" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">label =</span> <span class="fu">c</span>(<span class="fu">rep</span>(<span class="dv">1</span>, <span class="fu">nrow</span>(test_obs)), <span class="fu">rep</span>(<span class="dv">0</span>, <span class="fu">nrow</span>(test_bkg))),</span>
-<span id="cb19-3"><a href="#cb19-3" aria-hidden="true" tabindex="-1"></a>                <span class="at">month =</span> <span class="fu">factor</span>(<span class="fu">format</span>(date, <span class="st">"%b"</span>), <span class="at">levels =</span> month.abb), </span>
-<span id="cb19-4"><a href="#cb19-4" aria-hidden="true" tabindex="-1"></a>                <span class="at">.before =</span> <span class="dv">2</span>) <span class="sc">|&gt;</span></span>
-<span id="cb19-5"><a href="#cb19-5" aria-hidden="true" tabindex="-1"></a>  sf<span class="sc">::</span><span class="fu">st_drop_geometry</span>() <span class="sc">|&gt;</span></span>
-<span id="cb19-6"><a href="#cb19-6" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">select</span>(dplyr<span class="sc">::</span><span class="fu">all_of</span>(<span class="fu">c</span>(<span class="st">"month"</span>, <span class="st">"label"</span>, <span class="st">"pred"</span>))) <span class="sc">|&gt;</span></span>
-<span id="cb19-7"><a href="#cb19-7" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">group_by</span>(month) </span>
-<span id="cb19-8"><a href="#cb19-8" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb19-9"><a href="#cb19-9" aria-hidden="true" tabindex="-1"></a>dplyr<span class="sc">::</span><span class="fu">count</span>(y, month, label) <span class="sc">|&gt;</span></span>
-<span id="cb19-10"><a href="#cb19-10" aria-hidden="true" tabindex="-1"></a>  <span class="fu">print</span>(<span class="at">n =</span> <span class="dv">24</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb18"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb18-1"><a href="#cb18-1" aria-hidden="true" tabindex="-1"></a>y <span class="ot">=</span> x <span class="sc">|&gt;</span></span>
+<span id="cb18-2"><a href="#cb18-2" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">month =</span> <span class="fu">factor</span>(<span class="fu">format</span>(date, <span class="st">"%b"</span>), <span class="at">levels =</span> month.abb), </span>
+<span id="cb18-3"><a href="#cb18-3" aria-hidden="true" tabindex="-1"></a>                <span class="at">.before =</span> <span class="dv">1</span>) <span class="sc">|&gt;</span></span>
+<span id="cb18-4"><a href="#cb18-4" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">select</span>(dplyr<span class="sc">::</span><span class="fu">all_of</span>(<span class="fu">c</span>(<span class="st">"month"</span>, <span class="st">"pred"</span>, <span class="st">"date"</span>))) <span class="sc">|&gt;</span></span>
+<span id="cb18-5"><a href="#cb18-5" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">group_by</span>(month) </span>
+<span id="cb18-6"><a href="#cb18-6" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb18-7"><a href="#cb18-7" aria-hidden="true" tabindex="-1"></a>dplyr<span class="sc">::</span><span class="fu">count</span>(y, month) <span class="sc">|&gt;</span></span>
+<span id="cb18-8"><a href="#cb18-8" aria-hidden="true" tabindex="-1"></a>  <span class="fu">print</span>(<span class="at">n =</span> <span class="dv">12</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 24 × 3
-# Groups:   month [12]
-   month label     n
-   &lt;fct&gt; &lt;dbl&gt; &lt;int&gt;
- 1 Jan       0    36
- 2 Jan       1    21
- 3 Feb       0    15
- 4 Feb       1     7
- 5 Mar       0    46
- 6 Mar       1    23
- 7 Apr       0   259
- 8 Apr       1   169
- 9 May       0   182
-10 May       1   119
-11 Jun       0    73
-12 Jun       1    53
-13 Jul       0    76
-14 Jul       1    48
-15 Aug       0    46
-16 Aug       1    39
-17 Sep       0    48
-18 Sep       1    21
-19 Oct       0   102
-20 Oct       1    79
-21 Nov       0    27
-22 Nov       1    19
-23 Dec       0    15
-24 Dec       1    14</code></pre>
+<pre><code>Simple feature collection with 12 features and 2 fields
+Geometry type: MULTIPOINT
+Dimension:     XY
+Bounding box:  xmin: -75.7589 ymin: 35.1211 xmax: -65.48274 ymax: 43.83954
+Geodetic CRS:  WGS 84
+# A tibble: 12 × 3
+   month     n                                                          geometry
+ * &lt;fct&gt; &lt;int&gt;                                                  &lt;MULTIPOINT [°]&gt;
+ 1 Jan      21 ((-74.63902 36.26849), (-75.01758 36.49984), (-75.01801 36.72554…
+ 2 Feb       7 ((-74.52432 37.24967), (-74.45561 37.16891), (-74.74373 36.72355…
+ 3 Mar      23 ((-74.53117 36.26996), (-74.60195 36.72201), (-74.67127 36.72266…
+ 4 Apr     169 ((-72.924 38.6733), (-73.0165 38.591), (-73.0036 38.56), (-73.10…
+ 5 May     119 ((-74.56571 35.6059), (-75.2181 35.1934), (-75.3228 35.535), (-7…
+ 6 Jun      53 ((-73.10608 38.72575), (-74.86204 36.27105), (-75.04656 36.34824…
+ 7 Jul      48 ((-74.53554 36.19828), (-74.91756 36.27104), (-75.10905 36.27065…
+ 8 Aug      39 ((-72.78628 38.68677), (-72.98868 38.61241), (-74.9889 36.2911),…
+ 9 Sep      21 ((-75.3167 36.0439), (-75.5204 36.3294), (-75.5519 36.1854), (-7…
+10 Oct      79 ((-67.06445 42.91399), (-68.43614 43.83954), (-69.14391 43.16967…
+11 Nov      19 ((-72.52681 39.21286), (-71.54966 39.99385), (-67.79606 40.36107…
+12 Dec      14 ((-75.242 35.2705), (-75.3335 35.3027), (-75.436 35.1211), (-75.…</code></pre>
 </div>
 </div>
 <p>Now how about one ROC plot for each month? Yikes! This requires a iterative approach, using <code>group_map()</code>, to compute the ROC for each month. We then follow with plot wrapping by the <a href="https://patchwork.data-imaginist.com/articles/guides/assembly.html#functional-assembly">patchwork</a> package.</p>
-<div class="cell" width="100%" data-hash="modeling-01_cache/html/unnamed-chunk-15_04ea9bdf7ee86621af9c9680589b5ae4">
-<div class="sourceCode cell-code" id="cb21"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb21-1"><a href="#cb21-1" aria-hidden="true" tabindex="-1"></a>rocs <span class="ot">=</span> dplyr<span class="sc">::</span><span class="fu">group_map</span>(y, </span>
-<span id="cb21-2"><a href="#cb21-2" aria-hidden="true" tabindex="-1"></a>  <span class="cf">function</span>(tbl, key){</span>
-<span id="cb21-3"><a href="#cb21-3" aria-hidden="true" tabindex="-1"></a>    maxnetic<span class="sc">::</span><span class="fu">plot_ROC</span>(tbl, <span class="at">title =</span> <span class="fu">sprintf</span>(<span class="st">"%s, n = %i"</span>, key<span class="sc">$</span>month, <span class="fu">nrow</span>(tbl)), </span>
-<span id="cb21-4"><a href="#cb21-4" aria-hidden="true" tabindex="-1"></a>                                            <span class="at">xlab =</span> <span class="st">""</span>, <span class="at">ylab =</span> <span class="st">""</span>)</span>
-<span id="cb21-5"><a href="#cb21-5" aria-hidden="true" tabindex="-1"></a>  })</span>
-<span id="cb21-6"><a href="#cb21-6" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb21-7"><a href="#cb21-7" aria-hidden="true" tabindex="-1"></a>patchwork<span class="sc">::</span><span class="fu">wrap_plots</span>(rocs, <span class="at">ncol =</span> <span class="dv">4</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell" width="100%">
+<div class="sourceCode cell-code" id="cb20"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb20-1"><a href="#cb20-1" aria-hidden="true" tabindex="-1"></a>paucs <span class="ot">=</span> dplyr<span class="sc">::</span><span class="fu">group_map</span>(y, </span>
+<span id="cb20-2"><a href="#cb20-2" aria-hidden="true" tabindex="-1"></a>  <span class="cf">function</span>(tbl, key, <span class="at">pred_rasters =</span> <span class="cn">NULL</span>){</span>
+<span id="cb20-3"><a href="#cb20-3" aria-hidden="true" tabindex="-1"></a>    ix <span class="ot">=</span> key<span class="sc">$</span>month <span class="sc">%in%</span> month.abb</span>
+<span id="cb20-4"><a href="#cb20-4" aria-hidden="true" tabindex="-1"></a>    x <span class="ot">=</span> dplyr<span class="sc">::</span><span class="fu">slice</span>(pred_rasters, <span class="st">"time"</span>, ix)</span>
+<span id="cb20-5"><a href="#cb20-5" aria-hidden="true" tabindex="-1"></a>    pauc <span class="ot">=</span> maxnetic<span class="sc">::</span><span class="fu">pAUC</span>(x, tbl)</span>
+<span id="cb20-6"><a href="#cb20-6" aria-hidden="true" tabindex="-1"></a>    <span class="fu">plot</span>(pauc,<span class="at">title =</span> key<span class="sc">$</span>month, </span>
+<span id="cb20-7"><a href="#cb20-7" aria-hidden="true" tabindex="-1"></a>         <span class="at">xlab =</span> <span class="st">""</span>, <span class="at">ylab =</span> <span class="st">""</span>)</span>
+<span id="cb20-8"><a href="#cb20-8" aria-hidden="true" tabindex="-1"></a>  }, <span class="at">pred_rasters =</span> pred)</span>
+<span id="cb20-9"><a href="#cb20-9" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb20-10"><a href="#cb20-10" aria-hidden="true" tabindex="-1"></a>patchwork<span class="sc">::</span><span class="fu">wrap_plots</span>(paucs, <span class="at">ncol =</span> <span class="dv">4</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
-<p><img src="modeling-01_files/figure-html/unnamed-chunk-15-1.png" class="img-fluid" width="672"></p>
+<p><img src="modeling-01_files/figure-html/unnamed-chunk-14-1.png" class="img-fluid" width="672"></p>
 </div>
 </div>
-<p>Hmmm. That’s surprising, yes? Why during the summer months does our AUC go down. In fact, at times we are predicting the likelihood of <strong>not</strong> having an observation reported. It’s hard to know what to think, but consider that we are using a model generated across all months of multiple years and it might not predict a particular month and year very well. A step toward refinement, our next step is to make 12 models, one for each month.</p>
-
-
+<p>Hmmm. That’s surprising, yes? Why during the summer months does our AUC go down when we have the most number of observations? That seems counter intuitive.</p>
+</section>
 </section>
 </section>
+<section id="thinking-about-auc" class="level2" data-number="7">
+<h2 data-number="7" class="anchored" data-anchor-id="thinking-about-auc"><span class="header-section-number">7</span> Thinking about AUC</h2>
+<p>AUC is a diagnostic that provides a peek into the predictive power of a model. But what is it? An analogy is fitting a straight line to a small set of observations verses a large set of observations and then comparing the correlation coefficients. Here’s a simple example using R’s built-in dataset <code>cars</code> which is a data frame of 50 observations of speed and stopping distances of cars. We’ll compute a linear model for the entire data set, and then a second for a small subsample of the data. (Learn more about linear models in R <a href="https://rseek.org/?q=linear+models">here</a>.)</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb21"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb21-1"><a href="#cb21-1" aria-hidden="true" tabindex="-1"></a><span class="fu">data</span>(<span class="st">"cars"</span>)</span>
+<span id="cb21-2"><a href="#cb21-2" aria-hidden="true" tabindex="-1"></a>cars <span class="ot">=</span> dplyr<span class="sc">::</span><span class="fu">as_tibble</span>(cars)</span>
+<span id="cb21-3"><a href="#cb21-3" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb21-4"><a href="#cb21-4" aria-hidden="true" tabindex="-1"></a>all_fit <span class="ot">=</span> <span class="fu">lm</span>(dist <span class="sc">~</span> speed, <span class="at">data =</span> cars)</span>
+<span id="cb21-5"><a href="#cb21-5" aria-hidden="true" tabindex="-1"></a><span class="fu">summary</span>(all_fit)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>
+Call:
+lm(formula = dist ~ speed, data = cars)
+
+Residuals:
+    Min      1Q  Median      3Q     Max 
+-29.069  -9.525  -2.272   9.215  43.201 
+
+Coefficients:
+            Estimate Std. Error t value Pr(&gt;|t|)    
+(Intercept) -17.5791     6.7584  -2.601   0.0123 *  
+speed         3.9324     0.4155   9.464 1.49e-12 ***
+---
+Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
+
+Residual standard error: 15.38 on 48 degrees of freedom
+Multiple R-squared:  0.6511,    Adjusted R-squared:  0.6438 
+F-statistic: 89.57 on 1 and 48 DF,  p-value: 1.49e-12</code></pre>
+</div>
+</div>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb23"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb23-1"><a href="#cb23-1" aria-hidden="true" tabindex="-1"></a><span class="fu">set.seed</span>(<span class="dv">5</span>)</span>
+<span id="cb23-2"><a href="#cb23-2" aria-hidden="true" tabindex="-1"></a>sub_cars <span class="ot">=</span> dplyr<span class="sc">::</span><span class="fu">slice_sample</span>(cars, <span class="at">n =</span> <span class="dv">3</span>)</span>
+<span id="cb23-3"><a href="#cb23-3" aria-hidden="true" tabindex="-1"></a>sub_fit <span class="ot">=</span> <span class="fu">lm</span>(dist <span class="sc">~</span> speed, <span class="at">data =</span> sub_cars)</span>
+<span id="cb23-4"><a href="#cb23-4" aria-hidden="true" tabindex="-1"></a><span class="fu">summary</span>(sub_fit)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output cell-output-stdout">
+<pre><code>
+Call:
+lm(formula = dist ~ speed, data = sub_cars)
+
+Residuals:
+ 1  2  3 
+ 3  3 -6 
+
+Coefficients:
+            Estimate Std. Error t value Pr(&gt;|t|)
+(Intercept)  -6.5000     8.8741  -0.732    0.598
+speed         3.3750     0.6495   5.196    0.121
+
+Residual standard error: 7.348 on 1 degrees of freedom
+Multiple R-squared:  0.9643,    Adjusted R-squared:  0.9286 
+F-statistic:    27 on 1 and 1 DF,  p-value: 0.121</code></pre>
+</div>
+</div>
+<p>You can see that the <code>rU+00B2</code> value is quite high for the smaller data set, but the model may not be predictive over the full range of data. AUC is somewhat analogous to to <code>rU+00B2</code> in that a relatively low score does not necessarily suggest a poor model.</p>
+<div class="cell">
+<div class="sourceCode cell-code" id="cb25"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb25-1"><a href="#cb25-1" aria-hidden="true" tabindex="-1"></a>ggplot2<span class="sc">::</span><span class="fu">ggplot</span>(<span class="at">data =</span> cars, ggplot2<span class="sc">::</span><span class="fu">aes</span>(<span class="at">x =</span> speed, <span class="at">y =</span> dist)) <span class="sc">+</span></span>
+<span id="cb25-2"><a href="#cb25-2" aria-hidden="true" tabindex="-1"></a>  ggplot2<span class="sc">::</span><span class="fu">geom_point</span>(<span class="at">color =</span> <span class="st">"blue"</span>) <span class="sc">+</span></span>
+<span id="cb25-3"><a href="#cb25-3" aria-hidden="true" tabindex="-1"></a>  ggplot2<span class="sc">::</span><span class="fu">geom_abline</span>(<span class="at">slope =</span> <span class="fu">coef</span>(all_fit)[<span class="dv">2</span>], <span class="at">intercept =</span> <span class="fu">coef</span>(all_fit)[<span class="dv">1</span>], <span class="at">color =</span> <span class="st">"blue"</span>) <span class="sc">+</span> </span>
+<span id="cb25-4"><a href="#cb25-4" aria-hidden="true" tabindex="-1"></a>  ggplot2<span class="sc">::</span><span class="fu">geom_point</span>(<span class="at">data =</span> sub_cars, ggplot2<span class="sc">::</span><span class="fu">aes</span>(<span class="at">x =</span> speed, <span class="at">y =</span> dist), <span class="at">color =</span> <span class="st">"orange"</span>) <span class="sc">+</span></span>
+<span id="cb25-5"><a href="#cb25-5" aria-hidden="true" tabindex="-1"></a>  ggplot2<span class="sc">::</span><span class="fu">geom_abline</span>(<span class="at">slope =</span> <span class="fu">coef</span>(sub_fit)[<span class="dv">2</span>], <span class="at">intercept =</span> <span class="fu">coef</span>(sub_fit)[<span class="dv">1</span>], <span class="at">color =</span> <span class="st">"orange"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<p><img src="modeling-01_files/figure-html/unnamed-chunk-17-1.png" class="img-fluid" width="672"></p>
+</div>
+</div>
+
+
 </section>
 
 <a onclick="window.scrollTo(0, 0); return false;" role="button" id="quarto-back-to-top"><i class="bi bi-arrow-up"></i> Back to top</a></main> <!-- /main -->
diff --git a/docs/modeling-01_files/figure-html/unnamed-chunk-12-1.png b/docs/modeling-01_files/figure-html/unnamed-chunk-11-1.png
similarity index 100%
rename from docs/modeling-01_files/figure-html/unnamed-chunk-12-1.png
rename to docs/modeling-01_files/figure-html/unnamed-chunk-11-1.png
diff --git a/docs/modeling-01_files/figure-html/unnamed-chunk-14-1.png b/docs/modeling-01_files/figure-html/unnamed-chunk-14-1.png
new file mode 100644
index 0000000..6059e6c
Binary files /dev/null and b/docs/modeling-01_files/figure-html/unnamed-chunk-14-1.png differ
diff --git a/docs/modeling-01_files/figure-html/unnamed-chunk-15-1.png b/docs/modeling-01_files/figure-html/unnamed-chunk-15-1.png
deleted file mode 100644
index c76ba3c..0000000
Binary files a/docs/modeling-01_files/figure-html/unnamed-chunk-15-1.png and /dev/null differ
diff --git a/docs/modeling-01_files/figure-html/unnamed-chunk-17-1.png b/docs/modeling-01_files/figure-html/unnamed-chunk-17-1.png
new file mode 100644
index 0000000..33eebe3
Binary files /dev/null and b/docs/modeling-01_files/figure-html/unnamed-chunk-17-1.png differ
diff --git a/docs/modeling-01_files/figure-html/unnamed-chunk-8-1.png b/docs/modeling-01_files/figure-html/unnamed-chunk-8-1.png
index 87068ba..1102229 100644
Binary files a/docs/modeling-01_files/figure-html/unnamed-chunk-8-1.png and b/docs/modeling-01_files/figure-html/unnamed-chunk-8-1.png differ
diff --git a/docs/modeling-02.html b/docs/modeling-02.html
index 2430785..17f7428 100644
--- a/docs/modeling-02.html
+++ b/docs/modeling-02.html
@@ -282,7 +282,8 @@ <h2 data-number="2" class="anchored" data-anchor-id="do-we-model-every-month"><s
 <p>So the colder months have fewer observations than the warmer months. We already knew that, but it will be interesting to see how that manifests itself in the models.</p>
 <section id="build-the-monthly-models" class="level3" data-number="2.1">
 <h3 data-number="2.1" class="anchored" data-anchor-id="build-the-monthly-models"><span class="header-section-number">2.1</span> Build the monthly models</h3>
-<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-3_284ecfcaf283531de330d17e2c416a1d">
+<p>Since we are building 12 models (rather than one) it is useful to create a function that computes a model for any month, and then iterate through the months of the year.</p>
+<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-3_14408e426c7c9e84885d104d1381d64b">
 <div class="sourceCode cell-code" id="cb4"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb4-1"><a href="#cb4-1" aria-hidden="true" tabindex="-1"></a><span class="co"># A function for making one month's model</span></span>
 <span id="cb4-2"><a href="#cb4-2" aria-hidden="true" tabindex="-1"></a><span class="co">#</span></span>
 <span id="cb4-3"><a href="#cb4-3" aria-hidden="true" tabindex="-1"></a><span class="co"># @param tbl a data frame of one month's observations</span></span>
@@ -324,6 +325,13 @@ <h3 data-number="2.1" class="anchored" data-anchor-id="build-the-monthly-models"
 <span id="cb4-39"><a href="#cb4-39" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">group_map</span>(model_month, <span class="at">bkg =</span> bkg, <span class="at">path =</span> path) <span class="sc">|&gt;</span></span>
 <span id="cb4-40"><a href="#cb4-40" aria-hidden="true" tabindex="-1"></a>  rlang<span class="sc">::</span><span class="fu">set_names</span>(<span class="fu">levels</span>(obs<span class="sc">$</span>month))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
+<p>We can look at the response plots for every month, but for demonstration purposes, we’ll just show one month.</p>
+<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-4_56127f4832cd034c70d9efb8b0100319">
+<div class="sourceCode cell-code" id="cb5"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(models[[<span class="st">'Jun'</span>]], <span class="at">type =</span> <span class="st">'cloglog'</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell-output-display">
+<p><img src="modeling-02_files/figure-html/unnamed-chunk-4-1.png" class="img-fluid" width="672"></p>
+</div>
+</div>
 </section>
 </section>
 <section id="predict-with-rasters" class="level2" data-number="3">
@@ -332,116 +340,112 @@ <h2 data-number="3" class="anchored" data-anchor-id="predict-with-rasters"><span
 <section id="load-the-raster-databases-sst-and-u_wind-and-v_wind" class="level3" data-number="3.1">
 <h3 data-number="3.1" class="anchored" data-anchor-id="load-the-raster-databases-sst-and-u_wind-and-v_wind"><span class="header-section-number">3.1</span> Load the raster databases (<code>sst</code> and <code>u_wind</code> and <code>v_wind</code>)</h3>
 <p>We also make sure they are in date order and add a “month” variable to each.</p>
-<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-4_2b8316b73774e8a35aaed4ddc0e4d8f0">
-<div class="sourceCode cell-code" id="cb5"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a>sst_path <span class="ot">=</span> <span class="st">"data/oisst"</span></span>
-<span id="cb5-2"><a href="#cb5-2" aria-hidden="true" tabindex="-1"></a>sst_db <span class="ot">=</span> oisster<span class="sc">::</span><span class="fu">read_database</span>(sst_path) <span class="sc">|&gt;</span></span>
-<span id="cb5-3"><a href="#cb5-3" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">arrange</span>(date) <span class="sc">|&gt;</span></span>
-<span id="cb5-4"><a href="#cb5-4" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">month =</span> <span class="fu">format</span>(date, <span class="st">"%b"</span>))</span>
-<span id="cb5-5"><a href="#cb5-5" aria-hidden="true" tabindex="-1"></a>  </span>
-<span id="cb5-6"><a href="#cb5-6" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb5-7"><a href="#cb5-7" aria-hidden="true" tabindex="-1"></a>wind_path <span class="ot">=</span> <span class="st">"data/nbs"</span></span>
-<span id="cb5-8"><a href="#cb5-8" aria-hidden="true" tabindex="-1"></a>wind_db <span class="ot">=</span> nbs<span class="sc">::</span><span class="fu">read_database</span>(wind_path) <span class="sc">|&gt;</span></span>
-<span id="cb5-9"><a href="#cb5-9" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">arrange</span>(date)<span class="sc">|&gt;</span></span>
-<span id="cb5-10"><a href="#cb5-10" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">month =</span> <span class="fu">format</span>(date, <span class="st">"%b"</span>))</span>
-<span id="cb5-11"><a href="#cb5-11" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb5-12"><a href="#cb5-12" aria-hidden="true" tabindex="-1"></a>u_wind_db <span class="ot">=</span> wind_db <span class="sc">|&gt;</span></span>
-<span id="cb5-13"><a href="#cb5-13" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(param <span class="sc">==</span> <span class="st">"u_wind"</span>)</span>
-<span id="cb5-14"><a href="#cb5-14" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb5-15"><a href="#cb5-15" aria-hidden="true" tabindex="-1"></a>v_wind_db <span class="ot">=</span> wind_db <span class="sc">|&gt;</span></span>
-<span id="cb5-16"><a href="#cb5-16" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(param <span class="sc">==</span> <span class="st">"v_wind"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-5_aec22c8734e04228e9c12ddd7dbebd94">
+<div class="sourceCode cell-code" id="cb6"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a>sst_path <span class="ot">=</span> <span class="st">"data/oisst"</span></span>
+<span id="cb6-2"><a href="#cb6-2" aria-hidden="true" tabindex="-1"></a>sst_db <span class="ot">=</span> oisster<span class="sc">::</span><span class="fu">read_database</span>(sst_path) <span class="sc">|&gt;</span></span>
+<span id="cb6-3"><a href="#cb6-3" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">arrange</span>(date) <span class="sc">|&gt;</span></span>
+<span id="cb6-4"><a href="#cb6-4" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">month =</span> <span class="fu">format</span>(date, <span class="st">"%b"</span>))</span>
+<span id="cb6-5"><a href="#cb6-5" aria-hidden="true" tabindex="-1"></a>  </span>
+<span id="cb6-6"><a href="#cb6-6" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb6-7"><a href="#cb6-7" aria-hidden="true" tabindex="-1"></a>wind_path <span class="ot">=</span> <span class="st">"data/nbs"</span></span>
+<span id="cb6-8"><a href="#cb6-8" aria-hidden="true" tabindex="-1"></a>wind_db <span class="ot">=</span> nbs<span class="sc">::</span><span class="fu">read_database</span>(wind_path) <span class="sc">|&gt;</span></span>
+<span id="cb6-9"><a href="#cb6-9" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">arrange</span>(date)<span class="sc">|&gt;</span></span>
+<span id="cb6-10"><a href="#cb6-10" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">month =</span> <span class="fu">format</span>(date, <span class="st">"%b"</span>))</span>
+<span id="cb6-11"><a href="#cb6-11" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb6-12"><a href="#cb6-12" aria-hidden="true" tabindex="-1"></a>u_wind_db <span class="ot">=</span> wind_db <span class="sc">|&gt;</span></span>
+<span id="cb6-13"><a href="#cb6-13" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(param <span class="sc">==</span> <span class="st">"u_wind"</span>)</span>
+<span id="cb6-14"><a href="#cb6-14" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb6-15"><a href="#cb6-15" aria-hidden="true" tabindex="-1"></a>v_wind_db <span class="ot">=</span> wind_db <span class="sc">|&gt;</span></span>
+<span id="cb6-16"><a href="#cb6-16" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">filter</span>(param <span class="sc">==</span> <span class="st">"v_wind"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 </section>
 <section id="iterate-through-the-months-making-predictions" class="level3" data-number="3.2">
 <h3 data-number="3.2" class="anchored" data-anchor-id="iterate-through-the-months-making-predictions"><span class="header-section-number">3.2</span> Iterate through the months making predictions</h3>
 <p>Now we can build an iterator function that will make a prediction for each month. Let’s narrow our predictions to just those for a particular year, 2019, and read the rasters in all at once.</p>
-<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-5_55ad08495b96a0c5e5e27c39837b7282">
-<div class="sourceCode cell-code" id="cb6"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb6-1"><a href="#cb6-1" aria-hidden="true" tabindex="-1"></a>dates <span class="ot">=</span> <span class="fu">as.Date</span>(<span class="fu">c</span>(<span class="st">"2019-01-01"</span>, <span class="st">"2019-12-31"</span>))</span>
-<span id="cb6-2"><a href="#cb6-2" aria-hidden="true" tabindex="-1"></a>x <span class="ot">=</span> <span class="fu">read_predictors</span>(</span>
-<span id="cb6-3"><a href="#cb6-3" aria-hidden="true" tabindex="-1"></a>  <span class="at">sst_db =</span> dplyr<span class="sc">::</span><span class="fu">filter</span>(sst_db, dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>])),</span>
-<span id="cb6-4"><a href="#cb6-4" aria-hidden="true" tabindex="-1"></a>  <span class="at">u_wind_db =</span> dplyr<span class="sc">::</span><span class="fu">filter</span>(u_wind_db, dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>])),</span>
-<span id="cb6-5"><a href="#cb6-5" aria-hidden="true" tabindex="-1"></a>  <span class="at">v_wind_db =</span> dplyr<span class="sc">::</span><span class="fu">filter</span>(v_wind_db, dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>]))</span>
-<span id="cb6-6"><a href="#cb6-6" aria-hidden="true" tabindex="-1"></a>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-6_c05a1ac68684d54c6ff8b072d57654ad">
+<div class="sourceCode cell-code" id="cb7"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a>dates <span class="ot">=</span> <span class="fu">as.Date</span>(<span class="fu">c</span>(<span class="st">"2019-01-01"</span>, <span class="st">"2019-12-31"</span>))</span>
+<span id="cb7-2"><a href="#cb7-2" aria-hidden="true" tabindex="-1"></a>x <span class="ot">=</span> <span class="fu">read_predictors</span>(</span>
+<span id="cb7-3"><a href="#cb7-3" aria-hidden="true" tabindex="-1"></a>  <span class="at">sst_db =</span> dplyr<span class="sc">::</span><span class="fu">filter</span>(sst_db, dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>])),</span>
+<span id="cb7-4"><a href="#cb7-4" aria-hidden="true" tabindex="-1"></a>  <span class="at">u_wind_db =</span> dplyr<span class="sc">::</span><span class="fu">filter</span>(u_wind_db, dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>])),</span>
+<span id="cb7-5"><a href="#cb7-5" aria-hidden="true" tabindex="-1"></a>  <span class="at">v_wind_db =</span> dplyr<span class="sc">::</span><span class="fu">filter</span>(v_wind_db, dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>]))</span>
+<span id="cb7-6"><a href="#cb7-6" aria-hidden="true" tabindex="-1"></a>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 <p>Now we can iterate through the months.</p>
-<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-6_0a3f1389b7cb6317975f2cf97f8f7c99">
-<div class="sourceCode cell-code" id="cb7"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a>date_sequence <span class="ot">=</span> <span class="fu">seq</span>(<span class="at">from =</span> dates[<span class="dv">1</span>], <span class="at">to =</span> dates[<span class="dv">2</span>], <span class="at">by =</span> <span class="st">"month"</span>)</span>
-<span id="cb7-2"><a href="#cb7-2" aria-hidden="true" tabindex="-1"></a>pred_rasters <span class="ot">=</span> <span class="fu">lapply</span>(<span class="fu">names</span>(models),</span>
-<span id="cb7-3"><a href="#cb7-3" aria-hidden="true" tabindex="-1"></a>  <span class="cf">function</span>(mon){</span>
-<span id="cb7-4"><a href="#cb7-4" aria-hidden="true" tabindex="-1"></a>    ix <span class="ot">=</span> <span class="fu">which</span>(month.abb <span class="sc">%in%</span> mon)</span>
-<span id="cb7-5"><a href="#cb7-5" aria-hidden="true" tabindex="-1"></a>    <span class="fu">predict</span>(models[[mon]], dplyr<span class="sc">::</span><span class="fu">slice</span>(x, time, ix, drop), <span class="at">type =</span> <span class="st">"cloglog"</span>)</span>
-<span id="cb7-6"><a href="#cb7-6" aria-hidden="true" tabindex="-1"></a>  }) </span>
-<span id="cb7-7"><a href="#cb7-7" aria-hidden="true" tabindex="-1"></a>pred_rasters <span class="ot">=</span> <span class="fu">do.call</span>(c, <span class="fu">append</span>(pred_rasters, <span class="fu">list</span>(<span class="at">along =</span> <span class="fu">list</span>(<span class="at">time =</span> date_sequence))))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-7_88b6e121185be4feda4a2f5f5d3eb399">
+<div class="sourceCode cell-code" id="cb8"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb8-1"><a href="#cb8-1" aria-hidden="true" tabindex="-1"></a>date_sequence <span class="ot">=</span> <span class="fu">seq</span>(<span class="at">from =</span> dates[<span class="dv">1</span>], <span class="at">to =</span> dates[<span class="dv">2</span>], <span class="at">by =</span> <span class="st">"month"</span>)</span>
+<span id="cb8-2"><a href="#cb8-2" aria-hidden="true" tabindex="-1"></a>pred_rasters <span class="ot">=</span> <span class="fu">lapply</span>(<span class="fu">names</span>(models),</span>
+<span id="cb8-3"><a href="#cb8-3" aria-hidden="true" tabindex="-1"></a>  <span class="cf">function</span>(mon){</span>
+<span id="cb8-4"><a href="#cb8-4" aria-hidden="true" tabindex="-1"></a>    ix <span class="ot">=</span> <span class="fu">which</span>(month.abb <span class="sc">%in%</span> mon)</span>
+<span id="cb8-5"><a href="#cb8-5" aria-hidden="true" tabindex="-1"></a>    <span class="fu">predict</span>(models[[mon]], dplyr<span class="sc">::</span><span class="fu">slice</span>(x, time, ix, drop), <span class="at">type =</span> <span class="st">"cloglog"</span>)</span>
+<span id="cb8-6"><a href="#cb8-6" aria-hidden="true" tabindex="-1"></a>  }) </span>
+<span id="cb8-7"><a href="#cb8-7" aria-hidden="true" tabindex="-1"></a>pred_rasters <span class="ot">=</span> <span class="fu">do.call</span>(c, <span class="fu">append</span>(pred_rasters, <span class="fu">list</span>(<span class="at">along =</span> <span class="fu">list</span>(<span class="at">time =</span> date_sequence))))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 </div>
 <p>Let’s plot them.</p>
-<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-7_6b6adaaf2eb80368c0f5993f775b658b">
-<div class="sourceCode cell-code" id="cb8"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb8-1"><a href="#cb8-1" aria-hidden="true" tabindex="-1"></a>coast <span class="ot">=</span> rnaturalearth<span class="sc">::</span><span class="fu">ne_coastline</span>(<span class="at">scale =</span> <span class="st">'large'</span>, <span class="at">returnclass =</span> <span class="st">'sf'</span>) <span class="sc">|&gt;</span></span>
-<span id="cb8-2"><a href="#cb8-2" aria-hidden="true" tabindex="-1"></a>  sf<span class="sc">::</span><span class="fu">st_geometry</span>() <span class="sc">|&gt;</span></span>
-<span id="cb8-3"><a href="#cb8-3" aria-hidden="true" tabindex="-1"></a>  sf<span class="sc">::</span><span class="fu">st_crop</span>(pred_rasters)</span>
-<span id="cb8-4"><a href="#cb8-4" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb8-5"><a href="#cb8-5" aria-hidden="true" tabindex="-1"></a>plot_coast <span class="ot">=</span> <span class="cf">function</span>() {</span>
-<span id="cb8-6"><a href="#cb8-6" aria-hidden="true" tabindex="-1"></a>  <span class="fu">plot</span>(coast, <span class="at">col =</span> <span class="st">'green'</span>, <span class="at">add =</span> <span class="cn">TRUE</span>)</span>
-<span id="cb8-7"><a href="#cb8-7" aria-hidden="true" tabindex="-1"></a>}</span>
-<span id="cb8-8"><a href="#cb8-8" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(pred_rasters <span class="sc">|&gt;</span> <span class="fu">st_to_180</span>(), <span class="at">hook =</span> plot_coast)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-8_004777e1101dedbce37699ae5d54f4c2">
+<div class="sourceCode cell-code" id="cb9"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a>coast <span class="ot">=</span> rnaturalearth<span class="sc">::</span><span class="fu">ne_coastline</span>(<span class="at">scale =</span> <span class="st">'large'</span>, <span class="at">returnclass =</span> <span class="st">'sf'</span>) <span class="sc">|&gt;</span></span>
+<span id="cb9-2"><a href="#cb9-2" aria-hidden="true" tabindex="-1"></a>  sf<span class="sc">::</span><span class="fu">st_geometry</span>() <span class="sc">|&gt;</span></span>
+<span id="cb9-3"><a href="#cb9-3" aria-hidden="true" tabindex="-1"></a>  sf<span class="sc">::</span><span class="fu">st_crop</span>(pred_rasters)</span>
+<span id="cb9-4"><a href="#cb9-4" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb9-5"><a href="#cb9-5" aria-hidden="true" tabindex="-1"></a>plot_coast <span class="ot">=</span> <span class="cf">function</span>() {</span>
+<span id="cb9-6"><a href="#cb9-6" aria-hidden="true" tabindex="-1"></a>  <span class="fu">plot</span>(coast, <span class="at">col =</span> <span class="st">'green'</span>, <span class="at">add =</span> <span class="cn">TRUE</span>)</span>
+<span id="cb9-7"><a href="#cb9-7" aria-hidden="true" tabindex="-1"></a>}</span>
+<span id="cb9-8"><a href="#cb9-8" aria-hidden="true" tabindex="-1"></a><span class="fu">plot</span>(pred_rasters, <span class="at">hook =</span> plot_coast)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
-<p><img src="modeling-02_files/figure-html/unnamed-chunk-7-1.png" class="img-fluid" width="672"></p>
+<p><img src="modeling-02_files/figure-html/unnamed-chunk-8-1.png" class="img-fluid" width="672"></p>
 </div>
 </div>
-<p>Let’s see what we can discern from the predict abilities. We can extract the predicted values at the observed locations.</p>
-<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-8_ef85eb9515be680b7f198a37ec5ae2f6">
-<div class="sourceCode cell-code" id="cb9"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a>pred_obs <span class="ot">=</span> stars<span class="sc">::</span><span class="fu">st_extract</span>(pred_rasters, </span>
-<span id="cb9-2"><a href="#cb9-2" aria-hidden="true" tabindex="-1"></a>                             dplyr<span class="sc">::</span><span class="fu">filter</span>(obs, dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>])),</span>
-<span id="cb9-3"><a href="#cb9-3" aria-hidden="true" tabindex="-1"></a>                             <span class="at">time_column =</span> <span class="st">"month_id"</span>) <span class="sc">|&gt;</span></span>
-<span id="cb9-4"><a href="#cb9-4" aria-hidden="true" tabindex="-1"></a>  sf<span class="sc">::</span><span class="fu">st_drop_geometry</span>() </span>
-<span id="cb9-5"><a href="#cb9-5" aria-hidden="true" tabindex="-1"></a>pred_bkg <span class="ot">=</span> stars<span class="sc">::</span><span class="fu">st_extract</span>(pred_rasters, </span>
-<span id="cb9-6"><a href="#cb9-6" aria-hidden="true" tabindex="-1"></a>                             dplyr<span class="sc">::</span><span class="fu">filter</span>(bkg, dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>])),</span>
-<span id="cb9-7"><a href="#cb9-7" aria-hidden="true" tabindex="-1"></a>                             <span class="at">time_column =</span> <span class="st">"month_id"</span>) <span class="sc">|&gt;</span></span>
-<span id="cb9-8"><a href="#cb9-8" aria-hidden="true" tabindex="-1"></a>  sf<span class="sc">::</span><span class="fu">st_drop_geometry</span>() </span>
-<span id="cb9-9"><a href="#cb9-9" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-10"><a href="#cb9-10" aria-hidden="true" tabindex="-1"></a>preds <span class="ot">=</span> dplyr<span class="sc">::</span><span class="fu">bind_rows</span>(pred_obs, pred_bkg) <span class="sc">|&gt;</span></span>
-<span id="cb9-11"><a href="#cb9-11" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">label =</span> <span class="fu">c</span>(<span class="fu">rep</span>(<span class="dv">1</span>, <span class="fu">nrow</span>(pred_obs)), <span class="fu">rep</span>(<span class="dv">0</span>, <span class="fu">nrow</span>(pred_bkg))), <span class="at">.before =</span> <span class="dv">1</span>) <span class="sc">|&gt;</span></span>
-<span id="cb9-12"><a href="#cb9-12" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">month =</span> <span class="fu">factor</span>(<span class="fu">format</span>(time, <span class="st">"%b"</span>), <span class="at">levels =</span> month.abb)) <span class="sc">|&gt;</span></span>
-<span id="cb9-13"><a href="#cb9-13" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">group_by</span>(month)</span>
-<span id="cb9-14"><a href="#cb9-14" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-15"><a href="#cb9-15" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb9-16"><a href="#cb9-16" aria-hidden="true" tabindex="-1"></a>aucs <span class="ot">=</span> dplyr<span class="sc">::</span><span class="fu">group_map</span>(preds,</span>
-<span id="cb9-17"><a href="#cb9-17" aria-hidden="true" tabindex="-1"></a>                        <span class="cf">function</span>(x, y) {</span>
-<span id="cb9-18"><a href="#cb9-18" aria-hidden="true" tabindex="-1"></a>                          dplyr<span class="sc">::</span><span class="fu">tibble</span>(<span class="at">month =</span> y<span class="sc">$</span>month, <span class="at">auc =</span> maxnetic<span class="sc">::</span><span class="fu">AUC</span>(x))</span>
-<span id="cb9-19"><a href="#cb9-19" aria-hidden="true" tabindex="-1"></a>                        }) <span class="sc">|&gt;</span></span>
-<span id="cb9-20"><a href="#cb9-20" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">bind_rows</span>() <span class="sc">|&gt;</span></span>
-<span id="cb9-21"><a href="#cb9-21" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">right_join</span>(counts, <span class="at">by =</span> <span class="st">"month"</span>) <span class="sc">|&gt;</span></span>
-<span id="cb9-22"><a href="#cb9-22" aria-hidden="true" tabindex="-1"></a>  <span class="fu">print</span>(<span class="at">n=</span><span class="dv">12</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<p>Let’s see what we can discern from the predict abilities. We can extract the predicted values at the observed locations. Having those in hand allows us to compute pAUC for each month.</p>
+<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-9_a7e317262834717357a0fd9d3b7b9f20">
+<div class="sourceCode cell-code" id="cb10"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb10-1"><a href="#cb10-1" aria-hidden="true" tabindex="-1"></a>pred_obs <span class="ot">=</span> stars<span class="sc">::</span><span class="fu">st_extract</span>(pred_rasters, </span>
+<span id="cb10-2"><a href="#cb10-2" aria-hidden="true" tabindex="-1"></a>                             dplyr<span class="sc">::</span><span class="fu">filter</span>(obs, dplyr<span class="sc">::</span><span class="fu">between</span>(date, dates[<span class="dv">1</span>], dates[<span class="dv">2</span>])),</span>
+<span id="cb10-3"><a href="#cb10-3" aria-hidden="true" tabindex="-1"></a>                             <span class="at">time_column =</span> <span class="st">"month_id"</span>) <span class="sc">|&gt;</span></span>
+<span id="cb10-4"><a href="#cb10-4" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">month =</span> <span class="fu">factor</span>(<span class="fu">format</span>(month_id, <span class="st">"%b"</span>), <span class="at">levels =</span> month.abb)) <span class="sc">|&gt;</span></span>
+<span id="cb10-5"><a href="#cb10-5" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">group_by</span>(month)</span>
+<span id="cb10-6"><a href="#cb10-6" aria-hidden="true" tabindex="-1"></a></span>
+<span id="cb10-7"><a href="#cb10-7" aria-hidden="true" tabindex="-1"></a>paucs <span class="ot">=</span> dplyr<span class="sc">::</span><span class="fu">group_map</span>(pred_obs,</span>
+<span id="cb10-8"><a href="#cb10-8" aria-hidden="true" tabindex="-1"></a>                        <span class="cf">function</span>(x, y) {</span>
+<span id="cb10-9"><a href="#cb10-9" aria-hidden="true" tabindex="-1"></a>                          ix <span class="ot">=</span> month.abb <span class="sc">%in%</span> y<span class="sc">$</span>month</span>
+<span id="cb10-10"><a href="#cb10-10" aria-hidden="true" tabindex="-1"></a>                          s <span class="ot">=</span> dplyr<span class="sc">::</span><span class="fu">slice</span>(pred_rasters, <span class="st">"time"</span>, ix)</span>
+<span id="cb10-11"><a href="#cb10-11" aria-hidden="true" tabindex="-1"></a>                          pauc <span class="ot">=</span> maxnetic<span class="sc">::</span><span class="fu">pAUC</span>(s,x)</span>
+<span id="cb10-12"><a href="#cb10-12" aria-hidden="true" tabindex="-1"></a>                          dplyr<span class="sc">::</span><span class="fu">tibble</span>(<span class="at">month =</span> y<span class="sc">$</span>month, </span>
+<span id="cb10-13"><a href="#cb10-13" aria-hidden="true" tabindex="-1"></a>                                        <span class="at">auc =</span> pauc<span class="sc">$</span>area,</span>
+<span id="cb10-14"><a href="#cb10-14" aria-hidden="true" tabindex="-1"></a>                                        <span class="at">pauc =</span> <span class="fu">list</span>(pauc))</span>
+<span id="cb10-15"><a href="#cb10-15" aria-hidden="true" tabindex="-1"></a>                        })<span class="sc">|&gt;</span></span>
+<span id="cb10-16"><a href="#cb10-16" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">bind_rows</span>() <span class="sc">|&gt;</span></span>
+<span id="cb10-17"><a href="#cb10-17" aria-hidden="true" tabindex="-1"></a>  <span class="fu">print</span>(<span class="at">n =</span> <span class="dv">12</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output cell-output-stdout">
-<pre><code># A tibble: 12 × 4
-   month   auc n_obs n_bkg
-   &lt;fct&gt; &lt;dbl&gt; &lt;int&gt; &lt;int&gt;
- 1 Jan   0.987    33    51
- 2 Feb   0.876    40    57
- 3 Mar   0.957    50    79
- 4 Apr   0.897   341   528
- 5 May   0.888   541   943
- 6 Jun   0.547  2137  3471
- 7 Jul   0.376  2108  3233
- 8 Aug   0.588  1698  2597
- 9 Sep   0.742   725  1205
-10 Oct   0.797   328   485
-11 Nov   0.873   494   739
-12 Dec   0.995    66    90</code></pre>
+<pre><code># A tibble: 12 × 3
+   month   auc pauc      
+   &lt;fct&gt; &lt;dbl&gt; &lt;list&gt;    
+ 1 Jan   0.703 &lt;pAUC [3]&gt;
+ 2 Feb   0.689 &lt;pAUC [3]&gt;
+ 3 Mar   0.698 &lt;pAUC [3]&gt;
+ 4 Apr   0.677 &lt;pAUC [3]&gt;
+ 5 May   0.654 &lt;pAUC [3]&gt;
+ 6 Jun   0.662 &lt;pAUC [3]&gt;
+ 7 Jul   0.665 &lt;pAUC [3]&gt;
+ 8 Aug   0.696 &lt;pAUC [3]&gt;
+ 9 Sep   0.663 &lt;pAUC [3]&gt;
+10 Oct   0.633 &lt;pAUC [3]&gt;
+11 Nov   0.627 &lt;pAUC [3]&gt;
+12 Dec   0.665 &lt;pAUC [3]&gt;</code></pre>
 </div>
 </div>
-<p>OK, that’s unexpected. The months with the lower counts of observations have relatively higher AUCs. Huh? Let’s look at that graphically.</p>
-<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-9_d7ad253784f29c8667055e5b0f7b6b33">
-<div class="sourceCode cell-code" id="cb11"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a>aucs_long <span class="ot">=</span> tidyr<span class="sc">::</span><span class="fu">pivot_longer</span>(aucs, dplyr<span class="sc">::</span><span class="fu">all_of</span>(<span class="fu">c</span>(<span class="st">"n_obs"</span>, <span class="st">"n_bkg"</span>)),</span>
-<span id="cb11-2"><a href="#cb11-2" aria-hidden="true" tabindex="-1"></a>                           <span class="at">names_to =</span> <span class="st">"type"</span>, <span class="at">values_to =</span> <span class="st">"count"</span>) <span class="sc">|&gt;</span></span>
-<span id="cb11-3"><a href="#cb11-3" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">mutate</span>(<span class="at">type =</span> dplyr<span class="sc">::</span><span class="fu">recode</span>(type, <span class="at">n_obs =</span> <span class="st">"obs"</span>, <span class="at">n_bkg =</span> <span class="st">"bkg"</span>))</span>
-<span id="cb11-4"><a href="#cb11-4" aria-hidden="true" tabindex="-1"></a></span>
-<span id="cb11-5"><a href="#cb11-5" aria-hidden="true" tabindex="-1"></a>ggplot2<span class="sc">::</span><span class="fu">ggplot</span>(<span class="at">data =</span> aucs_long, <span class="fu">aes</span>(<span class="at">x =</span> count, <span class="at">y =</span> auc, <span class="at">color =</span> type)) <span class="sc">+</span></span>
-<span id="cb11-6"><a href="#cb11-6" aria-hidden="true" tabindex="-1"></a>  ggplot2<span class="sc">::</span><span class="fu">geom_point</span>() <span class="sc">+</span> </span>
-<span id="cb11-7"><a href="#cb11-7" aria-hidden="true" tabindex="-1"></a>  ggplot2<span class="sc">::</span><span class="fu">geom_smooth</span>(<span class="at">method=</span><span class="st">'lm'</span>, <span class="at">formula=</span> y<span class="sc">~</span>x)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
+<p>Note that last element, <code>pauc</code>, is the result returned by the <code>maxnetic::pAUC()</code> function which we can plot.</p>
+<div class="cell" data-hash="modeling-02_cache/html/unnamed-chunk-10_d4439b1d52a25443e93dea2dfb9413f1">
+<div class="sourceCode cell-code" id="cb12"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb12-1"><a href="#cb12-1" aria-hidden="true" tabindex="-1"></a>pp <span class="ot">=</span> paucs <span class="sc">|&gt;</span></span>
+<span id="cb12-2"><a href="#cb12-2" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">group_by</span>(month) <span class="sc">|&gt;</span></span>
+<span id="cb12-3"><a href="#cb12-3" aria-hidden="true" tabindex="-1"></a>  dplyr<span class="sc">::</span><span class="fu">group_map</span>(</span>
+<span id="cb12-4"><a href="#cb12-4" aria-hidden="true" tabindex="-1"></a>    <span class="cf">function</span>(tbl, key){</span>
+<span id="cb12-5"><a href="#cb12-5" aria-hidden="true" tabindex="-1"></a>      <span class="fu">plot</span>(tbl<span class="sc">$</span>pauc[[<span class="dv">1</span>]], <span class="at">title =</span> key<span class="sc">$</span>month, <span class="at">xlab =</span> <span class="st">""</span>, <span class="at">ylab =</span> <span class="st">""</span>)</span>
+<span id="cb12-6"><a href="#cb12-6" aria-hidden="true" tabindex="-1"></a>    }</span>
+<span id="cb12-7"><a href="#cb12-7" aria-hidden="true" tabindex="-1"></a>  )</span>
+<span id="cb12-8"><a href="#cb12-8" aria-hidden="true" tabindex="-1"></a>patchwork<span class="sc">::</span><span class="fu">wrap_plots</span>(pp, <span class="at">ncol =</span> <span class="dv">4</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
 <div class="cell-output-display">
-<p><img src="modeling-02_files/figure-html/unnamed-chunk-9-1.png" class="img-fluid" width="672"></p>
+<p><img src="modeling-02_files/figure-html/unnamed-chunk-10-1.png" class="img-fluid" width="672"></p>
 </div>
 </div>
-<p>Surprised? Could this be overfitting resulting from sampling background in time weighted to the months when we have observations? Hmmmm.</p>
+<p>Well, it would be easy to become dispirited by this result. It would be reasonable to expect AUC values to improve if we built monthly models rather than a single model applied to any month. But it seems to not be so. Darn!</p>
 
 
 </section>
diff --git a/docs/modeling-02_files/figure-html/unnamed-chunk-10-1.png b/docs/modeling-02_files/figure-html/unnamed-chunk-10-1.png
new file mode 100644
index 0000000..fcdcd3e
Binary files /dev/null and b/docs/modeling-02_files/figure-html/unnamed-chunk-10-1.png differ
diff --git a/docs/modeling-02_files/figure-html/unnamed-chunk-4-1.png b/docs/modeling-02_files/figure-html/unnamed-chunk-4-1.png
new file mode 100644
index 0000000..c1388b1
Binary files /dev/null and b/docs/modeling-02_files/figure-html/unnamed-chunk-4-1.png differ
diff --git a/docs/modeling-02_files/figure-html/unnamed-chunk-7-1.png b/docs/modeling-02_files/figure-html/unnamed-chunk-7-1.png
deleted file mode 100644
index df52a54..0000000
Binary files a/docs/modeling-02_files/figure-html/unnamed-chunk-7-1.png and /dev/null differ
diff --git a/docs/modeling-02_files/figure-html/unnamed-chunk-8-1.png b/docs/modeling-02_files/figure-html/unnamed-chunk-8-1.png
new file mode 100644
index 0000000..6b8c08d
Binary files /dev/null and b/docs/modeling-02_files/figure-html/unnamed-chunk-8-1.png differ
diff --git a/docs/modeling-02_files/figure-html/unnamed-chunk-9-1.png b/docs/modeling-02_files/figure-html/unnamed-chunk-9-1.png
deleted file mode 100644
index 03d16aa..0000000
Binary files a/docs/modeling-02_files/figure-html/unnamed-chunk-9-1.png and /dev/null differ
diff --git a/docs/search.json b/docs/search.json
index 2f62830..cbd7a91 100644
--- a/docs/search.json
+++ b/docs/search.json
@@ -53,7 +53,14 @@
     "href": "modeling-01.html#make-a-prediction",
     "title": "Basic modeling",
     "section": "6 Make a prediction",
-    "text": "6 Make a prediction\nNow we can make predictions with our basic model. We’ll do it two ways. First by simply feeding the input data used to create the model into the prediction. This might seems a bit circular, but it is perfectly reasonable to see how the model does on already labeled data. Second we’ll make a prediction for each month in 2020 using raster data.\n\n6.1 Predict with a data frame\nHere we provide a data frame, in our case the original input data, to the predict() function with type cloglog which transform the response value into the 0-1 range.\n\nprediction = predict(model, input_table, type = 'cloglog')\nhist(prediction, xlab = \"prediction\", main = \"Basic Model\")\n\n\n\n\n\n6.1.1 How did it do?\nWe can use some utilities in the maxnetic package to help us assess the model. First, we need to create a table with two columns: label and pred. Label is the simple a vector of 0/1 indicating that the predicted value is known to be either background or presence. We already have that in our input_vector. Pred is simple the 0-1 scale predicted value. Once we have that we can craft a receiver operator characteristic curve and compute it’s AUC.\n\nx = dplyr::tibble(label = input_vector, pred = as.vector(prediction))\nplot_ROC(x, title = \"v1.0 Basic Model\")\n\n\n\n\nOverall, this is telling us that the model isn’t especially strong as a prediction tool, but it is much better than a 50-50 guess (that’s when AUC is close to 0.5, and the curve follows the light grey line). Learn more about ROC and AUC here.\n\n\n\n6.2 Predict with rasters\nWe can also predict using raster inputs using our basic model. Let’s read in rasters for each month of 2018, and then run a prediction for each month.\n\ndates = as.Date(c(\"2019-01-01\", \"2019-12-31\"))\n\nsst_path = \"data/oisst\"\nsst_db = oisster::read_database(sst_path) |&gt;\n  dplyr::arrange(date) |&gt;\n  dplyr::filter(dplyr::between(date, dates[1], dates[2]))\n  \n\nsst = sst_db |&gt;\n  oisster::compose_filename(path = sst_path) |&gt;\n  stars::read_stars(along = list(time = sst_db$date)) |&gt;\n  rlang::set_names(\"sst\")|&gt;\n  st_to_180()\n\n\nwind_path = \"data/nbs\"\nwind_db = nbs::read_database(wind_path) |&gt;\n  dplyr::arrange(date)|&gt;\n  dplyr::filter(dplyr::between(date, dates[1], dates[2]))\n\nu_wind_db = wind_db |&gt;\n  dplyr::filter(param == \"u_wind\")|&gt;\n  dplyr::filter(dplyr::between(date, dates[1], dates[2]))\nu_wind = u_wind_db |&gt;\n  nbs::compose_filename(path = wind_path) |&gt;\n  stars::read_stars(along = list(time = u_wind_db$date)) |&gt;\n  rlang::set_names(\"u_wind\") |&gt;\n  st_to_180()\n\nv_wind_db = wind_db |&gt;\n  dplyr::filter(param == \"v_wind\")|&gt;\n  dplyr::filter(dplyr::between(date, dates[1], dates[2]))\nv_wind = v_wind_db |&gt;\n  nbs::compose_filename(path = wind_path) |&gt;\n  stars::read_stars(along = list(time = v_wind_db$date)) |&gt;\n  rlang::set_names(\"v_wind\") |&gt;\n  st_to_180()\n\nOnce we have them in hand we need to bind them together. But we need to attend to common but important issue. The sst rasters and windspeed rasters have different extents. We can’t bind them together until we warp one set to match the other. Let’s warp sst to match u_wind. And then we can bind them together.\n\nsst_warped = stars::st_warp(sst, u_wind)\nx = list(sst_warped, u_wind, v_wind)\npredictors = do.call(c, append(x, list(along = NA_integer_))) \npredictors\n\nstars object with 3 dimensions and 3 attributes\nattribute(s):\n             Min.   1st Qu.     Median       Mean   3rd Qu.      Max.  NA's\nsst     -1.558928 12.528449 19.5220385 17.6005908 23.501083 29.216452 11352\nu_wind  -2.692028  1.144244  2.7007004  2.7228278  4.115177 13.148120  7612\nv_wind  -5.431324 -1.411349 -0.3202622 -0.1398384  1.106175  4.636874  7612\ndimension(s):\n     from to offset delta refsys point                    values x/y\nx       1 74 -76.38  0.25 WGS 84 FALSE                      NULL [x]\ny       1 46  46.12 -0.25 WGS 84 FALSE                      NULL [y]\ntime    1 12     NA    NA   Date    NA 2019-01-01,...,2019-12-01    \n\n\nNow we can run the prediction.\n\npred = predict(model, predictors, type = 'cloglog')\npred\n\nstars object with 3 dimensions and 1 attribute\nattribute(s):\n              Min.   1st Qu.    Median      Mean   3rd Qu.      Max.  NA's\npred  0.0001196393 0.1200618 0.2675931 0.3033565 0.4398977 0.8816952 11487\ndimension(s):\n     from to offset delta refsys point                    values x/y\nx       1 74 -76.38  0.25 WGS 84 FALSE                      NULL [x]\ny       1 46  46.12 -0.25 WGS 84 FALSE                      NULL [y]\ntime    1 12     NA    NA   Date    NA 2019-01-01,...,2019-12-01    \n\n\nSince we get a spatially mapped prediction back, we can plot it.\n\ncoast = rnaturalearth::ne_coastline(scale = 'large', returnclass = 'sf') |&gt;\n  sf::st_crop(pred)\n\nWarning: attribute variables are assumed to be spatially constant throughout\nall geometries\n\nplot_coast = function() {\n  plot(sf::st_geometry(coast), col = 'green', add = TRUE)\n}\nplot(pred, hook = plot_coast)\n\n\n\n\nWell, that certainly looks appealing with higher likelihood of near shore observations occurring during the warmer months.\n\n6.2.1 How did it do?\nTo compute an ROC and AUC for each month, we have a little bit of work to do. We need to extract the observations and background for each month from the prediction maps. These we can then pass to the plot_ROC() function.\n\n\n\n\n\n\nNote\n\n\n\nWe have to modify the date for each point to be the first date of each month. That’s because our predictors are monthlies.\n\n\n\ntest_obs = obs |&gt;\n  dplyr::filter(dplyr::between(date, dates[1], dates[2])) |&gt;\n  dplyr::select(dplyr::all_of(\"date\")) |&gt;\n  dplyr::mutate(date = oisster::current_month(date))\n\ntest_bkg = bkg |&gt;\n  dplyr::filter(dplyr::between(date, dates[1], dates[2])) |&gt;\n  dplyr::select(dplyr::all_of(\"date\")) |&gt;\n  dplyr::mutate(date = oisster::current_month(date))\n\ntest_input = dplyr::bind_rows(test_obs, test_bkg)\n\nx = stars::st_extract(pred, test_input, time_column = 'date') |&gt;\n  print()\n\nSimple feature collection with 1537 features and 3 fields\nGeometry type: POINT\nDimension:     XY\nBounding box:  xmin: -75.99915 ymin: 35.01635 xmax: -58.83057 ymax: 45.95233\nGeodetic CRS:  WGS 84\nFirst 10 features:\n        pred       time       date                   geometry\n1  0.2759255 2019-05-01 2019-05-01 POINT (-67.32935 40.42509)\n2  0.7245142 2019-03-01 2019-03-01 POINT (-74.41057 36.49908)\n3  0.6664676 2019-12-01 2019-12-01   POINT (-75.3994 35.9457)\n4  0.4536477 2019-06-01 2019-06-01 POINT (-75.10864 36.94806)\n5  0.6864945 2019-04-01 2019-04-01 POINT (-74.49892 36.57275)\n6  0.3105710 2019-09-01 2019-09-01   POINT (-75.5519 36.1854)\n7  0.3874695 2019-09-01 2019-09-01   POINT (-73.6245 40.3317)\n8  0.3785449 2019-04-01 2019-04-01 POINT (-69.04389 39.82132)\n9  0.7447747 2019-04-01 2019-04-01 POINT (-74.59436 36.87291)\n10 0.7447747 2019-04-01 2019-04-01 POINT (-74.45753 36.72279)\n\n\nFinally we can build a table that merges the prediction with the labels. We are going to add the name of the month to group by that.\n\ny = x |&gt;\n  dplyr::mutate(label = c(rep(1, nrow(test_obs)), rep(0, nrow(test_bkg))),\n                month = factor(format(date, \"%b\"), levels = month.abb), \n                .before = 2) |&gt;\n  sf::st_drop_geometry() |&gt;\n  dplyr::select(dplyr::all_of(c(\"month\", \"label\", \"pred\"))) |&gt;\n  dplyr::group_by(month) \n\ndplyr::count(y, month, label) |&gt;\n  print(n = 24)\n\n# A tibble: 24 × 3\n# Groups:   month [12]\n   month label     n\n   &lt;fct&gt; &lt;dbl&gt; &lt;int&gt;\n 1 Jan       0    36\n 2 Jan       1    21\n 3 Feb       0    15\n 4 Feb       1     7\n 5 Mar       0    46\n 6 Mar       1    23\n 7 Apr       0   259\n 8 Apr       1   169\n 9 May       0   182\n10 May       1   119\n11 Jun       0    73\n12 Jun       1    53\n13 Jul       0    76\n14 Jul       1    48\n15 Aug       0    46\n16 Aug       1    39\n17 Sep       0    48\n18 Sep       1    21\n19 Oct       0   102\n20 Oct       1    79\n21 Nov       0    27\n22 Nov       1    19\n23 Dec       0    15\n24 Dec       1    14\n\n\nNow how about one ROC plot for each month? Yikes! This requires a iterative approach, using group_map(), to compute the ROC for each month. We then follow with plot wrapping by the patchwork package.\n\nrocs = dplyr::group_map(y, \n  function(tbl, key){\n    maxnetic::plot_ROC(tbl, title = sprintf(\"%s, n = %i\", key$month, nrow(tbl)), \n                                            xlab = \"\", ylab = \"\")\n  })\n\npatchwork::wrap_plots(rocs, ncol = 4)\n\n\n\n\nHmmm. That’s surprising, yes? Why during the summer months does our AUC go down. In fact, at times we are predicting the likelihood of not having an observation reported. It’s hard to know what to think, but consider that we are using a model generated across all months of multiple years and it might not predict a particular month and year very well. A step toward refinement, our next step is to make 12 models, one for each month."
+    "text": "6 Make a prediction\nNow we can make predictions with our basic model. We’ll do it two ways. First by simply feeding the input data used to create the model into the prediction. This might seems a bit circular, but it is perfectly reasonable to see how the model does on already labeled data. Second we’ll make a prediction for each month in 2020 using raster data.\n\n6.1 Predict with a data frame\nHere we provide a data frame, in our case the original input data, to the predict() function with type cloglog which transform the response value into the 0-1 range.\n\nprediction = predict(model, input_table, type = 'cloglog')\nhist(prediction, xlab = \"prediction\", main = \"Basic Model\")\n\n\n\n\n\n6.1.1 How did it do?\nWe can use some utilities in the maxnetic package to help us assess the model. The pAUC() function will compute statistics, include a presence-only AUC value. We need to pass it two items - the universe of predictions and the predictions for just the presence points.\n\nix = input_vector &gt; 0\npauc = maxnetic::pAUC(prediction, prediction[ix])\nplot(pauc, title = \"v1.0 Basic Model\")\n\n\n\n\nOverall, this is telling us that the model isn’t especially strong as a prediction tool, but it is much better than a 50-50 guess (that’s when AUC is close to 0.5, and the curve follows the light grey line). Learn more about ROC and AUC here.\n\n\n\n6.2 Predict with rasters\nWe can also predict using raster inputs using our basic model. Let’s read in rasters for each month of 2019, and then run a prediction for each month.\nWe provide a function read_predictors() that will read and bind the rasters together for you given the filtered databases and paths. So, first we define the paths and filter the databases to point to just the months in 2019.\n\ndates = as.Date(c(\"2019-01-01\", \"2019-12-31\"))\n\nsst_path = \"data/oisst\"\nsst_db = oisster::read_database(sst_path) |&gt;\n  dplyr::arrange(date) |&gt;\n  dplyr::filter(dplyr::between(date, dates[1], dates[2]))\n\nwind_path = \"data/nbs\"\nwind_db = nbs::read_database(wind_path) |&gt;\n  dplyr::arrange(date)|&gt;\n  dplyr::filter(dplyr::between(date, dates[1], dates[2]))\n\nu_wind_db = wind_db |&gt;\n  dplyr::filter(param == \"u_wind\")|&gt;\n  dplyr::filter(dplyr::between(date, dates[1], dates[2]))\n\nv_wind_db = wind_db |&gt;\n  dplyr::filter(param == \"v_wind\")|&gt;\n  dplyr::filter(dplyr::between(date, dates[1], dates[2]))\n\npredictors = read_predictors(sst_db = sst_db,\n                             u_wind_db = u_wind_db,\n                             v_wind_db = v_wind_db)\npredictors\n\nstars object with 3 dimensions and 3 attributes\nattribute(s):\n             Min.   1st Qu.     Median       Mean   3rd Qu.      Max.  NA's\nsst     -1.558928 12.528449 19.5220385 17.6005908 23.501083 29.216452 11352\nu_wind  -2.692028  1.144244  2.7007004  2.7228278  4.115177 13.148120  7612\nv_wind  -5.431324 -1.411349 -0.3202622 -0.1398384  1.106175  4.636874  7612\ndimension(s):\n     from to offset delta refsys point                    values x/y\nx       1 74 -76.38  0.25 WGS 84 FALSE                      NULL [x]\ny       1 46  46.12 -0.25 WGS 84 FALSE                      NULL [y]\ntime    1 12     NA    NA   Date    NA 2019-01-01,...,2019-12-01    \n\n\nYou can see that we have the rasters in one object of three attributes (sst, u_wind and v_wind) each with 12 layers (Jan 2019 - Dec 2019). Now we can run the prediction.\n\npred = predict(model, predictors, type = 'cloglog')\npred\n\nstars object with 3 dimensions and 1 attribute\nattribute(s):\n              Min.   1st Qu.    Median      Mean   3rd Qu.      Max.  NA's\npred  0.0001196393 0.1200618 0.2675931 0.3033565 0.4398977 0.8816952 11487\ndimension(s):\n     from to offset delta refsys point                    values x/y\nx       1 74 -76.38  0.25 WGS 84 FALSE                      NULL [x]\ny       1 46  46.12 -0.25 WGS 84 FALSE                      NULL [y]\ntime    1 12     NA    NA   Date    NA 2019-01-01,...,2019-12-01    \n\n\nSince we get a spatially mapped prediction back, we can plot it.\n\ncoast = rnaturalearth::ne_coastline(scale = 'large', returnclass = 'sf') |&gt;\n  sf::st_crop(pred)\n\nWarning: attribute variables are assumed to be spatially constant throughout\nall geometries\n\nplot_coast = function() {\n  plot(sf::st_geometry(coast), col = 'green', add = TRUE)\n}\nplot(pred, hook = plot_coast)\n\n\n\n\nWell, that certainly looks appealing with higher likelihood of near shore observations occurring during the warmer months.\n\n6.2.1 How did it do?\nTo compute an ROC and AUC for each month, we have a little bit of work to do. We need to extract the observations locations for each month from the prediction maps. These we can then plot.\n\n\n\n\n\n\nNote\n\n\n\nWe have to modify the date for each point to be the first date of each month. That’s because our predictors are monthlies.\n\n\n\ntest_obs = obs |&gt;\n  dplyr::filter(dplyr::between(date, dates[1], dates[2])) |&gt;\n  dplyr::select(dplyr::all_of(\"date\")) |&gt;\n  dplyr::mutate(date = oisster::current_month(date))\n\nx = stars::st_extract(pred, test_obs, time_column = 'date') |&gt;\n  print()\n\nSimple feature collection with 612 features and 3 fields\nGeometry type: POINT\nDimension:     XY\nBounding box:  xmin: -75.7589 ymin: 35.1211 xmax: -65.48274 ymax: 43.83954\nGeodetic CRS:  WGS 84\nFirst 10 features:\n        pred       time       date                   geometry\n1  0.2759255 2019-05-01 2019-05-01 POINT (-67.32935 40.42509)\n2  0.7245142 2019-03-01 2019-03-01 POINT (-74.41057 36.49908)\n3  0.6664676 2019-12-01 2019-12-01   POINT (-75.3994 35.9457)\n4  0.4536477 2019-06-01 2019-06-01 POINT (-75.10864 36.94806)\n5  0.6864945 2019-04-01 2019-04-01 POINT (-74.49892 36.57275)\n6  0.3105710 2019-09-01 2019-09-01   POINT (-75.5519 36.1854)\n7  0.3874695 2019-09-01 2019-09-01   POINT (-73.6245 40.3317)\n8  0.3785449 2019-04-01 2019-04-01 POINT (-69.04389 39.82132)\n9  0.7447747 2019-04-01 2019-04-01 POINT (-74.59436 36.87291)\n10 0.7447747 2019-04-01 2019-04-01 POINT (-74.45753 36.72279)\n\n\nFinally we can build a table that merges the prediction with the labels. We are going to add the name of the month to group by that.\n\ny = x |&gt;\n  dplyr::mutate(month = factor(format(date, \"%b\"), levels = month.abb), \n                .before = 1) |&gt;\n  dplyr::select(dplyr::all_of(c(\"month\", \"pred\", \"date\"))) |&gt;\n  dplyr::group_by(month) \n\ndplyr::count(y, month) |&gt;\n  print(n = 12)\n\nSimple feature collection with 12 features and 2 fields\nGeometry type: MULTIPOINT\nDimension:     XY\nBounding box:  xmin: -75.7589 ymin: 35.1211 xmax: -65.48274 ymax: 43.83954\nGeodetic CRS:  WGS 84\n# A tibble: 12 × 3\n   month     n                                                          geometry\n * &lt;fct&gt; &lt;int&gt;                                                  &lt;MULTIPOINT [°]&gt;\n 1 Jan      21 ((-74.63902 36.26849), (-75.01758 36.49984), (-75.01801 36.72554…\n 2 Feb       7 ((-74.52432 37.24967), (-74.45561 37.16891), (-74.74373 36.72355…\n 3 Mar      23 ((-74.53117 36.26996), (-74.60195 36.72201), (-74.67127 36.72266…\n 4 Apr     169 ((-72.924 38.6733), (-73.0165 38.591), (-73.0036 38.56), (-73.10…\n 5 May     119 ((-74.56571 35.6059), (-75.2181 35.1934), (-75.3228 35.535), (-7…\n 6 Jun      53 ((-73.10608 38.72575), (-74.86204 36.27105), (-75.04656 36.34824…\n 7 Jul      48 ((-74.53554 36.19828), (-74.91756 36.27104), (-75.10905 36.27065…\n 8 Aug      39 ((-72.78628 38.68677), (-72.98868 38.61241), (-74.9889 36.2911),…\n 9 Sep      21 ((-75.3167 36.0439), (-75.5204 36.3294), (-75.5519 36.1854), (-7…\n10 Oct      79 ((-67.06445 42.91399), (-68.43614 43.83954), (-69.14391 43.16967…\n11 Nov      19 ((-72.52681 39.21286), (-71.54966 39.99385), (-67.79606 40.36107…\n12 Dec      14 ((-75.242 35.2705), (-75.3335 35.3027), (-75.436 35.1211), (-75.…\n\n\nNow how about one ROC plot for each month? Yikes! This requires a iterative approach, using group_map(), to compute the ROC for each month. We then follow with plot wrapping by the patchwork package.\n\npaucs = dplyr::group_map(y, \n  function(tbl, key, pred_rasters = NULL){\n    ix = key$month %in% month.abb\n    x = dplyr::slice(pred_rasters, \"time\", ix)\n    pauc = maxnetic::pAUC(x, tbl)\n    plot(pauc,title = key$month, \n         xlab = \"\", ylab = \"\")\n  }, pred_rasters = pred)\n\npatchwork::wrap_plots(paucs, ncol = 4)\n\n\n\n\nHmmm. That’s surprising, yes? Why during the summer months does our AUC go down when we have the most number of observations? That seems counter intuitive."
+  },
+  {
+    "objectID": "modeling-01.html#thinking-about-auc",
+    "href": "modeling-01.html#thinking-about-auc",
+    "title": "Basic modeling",
+    "section": "7 Thinking about AUC",
+    "text": "7 Thinking about AUC\nAUC is a diagnostic that provides a peek into the predictive power of a model. But what is it? An analogy is fitting a straight line to a small set of observations verses a large set of observations and then comparing the correlation coefficients. Here’s a simple example using R’s built-in dataset cars which is a data frame of 50 observations of speed and stopping distances of cars. We’ll compute a linear model for the entire data set, and then a second for a small subsample of the data. (Learn more about linear models in R here.)\n\ndata(\"cars\")\ncars = dplyr::as_tibble(cars)\n\nall_fit = lm(dist ~ speed, data = cars)\nsummary(all_fit)\n\n\nCall:\nlm(formula = dist ~ speed, data = cars)\n\nResiduals:\n    Min      1Q  Median      3Q     Max \n-29.069  -9.525  -2.272   9.215  43.201 \n\nCoefficients:\n            Estimate Std. Error t value Pr(&gt;|t|)    \n(Intercept) -17.5791     6.7584  -2.601   0.0123 *  \nspeed         3.9324     0.4155   9.464 1.49e-12 ***\n---\nSignif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1\n\nResidual standard error: 15.38 on 48 degrees of freedom\nMultiple R-squared:  0.6511,    Adjusted R-squared:  0.6438 \nF-statistic: 89.57 on 1 and 48 DF,  p-value: 1.49e-12\n\n\n\nset.seed(5)\nsub_cars = dplyr::slice_sample(cars, n = 3)\nsub_fit = lm(dist ~ speed, data = sub_cars)\nsummary(sub_fit)\n\n\nCall:\nlm(formula = dist ~ speed, data = sub_cars)\n\nResiduals:\n 1  2  3 \n 3  3 -6 \n\nCoefficients:\n            Estimate Std. Error t value Pr(&gt;|t|)\n(Intercept)  -6.5000     8.8741  -0.732    0.598\nspeed         3.3750     0.6495   5.196    0.121\n\nResidual standard error: 7.348 on 1 degrees of freedom\nMultiple R-squared:  0.9643,    Adjusted R-squared:  0.9286 \nF-statistic:    27 on 1 and 1 DF,  p-value: 0.121\n\n\nYou can see that the rU+00B2 value is quite high for the smaller data set, but the model may not be predictive over the full range of data. AUC is somewhat analogous to to rU+00B2 in that a relatively low score does not necessarily suggest a poor model.\n\nggplot2::ggplot(data = cars, ggplot2::aes(x = speed, y = dist)) +\n  ggplot2::geom_point(color = \"blue\") +\n  ggplot2::geom_abline(slope = coef(all_fit)[2], intercept = coef(all_fit)[1], color = \"blue\") + \n  ggplot2::geom_point(data = sub_cars, ggplot2::aes(x = speed, y = dist), color = \"orange\") +\n  ggplot2::geom_abline(slope = coef(sub_fit)[2], intercept = coef(sub_fit)[1], color = \"orange\")"
   },
   {
     "objectID": "predictors.html",
@@ -193,13 +200,13 @@
     "href": "modeling-02.html#do-we-model-every-month",
     "title": "Modeling each month",
     "section": "2 Do we model every month?",
-    "text": "2 Do we model every month?\nLet’s do a quick check by counting each by month. Note that we drop the spatial info so that we can make simply tallies.\n\ncounts = sf::st_drop_geometry(obs) |&gt; \n  dplyr::count(month, name = \"n_obs\") |&gt;\n  dplyr::left_join(sf::st_drop_geometry(bkg) |&gt; dplyr::count(month, name = \"n_bkg\"), \n                   by = 'month') |&gt;\n  print(n = 12)\n\n# A tibble: 12 × 3\n   month n_obs n_bkg\n   &lt;fct&gt; &lt;int&gt; &lt;int&gt;\n 1 Jan      33    51\n 2 Feb      40    57\n 3 Mar      50    79\n 4 Apr     341   528\n 5 May     541   943\n 6 Jun    2137  3471\n 7 Jul    2108  3233\n 8 Aug    1698  2597\n 9 Sep     725  1205\n10 Oct     328   485\n11 Nov     494   739\n12 Dec      66    90\n\n\nSo the colder months have fewer observations than the warmer months. We already knew that, but it will be interesting to see how that manifests itself in the models.\n\n2.1 Build the monthly models\n\n# A function for making one month's model\n#\n# @param tbl a data frame of one month's observations\n# @param key a data frame that holds the current iteration's month name\n# @param bkg a complete data frame of background data (which we filter for the given month)\n# @param path the path where the model is saved\n# @return a model, which is also saved in \"data/model/v2/v2.&lt;monthname&gt;\"\nmodel_month = function(tbl, key, bkg = NULL, path = \".\"){\n  \n  bkg = bkg |&gt;\n    dplyr::filter(month == key$month) |&gt;\n    sf::st_drop_geometry() |&gt;\n    dplyr::select(dplyr::all_of(c(\"sst\", \"u_wind\", \"v_wind\"))) |&gt;\n    na.omit()\n  \n  obs = tbl |&gt;\n    sf::st_drop_geometry() |&gt;\n    dplyr::select(dplyr::all_of(c(\"sst\", \"u_wind\", \"v_wind\"))) |&gt;\n    na.omit()\n  \n  # these are the predictor variables row bound\n  x = dplyr::bind_rows(obs, bkg)\n  \n  # and the flag indicating presence/background\n  flag = c(rep(1, nrow(obs)), rep(0, nrow(bkg)))\n  \n  model_path = file.path(path, paste0(\"v2.\", key$month, \".rds\"))\n\n  model = maxnet::maxnet(flag, x) |&gt;\n    maxnetic::write_maxnet(model_path)\n                         \n  model\n}\n\npath = file.path(\"data\", \"model\", \"v2\")\nok = dir.create(path, recursive = TRUE, showWarnings = FALSE)\nmodels = obs |&gt;\n  dplyr::group_by(month) |&gt;\n  dplyr::group_map(model_month, bkg = bkg, path = path) |&gt;\n  rlang::set_names(levels(obs$month))"
+    "text": "2 Do we model every month?\nLet’s do a quick check by counting each by month. Note that we drop the spatial info so that we can make simply tallies.\n\ncounts = sf::st_drop_geometry(obs) |&gt; \n  dplyr::count(month, name = \"n_obs\") |&gt;\n  dplyr::left_join(sf::st_drop_geometry(bkg) |&gt; dplyr::count(month, name = \"n_bkg\"), \n                   by = 'month') |&gt;\n  print(n = 12)\n\n# A tibble: 12 × 3\n   month n_obs n_bkg\n   &lt;fct&gt; &lt;int&gt; &lt;int&gt;\n 1 Jan      33    51\n 2 Feb      40    57\n 3 Mar      50    79\n 4 Apr     341   528\n 5 May     541   943\n 6 Jun    2137  3471\n 7 Jul    2108  3233\n 8 Aug    1698  2597\n 9 Sep     725  1205\n10 Oct     328   485\n11 Nov     494   739\n12 Dec      66    90\n\n\nSo the colder months have fewer observations than the warmer months. We already knew that, but it will be interesting to see how that manifests itself in the models.\n\n2.1 Build the monthly models\nSince we are building 12 models (rather than one) it is useful to create a function that computes a model for any month, and then iterate through the months of the year.\n\n# A function for making one month's model\n#\n# @param tbl a data frame of one month's observations\n# @param key a data frame that holds the current iteration's month name\n# @param bkg a complete data frame of background data (which we filter for the given month)\n# @param path the path where the model is saved\n# @return a model, which is also saved in \"data/model/v2/v2.&lt;monthname&gt;\"\nmodel_month = function(tbl, key, bkg = NULL, path = \".\"){\n  \n  bkg = bkg |&gt;\n    dplyr::filter(month == key$month) |&gt;\n    sf::st_drop_geometry() |&gt;\n    dplyr::select(dplyr::all_of(c(\"sst\", \"u_wind\", \"v_wind\"))) |&gt;\n    na.omit()\n  \n  obs = tbl |&gt;\n    sf::st_drop_geometry() |&gt;\n    dplyr::select(dplyr::all_of(c(\"sst\", \"u_wind\", \"v_wind\"))) |&gt;\n    na.omit()\n  \n  # these are the predictor variables row bound\n  x = dplyr::bind_rows(obs, bkg)\n  \n  # and the flag indicating presence/background\n  flag = c(rep(1, nrow(obs)), rep(0, nrow(bkg)))\n  \n  model_path = file.path(path, paste0(\"v2.\", key$month, \".rds\"))\n\n  model = maxnet::maxnet(flag, x) |&gt;\n    maxnetic::write_maxnet(model_path)\n                         \n  model\n}\n\npath = file.path(\"data\", \"model\", \"v2\")\nok = dir.create(path, recursive = TRUE, showWarnings = FALSE)\nmodels = obs |&gt;\n  dplyr::group_by(month) |&gt;\n  dplyr::group_map(model_month, bkg = bkg, path = path) |&gt;\n  rlang::set_names(levels(obs$month))\n\nWe can look at the response plots for every month, but for demonstration purposes, we’ll just show one month.\n\nplot(models[['Jun']], type = 'cloglog')"
   },
   {
     "objectID": "modeling-02.html#predict-with-rasters",
     "href": "modeling-02.html#predict-with-rasters",
     "title": "Modeling each month",
     "section": "3 Predict with rasters",
-    "text": "3 Predict with rasters\nFirst we load the raster databases as these are lightweight to pass into a function that iterates through the months.\n\n3.1 Load the raster databases (sst and u_wind and v_wind)\nWe also make sure they are in date order and add a “month” variable to each.\n\nsst_path = \"data/oisst\"\nsst_db = oisster::read_database(sst_path) |&gt;\n  dplyr::arrange(date) |&gt;\n  dplyr::mutate(month = format(date, \"%b\"))\n  \n\nwind_path = \"data/nbs\"\nwind_db = nbs::read_database(wind_path) |&gt;\n  dplyr::arrange(date)|&gt;\n  dplyr::mutate(month = format(date, \"%b\"))\n\nu_wind_db = wind_db |&gt;\n  dplyr::filter(param == \"u_wind\")\n\nv_wind_db = wind_db |&gt;\n  dplyr::filter(param == \"v_wind\")\n\n\n\n3.2 Iterate through the months making predictions\nNow we can build an iterator function that will make a prediction for each month. Let’s narrow our predictions to just those for a particular year, 2019, and read the rasters in all at once.\n\ndates = as.Date(c(\"2019-01-01\", \"2019-12-31\"))\nx = read_predictors(\n  sst_db = dplyr::filter(sst_db, dplyr::between(date, dates[1], dates[2])),\n  u_wind_db = dplyr::filter(u_wind_db, dplyr::between(date, dates[1], dates[2])),\n  v_wind_db = dplyr::filter(v_wind_db, dplyr::between(date, dates[1], dates[2]))\n)\n\nNow we can iterate through the months.\n\ndate_sequence = seq(from = dates[1], to = dates[2], by = \"month\")\npred_rasters = lapply(names(models),\n  function(mon){\n    ix = which(month.abb %in% mon)\n    predict(models[[mon]], dplyr::slice(x, time, ix, drop), type = \"cloglog\")\n  }) \npred_rasters = do.call(c, append(pred_rasters, list(along = list(time = date_sequence))))\n\nLet’s plot them.\n\ncoast = rnaturalearth::ne_coastline(scale = 'large', returnclass = 'sf') |&gt;\n  sf::st_geometry() |&gt;\n  sf::st_crop(pred_rasters)\n\nplot_coast = function() {\n  plot(coast, col = 'green', add = TRUE)\n}\nplot(pred_rasters |&gt; st_to_180(), hook = plot_coast)\n\n\n\n\nLet’s see what we can discern from the predict abilities. We can extract the predicted values at the observed locations.\n\npred_obs = stars::st_extract(pred_rasters, \n                             dplyr::filter(obs, dplyr::between(date, dates[1], dates[2])),\n                             time_column = \"month_id\") |&gt;\n  sf::st_drop_geometry() \npred_bkg = stars::st_extract(pred_rasters, \n                             dplyr::filter(bkg, dplyr::between(date, dates[1], dates[2])),\n                             time_column = \"month_id\") |&gt;\n  sf::st_drop_geometry() \n\npreds = dplyr::bind_rows(pred_obs, pred_bkg) |&gt;\n  dplyr::mutate(label = c(rep(1, nrow(pred_obs)), rep(0, nrow(pred_bkg))), .before = 1) |&gt;\n  dplyr::mutate(month = factor(format(time, \"%b\"), levels = month.abb)) |&gt;\n  dplyr::group_by(month)\n\n\naucs = dplyr::group_map(preds,\n                        function(x, y) {\n                          dplyr::tibble(month = y$month, auc = maxnetic::AUC(x))\n                        }) |&gt;\n  dplyr::bind_rows() |&gt;\n  dplyr::right_join(counts, by = \"month\") |&gt;\n  print(n=12)\n\n# A tibble: 12 × 4\n   month   auc n_obs n_bkg\n   &lt;fct&gt; &lt;dbl&gt; &lt;int&gt; &lt;int&gt;\n 1 Jan   0.987    33    51\n 2 Feb   0.876    40    57\n 3 Mar   0.957    50    79\n 4 Apr   0.897   341   528\n 5 May   0.888   541   943\n 6 Jun   0.547  2137  3471\n 7 Jul   0.376  2108  3233\n 8 Aug   0.588  1698  2597\n 9 Sep   0.742   725  1205\n10 Oct   0.797   328   485\n11 Nov   0.873   494   739\n12 Dec   0.995    66    90\n\n\nOK, that’s unexpected. The months with the lower counts of observations have relatively higher AUCs. Huh? Let’s look at that graphically.\n\naucs_long = tidyr::pivot_longer(aucs, dplyr::all_of(c(\"n_obs\", \"n_bkg\")),\n                           names_to = \"type\", values_to = \"count\") |&gt;\n  dplyr::mutate(type = dplyr::recode(type, n_obs = \"obs\", n_bkg = \"bkg\"))\n\nggplot2::ggplot(data = aucs_long, aes(x = count, y = auc, color = type)) +\n  ggplot2::geom_point() + \n  ggplot2::geom_smooth(method='lm', formula= y~x)\n\n\n\n\nSurprised? Could this be overfitting resulting from sampling background in time weighted to the months when we have observations? Hmmmm."
+    "text": "3 Predict with rasters\nFirst we load the raster databases as these are lightweight to pass into a function that iterates through the months.\n\n3.1 Load the raster databases (sst and u_wind and v_wind)\nWe also make sure they are in date order and add a “month” variable to each.\n\nsst_path = \"data/oisst\"\nsst_db = oisster::read_database(sst_path) |&gt;\n  dplyr::arrange(date) |&gt;\n  dplyr::mutate(month = format(date, \"%b\"))\n  \n\nwind_path = \"data/nbs\"\nwind_db = nbs::read_database(wind_path) |&gt;\n  dplyr::arrange(date)|&gt;\n  dplyr::mutate(month = format(date, \"%b\"))\n\nu_wind_db = wind_db |&gt;\n  dplyr::filter(param == \"u_wind\")\n\nv_wind_db = wind_db |&gt;\n  dplyr::filter(param == \"v_wind\")\n\n\n\n3.2 Iterate through the months making predictions\nNow we can build an iterator function that will make a prediction for each month. Let’s narrow our predictions to just those for a particular year, 2019, and read the rasters in all at once.\n\ndates = as.Date(c(\"2019-01-01\", \"2019-12-31\"))\nx = read_predictors(\n  sst_db = dplyr::filter(sst_db, dplyr::between(date, dates[1], dates[2])),\n  u_wind_db = dplyr::filter(u_wind_db, dplyr::between(date, dates[1], dates[2])),\n  v_wind_db = dplyr::filter(v_wind_db, dplyr::between(date, dates[1], dates[2]))\n)\n\nNow we can iterate through the months.\n\ndate_sequence = seq(from = dates[1], to = dates[2], by = \"month\")\npred_rasters = lapply(names(models),\n  function(mon){\n    ix = which(month.abb %in% mon)\n    predict(models[[mon]], dplyr::slice(x, time, ix, drop), type = \"cloglog\")\n  }) \npred_rasters = do.call(c, append(pred_rasters, list(along = list(time = date_sequence))))\n\nLet’s plot them.\n\ncoast = rnaturalearth::ne_coastline(scale = 'large', returnclass = 'sf') |&gt;\n  sf::st_geometry() |&gt;\n  sf::st_crop(pred_rasters)\n\nplot_coast = function() {\n  plot(coast, col = 'green', add = TRUE)\n}\nplot(pred_rasters, hook = plot_coast)\n\n\n\n\nLet’s see what we can discern from the predict abilities. We can extract the predicted values at the observed locations. Having those in hand allows us to compute pAUC for each month.\n\npred_obs = stars::st_extract(pred_rasters, \n                             dplyr::filter(obs, dplyr::between(date, dates[1], dates[2])),\n                             time_column = \"month_id\") |&gt;\n  dplyr::mutate(month = factor(format(month_id, \"%b\"), levels = month.abb)) |&gt;\n  dplyr::group_by(month)\n\npaucs = dplyr::group_map(pred_obs,\n                        function(x, y) {\n                          ix = month.abb %in% y$month\n                          s = dplyr::slice(pred_rasters, \"time\", ix)\n                          pauc = maxnetic::pAUC(s,x)\n                          dplyr::tibble(month = y$month, \n                                        auc = pauc$area,\n                                        pauc = list(pauc))\n                        })|&gt;\n  dplyr::bind_rows() |&gt;\n  print(n = 12)\n\n# A tibble: 12 × 3\n   month   auc pauc      \n   &lt;fct&gt; &lt;dbl&gt; &lt;list&gt;    \n 1 Jan   0.703 &lt;pAUC [3]&gt;\n 2 Feb   0.689 &lt;pAUC [3]&gt;\n 3 Mar   0.698 &lt;pAUC [3]&gt;\n 4 Apr   0.677 &lt;pAUC [3]&gt;\n 5 May   0.654 &lt;pAUC [3]&gt;\n 6 Jun   0.662 &lt;pAUC [3]&gt;\n 7 Jul   0.665 &lt;pAUC [3]&gt;\n 8 Aug   0.696 &lt;pAUC [3]&gt;\n 9 Sep   0.663 &lt;pAUC [3]&gt;\n10 Oct   0.633 &lt;pAUC [3]&gt;\n11 Nov   0.627 &lt;pAUC [3]&gt;\n12 Dec   0.665 &lt;pAUC [3]&gt;\n\n\nNote that last element, pauc, is the result returned by the maxnetic::pAUC() function which we can plot.\n\npp = paucs |&gt;\n  dplyr::group_by(month) |&gt;\n  dplyr::group_map(\n    function(tbl, key){\n      plot(tbl$pauc[[1]], title = key$month, xlab = \"\", ylab = \"\")\n    }\n  )\npatchwork::wrap_plots(pp, ncol = 4)\n\n\n\n\nWell, it would be easy to become dispirited by this result. It would be reasonable to expect AUC values to improve if we built monthly models rather than a single model applied to any month. But it seems to not be so. Darn!"
   }
 ]
\ No newline at end of file
diff --git a/functions/stars.R b/functions/stars.R
index e224403..b8f13fa 100644
--- a/functions/stars.R
+++ b/functions/stars.R
@@ -46,5 +46,5 @@ read_predictors = function(
   } else {
    x = do.call(c, append(xx, list(along = NA_integer_)))         
   }
-  x
+  st_to_180(x)
 }
\ No newline at end of file
diff --git a/modeling-01.qmd b/modeling-01.qmd
index 2975284..aa96d43 100644
--- a/modeling-01.qmd
+++ b/modeling-01.qmd
@@ -1,6 +1,6 @@
 ---
 title: "Basic modeling"
-cache: true
+cache: false
 ---
 
 So at this point we have point data for observation and background that have been joined with common environmental covariates (aka predictors).  Here we show the basic steps taken to prepare, build and assess a model. Later, we'll try more sophisticated modeling, such as modeling by month or splitting the data into training-testing groups.
@@ -116,18 +116,21 @@ hist(prediction, xlab = "prediction", main = "Basic Model")
 
 #### How did it do?
 
-We can use some utilities in the [maxnetic](https://github.com/BigelowLab/maxnetic) package to help us assess the model.  First, we need to create a table with two columns: `label` and `pred`.  Label is the simple a vector of 0/1 indicating that the predicted value is known to be either background or presence.  We already have that in our `input_vector`.  Pred is simple the 0-1 scale predicted value. Once we have that we can craft a [receiver operator characteristic curve](https://en.wikipedia.org/wiki/Receiver_operating_characteristic) and compute it's [AUC](https://en.wikipedia.org/wiki/Receiver_operating_characteristic#Area_under_the_curve).
+We can use some utilities in the [maxnetic](https://github.com/BigelowLab/maxnetic) package to help us assess the model.  The `pAUC()` function will compute statistics, include a presence-only AUC value.  We need to pass it two items - the universe of predictions and the predictions for just the presence points.
 
 ```{r}
-x = dplyr::tibble(label = input_vector, pred = as.vector(prediction))
-plot_ROC(x, title = "v1.0 Basic Model")
+ix = input_vector > 0
+pauc = maxnetic::pAUC(prediction, prediction[ix])
+plot(pauc, title = "v1.0 Basic Model")
 ```
 Overall, this is telling us that the model isn't especially strong as a prediction tool, but it is much better than a 50-50 guess (that's when AUC is close to 0.5, and the curve follows the light grey line).  Learn more about ROC and AUC [here](https://rviews.rstudio.com/2019/01/17/roc-curves/).
 
 
 ### Predict with rasters
 
-We can also predict using raster inputs using our basic model. Let's read in rasters for each month of 2018, and then run a prediction for each month.
+We can also predict using raster inputs using our basic model. Let's read in rasters for each month of 2019, and then run a prediction for each month.
+
+We provide a function `read_predictors()` that will read and bind the rasters together for you given the filtered databases and paths.  So, first we define the paths and filter the databases to point to just the months in 2019.
 
 ```{r}
 dates = as.Date(c("2019-01-01", "2019-12-31"))
@@ -136,14 +139,6 @@ sst_path = "data/oisst"
 sst_db = oisster::read_database(sst_path) |>
   dplyr::arrange(date) |>
   dplyr::filter(dplyr::between(date, dates[1], dates[2]))
-  
-
-sst = sst_db |>
-  oisster::compose_filename(path = sst_path) |>
-  stars::read_stars(along = list(time = sst_db$date)) |>
-  rlang::set_names("sst")|>
-  st_to_180()
-
 
 wind_path = "data/nbs"
 wind_db = nbs::read_database(wind_path) |>
@@ -153,33 +148,18 @@ wind_db = nbs::read_database(wind_path) |>
 u_wind_db = wind_db |>
   dplyr::filter(param == "u_wind")|>
   dplyr::filter(dplyr::between(date, dates[1], dates[2]))
-u_wind = u_wind_db |>
-  nbs::compose_filename(path = wind_path) |>
-  stars::read_stars(along = list(time = u_wind_db$date)) |>
-  rlang::set_names("u_wind") |>
-  st_to_180()
 
 v_wind_db = wind_db |>
   dplyr::filter(param == "v_wind")|>
   dplyr::filter(dplyr::between(date, dates[1], dates[2]))
-v_wind = v_wind_db |>
-  nbs::compose_filename(path = wind_path) |>
-  stars::read_stars(along = list(time = v_wind_db$date)) |>
-  rlang::set_names("v_wind") |>
-  st_to_180()
-```
-
-
-Once we have them in hand we need to bind them together.  But we need to attend to common but important issue.  The `sst` rasters and `windspeed` rasters have different extents. We can't bind them together until we warp one set to match the other.  Let's warp `sst` to match `u_wind`.  And then we can bind them together.
 
-```{r}
-sst_warped = stars::st_warp(sst, u_wind)
-x = list(sst_warped, u_wind, v_wind)
-predictors = do.call(c, append(x, list(along = NA_integer_))) 
+predictors = read_predictors(sst_db = sst_db,
+                             u_wind_db = u_wind_db,
+                             v_wind_db = v_wind_db)
 predictors
 ```
 
-Now we can run the prediction.
+You can see that we have the rasters in one object of three attributes (`sst`, `u_wind` and `v_wind`) each with 12 layers (Jan 2019 - Dec 2019). Now we can run the prediction.
 
 ```{r}
 pred = predict(model, predictors, type = 'cloglog')
@@ -202,7 +182,7 @@ Well, that certainly looks appealing with higher likelihood of near shore observ
 
 #### How did it do?
 
-To compute an ROC and AUC for each month, we have a little bit of work to do.  We need to extract the observations and background for each month from the prediction maps.  These we can then pass to the `plot_ROC()` function.
+To compute an ROC and AUC for each month, we have a little bit of work to do.  We need to extract the observations locations for each month from the prediction maps.  These we can then plot.
 
 :::{.callout-note}
 We have to modify the date for each point to be the first date of each month. That's because our predictors are monthlies.
@@ -214,14 +194,7 @@ test_obs = obs |>
   dplyr::select(dplyr::all_of("date")) |>
   dplyr::mutate(date = oisster::current_month(date))
 
-test_bkg = bkg |>
-  dplyr::filter(dplyr::between(date, dates[1], dates[2])) |>
-  dplyr::select(dplyr::all_of("date")) |>
-  dplyr::mutate(date = oisster::current_month(date))
-
-test_input = dplyr::bind_rows(test_obs, test_bkg)
-
-x = stars::st_extract(pred, test_input, time_column = 'date') |>
+x = stars::st_extract(pred, test_obs, time_column = 'date') |>
   print()
 ```
 
@@ -229,30 +202,63 @@ Finally we can build a table that merges the prediction with the labels. We are
 
 ```{r}
 y = x |>
-  dplyr::mutate(label = c(rep(1, nrow(test_obs)), rep(0, nrow(test_bkg))),
-                month = factor(format(date, "%b"), levels = month.abb), 
-                .before = 2) |>
-  sf::st_drop_geometry() |>
-  dplyr::select(dplyr::all_of(c("month", "label", "pred"))) |>
+  dplyr::mutate(month = factor(format(date, "%b"), levels = month.abb), 
+                .before = 1) |>
+  dplyr::select(dplyr::all_of(c("month", "pred", "date"))) |>
   dplyr::group_by(month) 
 
-dplyr::count(y, month, label) |>
-  print(n = 24)
+dplyr::count(y, month) |>
+  print(n = 12)
 ```
   
 Now how about one ROC plot for each month?  Yikes!  This requires a iterative approach, using `group_map()`, to compute the ROC for each month.  We then follow with plot wrapping by the [patchwork](https://patchwork.data-imaginist.com/articles/guides/assembly.html#functional-assembly) package.
   
 ```{r}
 #| width: "100%"
-rocs = dplyr::group_map(y, 
-  function(tbl, key){
-    maxnetic::plot_ROC(tbl, title = sprintf("%s, n = %i", key$month, nrow(tbl)), 
-                                            xlab = "", ylab = "")
-  })
+paucs = dplyr::group_map(y, 
+  function(tbl, key, pred_rasters = NULL){
+    ix = key$month %in% month.abb
+    x = dplyr::slice(pred_rasters, "time", ix)
+    pauc = maxnetic::pAUC(x, tbl)
+    plot(pauc,title = key$month, 
+         xlab = "", ylab = "")
+  }, pred_rasters = pred)
+
+patchwork::wrap_plots(paucs, ncol = 4)
+```
 
-patchwork::wrap_plots(rocs, ncol = 4)
+Hmmm.  That's surprising, yes?  Why during the summer months does our AUC go down when we have the most number of observations?  That seems counter intuitive. 
+
+## Thinking about AUC
+
+AUC is a diagnostic that provides a peek into the predictive power of a model. But what is it?  An analogy is fitting a straight line to a small set of observations verses a large set of observations and then comparing the correlation coefficients.  Here's a simple example using R's built-in dataset `cars` which is a data frame of 50 observations of speed and stopping distances of cars.  We'll compute a linear model for the entire data set, and then a second for a small subsample of the data. (Learn more about linear models in R [here](https://rseek.org/?q=linear+models).)
+
+```{r}
+data("cars")
+cars = dplyr::as_tibble(cars)
+
+all_fit = lm(dist ~ speed, data = cars)
+summary(all_fit)
+```
+
+```{r}
+set.seed(5)
+sub_cars = dplyr::slice_sample(cars, n = 3)
+sub_fit = lm(dist ~ speed, data = sub_cars)
+summary(sub_fit)
 ```
 
-Hmmm.  That's surprising, yes?  Why during the summer months does our AUC go down.  In fact, at times we are predicting the likelihood of **not** having an observation reported. It's hard to know what to think, but consider that we are using a model generated across all months of multiple years and it might not predict a particular month and year very well.  A step toward refinement, our next step is to make 12 models, one for each month.
+You can see that the `rU+00B2` value is quite high for the smaller data set, but the model may not be predictive over the full range of data.  AUC is somewhat analogous to to `rU+00B2` in that a relatively low score does not necessarily suggest a poor model.
+
+```{r}
+ggplot2::ggplot(data = cars, ggplot2::aes(x = speed, y = dist)) +
+  ggplot2::geom_point(color = "blue") +
+  ggplot2::geom_abline(slope = coef(all_fit)[2], intercept = coef(all_fit)[1], color = "blue") + 
+  ggplot2::geom_point(data = sub_cars, ggplot2::aes(x = speed, y = dist), color = "orange") +
+  ggplot2::geom_abline(slope = coef(sub_fit)[2], intercept = coef(sub_fit)[1], color = "orange")
+```
+  
+
+
 
 
diff --git a/modeling-01_files/figure-html/unnamed-chunk-11-1.png b/modeling-01_files/figure-html/unnamed-chunk-11-1.png
new file mode 100644
index 0000000..b605749
Binary files /dev/null and b/modeling-01_files/figure-html/unnamed-chunk-11-1.png differ
diff --git a/modeling-01_files/figure-html/unnamed-chunk-14-1.png b/modeling-01_files/figure-html/unnamed-chunk-14-1.png
new file mode 100644
index 0000000..6059e6c
Binary files /dev/null and b/modeling-01_files/figure-html/unnamed-chunk-14-1.png differ
diff --git a/modeling-01_files/figure-html/unnamed-chunk-17-1.png b/modeling-01_files/figure-html/unnamed-chunk-17-1.png
new file mode 100644
index 0000000..33eebe3
Binary files /dev/null and b/modeling-01_files/figure-html/unnamed-chunk-17-1.png differ
diff --git a/modeling-01_files/figure-html/unnamed-chunk-8-1.png b/modeling-01_files/figure-html/unnamed-chunk-8-1.png
index 87068ba..1102229 100644
Binary files a/modeling-01_files/figure-html/unnamed-chunk-8-1.png and b/modeling-01_files/figure-html/unnamed-chunk-8-1.png differ
diff --git a/modeling-02.qmd b/modeling-02.qmd
index 5777100..2abc3ae 100644
--- a/modeling-02.qmd
+++ b/modeling-02.qmd
@@ -37,8 +37,9 @@ So the colder months have fewer observations than the warmer months.  We already
 
 ### Build the monthly models
 
-```{r}
+Since we are building 12 models (rather than one) it is useful to create a function that computes a model for any month, and then iterate through the months of the year.
 
+```{r}
 # A function for making one month's model
 #
 # @param tbl a data frame of one month's observations
@@ -81,8 +82,13 @@ models = obs |>
   rlang::set_names(levels(obs$month))
 ```
 
-## Predict with rasters
+We can look at the response plots for every month, but for demonstration purposes, we'll just show one month.
 
+```{r}
+plot(models[['Jun']], type = 'cloglog')
+```
+
+## Predict with rasters
 First we load the raster databases as these are lightweight to pass into a function that iterates through the months.
 
 ### Load the raster databases (`sst` and `u_wind` and `v_wind`)
@@ -142,10 +148,10 @@ coast = rnaturalearth::ne_coastline(scale = 'large', returnclass = 'sf') |>
 plot_coast = function() {
   plot(coast, col = 'green', add = TRUE)
 }
-plot(pred_rasters |> st_to_180(), hook = plot_coast)
+plot(pred_rasters, hook = plot_coast)
 ```
 
-Let's see what we can discern from the predict abilities. We can extract the predicted values at the observed locations.
+Let's see what we can discern from the predict abilities. We can extract the predicted values at the observed locations.  Having those in hand allows us to compute pAUC for each month.
 
 
 
@@ -153,36 +159,33 @@ Let's see what we can discern from the predict abilities. We can extract the pre
 pred_obs = stars::st_extract(pred_rasters, 
                              dplyr::filter(obs, dplyr::between(date, dates[1], dates[2])),
                              time_column = "month_id") |>
-  sf::st_drop_geometry() 
-pred_bkg = stars::st_extract(pred_rasters, 
-                             dplyr::filter(bkg, dplyr::between(date, dates[1], dates[2])),
-                             time_column = "month_id") |>
-  sf::st_drop_geometry() 
-
-preds = dplyr::bind_rows(pred_obs, pred_bkg) |>
-  dplyr::mutate(label = c(rep(1, nrow(pred_obs)), rep(0, nrow(pred_bkg))), .before = 1) |>
-  dplyr::mutate(month = factor(format(time, "%b"), levels = month.abb)) |>
+  dplyr::mutate(month = factor(format(month_id, "%b"), levels = month.abb)) |>
   dplyr::group_by(month)
 
-
-aucs = dplyr::group_map(preds,
+paucs = dplyr::group_map(pred_obs,
                         function(x, y) {
-                          dplyr::tibble(month = y$month, auc = maxnetic::AUC(x))
-                        }) |>
+                          ix = month.abb %in% y$month
+                          s = dplyr::slice(pred_rasters, "time", ix)
+                          pauc = maxnetic::pAUC(s,x)
+                          dplyr::tibble(month = y$month, 
+                                        auc = pauc$area,
+                                        pauc = list(pauc))
+                        })|>
   dplyr::bind_rows() |>
-  dplyr::right_join(counts, by = "month") |>
-  print(n=12)
+  print(n = 12)
 ```
 
-OK, that's unexpected.  The months with the lower counts of observations have relatively higher AUCs.  Huh?  Let's look at that graphically.
+Note that last element, `pauc`, is the result returned by the `maxnetic::pAUC()` function which we can plot.
 
 ```{r}
-aucs_long = tidyr::pivot_longer(aucs, dplyr::all_of(c("n_obs", "n_bkg")),
-                           names_to = "type", values_to = "count") |>
-  dplyr::mutate(type = dplyr::recode(type, n_obs = "obs", n_bkg = "bkg"))
-
-ggplot2::ggplot(data = aucs_long, aes(x = count, y = auc, color = type)) +
-  ggplot2::geom_point() + 
-  ggplot2::geom_smooth(method='lm', formula= y~x)
+pp = paucs |>
+  dplyr::group_by(month) |>
+  dplyr::group_map(
+    function(tbl, key){
+      plot(tbl$pauc[[1]], title = key$month, xlab = "", ylab = "")
+    }
+  )
+patchwork::wrap_plots(pp, ncol = 4)
 ```
-Surprised?  Could this be overfitting resulting from sampling background in time weighted to the months when we have observations?  Hmmmm.
+
+Well, it would be easy to become dispirited by this result.  It would be reasonable to expect AUC values to improve if we built monthly models rather than a single model applied to any month.  But it seems to not be so. Darn!
\ No newline at end of file
diff --git a/modeling-02_files/figure-html/unnamed-chunk-10-1.png b/modeling-02_files/figure-html/unnamed-chunk-10-1.png
new file mode 100644
index 0000000..fcdcd3e
Binary files /dev/null and b/modeling-02_files/figure-html/unnamed-chunk-10-1.png differ
diff --git a/modeling-02_files/figure-html/unnamed-chunk-4-1.png b/modeling-02_files/figure-html/unnamed-chunk-4-1.png
new file mode 100644
index 0000000..c1388b1
Binary files /dev/null and b/modeling-02_files/figure-html/unnamed-chunk-4-1.png differ
diff --git a/modeling-02_files/figure-html/unnamed-chunk-7-1.png b/modeling-02_files/figure-html/unnamed-chunk-7-1.png
index df52a54..6b8c08d 100644
Binary files a/modeling-02_files/figure-html/unnamed-chunk-7-1.png and b/modeling-02_files/figure-html/unnamed-chunk-7-1.png differ
diff --git a/modeling-02_files/figure-html/unnamed-chunk-8-1.png b/modeling-02_files/figure-html/unnamed-chunk-8-1.png
new file mode 100644
index 0000000..6b8c08d
Binary files /dev/null and b/modeling-02_files/figure-html/unnamed-chunk-8-1.png differ
diff --git a/modeling-02_files/figure-html/unnamed-chunk-9-1.png b/modeling-02_files/figure-html/unnamed-chunk-9-1.png
index 03d16aa..fcdcd3e 100644
Binary files a/modeling-02_files/figure-html/unnamed-chunk-9-1.png and b/modeling-02_files/figure-html/unnamed-chunk-9-1.png differ