diff --git a/_quarto.yml b/_quarto.yml index 6deded89..a88ebe7c 100644 --- a/_quarto.yml +++ b/_quarto.yml @@ -25,7 +25,7 @@ book: - visualization_1/visualization_1.qmd - visualization_2/visualization_2.qmd - sampling/sampling.qmd - # - intro_to_modeling/intro_to_modeling.qmd + - intro_to_modeling/intro_to_modeling.qmd # - constant_model_loss_transformations/loss_transformations.qmd # - ols/ols.qmd # - gradient_descent/gradient_descent.qmd diff --git a/docs/eda/eda.html b/docs/eda/eda.html index bf485fe7..246b651a 100644 --- a/docs/eda/eda.html +++ b/docs/eda/eda.html @@ -237,6 +237,12 @@ 9  Sampling + + @@ -325,7 +331,7 @@

Data Cleaning and EDA

-
+
Code
import numpy as np
@@ -390,7 +396,7 @@ 

5.1.1.1 CSV

CSVs, which stand for Comma-Separated Values, are a common tabular data format. In the past two pandas lectures, we briefly touched on the idea of file format: the way data is encoded in a file for storage. Specifically, our elections and babynames datasets were stored and loaded as CSVs:

-
+
pd.read_csv("data/elections.csv").head(5)
@@ -461,7 +467,7 @@