forked from data-8/data-8.github.io
-
Notifications
You must be signed in to change notification settings - Fork 0
/
resources.html
416 lines (388 loc) · 18.2 KB
/
resources.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
<!DOCTYPE html>
<html lang="en">
<head>
<title>Resources</title>
<!-- Latest compiled and minified Bootstrap CSS -->
<link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.7/css/bootstrap.min.css" integrity="sha384-BVYiiSIFeK1dGmJRAkycuHAHRg32OmUcww7on3RYdg4Va+PmSTsz/K68vbdEjh4u" crossorigin="anonymous">
<link rel="stylesheet" href="./theme/css/main.css" />
</head>
<body id="index" class="home">
<nav class="navbar navbar-default navbar-fixed-top">
<div class="container">
<div class="navbar-header">
<button type="button" class="navbar-toggle collapsed" data-toggle="collapse" data-target="#navbar" aria-expanded="false" aria-controls="navbar">
<span class="sr-only">Toggle navigation</span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
</button>
<a class="navbar-brand" href="./">
Data 8 Fall 2017
</a>
</div>
<div id="navbar" class="collapse navbar-collapse">
<ul class="nav navbar-nav">
<li><a href="./office-hours.html">Office Hours</a></li>
<li><a href="./policies.html">Policies</a></li>
<li class="active"><a href="./resources.html">Resources</a></li>
<li><a href="./schedule.html">Schedule</a></li>
<li><a href="./staff.html">Staff</a></li>
<li class="dropdown">
<a href="#" class="dropdown-toggle" data-toggle="dropdown" role="button" aria-haspopup="true" aria-expanded="false">Links <span class="caret"></span></a>
<ul class="dropdown-menu">
<li><a href="https://goo.gl/forms/zZVlZW6X2vxqCwrs1">Attendance</a></li>
<li><a href="http://data8.org/connector/">Connectors</a></li>
<li><a href="http://datahub.berkeley.edu/">DataHub</a></li>
<li><a href="http://data8.org/datascience/">Datascience Docs</a></li>
<li><a href="https://goo.gl/forms/Weh4GtljDPBmZ85r1">Feedback</a></li>
<li><a href="https://piazza.com/berkeley/fall2017/data8">Piazza</a></li>
</ul>
</li>
</ul>
</div><!--/.nav-collapse -->
</div>
</nav>
<div class="container">
<h2>Resources</h2>
<style>
th {
text-align: left
}
table > thead > tr > th, table > tbody > tr > th, table > tfoot > tr > th, table > thead > tr > td,
table > tbody > tr > td, table > tfoot > tr > td {
padding: 8px;
line-height: 1.42857143;
vertical-align: top;
border-top: 1px solid #ddd
}
table > thead > tr > th {
vertical-align: bottom;
border-bottom: 2px solid #ddd
}
table > caption + thead > tr:first-child > th, table > colgroup + thead > tr:first-child > th,
table > thead:first-child > tr:first-child > th, table > caption + thead > tr:first-child > td,
table > colgroup + thead > tr:first-child > td, table > thead:first-child > tr:first-child > td {
border-top: 0
}
table > tbody + tbody {
border-top: 2px solid #ddd
}
table {
border: 1px solid #ddd;
border-spacing: 0;
border-collapse: collapse;
background-color: #fff;
width: 100%;
max-width: 100%;
margin-bottom: 20px
}
td, th {
padding: 0
}
table > thead > tr > th, table > tbody > tr > th, table > tfoot > tr > th,
table > thead > tr > td, table > tbody > tr > td, table > tfoot > tr > td {
border: 1px solid #ddd
}
table > thead > tr > th, table > thead > tr > td {
border-bottom-width: 2px
}
table > tbody > tr:nth-child(odd) {
background-color: #f9f9f9
}
</style>
<h3>Exam Resources</h3>
<h4>Study Guides</h4>
<ul>
<li>The <a href="http://data8.org/data8assets/exam/data8-fa17-midterm-guide.pdf">midterm study guide</a> will be distributed with the midterm exam.</li>
<li>The <a href="http://data8.org/data8assets/exam/data8-fa17-final-guide.pdf">final study guide</a> will be distributed with the final exam, as will the <a href="http://data8.org/data8assets/exam/data8-fa17-midterm-guide.pdf">midterm study guide</a>.</li>
</ul>
<h4>Review Videos</h4>
<ul>
<li><a href="https://youtu.be/cUHGgEwMM8w">Fall 2017 Midterm Review Problems by Wilton Wu and Howard Ki</a></li>
<li>General reviews: <a href="https://www.youtube.com/watch?v=iDH9e82HOFs">day 1</a>.</li>
</ul>
<p>Content Specific Videos:</p>
<ul>
<li><a href="https://youtu.be/3VrXSWrPtmg">Tables and Programming Review by Carlos Ortega and Amir Shahatit</a></li>
<li><a href="https://youtu.be/7P6zJmMV7AE">Histograms Review by Suraj Rampure</a></li>
<li><a href="https://youtu.be/RF42s8dDgKg">Probability and Sampling Review by Vasilis Oikonomou and Tanay Nathan</a></li>
</ul>
<h4>Practice Exams</h4>
<p>Midterm:</p>
<p>Please note that Hypothesis Testing WILL NOT be on the Fall 2017 midterm. Some parts of the past midterms go over questions on hypothesis testing.</p>
<ul>
<li><a href="http://data8.org/data8assets/exam/data8-sp17-midterm.pdf">Spring 2017 midterm</a>,
<a href="http://data8.org/data8assets/exam/data8-sp17-midterm-solution.pdf">solution</a>, and
<a href="https://youtu.be/LPMzXlYf8n8">video walkthrough by Suraj Rampure</a>.</li>
<li><a href="http://data8.org/data8assets/exam/data8-sp17-practice.pdf">Spring 2017 practice midterm</a>,
<a href="http://data8.org/data8assets/exam/data8-sp17-practice-solution.pdf">solution</a>, and
<a href="https://youtu.be/i8dXfBnr1ng">video walkthrough by Erik Cheng</a>.
The practice midterm is the Spring 2016 midterm, but modified to only include topics covered in Spring 2017.</li>
<li><a href="http://data8.org/data8assets/exam/data8-su17-midterm.pdf">Summer 2017 midterm</a>,
<a href="http://data8.org/data8assets/exam/data8-su17-midterm-solution.pdf">solution</a>, and
<a href="https://youtu.be/mdHvHv_lqLk">video walkthrough by Suraj Rampure</a></li>
<li><a href="http://data8.org/data8assets/exam/data8-fa16-midterm.pdf">Fall 2016 midterm</a> and
<a href="https://youtu.be/LR8Zfa1JovA">video walkthrough by Suraj Rampure</a>.
This exam only includes questions on topics covered in Spring 2017.</li>
</ul>
<p>Finals:</p>
<ul>
<li><a href="http://data8.org/data8assets/exam/data8-sp17-final.pdf">Spring 2017 final</a>, <a href="http://data8.org/data8assets/exam/data8-sp17-final-solution.pdf">exam solution</a> and <a href="https://youtu.be/rZfGEibJ0Og">video walkthrough by Suraj Rampure</a>.</li>
<li><a href="http://data8.org/data8assets/exam/data8-sp17-practice-final.pdf">Spring 2017 practice final</a>,
<a href="http://data8.org/data8assets/exam/data8-sp17-practice-final-solution.pdf">solution</a>, and
<a href="https://youtu.be/oaNolUkZTL0">video walkthrough by Suraj Rampure</a>.
The practice final is the Spring 2016 final, but with modified solutions that use an updated version of the <code>datascience</code> module.</li>
<li><a href="http://data8.org/data8assets/exam/data8-fa16-final.pdf">Fall 2016 final</a> and
<a href="https://youtu.be/YeK-pRlZ5co">video walkthrough by Emma Jaeger</a>.</li>
</ul>
<h3>GSI-created Materials</h3>
<h4>Slides</h4>
<ul>
<li><a href="https://chengeaa.github.io/students">Discussion slides by Erik Cheng</a></li>
<li><a href="http://scottjlee.github.io/teaching.html">Discussion slides by Scott Lee</a></li>
<li><a href="http://surajrampure.com/data8.html">Discussion slides by Suraj Rampure</a></li>
<li><a href="https://drive.google.com/drive/folders/0B16mBAQVjBrucDVZUEszQ0pVXzg">Discussion slides by Vinitra Swamy</a></li>
<li><a href="https://drive.google.com/open?id=0B7pyDtabHBt_SFJlcmwwNExYTTQ">Discussion slides by Sahil Upadhyay</a></li>
<li><a href="http://emma-jaeger.com/resources.html">Discussion slides by Emma Jaeger</a></li>
</ul>
<h4>Other Guides</h4>
<ul>
<li><a href="http://fhdkmrn.github.io/resources/cs8/Histogram_Extra_Practice.pdf">Additional histogram practice</a> and
<a href="http://fhdkmrn.github.io/resources/cs8/Histogram_Extra_Practice_Sols.pdf">solutions</a> by Supreme Overlord Professor Fahad Kamran</li>
<li><a href="http://www.surajrampure.com/resources/data8/tvd-walkthrough.html">Total Variation Distance discussion problem walkthrough by Suraj Rampure</a></li>
<li><a href="http://data8.org/data8assets/exam/stats_final_review.pdf">Final exam comprehensive statistics review by Francie McQuarrie</a></li>
</ul>
<h3>Tutoring Worksheets</h3>
<ul>
<li><a href="https://drive.google.com/drive/folders/0Byk73pF3jtu9bDhYVUpodmV0RVE?usp=sharing">Worksheets from tutoring sections</a> are available for review.</li>
</ul>
<p><br /></p>
<h3>Table Functions and Methods</h3>
<table>
<thead>
<tr>
<th>Name</th>
<th align="center">Chapter</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>Table</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/tables.html">5</a></td>
<td>Create an empty table, usually to extend with data</td>
</tr>
<tr>
<td><code>Table.read_table</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/tables.html">5</a></td>
<td>Create a table from a data file</td>
</tr>
<tr>
<td><code>with_columns</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/tables.html">5</a></td>
<td>Create a copy of a table with more columns</td>
</tr>
<tr>
<td><code>column</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/tables.html">5</a></td>
<td>Create an array containing the elements of a column</td>
</tr>
<tr>
<td><code>num_rows</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/tables.html">5</a></td>
<td>Compute the number of rows in a table</td>
</tr>
<tr>
<td><code>num_columns</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/tables.html">5</a></td>
<td>Compute the number of columns in a table</td>
</tr>
<tr>
<td><code>labels</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/tables.html">5</a></td>
<td>Lists the column labels in a table</td>
</tr>
<tr>
<td><code>select</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/tables.html">5</a></td>
<td>Create a copy of a table with only some of the columns</td>
</tr>
<tr>
<td><code>drop</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/tables.html">5</a></td>
<td>Create a copy of a table without some of the columns</td>
</tr>
<tr>
<td><code>relabel</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/tables.html">5</a></td>
<td>Modifies the existing table <em>in place</em>, changing the column heading in the first argument to the second</td>
</tr>
<tr>
<td><code>relabeled</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/tables.html">5</a></td>
<td>Returns a new table with the column heading in the first argument changed to the second</td>
</tr>
<tr>
<td><code>sort</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/1/sorting-rows.html">5.1</a></td>
<td>Create a copy of a table sorted by the values in a column. Defaults to ascending order unless "descending = True" is included</td>
</tr>
<tr>
<td><code>where</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/2/selecting-rows.html">5.2</a></td>
<td>Create a copy of a table with only the rows that match some <em>predicate</em></td>
</tr>
<tr>
<td><code>take</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/05/2/selecting-rows.html">5.2</a></td>
<td>Create a copy of the table with only the rows whose indices are in the given array</td>
</tr>
<tr>
<td><code>scatter</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/06/visualization.html">6</a></td>
<td>Draw a scatter plot consisting of one point for each row of the table.</td>
</tr>
<tr>
<td><code>plot</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/06/visualization.html">6</a></td>
<td>Draw a line graph consisting of one point for each row of the table.</td>
</tr>
<tr>
<td><code>barh</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/06/1/visualizing-categorical-distributions.html">6.1</a></td>
<td>Draws a bar chart of the frequencies of a categorical distribution</td>
</tr>
<tr>
<td><code>hist</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/06/2/visualizing-numerical-distributions.html">6.2</a></td>
<td>Draws a histogram of a numerical distribution</td>
</tr>
<tr>
<td><code>apply</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/07/1/applying-a-function-to-a-column.html">7.1</a></td>
<td>Returns an array of values resulting from applying a function to some column in a table</td>
</tr>
<tr>
<td><code>group</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/07/2/classifying-by-one-variable.html">7.2</a></td>
<td>Create a copy of the table with all rows with the same values in a certain column aggregated into one row in the new table</td>
</tr>
<tr>
<td><code>groups</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/07/3/cross-classifying-by-more-than-one-variable.html">7.3</a></td>
<td>Create a copy of the table with all rows with the same value in a certain array of columns aggregated into one row in the new table</td>
</tr>
<tr>
<td><code>pivot</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/07/3/cross-classifying-by-more-than-one-variable.html">7.3</a></td>
<td>Create a copy of the table with a column for each element in the first argument and a row for each element in the second argument and aggregates values</td>
</tr>
<tr>
<td><code>join</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/07/4/joining-tables-by-columns.html">7.4</a></td>
<td>Create a copy of the table that is the result of joining the columns of two tables, with a row for each shared value in the two tables</td>
</tr>
<tr>
<td><code>sample</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/09/empirical-distributions.html">9</a></td>
<td>Draws some number of rows at random from a table. By default, with replacement.</td>
</tr>
<tr>
<td><code>sample_from_distribution</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/10/1/jury-selection.html">10.1</a></td>
<td>Returns a new table with an additional column whose values correspond to a random sample (of specified size) based on proportions in a specified column.</td>
</tr>
</tbody>
</table>
<p><br /></p>
<h3>Array Functions and Methods</h3>
<table>
<thead>
<tr>
<th>Name</th>
<th align="center">Chapter</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>max</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/03/3/call-expressions.html">3.3</a></td>
<td>Returns the maximum value of an array</td>
</tr>
<tr>
<td><code>min</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/03/3/call-expressions.html">3.3</a></td>
<td>Returns the minimum value of an array</td>
</tr>
<tr>
<td><code>sum</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/03/3/call-expressions.html">3.3</a></td>
<td>Returns the sum of the values in an array</td>
</tr>
<tr>
<td><code>len</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/03/3/call-expressions.html">3.3</a></td>
<td>Returns the length (number of elements) of an array</td>
</tr>
<tr>
<td><code>make_array</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/04/4/arrays.html">4.4</a></td>
<td>Makes a numpy array with the values passed in</td>
</tr>
<tr>
<td><code>np.average</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/04/4/arrays.html">4.4</a></td>
<td>Returns the mean value of an array</td>
</tr>
<tr>
<td><code>np.diff</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/04/4/arrays.html">4.4</a></td>
<td>Returns a new array of size len(arr)-1 with elements equal to the difference between adjacent elements</td>
</tr>
<tr>
<td><code>np.sqrt</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/04/4/arrays.html">4.4</a></td>
<td>Returns an array with the square root of each element</td>
</tr>
<tr>
<td><code>np.arange</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/04/5/ranges.html">4.5</a></td>
<td>Returns an array of an end-exclusive range of variable step size</td>
</tr>
<tr>
<td><code>arr.item</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/04/6/more-on-arrays.html">4.6</a></td>
<td>Returns the i-th item in an array (remember Python indices start at 0!)</td>
</tr>
<tr>
<td><code>np.random.choice</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/08/randomness.html">8</a></td>
<td>Picks one (by default) or some number 'n' of items from an array at random. By default, with replacement.</td>
</tr>
<tr>
<td><code>np.count_nonzero</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/08/randomness.html">8</a></td>
<td>Returns the number of non-zero (or True) elements in an array.</td>
</tr>
<tr>
<td><code>np.append</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/08/2/iteration.html">8.2</a></td>
<td>Returns a copy of the input array with some item (must be the same type as the other entries in the array) appended to the end.</td>
</tr>
<tr>
<td><code>percentile</code></td>
<td align="center"><a href="https://www.inferentialthinking.com/chapters/11/1/percentiles.html">11.1</a></td>
<td>Returns the corresponding percentile of an array.</td>
</tr>
</tbody>
</table>
</div>
<footer id="contentinfo" class="body">
</footer><!-- /#contentinfo -->
<!-- jQuery (necessary for Bootstrap's JavaScript plugins) -->
<script src="https://ajax.googleapis.com/ajax/libs/jquery/1.12.4/jquery.min.js"></script>
<!-- Latest compiled and minified Bootstrap JavaScript -->
<script src="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.7/js/bootstrap.min.js" integrity="sha384-Tc5IQib027qvyjSMfHjOMaLkfuWVxZxUPnCJA7l2mCWNIpG9mGCD8wGNIcPD7Txa" crossorigin="anonymous"></script>
</body>
</html>